Abstract
We study parabolic quasi-variational inequalities (QVIs) of obstacle type. Under appropriate assumptions on the obstacle mapping, we prove the existence of solutions of such QVIs by two methods: one by time discretisation through elliptic QVIs and the second by iteration through parabolic variational inequalities. Using these results, we show the directional differentiability (in a certain sense) of the solution map which takes the source term of a parabolic QVI into the set of solutions, and we relate this result to the contingent derivative of the aforementioned map. We finish with an example where the obstacle mapping is given by the inverse of a parabolic differential operator.
1 Introduction
Quasi-variational inequalities (QVIs) are versatile models that are used to describe many different phenomena from fields as varied as physics, biology, finance and economics. QVIs were first formulated by Bensoussan and Lions [8, 32] in the context of stochastic impulse control problems and since then have appeared in many models where nonsmoothness and nonconvexity are present, including superconductivity [4, 7, 29, 37, 40], the formation and growth of sandpiles [5, 7, 36, 38, 39] and networks of lakes and rivers [6, 36, 38], generalised Nash equilibrium problems [17, 24, 34], and more recently in thermoforming [2].
In general, QVIs are more complicated than variational inequalities (VIs) because solutions are sought in a constraint set which depends on the solution itself. This is an additional source of nonlinearity and nonsmoothness and creates considerable issues in the analysis and development of solution algorithms to QVI and the associated optimal control problems.
We focus in this work on parabolic QVIs with constraint sets of obstacle type and we address the issues of existence of solutions and directional differentiability for the solution map taking the source term of the QVI into the set of solutions. This extends to the parabolic setting our previous work [2] where we provided a differentiability result for solution mappings associated to elliptic QVIs. Literature for evolutionary QVIs is scarce in the infinite-dimensional setting: among the few works available, we refer to [25] for a study of parabolic QVIs with gradient constraints, [26] for existence and numerical results in the hyperbolic setting, [18] for QVIs arising in hydraulics, [28] for state-dependent sweeping processes, and evolutionary superconductivity models in [40], as well as the work [19]. Differentiability analysis for parabolic (non-quasi) VIs was studied in [27] and [15].
Let us now enter into the specifics of our setting. Let \(V \subset H \subset V^*\) be a Gelfand triple of separable Hilbert spaces with a compact embedding. Furthermore, we assume that there exists an ordering to elements of H via a closed convex cone \(H_+\) satisfying \(H_+ = \{h \in H : (h,g) \ge 0 \quad \forall g \in H_+\}\); the ordering then is \(\psi _1 \le \psi _2\) if and only if \(\psi _2-\psi _1 \in H_+\). An example to have in mind is \(H=L^2(\varOmega )\) with \(H_+\) the set of almost everywhere non-negative functions in H. This also induces an ordering for V and \(V^*\) as well as for the associated Bochner spaces \(L^2(0,T;H)\), \(L^2(0,T;V)\) and so on. We define \(V_+ := \{ v \in V : v \ge 0\}\). We write \(v^+\) for the orthogonal projection of \(v \in H\) onto the space \(H_+\) and we have the decomposition \(v = v^+ - v^-\). Suppose that \(v \in V\) implies that \(v^+ \in V\) and that there exists a constant \(C >0\) such that for all \(v \in V\),
An example of such a space V is \(V=W^{1,p}(\varOmega )\) for \(1 \le p \le \infty \); we refer to [1] for a definition of the Sobolev space \(W^{1,p}(\varOmega )\) over a domain \(\varOmega \).
Let \(A:V \rightarrow V^*\) be a linear, symmetric, bounded and coercive operator which is T-monotone, which means that
and let \(\varPhi :L^2(0,T;H) \rightarrow L^2(0,T;V)\) be a mapping which is increasing, i.e.,
Further assumptions will be introduced later as and when required. We consider parabolic QVIs of the following form.
QVI Problem: Given \(f \in L^2(0,T;H)\) and \(z_0 \in V\), find \(z \in L^2(0,T;V)\) with \(z' \in L^2(0,T;V^*)\) such that for all \(v \in L^2(0,T;V)\) with \(v(t) \le \varPhi (z)(t)\),
We write the solution mapping taking the source term into the (weak or strong) solution as \(\mathbf {P}_{z_0}\) so that (1) reads \(z \in \mathbf {P}_{z_0}(f)\) (sometimes we omit the subscript). In this paper, we contribute three main results associated to this QVI:
-
Existence of solutions to (1) via time-discretisation (Theorem 9): we show that solutions to (1) can be formulated as the limit of a sequence constructed from considering time-discretised elliptic QVI problems. This result makes use of the theory of sub- and supersolutions and the Tartar–Birkhoff fixed point method.
-
Approximation of solution to (1) by solutions of parabolic VIs (Theorem 16): we define an iterative sequence of solutions of parabolic VIs and show that the sequence converges in a monotone fashion to a solution of the parabolic QVI; this is another QVI existence result. The existence for the aforementioned parabolic VIs comes from either Theorem 9 or from a certain result of Brezis (which will be given in the relevant section), giving rise to two different sets of assumptions under which the theorem holds.
-
Directional differentiability for the source-to-solution mapping \(\mathbf {P}\) (Theorem 40): we prove that the map \(\mathbf {P}\) is directionally differentiable in a certain sense using Theorem 16 and some technical lemmas related to the expansion formulae for parabolic VI solution mappings.
Thus we give two existence results and a differential sensitivity result. It should be noted that the differentiability result essentially gives a characterisation of the contingent derivative (a concept frequently used in set-valued analysis) of \(\mathbf {P}\) (between appropriate spaces) in terms of a parabolic QVI; see Proposition 41 for details.
1.1 Notations and layout of paper
We shall usually write the duality pairing between V and \(V^*\) as \(\langle \cdot , \cdot \rangle \) rather than \(\langle \cdot , \cdot , \rangle _{V^*,V}\) for ease of reading. We use the notations \(\hookrightarrow \) and to represent continuous and compact embeddings respectively. Let us define the Sobolev–Bochner spaces
and defining the linear parabolic operator \({L}:W(0,T) \rightarrow L^2(0,T;V^*)\) by \({L}v := v'+ Av\), we also define the following space
Note the relationships \(W_s(0,T) \hookrightarrow W(0,T)\) and \(W_{\mathrm{r}}(0,T) \subset W(0,T)\).
If \(z \in W(0,T)\) satisfies (1), we say that it is a weak solution or simply a solution. A weaker notion of solution is given by transferring the time regularity of the solution onto the test function and requiring only \(z \in L^2(0,T;V) \cap L^\infty (0,T;H)\) to satisfy
and we call z a very weak solution. Note that the initial data also has been transferred onto the test function (indeed, the weak solution is not sufficiently regular to have a prescribed initial data).
The paper is structured as follows. In Sect. 2 we consider the existence of solutions to (1) via the method of time discretisation: we characterise a solution of the parabolic QVI as the limit of solutions of elliptic QVIs. In Sect. 3, we approximate solutions of the QVI by a sequence of solutions of parabolic VIs that are defined iteratively. In Sect. 4 we consider parabolic VIs and directional differentiability with respect to perturbations in the obstacle and we make use of this in Sect. 5 to prove that the source-to-solution map \(\mathbf {P}\) is directionally differentiable in a particular sense. We highlight some possible alternative approaches in Sect. 6 and finish with an example in Sect. 7.
2 Existence for parabolic QVIs through time discretisation
We prove existence to (1) by the method of time discretisation via elliptic QVIs of obstacle type. This is in contrast to [26] where the discretisation for evolutionary QVI was done in such a way as to yield a sequence of elliptic VIs, and as far as we are aware, our approach is novel in these type of problems.
We make the following basic assumption.
Assumption 1
Let \(\varPhi (0) \ge 0\) and
Let \(N \in \mathbb {N}\), \(h^N:=T/N\) and for \(n=0, 1, ..., N\), \(t_n^N := nh^N\). This divides the interval [0, T] into N subintervals of length \(h^N\); we will usually write h for \(h^N\). We approximate the source term by
and we consider the following elliptic QVI problem.
Discretised problem: Given \(z_0^N:=z_0\), find \(z_n^N \in V\) such that
This problem is sensible since by (3), for fixed t, the mapping \(v \mapsto \varPhi (v)(t)\) is well defined from H into V, since we may consider \(H \subset L^2(0,T;H)\) (elements of H can be thought of constant-in-time elements of \(L^2(0,T;H)\)), and \(\varPhi (v)\) can be evaluated pointwise in time. We consider first the existence of solutions to (4).
2.1 Existence and uniform estimates for the elliptic approximations
The inequality (4) can be rewritten as
We write the solution of (4) or (5) as \(\mathbf {Q}_{t_{n-1}^N}(hf_n^N + z_{n-1}^N) \ni z_n^N\). Related to (5) is the following VI:
denote by \(E_{t,h}(g,\psi )=v\) the solution of this problem. For fixed t and h and sufficiently smooth data, this mapping is well defined since the obstacle \(\varPhi (\psi )(t) \in V\). To ease notation, we write \(E_{t_n^N}(g,\psi )\) instead of \(E_{t_n^N, h^N}(g,\psi )\) (where \(h^N\) is the step size) because specifying \(h^N\) is redundant.
Let us make an observation which follows from the theory of Birkhoff–Tartar [12, 42]. Suppose there exists a subsolution \(z_{\mathrm{sub}}\) and a supersolution \(z^{sup}\) to (4), i.e., \(z_{\mathrm{sub}} \le E_{t_{n-1}^N}(hf_n^N+z_{n-1}^N, z_{\mathrm{sub}})\) and \(z^{sup} \ge E_{t_{n-1}^N}(hf_n^N+z_{n-1}^N, z^{sup})\). Then the QVI problem (4) has a solution \(z_n^N = E_{t_{n-1}^N}(hf_n^N+z_{n-1}^N, z_n^N) \in [z_{\mathrm{sub}},z^{sup}]\). The next lemma applies this idea.
Remark 1
In fact, the theory of Birkhoff and Tartar tells us not only that there exist in general multiple solutions in \([z_{\mathrm{sub}},z^{sup}]\) but also that there exist minimal and maximal solutions (a minimal solution \(z_{m}\) is a solution that satisfies \(z_m \le z\) for every solution z; maximal solutions are defined with the opposite inequality).
First, let us set \(A_h w := w + hAw\) (this \(A_h\) is the elliptic operator appearing in (5)) and define \(\bar{z}_{n,N}\) as the solution of the following elliptic PDE:
Lemma 2
Suppose that
Then the approximate problem (4) has a non-negative solution \(z_n^N \in [z_{n-1}^N,\bar{z}_{n,N}]\). Thus the sequence \(\{z_n^N\}_{n \in \mathbb {N}}\) is increasing.
Proof
We first show that \(z_0^N=z_0\) is a subsolution for the QVI for \(z_1^N\). Indeed, let \(w := E_{t_0^N}(hf_1^N + z_0^N, z_0)\) which satisfies
The choice \(v = w + (z_0-w)^+\) is a feasible test function due to the upper bound on the initial data from assumption (D2). This leads to
again by assumption (D2). This shows that \(z_0 \le E_{t_0^N}(hf_1^N+z_0^N, z_0)\) is indeed a subsolution.
The function \(\bar{z}_{1,N}\) defined through \(A_h \bar{z}_{1,N} = hf_1^N + z_0\) is a supersolution since \(\bar{z}_{1,N}=E_{t_{n-1}^N}(hf_1^N + z_0, \infty ) \ge E_{t_{n-1}^N}(hf_1^N + z_0, \bar{z}_{1,N})\) (thanks to the fact that \(\varPhi \) is increasing). Then we apply the theorem of Birkhoff–Tartar to obtain existence of \(z_1^N \in V\) with \(z_1^N \in [z_0^N,\bar{z}_{1,N}]\).
Suppose that \(z_n^N \in [z_{n-1}^N, \bar{z}_n^N] \cap \mathbf {Q}_{t_{n-1}^N}(f_n^N + z_{n-1}^N\)). Observe that
with the final inequality because the obstacle associated to \(t_{n}^N\) is greater than or equal to the obstacle associated to \(t_{n-1}^N\) by assumption (6). We also have
that is, \(z_n^N\) and \(\bar{z}_{n+1}^N\) are sub- and super-solutions respectively for \(\mathbf {Q}_{t_{n}^N}(f_{n+1}^N + z_n^N)\) (and the supersolution is greater than the subsolution). Therefore, by Birkhoff–Tartar there exists a \(z_{n+1}^N \in \mathbf {Q}_{t_{n}^N}(f_{n+1}^N + z_n^N)\) in the interval \([z_n^N, \bar{z}_{n+1}^N]\). \(\square \)
It appears that we may select any solution \(z^N_n\) as given by the Birkhoff–Tartar theorem in the previous lemma, regardless of how we choose \(z^N_{n-1}\). For example, we may choose \(z^N_{n-1}\) to be the minimal solution on its corresponding interval (with endpoints given by the sub- and supersolution) and \(z^N_n\) to be the maximal solution on its corresponding interval, with no effect on the resulting analysis (though of course, different choices may lead to different solutions of the original parabolic QVI in question in the end).
We now obtain some bounds on the sequence \(\{z_n^N\}\). For this, we use the fact that if \(f \in L^2(0,T;H)\), then
Lemma 3
Under the assumptions of Lemma 2, the following uniform bounds hold:
(The final bound needs symmetry of A and the increasing property of the sequence \(\{z^N_n\}_{n \in \mathbb {N}}\)).
Proof
In this proof, we omit the superscript N in various quantities for clarity.
We follow the argumentation of [20, Sect. 6.3.3]. Testing the QVI (5) with \(v=0\) (which is valid since \(0 \le \varPhi (0)(t_{n-1}) \le \varPhi (z_n)(t_{n-1})\) by assumption on the non-negativity at zero and the second inequality by the increasing property of \(\varPhi \) and the fact that \(z_n \ge 0\)) gives
and using the relation
we find
with the last line due to Young’s inequality. This leads to
whence, summing up between \(n=1\) and \(n=m\) for some \(m \in \mathbb {N}\), and using (7),
This leads to the first two bounds stated in the lemma. For the final bound, testing (4) with \(z_{n-1}\), which is valid since by Lemma 2, \(z_{n-1} \le z_n \le \varPhi (z_n)(t_{n-1})\), we find
which leads to
and here we use
to get
Neglecting the second term and summing this up from \(n=1\) to \(n=m\) and using (7), we obtain
as desired. \(\square \)
2.2 Interpolants
We define the piecewise constant interpolants
where \(\chi _A\) represents the characteristic function on the set A. In order to ease the presentation, we often use the notation \(T_n^{N}:= [t_n^N, t_{n+1}^N).\)
Corollary 4
Under the assumptions of Lemma 3, \(\{z^N\}\) and \(\{z^N_-\}\) are bounded uniformly in \(L^2(0,T;V) \cap L^\infty (0,T;H)\).
Proof
Since the \(T_n^N\) are disjoint, we see that
by (8), and
by (9). A similar argument leads to the bounds on \(z^N_-\). \(\square \)
To be able to handle the time derivative, it is useful to construct the interpolant
known as Rothe’s function, which also has the time derivative
Corollary 5
Under the assumptions of Lemma 3, \(\{\hat{z}^N\}\) is bounded uniformly in \(W_s(0,T)\).
Proof
We see that
with the last two inequalities by (9). Regarding the time derivative, using (10), we find
\(\square \)
2.3 Passing to the limit in the interpolants
By the previous subsection, we have the existence of \(z, \hat{z}\) such that, for a relabelled subsequence, the following convergences hold:
Lemma 6
We have \(\hat{z} \equiv z\). Furthermore,
Proof
Observe that
by (10). Thus as \(N \rightarrow \infty \), \(z^N_- - z^N \rightarrow 0\) in \(L^2(0,T;H)\). Since \(z^N \rightharpoonup z\) in \(L^2(0,T;V)\), \(z^N \rightarrow z\) in \(L^2(0,T;H)\) and we obtain \(z^N_- \rightarrow z\) in \(L^2(0,T;H)\) and weakly in \(L^2(0,T;V)\).
Now consider
with the final line by (10). This shows that \(z^N_- - \hat{z}^N \rightarrow 0\) in \(L^2(0,T;H)\), allowing us to identify \(\hat{z} = z\) as desired. \(\square \)
The convergence results above obviously imply that \(\hat{z}^N \rightarrow z\) in \(C^0([0,T];H)\) (due to Aubin–Lions), so that \(z_0 = \hat{z}^N(0) \rightarrow z(0)\), i.e., z has the right initial data. Let us now see that z is feasible.
Lemma 7
(Feasibility of the limit) Let the following conditions hold:
Then \(z(t) \le \varPhi (z)(t)\) for a.e. t.
Proof
Since by (6), for \(t \in T_{n-1}^N\), \(z^N_n \le \varPhi (z^N_n)(t^N_{n-1}) \le \varPhi (z^N_n)(t)\), we have
Using the assumption (12) applied to the right-hand side above, we have
Passing to the limit here using (13) gives the result. \(\square \)
Regarding the assumptions of the previous lemma, (13) is a mild continuity requirement on \(\varPhi \) whereas for (12), consider the superposition case \(\varPhi (v)(t) := \hat{\varPhi }(t,v(t))\). Then if \(\{v_n\}_{n \in \mathbb {N}} \subset V\), we have
and now supposing \(t \in T_{j-1}^N\) for some j, this becomes
and since j is arbitrary, this holds for all t. Hence (12) holds with equality.
In order to pass to the limit in the inequality satisfied by the interpolant \(z^N\), we have to be able to approximate test functions in the limiting constraint set. This is possible as the next lemma shows.
Lemma 8
(Recovery sequence) Assume the condition
Then for every \(v \in L^2(0,T;V)\) with \(v(t) \le \varPhi (z)(t)\), there exists a \(v^N \in L^2(0,T;V)\) such that
Proof
Let \(v \in L^2(0,T;V)\) with \(v(t) \le \varPhi (z)(t)\) and define
which satisfies \(v_n^N(t) \le \varPhi (z_n^N)(t_{n-1}^N)\) and set
Take \(\epsilon > 0\). We have
by assumption (14) as long as \(N \ge N_0\). This shows that \(v^N \rightarrow v\). \(\square \)
Let us consider two cases under which the assumption (14) of the previous lemma holds.
1. Superposition case. In case where \(\varPhi (v)(t) := \hat{\varPhi }(v(t))\), (14) translates to a complete continuity assumption. Indeed, the sum term in (14) is
so that (14) simply asks for \(\varPhi (w_N) \rightarrow \varPhi (w)\) in \(L^2(0,T;V)\) whenever \(w_N \rightharpoonup w\) in \(L^2(0,T;V)\) and weakly-* in \(L^\infty (0,T;H)\).
2. VI case. When \(\varPhi (v)(t) \equiv \psi (t)\) for some obstacle \(\psi \) and if \(\psi \in C^0([0,T];V)\) then for every \(\epsilon > 0\), there exists \(\delta > 0\) such that \(|t-s| \le \delta \) implies \(\left\Vert \psi (t)-\psi (s)\right\Vert _{V} \le \sqrt{\epsilon /T}.\) When \(t \in T_{n-1}^N\), we have \(|t^N_{n-1} - t| \le |t^N_{n-1}-t^N_n| \le h^N \rightarrow 0\) as \(N \rightarrow \infty \). So for sufficiently large N, say \(N \ge N_0\), we have \(|t^N_{n-1}-t| \le \delta \) and thus
and so (14) also holds.
Theorem 9
Let Assumption 1, (D1), (D2), (6), (12), (13) and (14) hold. Then there exists a non-negative solution \(z \in W_s(0,T)\) to (1) which is the limit of the interpolants \(\{z^N\}, \{\hat{z}^N\}.\) Furthermore, the map \(t \mapsto z(t)\) is increasing.
Proof
Let \(v \in L^2(0,T;V)\) satisfy \(v(t) \le \varPhi (z)(t)\) and let us take \(v^N\) as defined in the proof of Lemma 8. Then by (4),
Writing the duality product involving the time derivative as an inner product, we have, using the convergences in (11) and the weak lower semicontinuity of the bilinear form generated by A,
so that \(z \in \mathbf {P}(f) \cap W_s(0,T)\). Since \(\{z_n^N\}\) are non-negative, it follows that z is too.
By (11), it follows that \(z^{N_j}(t) \rightarrow z(t)\) in H for almost every \(t \in [0,T]\). Let now \(s \le r\) and suppose that \(s \in T_{m-1}^{N_j}\) and \(r \in T_{n-1}^{N_j}\) with \(m \le n\). Then we have \(z^{N_j}(s) = z^{N_j}_m \le z^{N_j}_n = z^{N_j}(t)\) since the sequence \(\{z^{N_j}_i\}_{i \in \mathbb {N}}\) is increasing. Passing to the limit, we find for almost every r and s with \(s \le r\) that \(z(s) \le z(r)\), i.e., \(t \mapsto z(t)\) is increasing. \(\square \)
3 Parabolic VI iterations of parabolic QVIs
In this section, we will show the existence of sequences of solutions to VIs that converge to solutions of QVIs. We begin with collecting some facts regarding parabolic VIs.
Consider the parabolic VI
We write the solution as \(z:=\sigma _{z_0}(f, \psi )\) when it exists. Given \(f \in L^2(0,T;H)\) and \(\psi \in V\) independent of time, the solution \(z \in W_s(0,T)\) exists uniquely, see e.g. [3, 9, 11].
The problem (15) can be transformed to a parabolic VI with zero initial data with right-hand side \(f-Az_0\) and obstacle \(\psi -z_0\) with the substitution \(u(t)=z(t)-z_0\):
We often write simply \(\sigma \) rather than \(\sigma _{z_0}\) when we do not need to emphasise the initial data. The next lemma shows that \(\sigma \) is increasing in its arguments.
Lemma 10
(I. Comparison principle for parabolic VIs) Suppose for \(i=1, 2\) that \(z_i \in W(0,T)\) is a solution of (15) with data \(f_i \in L^2(0,T;V^*)\) and obstacle \(\psi _i\) such that \(f_1 \le f_2\) and \(\psi _1 \le \psi _2\). Then \(z_1 \le z_2\).
Proof
The \(z_i\) satisfy the inequalities
for all \(v_1, v_2 \in L^2(0,T;V)\) such that \(v_1(t) \le \psi _1(t)\) and \(v_2(t) \le \psi _2(t)\). Let us take here \(v_1 = z_1 - (z_1-z_2)^+\), which is clearly a feasible test function, and take \(v_2= z_2 + (z_1-z_2)^+\) which is also feasible since
This gives us
Adding yields
whence using T-monotonicity and coercivity, we obtain \(z_1 \le z_2\). \(\square \)
We start by giving some existence results for (15) now in the general case when \(\psi \) is not necessarily independent of time. The first one is a result of applying Theorem 9 of Sect. 2 using the obstacle mapping \(\varPhi (v)(t) \equiv \psi (t)\) (it can be seen that all of the assumptions of the theorem are satisfied, refer also to the remarks below the proof of Lemma 8).
Proposition 11
(Existence via time discretisation) Let \(\psi \in C^0([0,T];V)\) be non-negative with \(t \mapsto \psi (t)\) increasing and let (D1) and
hold. Then (15) has a unique non-negative solution \(z=\sigma (f,\psi ) \in W_s(0,T)\) which is increasing in time.
The next proposition applies a result due to Brezis–Stampacchia (see [30, Sect. 2.9.6.1, p. 286]) to obtain existence of a very weak solution and then a further argument is required to obtain additional regularity.
Proposition 12
(Existence II) Let \(\psi \in W_{\mathrm{r}}(0,T)\) be such that \(t \mapsto \psi (t)\) is increasing with \(z_0 \le \psi (0)\). Then (15) has a unique solution \(z=\sigma (f,\psi ) \in W_{\mathrm{r}}(0,T)\).
Proof
First observe that since \(z_0 \le \psi (0)\) and \(t \mapsto \psi (t)\) is increasing, Theorem 7.1 of [41, Sect. III] gives the existence of a weak solution \(z \in L^2(0,T;V) \cap L^\infty (0,T;H)\) (see the introduction for the definition):
Indeed, if for simplicity we take \(z_0 \equiv 0\), the domain of L is \(D(L):=\{ v \in H^1(0,T;V^*) : v(0) = 0\}\) and the condition (7.5) of [41, Sect. III.7] follows since \(\psi \) is increasing in time (see also [41, p. 150]) and condition (7.7) of [41, Sect. III.7] holds for the function \(v_0 := \psi \). Hence the aforementioned theorem is applicable.
Now, given \(v \in L^2(0,T;V)\) with \(v(t) \le \psi (t)\), consider the PDE
which, by standard parabolic theory, has a strong solution \(v_\epsilon \in W_s(0,T)\) thanks to the regularity on \(\psi \). A rearrangement and adding and subtracting the same term leads to
Testing the equation above with \((v_\epsilon -\psi )^+\) and using the non-negativity of the right-hand side,
we find
which implies that \(v_\epsilon (t) \le \psi (t)\). Then [41, Sect. III, Proposition 7.2] implies that the solution is actually strong, i.e., (15) holds and it belongs to W(0, T) with the additional regularity \({L}z \in L^2(0,T;H)\), i.e., \(z \in W_{\mathrm{r}}(0,T)\). \(\square \)
Some related regularity results given sufficient smoothness for f can be found in e.g. [10, Theorem 2.1].
3.1 Parabolic VIs with obstacle mapping
Take \(\varPhi \) as in the introduction and fix \(\psi \in L^2(0,T;H)\). Define the map
that is, \(z=S_{z_0}(f,\psi )\) solves
We will often omit the subscript \(z_0\) in \(S_{z_0}(f,\psi )\) and simply write \(S(f,\psi )\). Let us translate the content of Propositions 11 and 12 to this setting.
Proposition 13
(Existence for (16)) Let
and either

or
Then (16) has a solution \(z=S(f, \psi )\in W(0,T)\) with the regularity that, in the first case, \(z \in W_s(0,T)\) is non-negative and \(t \mapsto z(t)\) is increasing, whereas in the second case, \(z \in W_{\mathrm{r}}(0,T).\)
Proof
The first set of assumptions imply that the hypotheses of Proposition 11 hold for the obstacle \(\varPhi (\psi )\), whilst under the second set of assumptions, we apply Proposition 12. \(\square \)
Remark 2
In this section, it suffices for \(\varPhi \) to be defined on \(L^2(0,T;V)\) rather than \(L^2(0,T;H)\) since that assumption was necessarly only to apply the Birkhoff–Tartar theory in the previous section.
The next lemma states that the solution mapping \(S(f,\psi )=z\) is increasing with respect to the arguments. This follows simply by using Lemma 10 and the fact that \(\varPhi \) is increasing.
Lemma 14
(II. Comparison principle for parabolic VIs) Suppose for \(i=1, 2\) that \(z_i \in W(0,T)\) is a solution of (16) with data \(f_i \in L^2(0,T;V^*)\) and obstacle \(\psi _i\) such that \(f_1 \le f_2\) and \(\psi _1 \le \psi _2\). Then \(z_1 \le z_2\).
3.2 Iteration scheme to approximate a solution of the parabolic QVI
We say that a function \(z_{\mathrm{sub}} \in L^2(0,T;H)\) is a subsolution for (1) if \(z_{\mathrm{sub}} \le S(f, z_{\mathrm{sub}})\), and a supersolution is defined with the opposite inequality.
Remark 3
Let \(\varPhi (0) \ge 0\). If \(f \ge 0\), the function 0 is a subsolution, and the function \(\bar{z}\) defined by
is a supersolution to (1). Both claims follow by the comparison principle: the first claim is clear and the second follows upon realising that \(\bar{z} = S(f, \infty ) \ge S(f,\bar{z})\). We need the sign condition on f for \(z_{\mathrm{sub}}=0 \le z^{sup}=\bar{z}\).
Lemma 15
If \(d \ge 0\), any subsolution for \(\mathbf {P}(f)\) is a subsolution for \(\mathbf {P}(f+sd)\) where \(s \ge 0.\)
Proof
This is obvious: if w is a subsolution for \(\mathbf {P}(f)\), then \(w \le S(f,w) \le S(f+sd, w)\). \(\square \)
The previous lemma tells us in particular that any element of \(\mathbf {P}(f)\) is a subsolution for \(\mathbf {P}(f+sd)\). The next theorem, which shows that a solution of the QVI can be approximated by solutions of VIs defined iteratively with respect to the obstacle, is based on an iteration idea of Bensoussan and Lions in [11, Chapter 5.1] (there, the authors consider \(\varPhi \) to be of impulse control type). The theorem and its sister result Theorem 17 show in particular that the approximating sequences converge to extremal (the smallest or largest) solutions of the QVI in certain intervals.
Theorem 16
(Increasing approximation of the minimal solution of QVI by solutions of VIs) Let \(z_{\mathrm{sub}} \in L^2(0,T;V)\) be a subsolution for \(\mathbf {P}(f)\) such that \(\varPhi (z_{sub }) \ge 0\) and
Let either

or

hold. Then the sequence \(\{z^n\}_{n \in \mathbb {N}}\) denoted by
is well defined, monotonically increasing and satisfies
Furthermore,
-
in the first case, \(z^n \in W_s(0,T)\) with \(z^n \ge 0\) , \(\partial _t z^n \rightharpoonup \partial _t z\) in \(L^2(0,T;H)\) and \(z \in W_s(0,T)\) is a strong solution (i.e. it satisfies (1) with additional regularity on the time derivative), and both \(z^n\) and z are increasing in time
-
or, in the second case, \(z^n \in W_{\mathrm{r}}(0,T)\).
Before we proceed, let us observe that
-
1.
since \(\varPhi \) is increasing, (O1a) is equivalent to \(\psi \ge 0 \implies \varPhi (\psi ) \ge 0\)
-
2.
(O3b) implicitly implies that \(\varPhi (w) \in W(0,T)\).
Proof
The proof is split into five steps.
1. Monotonicity of \(\{z^n\}\). The \(z^n\) (if they exist) satisfy for all \(v \in L^2(0,T;V)\) with \(v(t) \le \varPhi (z^{n-1})(t)\) the inequality
Since \(z^0\) is a subsolution, \(z^1 = S(f,z^0) \ge z^0\). Suppose that \(z^j \ge z^{j-1}\) for some j; then \(z^{j+1} = S(f,z^j) \ge S(f,z^{j-1}) = z^j\) (the inequality due to Lemma 14). This shows that \(z^n\) is a monotonically increasing sequence.
2. Existence of \(\{z^n\}\). For the actual existence, we apply Proposition 13 as we see now.
First case. In case of (D1), (22), (19), (24), (23) and since \(\varPhi (z_{\mathrm{sub}}) \ge 0\), Proposition 13 tells us that \(z^1=S(f,z_{\mathrm{sub}}) \in W_s(0,T)\). We find \(z^1 \ge S(0,0) = 0\) by Lemma 14, and hence by (O1a), \(\varPhi (z^1) \ge 0\).
Let us also see why \(z_0 \le \varPhi (z^1)(0)\). The monotonicity above implies that \(\varPhi (z_{sub}) \le \varPhi (z^1)\). As \(z^1 \in W_s(0,T)\), (26) implies that \(\varPhi (z^1) \in C^0([0,T];V)\), which along with (24) implies that we can take the trace of the previous inequality at time 0, giving \(z_0 \le \varPhi (z_{sub})(0) \le \varPhi (z^1)(0)\) where the first inequality is with the aid of (18). Making use of (26), the increasing property and (25) (which tells us that \(t \mapsto \varPhi (z^1)(t)\) is increasing, since \(t \mapsto z^1(t)\) is), Proposition 13 is again applicable and we use it to obtain \(z^2\).
By bootstrapping this argument we get that \(z^n \in W_s(0,T)\) is well defined. Furthermore we have \(z^n \ge 0\) by the sign condition on the data.
Second case. In case of (27) and (28), (18), (17) and (21) and (18) hold for the obstacle \(z_{\mathrm{sub}}\) and we get \(z^1 = S(f, z_{sub}) \in W_{\mathrm{r}}(0,T)\). Applying (O2b) to this, \(\varPhi (z^1) \in W_{\mathrm{r}}(0,T)\). Suppose that the first part of (29) holds. Then since \(z^1(0) = z_0\), we find that \(z_0 \le \varPhi (z^1)(0)\) and then again (D2) is satisfied for the obstacle \(\varPhi (z^1)\), giving existence of \(z^2=S(f,z^1)\). Repeating this, we get \(z^n \in W_{\mathrm{r}}(0,T)\). If instead the second part of (29) holds, we get by monotonicity that \(z^0 \le z^1\) and using the increasing property of \(\varPhi \), \(z_0 \le \varPhi (z^0)(0) \le \varPhi (z^1)(0)\) (the first inequality by (22)) and again we can apply the existence and proceed in this manner for general n.
3. Uniform bounds on \(\{z^n\}\). By (O1a) and the fact that \(z^n \ge 0\), or by (O1b), we find that 0 is a valid test function in (30) and testing with it yields
which immediately leads to bounds in \(L^\infty (0,T;H)\) and \(L^2(0,T;V)\) giving the weak convergences stated in the theorem to some z, for the full sequence (and not a subsequence) thanks to the monotonicity property.
3.1. Uniform bounds on \(\partial _t z^n\) under first set of assumptions. In this case, we can obtain a bound on the time derivative. Indeed, due to the work on the time discretisations in Sect. 2, from (11) and Lemma 6 we know that for each j, the interpolant \((z^j)^N\) of the time-discretised solutions is such that \((z^j)^N \rightharpoonup z^{j}\) in \(L^2(0,T;V)\) and \(\partial _t (\hat{z}^j)^N \rightharpoonup \partial _t z^{j}\) in \(L^2(0,T;H)\). One should bear in mind that the \(\{(z^j)^N\}_N\) are interpolants constructed from solutions of elliptic VIs and not QVIs since the \(\{z^j\}_{j \in \mathbb {N}}\) are solutions of VIs. One observes the bound
where the constant C ultimately arises from (10). Evidently, it only depends on the initial data and the source term. Hence, in this case,
4. Passage to the limit. Either of the conditions (O2a) and (O3b) allow us to pass to the limit in \(z^n(t) \le \varPhi (z^{n-1})(t)\) to deduce the feasibility of z since order is preserved in norm convergence. Now let \(v^* \in L^2(0,T;V)\) be a test function such that \(v^* \le \varPhi (z)\). We use
as the test function in the VI (30).
4.1. Under first set of assumptions. In this case, using the strong convergence \(v^n \rightarrow v^*\) assured by (O2a), we can use (31) and pass to the limit after writing the duality pairing for the time derivative as an inner product to get the inequality
4.2. Under second set of assumptions. In this case, we take the limiting test function \(v^* \in W(0,T)\) with \(v^*(0)=z_0\) and rewrite (30), using the monotonicity of the time derivative and assumption (O3b) (which guarantees that \(\varPhi (z) \in W(0,T)\), and hence \(v^n \in W(0,T)\)) as
By (O3b), we find that \(v^n \rightarrow v^*\) in W(0, T), and hence we can pass to the limit in the above to obtain (2).
5. Minimality of the solution. Suppose that \(z^* \in \mathbf {P}(f)\) is the minimal solution on the interval \([z_{\mathrm{sub}}, \infty )\), which in particular implies \(z^* \le z.\) We see by the comparison principle and since \(z^0=z_{\mathrm{sub}}\) is a subsolution that
By this, we find \(z^1=S(f,z^0) \le S(f,z^*) = z^*\). Similarly, \(z^2 = S(f,z^1) \le S(f,z^*) = z^*\), and thus
Passing to the limit shows that \(z \le z^*\) so that \(z=z^*\). \(\square \)
Remark 4
Note that the compactness assumptions (O2a) and (O3b) are only required for identifying the limit point z and showing that it is feasible.
Theorem 17
(Decreasing approximation of the maximal solution of QVI by solutions of VIs) Let \(z^0 := z^{\sup }\) be a supersolution of \(\mathbf {P}(f)\) and assume that
Under the assumptions of the previous theorem (except (29)) except with \(z_{\mathrm{sub}}\) replaced with \(z^{sup}\) and (27) replaced with
the sequence \(\{z^n\}\) is monotonically decreasing and converges to a solution \(z \in \mathbf {P}(f)\) with the same regularity and convergence results as stated in Theorem 16. Furthermore, z is the maximal solution of (1) or (2) in the interval \((-\infty ,z^{sup}]\).
Proof
It follows that \(z \in \mathbf {P}(f) \cap (-\infty , z^{sup}]\) by the same argumentation as in the proof of the previous theorem. Let us prove the claim of the maximality of the solution. Suppose that there exists a maximal solution \(z^* \in (-\infty , z^{sup}]\) so that \(z^* \ge z\) where \(z = \lim _n z^n \) with \(z^0 = z^{sup}.\) We have \(z^0=z^{sup} \ge S(f,z^{sup}) \ge S(f,z^*) = z^*\), and thus \(z^1 = S(f,z^0) \ge S(f,z^*) = z^*.\) Iterating shows that
whence passing to the limit, \(z \ge z^*\), and thus \(z = z^*\) is the maximal solution. \(\square \)
3.3 Transformation of VIs with obstacle to zero obstacle VIs
It will become useful to relate solutions of the parabolic VI (16) with non-trivial obstacle to solutions of parabolic VIs with zero (lower) obstacle. We achieve this as follows. Take \(w_0 \ge 0\) and define \(\bar{S}_{w_0}:L^2(0,T;H) \rightarrow W_s(0,T)\) by \(\bar{S}_{w_0}(g):=w\) the solution to the parabolic VI with lower obstacle
Omitting when convenient the subscript, we obtain the following estimate for \(w_i = \bar{S}(g_i)\):
which then leads to
and (due to Young’s inequality applied to the right-hand side)
hence also
Let us note that (32) in particular implies
The relationship between solutions of VIs with non-trivial obstacles and VIs with zero obstacle is given in the next result.
Proposition 18
Let (17), (18) hold and let \(g \in L^2(0,T;H)\), \(z_0 \in V\) and \(\psi \in L^2(0,T;V)\) be such that \(\varPhi (\psi ) \in W_{\mathrm{r}}(0,T)\).
Then
holds in \(W_{\mathrm{r}}(0,T)\) where \(w_0 = \varPhi (\psi )(0) - z_0\).
Furthermore, if (D1), (19), and (20) hold, then in fact \(S_{z_0}(g,\psi ) \in W_s(0,T)\) and hence the spatial regularity \(AS_{z_0}(g,\psi ) \in L^2(0,T;H)\).
Proof
Under the hypotheses, Proposition 13 can be applied to deduce that \(z:=S_{z_0}(g,\psi )\) is well defined in \(W_{\mathrm{r}}(0,T)\) and it solves the VI (16). Set \(w := \varPhi (\psi )- z\) (which belongs to \(W_{\mathrm{r}}(0,T)\)) and observe that
The upper bound on \(z_0\) implies that \(w_0 \ge 0\). Now define \(\varphi (t) := \varPhi (\psi )(t) - v(t)\). Then the above reads
This shows the desired identity \(\bar{S}_{w_0}({L}\varPhi (\psi )-g)=w=\varPhi (\psi )-z = \varPhi (\psi ) - S_{z_0}(g,\psi )\). Under the additional assumptions, Proposition 13 yields \(z \in W_s(0,T)\) and \(w = \varPhi (\psi ) - z \in W_{\mathrm{r}}(0,T) + W_s(0,T)\). \(\square \)
4 Expansion formula for variations in the obstacle and source term
The aim in this section is obtain differential expansion formulae for the solution mapping of parabolic VIs with respect to perturbations on the source term and the obstacle. This will form the backbone of our QVI differentiability result in the next section.
4.1 Definitions and cones from variational analysis
To state directional differentiability results for VIs, we need some concepts and notation which we shall collect in this subsection. Let us define the lower obstacle sets
and for \(y \in \mathbb {K}_0\), the radial cone at y
We shall consider \(\mathbb {K}_0\) as a subset of the Banach space \(L^2(0,T;V)\). The tangent cone is defined as the closure of the radial cone:
Obviously, \(T^{\mathrm{rad}}_{\mathbb {K}_0}(y) \subset T_{\mathbb {K}_0, L^2(0,T;V)}^{\mathrm{tan}}(y).\) We now show that the tangent cone is contained in a set which has a convenient description (see also the discussion after Remark 5.6 in [15]).
We will need the notions of capacity of sets, quasi-continuity of functions and related concepts, consult [33, Sect. 3], [22, Sect. 3] and [13, Sect. 6.4.3] for more details. Below, we shall use the abbreviation ‘q.e.’ to mean quasi-everywhere; a statement holds quasi-everywhere if it holds everywhere except on a set of capacity zero.
Lemma 19
The tangent cone of \(\mathbb {K}_0\) can be characterised as
where \(\bar{y}(t)\) is a quasi-continuous representative of y(t).
Proof
If \(w \in T_{\mathbb {K}_0}^{\mathrm{rad}}(y)\), then from (36), \(w \in L^2(0,T;V)\) and there exists \(\rho ^* \ge 0\) such that \(y(t) + \rho w(t) \in K_0\) for almost every t and for all \(\rho \in [0,\rho ^*)\), meaning that \(w(t) \in T_{K_0}^{\mathrm{rad}}(y(t)) \subset T_{{K}_0}^{\mathrm{tan}}(y(t))\) for almost every t. This shows that
and if we take the closure in \(L^2(0,T;V)\) on both sides,
Suppose that \(\{w_n\} \subset L^2(0,T;V)\) is a sequence that belongs to the set on the right-hand side above with \(w_n \rightarrow w\) in \(L^2(0,T;V)\). Thus, for a subsequence, \(w_{n_j}(t) \rightarrow w(t)\) in V and \(w_{n_j}(t) \in T_{K_0}^{\mathrm{tan}}(y(t))\) for almost every t. Since the tangent cones are closed sets, the limit point \(w(t) \in T_{K_0}^{\mathrm{tan}}(y(t))\). Hence \(w \in \{ w \in L^2(0,T;V) : w(t) \in T_{{K}_0}^{\mathrm{tan}}(y(t)) \text { a.e. } t \in [0,T]\}\) and the closure can be omitted on the right-hand side of (38).
From [33, Lemma 3.2], Mignot proves the following description of the tangent cone of \(K_0\):
with \(\bar{y}\) a quasi-continuous representative of the function y. This provides the characterisation stated in the lemma. \(\square \)
The set \(\mathbb {K}_0\) is said to be polyhedric at \((y,\lambda ) \in \mathbb {K}_0 \times T_{\mathbb {K}_0, L^2(0,T;V)}^{\mathrm{tan}}(y)^{\circ }\) if
where
is the known as the polar cone. The set is polyhedric at y if it is polyhedric at \((y,\lambda )\) for all \(\lambda \), and it is polyhedric if it is polyhedric at y for all y. The concept of polyhedricity is useful because it is a sufficient condition guaranteeing the directional differentiability of the metric projection associated to that set (see [13, 23, 33] and also [43]) and this fact ultimately enables one to obtain directional differentiability for solution mappings of variational inequalities.
Now, the set \(\mathbb {K}_0\) is not polyhedric as a subset of the space W(0, T) since W(0, T) lacks certain smoothness properties due to the low regularity of the time derivative. However, \(\tilde{\mathbb {K}}_0 := \mathbb {K}_0 \cap W_s(0,T)\) is indeed polyhedric.
Lemma 20
The set \(\tilde{\mathbb {K}}_0\) is polyhedric as a subset of \(W_s(0,T)\) and for \((y,\lambda ) \in \tilde{\mathbb {K}}_0 \times T_{\tilde{\mathbb {K}}_0, W_s(0,T)}^{\mathrm{tan}}(y)^\circ \),
where \(\bar{y}\) is a quasi-continuous representative of y and
Proof
First note that if \(v \in W_s(0,T)\), \(\partial _t(v^+) = \chi _{\{v \ge 0\}}\partial _t v\) by the chain rule and hence we have the bound
which shows that \((\cdot )^+:W_s(0,T) \rightarrow W_s(0,T)\) is a bounded map. It follows that \(W_s(0,T)\) is a vector lattice in the sense of Definition 4.6 of [43] when associated to the cone \(\tilde{\mathbb {K}}_0\). The boundedness of \((\cdot )^+ :W_s(0,T) \rightarrow W_s(0,T)\) and Lemma 4.8 and Theorem 4.18 of [43] imply that \(\tilde{\mathbb {K}}_0\) is polyhedric in \(W_s(0,T)\) and hence
The space \(W_s(0,T)\) is also a Dirichlet space in the sense of [33, Definition 3.1] on the set \([0,T]\times \bar{\varOmega }\) and so, due to the characterisation of the tangent cone in [33, Lemma 3.2], we find
\(\square \)
4.2 Directional differentiability for VIs
We now specialise to the case where the pivot space H is a Lebesgue space, a restriction which is needed for the results of [15].
Assumption 21
Set \(H:=L^2(\varOmega , \mu )\) where \((\varOmega , \varSigma , \mu )\) is a complete measure space and let \(V \subset H\) be a separable Hilbert space.
Theorem 4.1 of [15] states that for \(w_0 \in V_+\), the map \(\bar{S}_{w_0}:L^2(0,T;H) \rightarrow L^p(0,T;H)\) (defined in Sect. 3.3) is directionally differentiable for all \(p \in [1,\infty )\), i.e.,
where \(s^{-1}o(s) \rightarrow 0\) in \(L^p(0,T;H)\) as \(s \rightarrow 0^+\), and \(\delta :=\bar{S}_{w_0}'(g)(d) \in L^2(0,T;V) \cap L^\infty (0,T;H)\) satisfies, with \(w=\bar{S}_{w_0}(g)\), the inequality
Using this, we shall first work to deduce a differentiability formula for the map S under perturbations of the right-hand side source with a fixed obstacle.
Remark 5
Our notation emphasises the fact that the higher-order term in (39) depends on the base point g. This is important because the behaviour of the higher-order terms is in general unknown with respect to the base points, such as for example whether there is any kind of uniformity of the convergence of the higher-order terms on compact or bounded subsets of the base points. Such uniform convergence does hold in cases where the map has more smoothness, namely if it possesses the so-called uniform Hadamard differentiability property, but it is not clear whether this is the case for us when such issues become relevant in Sect. 6.
This is in stark contrast to the dependence on the direction: we know that the terms converge uniformly on compact subsets of the direction since \(\bar{S}\) is Hadamard (and hence compactly) differentiable.
If \(d(s) \rightarrow d\), we write
Let us see why \(\hat{o}\) above is a higher-order term. Let \(h:(0,1) \rightarrow L^2(0,T;H)\) and take \(d(s) = d + s^{-1}h(s)\) and \(p \in [1,\infty )\). Subtracting (39) from (41), we obtain from the Lipschitz nature of \(\bar{S}\),
using \(L^2(0,T;H) \hookrightarrow L^1(0,T;H)\). The estimate
follows from applying instead the Lipschitz estimate (33) to the first line of the above calculation.
The next proposition guarantees (under certain assumptions) the directional differentiability of one or both of the maps
Proposition 22
Let \(f,d \in L^2(0,T;H)\), \(\psi \in L^2(0,T;V)\) with \(\varPhi (\psi ) \in W_{\mathrm{r}}(0,T)\) and let (17) and (18) hold. Then
where, with \(w_0 := \varPhi (\psi )(0) - z_0\),
belongs to \(L^2(0,T;V) \cap L^\infty (0,T;H)\), and h is a higher-order term in \(L^p(0,T;H)\) for \(p \in [1,\infty )\) whose convergence is uniform in d on compact subsets of \(L^2(0,T;H)\).
The directional derivative \(\alpha :=\partial S_{z_0}(f,\psi )(d)\) satisfies
If additionally, for \(s \ge 0,\)

then (43) holds in \(W_s(0,T)\cap W_{\mathrm{r}}(0,T).\)
Proof
The assumptions imply that Proposition 13 applies and we obtain existence of the left-hand side and the first term on the right-hand side of (43). We can via Proposition 18 utilise (35) with the source terms f and \(f+sd\) to write \(S_{z_0}(f,\psi )\) and \(S_{z_0}(f+sd,\psi )\) in terms of \(\bar{S}_{w_0}\), and then with the aid of the expansion formula (39), we find
\(\square \)
4.3 Differentiability with respect to the obstacle and the source term
We clearly need some differentiability for the obstacle mapping to proceed the study further and this comes in the following assumption which we take to stand for the rest of the paper.
Assumption 23
Suppose that \(\varPhi :L^2(0,T;H) \rightarrow W(0,T)\) is Hadamard differentiable.
It is necessary for \(\varPhi \) to be defined on \(L^2(0,T;H)\) and differentiable as stated because it implies the uniform convergence with respect to compact subsets of the direction in \(L^2(0,T;H)\) of the difference quotients to the directional derivative of \(\varPhi \), which is a fact that we will use later in Sect. 5.5 in the analysis of some higher-order terms that arise.
We write, for \(\rho (s) \rightarrow \rho \),
Remark 6
Assumption 23 implies, thanks to \(W(0,T) \hookrightarrow C^0([0,T];H) \hookrightarrow L^p(0,T;H)\), that
In this section, we could have merely assumed (49) instead of Assumption 23 and most results would carry through all the way up to the identification of the term r in Proposition 25 as a higher-order term (and hence the differentiability)
Now if \(h:(0,1) \rightarrow L^2(0,T;H)\) satisfies \(s^{-1}h(s) \rightarrow 0\) as \(s \rightarrow 0^+\), then from (49) and the mean value theorem [35, Sect. 2, Proposition 2.29],
which leads to the estimate
Recall that \(Lv:=v' + Av\).
Lemma 24
The map \({L}(\varPhi (\cdot )):L^2(0,T;H) \rightarrow L^2(0,T;V^*)\) is Hadamard differentiable with derivative \({L}(\varPhi '(\psi )(\rho ))\) at the point \(\psi \) in direction \(\rho \), and its higher-order term \({L}(l(s,\rho ;\psi ))\) satisfies
Proof
Applying the operator L to (47) we get the following equality in \(L^2(0,T;V^*)\):
Due to the estimate
we see that \({L}\varPhi \) is Hadamard differentiable in the stated spaces since \(\varPhi :L^2(0,T;H) \rightarrow W(0,T)\) is Hadamard differentiable. Subtracting the expansion
from the equality above, we obtain
Since the composite mapping \({L}\varPhi :L^2(0,T;H) \rightarrow L^2(0,T;V^*)\) is Hadamard differentiable, the mean value theorem applied to the right-hand side above implies
\(\square \)
Proposition 25
Assume (17) and let \(f, d \in L^2(0,T;H)\), \(\psi , \rho \in L^2(0,T;V)\) and \(h:(0,1) \rightarrow L^2(0,T;V)\) with \(h(0) = 0\) be such that, for \(s \ge 0\),
Then
holds in \(W_{\mathrm{r}}(0,T)\) where
and \(\alpha :=S_{z_0}'(f,\psi )(d,\rho ) \in \varPhi '(\psi )(\rho ) + L^2(0,T;V) \cap L^\infty (0,T;H)\) satisfies the VI
If additionally

then the formula above holds in \(W_s(0,T)\).
If also for \(p \in [1,\infty )\),

then
that is, \(S_{z_0}:L^2(0,T;H) \times L^2(0,T;V) \rightarrow L^p(0,T;H)\) is directionally differentiable.
Proof
Due to assumptions (51) and (52), the left-hand side of the expansion formula to be proved can be written using (35) in Proposition 18:
where \(w_0 = \varPhi (\psi +s\rho +h(s))(0)-z_0\). The first term on the right-hand side here can be expanded through the formula (48) for \(\varPhi \):
This is an equality in \(L^p(0,T;H)\) for all p (and in fact in W(0, T) by assumption). Note that (51) implies that we can apply L to all terms in (57) and doing so yields
Using this and the expansion formula (41) for \(\bar{S}\), the second term on the right-hand side of (56) becomes
where the second equality holds since every term inside \(\bar{S}_{w_0}\) on the left-hand side is in \(L^2(0,T;H)\) and so (39) applies. Now, plugging (57) and (58) into (56) we find
where for the final equality, we again used the formula (35) which is applicable because (52) implies that \(z_0 \le \varPhi (\psi )(0)\), and we used the relation (44) between the directional derivatives of \(\bar{S}\) and S:
We then set \(\alpha := \varPhi '(\psi )(\rho )+\partial S(f,\psi )(d-{L}\varPhi '(\psi )(\rho ))\). From (40), (43), (45), the function \(\delta :=\partial S(f,\psi )(d-{L}\varPhi '(\psi )(\rho )) \in L^2(0,T;V)\cap L^\infty (0,T;H)\) satisfies
where \(w=\bar{S}({L}\varPhi (\psi )-f) = \varPhi (\psi )-S(f,\psi )\). Recalling the definition of \(\alpha \) and making the substitution \(\varphi := \varPhi '(\psi )(\rho )+ z\) in the above variational formulation for \(\delta \) yields the formulation for \(\alpha \) stated in the proposition. If additionally (\(\hbox {D}_{\mathrm{s}}\)), (54), (53), then Proposition 18 gives the stated regularity.
Dropping now the dependence on the base points for clarity, we estimate the remainder term r (which is defined as in the statement of this proposition) as follows, making use of (42) and Lemma 24,
Dividing by s and taking the limit \(s \rightarrow 0^+\), we see that the remainder term vanishes in the limit due to assumption (55).
Furthermore the convergence to zero is uniform in d on compact subsets since d appears only in the final term which we know has the same property as \(S(\cdot , \psi )\) is Hadamard differentiable. \(\square \)
Remarks 26
The assumption \(h(0)=0\) in the proposition implies that all assumptions that hold for the perturbed data also hold for the non-perturbed data (i.e. at \(s=0\)). Without this assumption, we would have to assume in addition \(\varPhi (\psi ) \in W_{\mathrm{r}}(0,T)\) and (20) and (D1) along with (53).
5 Directional differentiability
Fix \(f,d \in L^2(0,T;H)\). We begin by choosing an element of \(\mathbf {P}(f)\) with sufficient regularity as described in the following assumption.
Assumption 27
Take \(u_0 \in V_+\) and let \(u \in \mathbf {P}_{u_0}(f) \cap W(0,T)\) be such that \(t \mapsto \varPhi (u)(t)\) is increasing.
Refer to Theorems 9 and 16 which give conditions for the existence of such a \(u \in \mathbf {P}_{u_0}(f)\) and for the increasing property of \(t \mapsto u(t)\); if \(\varPhi \) then preserves the increasing-in-time property then one would have the satisfaction of this assumption.
Picking \(u \in \mathbf {P}_{u_0}(f)\) satisfying Assumption 27, define the sequence
Our aim will be to apply Theorems 16 or 17 to this sequence in order to show that, under additional assumptions, it is well defined and has the right convergence properties. Furthermore, we also want to obtain expansion formulae for each \(u^n_s\).
5.1 Expansion formula for the VI iterates
The following sets of assumptions are to ensure that Theorem 16 (or Theorem 17) can be applied for our sequence \(\{u_s^n\}_{n \in \mathbb {N}}\).
Assumption 28
Assume
if \(\rho \in L^2(0,T;V),\) and \(h:(0,1) \rightarrow L^2(0,T;H)\) is a higher-order term, then for some \(p \in [1,\infty )\), as \(s \rightarrow 0^+\),
and either

or

Remark 7
Regarding (59), observe that if \(d \ge 0\), the initial element \(u_s^0 = u\) is a subsolution for \(\mathbf {P}(f+sd)\) since \(u = S(f,u) \le S(f+sd, u)\) (see also Lemma 15) whilst if \(d \le 0\) then \(u_s^0\) is instead a supersolution.
Define \(\alpha ^1 = \delta ^1 := \partial S(f,u)(d)\) and for \(n\ge 2\), we make the recursive definitions:
To ease the notation on the higher-order terms, we did not write the base point u in the r term (which originates from Proposition 25) above. We now give two results (with varying assumptions) in the next proposition concerning convergence behaviour and an expansion formula for the sequence \(\{u_s^n\}\).
Proposition 29
Let Assumption 28 hold. Then \(\{u_s^n\}_{n \in \mathbb {N}} \subset W(0,T)\) is a well defined non-negative monotone sequence (increasing if \(d \ge 0\), decreasing if \(d \le 0\)) such that
where
-
(1)
\(\alpha ^n\) satisfies the VI
$$\begin{aligned}&\alpha ^n - \varPhi '(u)(\alpha ^{n-1}) \in T^{\tan }_{\mathbb {K}_0, L^2}(\varPhi (u)-u)\cap [u' + Au - f]^\perp :\nonumber \\&\quad \int _0^T \langle \varphi ' + A\alpha ^n - d, \alpha ^n - \varphi \rangle \le \frac{1}{2} \left\Vert \varphi (0)-\varPhi '(u)\alpha ^{n-1}(0)\right\Vert _{H}^2 \nonumber \\&\quad \forall \varphi : \varphi - \varPhi '(u)(\alpha ^{n-1}) \in \mathrm {cl}_{W}(T^{\mathrm{rad}}_{\mathbb {K}_0}(\varPhi (u)-u)\cap [u' + Au - f]^\perp ); \end{aligned}$$(66) -
(2)
\({L}\varPhi (u_s^n) {\in } L^2(0,T;H)\), \(\alpha ^n-\varPhi '(u)(\alpha ^{n-1}) {\in } L^2(0,T;V) \cap L^\infty (0,T;H)\), \({L}\varPhi '(u)(\alpha ^{n-1}) {\in } L^2(0,T;H);\)
-
(3)
\(s^{-1}o^n(s) \rightarrow 0\) in \(L^p(0,T;H)\) as \(s \rightarrow 0^+\) with \(p \in [1,\infty )\) from Assumption 28;
-
(4)
under the first set of assumptions,
$$\begin{aligned} u_s^n \rightharpoonup u_s \text { in } W_s(0,T) \text { where } u_s^n \ge 0 \text { and } u_s \in \mathbf {P}_{u_0}(f+sd)\text { is a non-negative solution}; \end{aligned}$$ -
(5)
under the second set of assumptions, \(u_s^n \in W_{\mathrm{r}}(0,T)\) and
$$\begin{aligned}&u_s^n \rightharpoonup u_s \text { in } L^2(0,T;V) \text { and weakly-star in } L^\infty (0,T;H) \text { where}\\&u_s \in \mathbf {P}_{u_0}(f+sd)\text { is a very weak solution.} \end{aligned}$$
Proof
Let us first show monotonicity of the sequence assuming existence. First take \(d \ge 0\). Then since u is a subsolution, \(u \le S(f,u) \le S(f+sd, u) = u^1_s\). By the comparison principle, we find again that \(u^1_s = S(f+sd, u) \le S(f+sd, u^1_s) = u^2_s\) and in this we obtain that \(\{u^n_s\}\) is an increasing sequence. Likewise if \(d \le 0\), the sequence is decreasing.
1. First case. Observe that since \(u \le \varPhi (u)\) and \(\varPhi (u) \in W_s(0,T)\), by (O3a), we can take the trace at \(t=0\) to get \(\varPhi (u)(0) \ge u_0\). Since (\(\hbox {D}_{\mathrm{s}}\)), (L1a) and (60) hold and as (20) is satisfied for the obstacle u (thanks to (L1a) and (O3a)), Proposition 13 implies that \(u_s^1=S(f+sd,u) \in W_s(0,T)\) exists and is increasing in time and non-negative.
Regarding the upper bound for the initial data in terms of \(u_s^1\), we argue as follows. For \(d \ge 0\), we may apply \(\varPhi \) to the inequality \(u_s^1 \ge S(f,u) = u\) and use the regularity offered by (O3a) to obtain \(u_0 \le \varPhi (u)(0) \le \varPhi (u_s^1)(0).\) If \(d \le 0\), we use the condition (61) to obtain the same conclusion. Then making use of (O1a), (O3a), and (O4a) we apply Proposition 13 to obtain the existence for each \(u^2_s\) and subsequently, using these arguments, existence for each \(u^n_s\).
We now show the expansion formula (65) by induction.
1.1 Base case. Using Proposition 22 to expand \(u^1_s = S(f+sd, u)\), which is applicable due to (L1a), (O3a), and the increasing-in-time property and the non-negativity of \(u_0\) from Assumption 27, we obtain a \(\delta ^1 := \partial S(f,u)(d) \in L^2(0,T;V) \cap L^\infty (0,T;H)\) such that
where
and
is clearly a higher-order term. Finally, assumption (O3a) implies that \({L}\varPhi (u_s^1) \in L^2(0,T;H)\), whilst (L2) gives \({L}(\varPhi '(u)(\delta ^1)) \in L^2(0,T;H)\).
1.2 Inductive step. Now assume the statement is true for \(n=k\). By definition,
This object is again non-negative since \(u_s^k \ge 0\) implies that \(\varPhi (u_s^k) \ge \varPhi (0) \ge 0\) by (O1a). We have \(u_s^k \in W_s(0,T)\), and thus by (O3a), \(\varPhi (u_s^k) \in W_{\mathrm{r}}(0,T)\), and since \(\alpha ^k = \varPhi '(u)(\alpha ^{k-1}) + \delta ^k \in W_{\mathrm{r}}(0,T) + L^2(0,T;V)\cap L^\infty (0,T;H)\), (L2) implies that \(\varPhi '(u)(\alpha ^k) \in W_{\mathrm{r}}(0,T)\). Hence (51) holds for obstacle u, direction \(\alpha ^k\) and higher-order term \(o^k(s)\). By (O4a), the obstacle \(\varPhi (u^k_s)\) is increasing in time. Then, since \(\varPhi \) is increasing,
Proposition 25 can now be applied and we find
with \(u_s^{k+1} \in W_s(0,T) \cap W_{\mathrm{r}}(0,T)\) and \(\delta ^{k+1}=\alpha ^{k+1}-\varPhi '(u)(\alpha ^k) \in L^\infty (0,T;H)\cap L^2(0,T;H)\) (we already argued above that \({L}\varPhi '(u)(\alpha ^{k}) \in L^2(0,T;H)\)), meaning that \(\alpha ^{k+1} \in W_{\mathrm{r}}(0,T) + L^2(0,T;V)\cap L^\infty (0,T;H)\) as desired. Under Assumption (L3), by the same argument as in the proof of Proposition 25, \(o^{k+1}\) is a higher-order term given that \(o^k\) is.
Regarding the expression for the derivative, we know that \(\alpha ^{k+1}-\varPhi '(u)(\alpha ^k) = \partial S(f,u)(d-L\varPhi '(u)(\alpha ^k))\) solves the VI that appears in Proposition 22, i.e.,
whence setting \(\varphi := \phi + \varPhi '(u)(\alpha ^k)\) yields
as desired.
2. Second case. We will not repeat some of the same techniques used in the above case and simply focus on the differences under the different set of assumptions. Due to (L1b), \(u^1_s\) exists by Proposition 13. Using (62) we find \(u_0 \le \varPhi (u^1_s)(0)\). The monotonicity of \(\{u_s^n\}_{n \in \mathbb {N}}\) and (62) shows this bound for all \(u^n_s\). Using (O2b), (O4b) and (62), we infer the existence for all \(u_s^n\) by the same proposition.
We prove the remaining claims again by induction. For the base case, we can expand \(u_s^1=S(f+sd,u)\) by using (L1b) and Proposition 22 directly and we obtain \(\delta ^1 := \partial S(f,u)(d) \in L^2(0,T;V) \cap L^\infty (0,T;H)\) such that (67) holds. Furthermore, \(u^1_s \in W_{\mathrm{r}}(0,T)\). Now assume the statement is true for \(n=k\) so that (68) holds. Since \(u^k_s \in W_{\mathrm{r}}(0,T)\), \({L}\varPhi (u_s^k) \in L^2(0,T;H)\), and by (L2), since
we have \({L}\varPhi '(u)(\alpha ^k) \in L^2(0,T;H)\) and so assumption (51) of Proposition 25 holds, and as does (52) as shown above, and the proposition can applied to give
(just like in the proof of the first case) with \(u_s^{k+1} \in W_{\mathrm{r}}(0,T)\) and \(\alpha ^{k+1}-\varPhi '(u)(\alpha ^k) \in L^2(0,T;V)\cap L^\infty (0,T;H)\) and \({L}\varPhi '(u)(\alpha ^{k}) \in L^2(0,T;H)\) as desired. Note that we used the fact that (O4b) implies the increasing property of all obstacles considered in the proof.
3. Conclusion. The claim of the VI satisfied by the \(\alpha ^n\) follows from Proposition 25 whilst the convergence behaviour stated in the result is a consequence of either Theorem 16 (if \(d \ge 0\)) or Theorem 17 (if \(d \le 0\)), using the fact that (59) implies that \(u^0_s\) is either a subsolution or supersolution. \(\square \)
Remark 8
Everything up to the convergence of the \(\{u_s^n\}\) stated in the above result holds if we do not assume (59) and either (O2a) or (O3b) respectively. Also, (L3) was necessary only to prove that each \(o^n\) is a higher-order term.
5.2 Properties of the iterates
In this section, we give some basic attributes of the directional derivatives \(\alpha ^n\) and the higher-order terms \(o^n\). One should not forget that these objects are time-dependent, and we will always denote the time component by t; this should not be confused with the perturbation parameter s.
Lemma 30
The following properties hold:
-
1.
For a.e. \(t \in [0,T]\),
$$\begin{aligned} \begin{aligned} \alpha ^1(t)&\ge 0,&\text { q.e. on } \{u(t)=\varPhi (u)(t)\}, \\ \alpha ^n(t)&\ge \varPhi '(u)(\alpha ^{n-1})(t),&\text { q.e. on } \{u(t)=\varPhi (u)(t)\}\text { for } n > 1. \end{aligned} \end{aligned}$$ -
2.
The sequences
$$\begin{aligned} \{\alpha ^n\}_{n \in \mathbb {N}} \text { and } \{\alpha ^n + s^{-1}o^n(s)\}_{n \in \mathbb {N}} \end{aligned}$$are monotone (increasing if \(d \ge 0\) and decreasing if \(d \le 0\)) and have the same sign as d.
-
3.
\((\alpha ^n+s^{-1}o^n(s))|_{t=0} = 0.\)
-
4.
\(\varPhi '(u)(\alpha ^n)\) has the same sign as d.
Proof
The first claim follows from the set that the \(\alpha ^n\) belong to and the characterisation of Lemma 19. The second claim is true since the sequence \(u^n_s\) is increasing or decreasing in n and due to (65) and the vanishing behaviour of \(s^{-1}o^n(s)\) and the fact that \(u^n_s-u\) has the same sign as d (since if \(d \ge 0\), \(u^n_s\) is increasing and hence greater than u whilst if \(d \le 0\), \(u^n_s\) is decreasing and smaller than u). For the third claim, in (65), take the trace \(t=0\) (which is valid since \(u^n_s, u\) were defined to have trace \(u_0\) at \(t=0\)) to obtain
Finally, we have that
where the limit is in \(L^p(0,T;H)\), and hence, passing to a subsequence, the limit also holds at almost every time strongly in H:
which is either greater than or less than zero depending on the sign of \(\alpha ^n\) which in turn depends on the sign of d (see part 2 of this lemma). \(\square \)
The first part of the previous result tells us about the quasi-everywhere behaviour of the directional derivatives on the coincidence set. We can say a little more about them in an almost everywhere sense.
Lemma 31
We have
If \(\varPhi \) is a superposition operator, then for each n,
Proof
From \(s\alpha ^n = u^n_s-u-o^n(s)\), since \(u^n_s \le \varPhi (u^{n-1}_s) \le ... \le \varPhi ^n(u^0_s) = \varPhi ^n(u)\), we find
On the set \(\{u=\varPhi (u)\}\), we get \(s\alpha ^1 \le -o^1(s)\) and dividing here by s and sending to zero, we see by the sandwich theorem that if \(d \ge 0\), \(\alpha ^1 = 0\) on \(\{u=\varPhi (u)\}\). If \(\varPhi \) is a superposition operator, observe that if t is such that \(u(t) = \varPhi (u(t))\), then in fact
Using this fact on the right-hand side of (69) gives us the result. \(\square \)
5.3 Uniform bounds on the iterates
We give a result on the boundedness of the directional derivatives \(\alpha ^n\) under two different sets of assumptions. The first set requires some boundedness conditions on the obstacle mapping including a smallness condition, whilst the second requires instead some regularity and complementarity (for the latter, see [21, Sect. 7.3.1] for the parabolic VI case) for the system.
Assumption 32
Suppose that either

or

where all constants are independent of n.
Regarding the fulfillment of assumption (L5b), Lemma 30 may be helpful for certain choices of \(\varPhi \) in applications.
Proposition 33
(Bound on \(\{\alpha ^n\}\)) Let Assumption 32 hold. Then \(\alpha ^n \rightharpoonup \alpha \text { in } L^2(0,T;V)\).
Proof
1. Under boundedness assumptions. First suppose that (L4a) — (L7a) hold. In (66) take \(\varphi =\varPhi '(u)(\alpha ^{n-1})\) as test function (this is admissible since zero is contained in the radial cone and the orthogonal space that the test function space is obtained from) to get
We can neglect the term \((d,\varPhi '(u)(\alpha ^{n-1}))_H\) due to part 4 of Lemma 30 which tells us that both d and \(\varPhi '(u)(\alpha ^n)\) have the same sign. Hence the above inequality becomes
Defining \(a_n := \left\Vert \alpha ^n\right\Vert _{L^2(0,T;V)}\), this reads
which we write as
where we have denoted
Solving this recurrence inequality leads to
We evidently need \(A_2 < A_1\) for this sequence to be bounded uniformly, that is,
i.e., (L7a). Under this condition, the bound is uniform and there is a weak limit for a subsequence of \(\{\alpha ^n\}_{n \in \mathbb {N}}\). Since the \(\alpha ^n\) are monotone, they have a pointwise a.e. monotone limit which must agree with \(\alpha \) so indeed \(\alpha ^n \rightharpoonup \alpha \) in \(L^2(0,T;V)\).
2. Under regularity assumptions. Now assume instead that (L4b)–(L6b) hold. We want to show that \(\varphi \equiv 0\) is a valid test function in the VI (66) for \(\alpha ^n\). Thus we need to prove that \(-\varPhi '(u)(\alpha ^{n-1}) \in \mathrm {cl}_{W}(T^{\mathrm{rad}}_{\mathbb {K}_0}(\varPhi (u)-u)\cap [u'+Au-f]^\perp )\). On this note, observe that

The assumption (L5b) implies that \(-\varPhi '(u)(\alpha ^{n-1}) \ge 0\) a.e. on \(\{u=\varPhi (u)\}\) and thus it belongs to \(T^{\mathrm{rad}}_{\mathbb {K}_0}(u-\varPhi (u)) \cap [u'+Au-f]^\perp \) (see (36)) and this is obviously a subset of its closure in W. Therefore, 0 is a valid test function in (66) and testing with this we find
which easily leads to the desired bound due to the assumption (L6b). As before, the monotonicity of the sequence implies the convergence for the whole sequence. \(\square \)
5.4 Characterisation of the limit of the directional derivatives
We now want to study the limiting objects associated to the sequences \(\{\alpha ^n\}\) and \(\{\delta ^n\}\). First, we introduce the notation \(B_\epsilon ^q(u)\) to stand for a closed ball in \(L^q(0,T;H)\) around u of radius \(\epsilon \)
and introduce some assumptions that will be of use here and in further sections.
Assumption 34
Suppose that
and assume that there exists \(\epsilon > 0\) such that for all \(z \in L^2(0,T;H) \cap B_\epsilon ^p(u) + B_\epsilon ^2(0)\) and \(v \in L^2(0,T;V)\cap L^p(0,T;H)\),
where
Here, \(p \in [1,\infty )\) is as in Assumption 28.
Regarding (L8), it may be helpful to note that Assumption 23 implies that \(\varPhi :L^2(0,T;V) \rightarrow W(0,T)\) is completely continuous. As a precursor to characterising the directional derivative \(\alpha \), we study the limit of \(\{\delta ^n\}_{n \in \mathbb {N}}\) in the next lemma. Since \(\delta ^n = \alpha ^n + \varPhi '(u)(\alpha ^{n-1})\), if \(\varPhi '(u)(\cdot ):L^2(0,T;V) \rightarrow L^p(0,T;H)\) is bounded, we can find a subsequence of \(\{\delta ^n\}_{n \in \mathbb {N}}\) such that \(\delta ^{n_j} \rightharpoonup \delta \) in \(L^p(0,T;H)\) for some \(\delta \). In fact under additional assumptions the convergence holds for the full sequence as shown below.
Lemma 35
If (L8) holds, then \(\delta ^n \rightharpoonup \delta \) in \(L^2(0,T;V)\).
Proof
With the aid of Proposition 33, we can pass to the limit in (63), which is \(\alpha ^{n+1}=\varPhi '(u)(\alpha ^{n}) - \delta ^{n+1}\), to find the weak convergence in \(L^2(0,T;V)\) of the whole sequence \(\{\delta ^n\}\) to some \(\delta \in L^2(0,T;V)\). \(\square \)
To characterise \(\delta \) as the solution of a VI in itself, it becomes useful to define the set
Lemma 36
Under the conditions of the previous lemma, \(\delta \) satisfies
Proof
From (45), \(\delta ^n\) satisfies the VI
We can pass to the limit here using the convergence result of the previous lemma and the continuity of \(\varPhi '(u)(\cdot ):L^2(0,T;H) \rightarrow W(0,T)\) (noting that \(\alpha ^n \rightarrow \alpha \) in \(L^2(0,T;H)\) thanks to ) and the limiting object \(\delta \) satisfies the inequality stated in the lemma.
We must check that \(\delta \in C_{\mathbb {K}_0}(\varPhi (u)-u) \cap [u' + Au - f]^\perp \) too. It is clear that the orthogonality condition is satisfied due to the convergence in the previous lemma. By (37),
Due to Mazur’s lemma, there is a convex combination
of \(\{\delta ^n\}_{n \in \mathbb {N}}\) such that \(v_k \rightarrow \delta \) in \(L^2(0,T;V)\). Since this convergence is strong, for a subsequence, \(v_{k_m}(t) \rightarrow \delta (t)\) in V and hence pointwise q.e. for a.e. \(t \in [0,T]\).
By definition, \(\delta ^n(t) \ge 0\) everywhere on \(\{u(t)=\varPhi (u)(t)\} \setminus A_n(u)(t)\) where \(A_n(u)(t) \subset \{u(t)=\varPhi (u)(t)\}\) is a set of capacity zero; this implies that
and using the fact that a countable union of capacity zero sets has capacity zero and the inequality (70), we can pass to the limit to deduce that \(\delta (t) \ge 0\) quasi-everywhere on \(\{u(t)=\varPhi (u)(t)\}\) for a.e. \(t \in [0,T]\). \(\square \)
Proposition 37
Under the conditions of the previous lemma, \(\alpha \) satisfies the QVI
Proof
From the definition of \(\alpha ^n\) in terms of \(\delta ^n\) in (63), we obtain
Using this fact in the QVI for \(\delta \) given in Lemma 36 yields
which translates into the desired result after setting \(w := \varPhi '(u)(\alpha ) + z\). \(\square \)
5.5 Dealing with the higher-order term
We come to the final part which consists in showing that the limit of the higher-order terms \(o^n\) is indeed a higher-order term itself. The idea is to be able to commute the two limits
and this can be done typically under a uniform convergence of one of the limits. Such a uniform convergence is assured by the next proposition but first we need a technical result which tells us that the sequence \(\{u^n_s\}\) stays close to u for small s. Define
Lemma 38
Under assumptions (L10)–(L13), if \(\bar{\epsilon }\le \epsilon \) and
then \(\{u^n_s\}_{n \in \mathbb {N}}\subset B_{\bar{\epsilon }}^p(u)\).
Proof
The continuous dependence estimate (33) for the map \(\bar{S}\) along with \(L^\infty (0,T) \hookrightarrow L^q(0,T)\) for any \(q \ge 1\) yields the following estimate for \(f, g \in L^2(0,T;H)\) and \(\psi , \phi \in L^2(0,T;V)\) with \(\varPhi (\psi ), \varPhi (\phi ) \in W_{\mathrm{r}}(0,T)\):
Since \(u \in W(0,T)\), Assumption 28 implies that \(\varPhi (u) \in W_{\mathrm{r}}(0,T)\) and we can apply (71) for \(u^1_s = S(f+sd,u)\) and \(u=S(f,u)\) to get
so that \(u^1_s \in B^q_{\bar{\epsilon }}(u)\) if s is sufficiently small. Let us fix \(q=p\). Applying the mean value theorem to the first three terms on the right-hand side of (71) and taking in addition \(\phi , \psi \in L^2(0,T;H) \cap B_{\bar{\epsilon }}^p(u)\) so that for any \(\lambda \in [0,1]\), \(\lambda \phi + (1-\lambda )\psi \in L^2(0,T;H) \cap B_{\bar{\epsilon }}^p(u)\) as well and we can use assumptions (L10)–(L12) to get
for all such \(\phi , \psi \). Since \(u^1_s, u \in L^2(0,T;H) \cap B_{\bar{\epsilon }}^p(u)\) with \(L\varPhi (u_s^n) \in L^2(0,T;H)\) from Proposition 29, the above formula is applicable for \(u^2_s = S(f+sd,u^1_s)\) and u, and we get for sufficiently small s that
i.e., \(u_s^2 \in B_{\bar{\epsilon }}(u)\). Arguing by induction, the general case satisfies the estimate
and by solving the above recurrence inequality, we find
where we used the formula for geometric series (recall that (L13) says precisely that \(C^* < 1\)). \(\square \)
Proposition 39
Suppose Assumptions 32 and 34 hold. Then \(s^{-1}o^n(s) \rightarrow 0\) in \(L^p(0,T;H)\) as \(s \rightarrow 0^+\) uniformly in n and thus the limit \(\lim _{n \rightarrow \infty } o^n\) is also a higher-order term.
Proof
The proof is in four steps.
Step 1. We first collect some estimates. From the expansions
we get, from subtracting one from the other and using the mean value theorem,
Now take
(the supremum over n is sensible due to Proposition 33). Then \(u_s^{n-1} \in B_{\epsilon }^p(u)\) for all n by Lemma 38, so that \(\lambda u_s^{n-1} + (1-\lambda )u \in B_{\epsilon }^p(u)\) for \(\lambda \in (0,1)\), and we also have \(s\alpha ^{n-1} \in B_{\epsilon }^2(0)\) and hence \(\lambda u^{n-1}_s + (1-\lambda )(u+s\alpha ^{n-1}) \in B_{\epsilon }^p(u) + B_{\epsilon }^2(0)\). Thus assumption (L11) is applicable on the right-hand side of (72) and we obtain
Secondly, arguing as before, by differentiating in time the expansions for \(\varPhi (u+s\alpha ^{n-1})\) and \(\varPhi (u+s\alpha ^{n-1}+o^{n-1}(s))\), taking the difference and using the mean value theorem (bearing in mind that \((\partial _t \varPhi )'(z)(b) = \partial _t \varPhi '(z)(b)\) by linearity) and applying assumption (L12),
Step 2. By definition of \(o^n\) in (64),
and using the boundedness estimates (50), (42) and assumption (L10),
We estimate the third term above using (73) and (74) of Step 1:
Inserting this back above, we find
for some \(C<1\) by (L13). The above can be recast as
where
The recurrence inequality can be solved for \(a^n\) in terms of \(a_1\) and the \(b_i\):
Step 3. Let us see why
By the weak convergence of \(\{\alpha ^n\}\) in \(L^2(0,T;V)\) (and hence strong convergence in \(L^2(0,T;H)\)), \(\{\alpha ^{n}\}\) is a compact set in \(L^2(0,T;H)\). Since \(\varPhi :L^2(0,T;H) \rightarrow W(0,T)\) is Hadamard differentiable, it is compactly differentiable, meaning that \(s^{-1}l(s,\alpha ^n) \rightarrow 0\) in W(0, T) uniformly, thus the first three terms in the definition of \(b_n\) are taken care of. For the final term, we note that \(\{{L}\varPhi '(u)(\alpha ^{n})\}_{n \in \mathbb {N}}\) is also compact in \(L^2(0,T;H)\) by (L9). Therefore, for any \(\epsilon > 0\), there exists a \(\tau _1 > 0\) independent of j such that
Step 4. As \(o_1(s) = r(s,0,0)=o(s,d)\) is a higher-order term, we know that there is a \(\tau _2 > 0\) such that
Now recalling (75), for \(s \le \min (\tau _0, \tau _1,\tau _2)\),
This shows that \(o^n(s)/s\) tends to zero uniformly in n. \(\square \)
We are finally in position to state the main theorem which condenses the various intermediary results we obtained above.
Theorem 40
Let Assumptions 21 and 23 hold and take \(u \in \mathbf {P}_{u_0}(f)\) satisfying Assumption 27. Let also Assumptions 28, 32 and 34 hold.
Then there exists a \(u_s \in \mathbf {P}(f+sd)\) such that, under the first set of assumptions of Assumption 28, \(u_s \in W_s(0,T)\) is a (strong) solution whereas under the second set of assumptions of Assumption 28, \(u_s \in L^2(0,T;V) \cap L^\infty (0,T;H)\) is a very weak solution, and \(u_s\) satisfies
where
with \(p \in [1,\infty )\) is as in Assumption 28, and \(\alpha \) satisfies the parabolic QVI
Let us relate this result to notions of differentiability commonly used in set-valued and multi-valued analysis. Recall that the map \(\mathbf {P}:F \rightrightarrows U\) has a contingent derivative \(\beta \) at (f, u) (with \(u \in \mathbf {P}(f)\)) in the direction d, written \(\beta \in D\mathbf {P}(f,u)(d)\), if there exist sequences \(\beta _n \rightarrow \beta \), \(d_n \rightarrow d\) and \(s_n \rightarrow 0\) such that
We claim that \(\mathbf {P}\) does indeed possess contingent derivatives and our main results furnishes us with one such contingent derivative.
Proposition 41
The map \(\mathbf {P}:F \rightrightarrows U\) has a contingent derivative \(\alpha \in D\mathbf {Q}(f,u)(d)\) given by the previous theorem with either of the following choices:
(where \(L^2_{inc}(0,T;X)\) means increasing-in-time elements of \(L^2(0,T;X)\)) or
Proof
Indeed, given \(u \in \mathbf {P}(f)\) and sequences \(s_n \rightarrow 0\) and \(d_n \equiv d\), we can, by Theorem 40, find \(u_n^s \in \mathbf {P}(f+s_nd)\) such that
where the sequence
is such that \(\beta _n \rightarrow \alpha \) as \(n \rightarrow \infty \) because the term \(o(s_n;d)\) is higher order. \(\square \)
6 Other approaches to differentiability
We now discuss some possible alternative approaches to deriving Theorem 40 (or a variant of this theorem). The idea here is to bootstrap by using the already-achieved results on elliptic QVIs in [2], however, we shall see that this is not at all straightforward.
6.1 Time discretisation and elliptic QVI theory
One idea is to apply the elliptic QVI theory of [2] to the time-discrete problems and then pass to the limit there. With the definition
recall the notations defined in Sect. 2; in particular \(\mathbf {Q}_{t_{n}^N}\) is the solution mapping defined as \(z \in \mathbf {Q}_{t_n^N}(g)\) if and only if
With \(u_0^N := u_0 \in V\) for a given initial data, set
Under the assumptions on the source f and the direction d in Sect. 2, we know by [2, Theorem 1] that there exists \(u_{s,1}^N \in \mathbf {Q}_{t_{0}^N}(hf_1^N + u_0^N + shd_1^N)\) and \(\gamma _1^N := \mathbf {Q}'_{t_{0}^N}(hf_1^N + u_0)(hd_1^N)\) such that
where \(o_1^N(s,hd_1^N;hf_1^N+u_0^N)\) is a higher-order term. Since \(u_{s,0}^N = u_0^N\),
In a similar way, since \(u_2^N = \mathbf {Q}_{t_{1}^N}(hf_2^N + u_1^N)\), we know that there exists a function \(u_{s,2}^N \in \mathbf {Q}_{t_{1}^N}(hf_2^N + u_1^N + s(hd_2^N + \alpha _1^N) + o_1^N(s))\) and \(\gamma _2^N = \mathbf {Q}_{t_{1}^N}'(hf_2^N+u_1^N)(hd_2^N + \gamma _1^N)\) such that
where \(o_2^N(s, hd_2^N + \alpha _1^N, o_1^N(s);hf_2^N + u_1^N)\) is a higher-order term and since \(u_1^N + s\alpha _1^N + o_1^N(s) = u_{s,1}^N\),
Along these lines, we obtain the following lemma.
Lemma 42
Given \(u_n^N\) defined as above, there exists \(u_{s,n}^N \in \mathbf {Q}_{t_{n-1}^N}(hf_n^N + u_{n-1}^N + shd_n^N)\) and \(\gamma _n^N\) such that
where \(\gamma _n^N\) satisfies
and
is a higher-order term. Here, \(\mathcal {A}(u_n^N) = \{u_n^N = \varPsi _{n-1,N}(u_n^N)\}.\)
Proof
We prove this by induction. The base case has been shown above. Suppose it holds for the nth case. Then given \(u_{n+1}^N = \mathbf {Q}_{t_{n-1}^N}(hf_{n+1}^N+u_n^N)\), there exists
(thanks to the formula relating \(u_{s,n}^N\) and \(u_n^N\)) such that
which ends the proof. \(\square \)
By definition, \(u_{s,n}^N\) solves the QVI
for all \(v \in V\) s.t. \(v \le \varPhi (u_{s,n}^N)(t_{n-1}^N)\). It is then clear from (80) (since the structure of the time-discretised QVI is the same as those considered in Sect. 2 and we can apply the same argumentation as there) that the interpolants \(u_s^N\), \(\hat{u}_s^N\) constructed from \(\{u_{s,n}^N\}_{n \in \mathbb {N}}\) and which satisfy
are bounded in the appropriate spaces and hence there exists a \(u_s\) such that \(u_s^N \rightarrow u_s\) and
i.e., \(u_s \in \mathbf {P}(f+sd)\). Clearly, similar claims are true for \(u^N\), \(\hat{u}^N\) constructed as interpolants from \(\{u^N_n\}_{n \in \mathbb {N}}\). We now need to obtain uniform bounds on the \(\gamma _n^N\) and the higher-order term.
Lemma 43
Suppose that \(v=0\) is a valid test function in (79) (which is the case in the VI setting). Then
and hence
Proof
Since 0 is feasible, the proof for the boundedness of the first two estimates follows that of the first two estimates of Lemma 3. The proof of Corollary 4 shows that these bounds imply the boundedness in the Bochner spaces above. \(\square \)
Multiplying (78) by \(\chi _{T_{n-1}^N}\) and summing up over N, we find
Due to the bounds on the left-hand side and the first two terms on the right-hand side, we can pass to the limit in \(o^N\) too and we find
for some \(\gamma ^* \in L^2(0,T;V) \cap L^\infty (0,T;H)\), and the equality also holds in this space. It remains to be seen that \(o^*\) is a higher-order term and to characterise the term \(\gamma ^*\).
Here we come to a roadblock since we do not know how the terms \(o^N\) depend on the moving base point and thus we cannot say anything about the uniform convergence of the \(o^N\) and are unable to identify the limit of the higher-order terms as higher order; recall that we warned the reader of this issue in Remark 5. Alternatively, if we were able to identify \(\gamma ^*\) as \(\alpha \) from the previous sections, this would suffice to show the higher-order behaviour. But it seems difficult to handle the constraint set satisfied by \(\gamma ^N_n\) and obtain a type of Mosco convergence for recovery sequences to approximate the limiting test function space.
6.2 Elliptic regularisation of the parabolic (Q)VI
Another possible approach is through regularising the parabolic QVI as an elliptic QVI, applying the elliptic theory of [2] to the regularised problem and then passing to the limit in the regularisation parameter. This elliptic regularisation of parabolic problems can be seen in the work of Lions [31, p. 407].
The idea is to include in the parabolic inequality a term involving the second time derivative of the solution, i.e., \(-\epsilon u''\) with \(\epsilon > 0\); more precisely we wish to consider the QVI
The ‘final time’ condition is necessary to have a well defined problem. Define \(\mathcal {V} := \{v \in W_s(0,T) : v(0) = 0\}.\) Integrating by parts in (81) and using the initial and final conditions on u and the test function space, we obtain the weak form: find \(u \in \mathcal {V}\) with \( u(t) \le \varPhi (u)(t)\) such that
Thinking of the time component as simply another space dimension, the resulting elliptic operator \(\hat{A}:\mathcal {V} \rightarrow \mathcal {V}^*\) is
which is clearly T-monotone, bounded and coercive in the Sobolev–Bochner space \(\mathcal {V}\); for the latter, observe that
Clearly the solution of (82) depends on \(\epsilon \) so let us write it as \(u^\epsilon \). Working formally, we may apply [2, Theorem 1.6] and we find existence of a \(u_s^\epsilon \) solving (82) with source term \(f+sd\) and a \(\alpha ^\epsilon \) such that
where \(o^\epsilon \) is a higher-order term. As for \(\alpha ^\epsilon \), it satisfies \(\alpha ^{\epsilon } \in \mathbb {K}^{u^\epsilon }(\alpha ^\epsilon )\) and
Testing (82) with \(v=0\), we obtain the bound
so \(u^\epsilon \) is bounded in at least \(L^2(0,T;V) \cap L^\infty (0,T;H)\) uniformly in \(\epsilon \). In a similar way, \(u^\epsilon _s\) is also bounded in this space uniformly in \(\epsilon \) and s. Let us also suppose that we have a uniform bound on \(\alpha ^\epsilon \). Then it remains to be seen that the limit of the \(o^\epsilon \) is a higher-order term, and this is where we once again run into problems. A monotonicity in \(\epsilon \) result on the solutions of (82) would be useful.
It is worth remarking that this technical issues also arises in the VI case and in the simpler unconstrained (PDE) case.
7 Example
We study here an example where the obstacle mapping is given by the inverse of a parabolic differential operator. This example is motivated by applications in thermoforming (which is a process that manufactures shapes such as car panels from a given mould shape; see [2] and references therein for more information): in [2], we studied an elliptic nonlinear PDE-QVI model that describes such a thermoforming process but in reality, the full model is a highly complicated evolutionary system of equations and QVIs involving obstacle maps that are inverses of differential operators. As a first initial step on the road to studying the full problem, we consider a simplification and study the following example.
Let \(H=L^2(0,\omega )\) and \(V=H^1_0(0,\omega )\) and consider the following nonlinear heat equation:
where
-
(1)
\(g :\mathbb {R} \rightarrow \mathbb {R}\) is \(C^1\), non-negative and increasing,
-
(2)
\(g, g' \in L^\infty (\mathbb {R})\),
-
(3)
\(g, g':L^2(0,T;H) \rightarrow L^2(0,T;H)\) are continuous,
-
(4)
\(g :L^2(0,T;H) \rightarrow L^2(0,T;H)\) is directionally differentiable,
-
(5)
\(B:V \rightarrow V^*\) is a linear, bounded, coercive and T-monotone operator giving rise to a differentiable bilinear form,
-
(6)
\(w_0 \in V\) and \(\psi \in L^2(0,T;V)\) are given data.
We denote by \(C^B_b\) and \(C^B_a\) the constants of boundedness and coercivity for the operator B while \(A:V \rightarrow V^*\) denotes an operator which is assumed to satisfy all assumptions given in the introduction of this paper and it should also give rise to a bilinear form which is smooth. Letting \(W=H^2(\varOmega )\), we further assume that
-
(7)
\(A:L^2(0,T;W) \rightarrow L^2(0,T;H)\),
-
(8)
the following norms are equivalent:
$$\begin{aligned} \left\Vert u\right\Vert _{L^2(0,T;H)} + \left\Vert Au\right\Vert _{L^2(0,T;H)},\quad \left\Vert u\right\Vert _{L^2(0,T;H)} + \left\Vert Bu\right\Vert _{L^2(0,T;H)},\quad \left\Vert u\right\Vert _{L^2(0,T;W)}. \end{aligned}$$
This equivalence of norms assumption is related to (boundary) regularity results which follow given enough smoothness of the boundary and depending on the specific elliptic operators, see [16, Theorem 4, Sect. 6.3]. The equivalence of norms implies
Define \(\varPhi (\psi ):=w\) as the solution of (83). By the standard theory of parabolic PDEs, \(\varPhi \) maps \(L^2(0,T;H)\) into \(W_s(0,T)\). This regularity implies (using the equation itself) that for \(\psi \in L^2(0,T;H)\), \(B\varPhi (\psi ) \in L^2(0,T;H)\) and hence by (84) and the assumption on the range of the operator A, \(\varPhi (\psi ) \in L^2(0,T;W)\) and \(A\varPhi (\psi ) \in L^2(0,T;H)\). It follows then that \({L}\varPhi (\psi ) \in L^2(0,T;H)\) too (recall that \(L=\partial _t + A\)). Since \(\varPhi (\psi ) \in L^2(0,T;W) \cap H^1(0,T;H)\), the theorem of Aubin–Lions [14, Theorem II.5.16] gives the continuity in time \(\varPhi (\psi ) \in C^0([0,T];V)\).
We proceed now with checking the assumptions that will eventually enable us to use Theorem 40.
Lemma 44
Assumption 23 on the Hadamard differentiability of \(\varPhi \) holds and the directional derivative satisfies
where \(\varPhi _0\) is the solution mapping associated to the equation (83) with zero initial condition (i.e. \(\varPhi _0\) is the same as \(\varPhi \) except for the initial condition).
Proof
Take \(\psi , h \in L^2(0,T;H)\) and denote \(w_s := \varPhi (\psi + sh)\) and \(w:=\varPhi (\psi )\). Their difference satisfies
Consider the solution \(\eta \) of the PDE
Letting \((w_s-w)/s =: \eta _s\) we observe
whence, through standard energy estimates,
which implies that \(\eta _s \rightarrow \eta \) in \(L^2(0,T;V) \cap L^\infty (0,T;H)\) due to the differentiability assumption on g. We also have
and hence \(\eta _s \rightarrow \eta \) in W(0, T) showing that \(\varPhi :L^2(0,T;H) \rightarrow W(0,T)\) is differentiable as desired with \(\varPhi '(\psi )(h) = \varPhi _0(g'(\psi )(h))\) being the solution of the same PDE with source term \(g'(\psi )(h)\) and zero initial data. Along the same lines as the previous arguments, \(\varPhi '(\psi )(h) \in W_s(0,T)\). \(\square \)
Given a source term \(f \in L^2(0,T;H_+)\) and initial condition \(u_0 \in V_+\) satisfying \(u_0 \le \varPhi (0)(0) = w_0\) and (60), we fix a solution u solving the QVI
The fact that such a solution indeed exists will be shown later.
Lemma 45
Let \(d \in L^2(0,T;H_+)\) be such that \(f+sd\) is increasing in time and let \(w_0 \in V_+\) and \(Bw_0 \le 0\). Then Assumption 28 holds.
Proof
We showed in the previous paragraph the satisfaction of (O3a), and the condition (61) is irrelevant since \(d \ge 0\). Let us check the remaining assumptions.
-
Regarding the increasing property of \(\varPhi \) in time, we argue as follows. Making the substitution \(\bar{w} = w- w_0\) in (83) in order to remove the initial condition, the transformed PDE is
$$\begin{aligned} \begin{aligned} \bar{w}' + B\bar{w}&= g(\psi ) -Bw_0&\text {on } (0,T) \times (0,\omega ),\\ \bar{w}(\cdot ,0)&= \bar{w}(\cdot ,\omega ) = 0&\text {on } (0,T),\\ \bar{w}(0,\cdot )&= 0&\text {on } (0,\omega ). \end{aligned} \end{aligned}$$The solution of this can be written in terms of the Green’s function
$$\begin{aligned} G(x,y,t) := \frac{2}{L}\sum _{n=1}^\infty \sin \left( \frac{n\pi x}{L}\right) \sin \left( \frac{n\pi y}{L}\right) e^{-\frac{n^2\pi ^2t}{L^2}} \end{aligned}$$through the integral representation
$$\begin{aligned} \bar{w}(t,x) := \int _0^t \int _0^L (g(\psi (s,y))-Bw_0(y))G(x,y,t-s)\;\mathrm {d}y\mathrm {d}s. \end{aligned}$$Take \(\psi \) to be increasing in time. For \(t_1 \le t_2\),
$$\begin{aligned}&\bar{w}(t_1,x) - \bar{w}(t_2,x)\\&\quad = \int _0^{t_1} \int _0^L (g(\psi (s,y))-Bw_0(y))G(x,y,t_1-s)\;\mathrm {d}y\mathrm {d}s\\&\qquad - \int _0^{t_2} \int _0^L (g(\psi (s,y))-Bw_0(y))G(x,y,t_2-s)\;\mathrm {d}y\mathrm {d}s\\&\quad = \int _0^{t_1} \int _0^L (g(\psi (t_1-r,y))-Bw_0(y))G(x,y,r)\;\mathrm {d}y\mathrm {d}r \\&\qquad - \int _0^{t_2} \int _0^L (g(\psi (t_2-r,y))-Bw_0(y))G(x,y,r)\;\mathrm {d}y\mathrm {d}r\\&\qquad \qquad \qquad \qquad \qquad (\hbox {where } r:=t_i -s)\\&\quad = \int _0^{t_1} \int _0^L (g(\psi (t_1-r,y))-g(\psi (t_2-r,y))G(x,y,r)\;\mathrm {d}y\mathrm {d}r\\&\qquad - \int _{t_1}^{t_2} \int _0^L (g(\psi (t_2-r,y))-Bw_0(y))G(x,y,r)\;\mathrm {d}y\mathrm {d}r\\&\quad \le 0 \end{aligned}$$since \(\psi \) is increasing in time and g is increasing which implies that the first term is less than zero, and the second term is also clearly non-positive as its integrand is non-negative (due to the assumption on \(w_0\)). This shows that \(t \mapsto \bar{w}(t) = w(t) - w_0 = \varPhi (\psi )(t) - w_0\) is increasing, implying (O4a).
-
If \(\psi _1 \le \psi _2\) and \(w_i:=\varPhi (\psi _i)\), the difference solves
$$\begin{aligned} (w_1-w_2)' + B(w_1-w_2) = g(\psi _1)-g(\psi _2), \end{aligned}$$with zero initial data. Testing with \((w_1-w_2)^+\) and using the fact that g is increasing, the right-hand side is non-positive, showing that \(\varPhi (\psi _1) \le \varPhi (\psi _2)\).
-
Multiplying the equation by \(w^-\) leads to
$$\begin{aligned} \frac{1}{2} \frac{d}{dt} \left\Vert w^-\right\Vert _{H}^2 + C_a^B\left\Vert w^-\right\Vert _{V}^2 \le (g(\psi ), -w^-)_H \le 0, \end{aligned}$$since \(g \ge 0\), which shows that \(\varPhi (\psi )\) is non-negative too as long as \(w_0 \in V_+\), thus (O1a) and (L1a).
-
Let \(\psi _n \rightharpoonup \psi \) in \(L^2(0,T;V)\) and set \(w_n := \varPhi (\psi _n)\) and \(w:= \varPhi (\psi )\) so that
$$\begin{aligned} w_n' + Bw_n = g(\psi _n) \quad \text {and} \quad w' + Bw = g(\psi ). \end{aligned}$$Subtracting one from the other and testing with the difference, we obtain
$$\begin{aligned} \frac{1}{2} \frac{d}{dt} \left\Vert w_n-w\right\Vert _{H}^2 + C_a^B\left\Vert w_n-w\right\Vert _{V}^2 \le \left\Vert g(\psi _n)-g(\psi )\right\Vert _{H}\left\Vert w_n-w\right\Vert _{H}. \end{aligned}$$Making use of Young’s inequality on the right-hand side and then integrating over time, we find due to the continuity of g in \(L^2(0,T;H)\) that \(w_n \rightarrow w\) in \(L^2(0,T;V) \cap L^\infty (0,T;H)\) and so (O2a) is also valid.
-
If \(h,\psi \in L^2(0,T;V)\), \(\varPhi '(\psi )(h) = \varPhi _0(g'(\psi )(h)) \in W_s(0,T)\) and by the same reasoning as above, we find \({L}\varPhi '(\psi )(h) \in L^2(0,T;H)\), giving (L2). The property (L3) will be shown below.
\(\square \)
Let us return now to the QVI (85).
Lemma 46
A solution u of (85) exists with all the stated properties. Furthermore, the assumptions on u in Assumption 27 also hold.
Proof
We aim to apply Theorem 16. Indeed, taking the function 0 as a subsolution (by the comparison principle), we showed above that \(\varPhi (\psi ) \in C^0([0,T];V)\) for any \(\psi \in L^2(0,T;H)\) which ensures (24) and (26). We also proved in the same lemma the validity of (O1a) and (O2a) as well as the increasing-in-time properties (23), (25) for \(\varPhi \), hence there is a \(u \in W_s(0,T)\) solving the above QVI and Assumption 27 is met. \(\square \)
Lemma 47
The local hypotheses comprising Assumptions 32 and 34 hold.
Proof
Let \(h \in L^2(0,T;V)\) and \(\eta := \varPhi '(u)(h)\) so that
-
From the standard energy estimate
$$\begin{aligned} \frac{1}{2} \left\Vert \eta (r)\right\Vert _{H}^2 + C_a^B\int _0^r \left\Vert \eta \right\Vert _{V}^2 \le \int _0^r (g'(u)h, \eta )_H \le \left\Vert g'\right\Vert _{\infty }\left\Vert \eta \right\Vert _{L^\infty (0,r;H)}\left\Vert \eta \right\Vert _{L^1(0,r;H)}, \end{aligned}$$we find
$$\begin{aligned} \frac{1}{2} \left\Vert \eta \right\Vert _{L^\infty (0,T;H)}^2 + C_a^B \left\Vert \eta \right\Vert _{L^2(0,T;V)}^2 \le \left\Vert g'\right\Vert _{\infty }\left\Vert \eta \right\Vert _{L^\infty (0,T;H)}\left\Vert \eta \right\Vert _{L^1(0,T;H)}, \end{aligned}$$which leads to
$$\begin{aligned} \left\Vert \varPhi '(u)(h)\right\Vert _{L^2(0,T;V)}^2 \le \frac{T\left\Vert g'\right\Vert _{\infty }^2}{2C_a^B}\left\Vert h\right\Vert _{L^2(0,T;H)}^2, \end{aligned}$$(86)i.e., (L4a) after using the continuity of the embedding \(V \hookrightarrow H\).
-
The second estimate above also leads to
$$\begin{aligned} \left( \frac{1}{2}-\epsilon \right) \left\Vert \eta \right\Vert _{L^\infty (0,T;H)}^2 + C_a^B \left\Vert \eta \right\Vert _{L^2(0,T;V)}^2&\le \left\Vert g'\right\Vert _{\infty }^2C_\epsilon \left\Vert h\right\Vert _{L^1(0,T;H)}^2\\&\le \left\Vert g'\right\Vert _{\infty }^2C_\epsilon T \left\Vert h\right\Vert _{L^2(0,T;H)}^2, \end{aligned}$$whence (L6a):
$$\begin{aligned} \left\Vert \varPhi '(u)(h)\right\Vert _{L^\infty (0,T;H)} \le \left\Vert g'\right\Vert _{\infty }\sqrt{\frac{TC_\epsilon }{1/2-\epsilon }}\left\Vert h\right\Vert _{L^2(0,T;H)}. \end{aligned}$$(87) -
For (L5a), we simply use the equation itself and (86):
$$\begin{aligned} \left\Vert \partial _t \varPhi '(u)(h)\right\Vert _{L^2(0,T;V^*)}&\le \left\Vert g'\right\Vert _{\infty }\left\Vert h\right\Vert _{L^2(0,T;V^*)} + C_b^B\left\Vert \varPhi '(u)(h)\right\Vert _{L^2(0,T;V)}\nonumber \\&\le \left\Vert g'\right\Vert _{\infty }\left( 1+C_b^B\sqrt{\frac{T}{2C_a^B}}\right) \left\Vert h\right\Vert _{L^2(0,T;H)}. \end{aligned}$$(88)Therefore, if T and/or \(\left\Vert g'\right\Vert _{\infty }\) is sufficiently small, assumption (L7a) holds.
-
The estimate (86) yields the first property in (L3) whilst (86) and (88) yield the second property.
-
Regarding the completely continuity requirements of Assumption 34, let us take \(h_n \rightharpoonup h\) in \(L^2(0,T;V)\) and define \(w_n := \varPhi '(u)(h_n) = \varPhi _0(g'(u)(h_n))\) and \(w:=\varPhi '(u)(h) = \varPhi _0(g'(u)h)\). We see that
$$\begin{aligned} w_n' + Bw_n = g'(u)h_n \quad \text {and}\quad w' + Bw = g'(u)h, \end{aligned}$$which immediately implies
$$\begin{aligned} C_a\left\Vert w_n-w\right\Vert _{L^2(0,T;V)}^2 \le \int _0^T (g'(u)h_n - g'(u)h, w_n-w)_H, \end{aligned}$$on which using the boundedness of \(g'\) and the strong convergence \(h_n \rightarrow h\) in \(L^2(0,T;H)\), we get that \(w_n \rightarrow w\) in \(L^2(0,T;V)\), giving (L8).
-
We also have that \({L}\varPhi '(u)(h_n) = (\partial _t + A)\varPhi _0(g'(u)h_n) = (\partial _t + A)w_n = (\partial _t + B)w_n + Aw_n - Bw_n\), which implies that
$$\begin{aligned}&\left\Vert {L}\varPhi '(u)(h_n)-{L}\varPhi '(u)(h)\right\Vert _{L^2(0,T;H)}\nonumber \\&\quad = \left\Vert g'(u)h_n - g'(u)h + (A-B)(w_n-w)\right\Vert _{L^2(0,T;H)}\nonumber \\&\quad \le \left\Vert g'(u)(h_n - h)\right\Vert _{L^2(0,T;H)} + \left\Vert A(w_n-w)\right\Vert _{L^2(0,T;H)}\nonumber \\&\qquad + \left\Vert B(w_n-w)\right\Vert _{L^2(0,T;H)}. \end{aligned}$$(89)The first term on the right-hand side converges to zero because \(g'\) is bounded. For the third term, we note the following standard estimate for linear parabolic PDEs (using the differentiability of B):
$$\begin{aligned} \left\Vert w_n-w\right\Vert _{W_s(0,T)} \le C\left\Vert g'(u)(h_n-h)\right\Vert _{L^2(0,T;H)} \end{aligned}$$whence
$$\begin{aligned} \left\Vert Bw_n - Bw\right\Vert _{L^2(0,T;H)}&\le \left\Vert g'(u)(h_n - h)\right\Vert _{L^2(0,T;H)} + \left\Vert w_n'-w'\right\Vert _{L^2(0,T;H)}\\&\le C\left\Vert g'(u)(h_n - h)\right\Vert _{L^2(0,T;H)}. \end{aligned}$$Regarding the second term of (89), we manipulate using the equivalence of norms as following:
$$\begin{aligned} \left\Vert A(w_n-w)\right\Vert _{L^2(0,T;H)}&\le C_1\left\Vert w_n-w\right\Vert _{L^2(0,T;W)}\\&\le C_2\left( \left\Vert w_n-w\right\Vert _{L^2(0,T;H)} + \left\Vert B(w_n-w)\right\Vert _{L^2(0,T;H)}\right) \end{aligned}$$and we apply again the estimate from the previous step on the right-hand side and this eventually leads to (L9).
-
The bounds (86), (87) and (88) imply (L10), (L11), (L12) and the smallness condition (L13) holds again if T or \(\left\Vert g'\right\Vert _{\infty }\) is sufficiently small.
\(\square \)
Having met all the requirements, we may apply Theorem 40 to infer that the solution mapping taking \(f \mapsto u\) in (85) is directionally differentiable in the stated sense and the associated derivative solves the QVI
Change history
10 July 2021
A Correction to this paper has been published: https://doi.org/10.1007/s00526-021-02026-1
References
Adams, R.A., Fournier, J.J.F.: Sobolev Spaces, 2nd edn. Academic Press, London (2003)
Alphonse, A., Hintermüller, M., Rautenberg, C.N.: Directional differentiability for elliptic quasi-variational inequalities of obstacle type. Calc. Var. Partial Differ. Equ. 58(1), 39–47 (2019). https://doi.org/10.1007/s00526-018-1473-0
Barbu, V.: Optimal Control of Variational Inequalities, Research Notes in Mathematics, vol. 100. Pitman (Advanced Publishing Program), Boston (1984)
Barrett, J.W., Prigozhin, L.: A quasi-variational inequality problem in superconductivity. Math. Models Methods Appl. Sci. 20(5), 679–706 (2010). https://doi.org/10.1142/S0218202510004404
Barrett, J.W., Prigozhin, L.: A quasi-variational inequality problem arising in the modeling of growing sandpiles. ESAIM Math. Model. Numer. Anal. 47(4), 1133–1165 (2013). https://doi.org/10.1051/m2an/2012062
Barrett, J.W., Prigozhin, L.: Lakes and rivers in the landscape: a quasi-variational inequality approach. Interfaces Free Bound. 16(2), 269–296 (2014). https://doi.org/10.4171/IFB/320
Barrett, J.W., Prigozhin, L.: Sandpiles and superconductors: nonconforming linear finite element approximations for mixed formulations of quasi-variational inequalities. IMA J. Numer. Anal. 35(1), 1–38 (2015). https://doi.org/10.1093/imanum/drt062
Bensoussan, A., Goursat, M., Lions, J.L.: Contrôle impulsionnel et inéquations quasi-variationnelles stationnaires. C. R. Acad. Sci. Paris Sér. A-B 276, A1279–A1284 (1973)
Bensoussan, A., Lions, J.L.: Controle impulsionnel et inéquations quasi-variationnelles d’évolutions. C. R. Acad. Sci. Paris 276, 1333–1338 (1974)
Bensoussan, A., Lions, J.L.: Applications of variational inequalities in stochastic control, Studies in Mathematics and its Applications, vol. 12. North-Holland Publishing Co., Amsterdam-New York (1982). Translated from the French
Bensoussan, A., Lions, J.L.: Impulse control and quasivariational inequalities. \(\mu \). Gauthier-Villars, Montrouge; Heyden & Son, Inc., Philadelphia, PA (1984). Translated from the French by J. M. Cole
Birkhoff, G.: Lattice Theory, American Mathematical Society Colloquium Publications, vol. 25, 3rd edn. American Mathematical Society, Providence (1979)
Bonnans, J.F., Shapiro, A.: Perturbation Analysis of Optimization Problems. Springer Series in Operations Research. Springer, New York (2000). https://doi.org/10.1007/978-1-4612-1394-9
Boyer, F., Fabrie, P.: Mathematical Tools for the Study of the Incompressible Navier–Stokes Equations and Applied Mathematical Sciences. Springer, Berlin (2012)
Christof, C.: Sensitivity analysis and optimal control of obstacle-type evolution variational inequalities. SIAM J. Control Optim. 57(1), 192–218 (2019). https://doi.org/10.1137/18M1183662
Evans, L.C.: Partial Differential Equations. American Mathematical Society, Providence (1998)
Facchinei, F., Kanzow, C.: Generalized nash equilibrium problems. 4OR 5(3), 173–210 (2007). https://doi.org/10.1007/s10288-007-0054-4
Friedman, A., Jensen, R.: A parabolic quasi-variational inequality arising in hydraulics. Annali della Scuola Normale Superiore di Pisa - Classe di Scienze Ser. 4, 2(3), 421–468 (1975)
Friedman, A., Kinderlehrer, D.: A class of parabolic quasi-variational inequalities. J. Differ. Equ. 21(2), 395–416 (1976). https://doi.org/10.1016/0022-0396(76)90128-5
Glowinski, R., Lions, J.P., Trémolières, R.: Numerical Analysis of Variational Inequalities. North-Holland, Amsterdam (1981)
Gupta, S.C.: The Classical Stefan Problem. Elsevier, Amsterdam (2018). Basic Concepts, Modelling and Analysis with Quasi-analytical Solutions and Methods, New edition of [ MR2032973]
Haraux, A.: How to differentiate the projection on a convex set in Hilbert space: some applications to variational inequalities. J. Math. Soc. Jpn. 29(4), 615–631 (1977)
Haraux, A.: How to differentiate the projection on a convex set in Hilbert space: some applications to variational inequalities. J. Math. Soc. Jpn. 29(4), 615–631 (1977). https://doi.org/10.2969/jmsj/02940615
Harker, P.T.: Generalized nash games and quasi-variational inequalities. Eur. J. Oper. Res. 54(1), 81–94 (1991). https://doi.org/10.1016/0377-2217(91)90325-P
Hintermüller, M., Rautenberg, C.N.: Parabolic quasi-variational inequalities with gradient-type constraints. SIAM J. Optim. 23(4), 2090–2123 (2013)
Hintermüller, M., Rautenberg, C.N., Strogies, N.: Dissipative and non-dissipative evolutionary quasi-variational inequalities with gradient constraints. Set-Valued and Variational Analysis (2018). https://doi.org/10.1007/s11228-018-0489-0
Jarušek, J., Krbec, M., Rao, M., Sokolowski, J.: Conical differentiability for evolution variational inequalities. J. Differ. Equ. 193(1), 131–146 (2003). https://doi.org/10.1016/S0022-0396(03)00136-0
Kunze, M., Monteiro Marques, M.D.P.: On parabolic quasi-variational inequalities and state-dependent sweeping processes. Topol. Methods Nonlinear Anal. 12(1), 179–191 (1998). https://doi.org/10.12775/TMNA.1998.036
Kunze, M., Rodrigues, J.F.: An elliptic quasi-variational inequality with gradient constraints and some of its applications. Math. Methods Appl. Sci. 23(10), 897–908 (2000). https://doi.org/10.1002/1099-1476(20000710)23:10<897::AID-MMA141>3.0.CO;2-H
Lions, J.L.: Quelques Méthodes de Résolutions des Problémes aux Limites non Linéaires. Dunod, Gauthier-Villars (1969)
Lions, J.L.: Perturbations singulières dans les problèmes aux limites et en contrôle optimal. Lecture Notes in Mathematics, vol. 323. Springer, Berlin (1973)
Lions, J.L.: Sur le côntrole optimal des systemes distribuées. Enseigne 19, 125–166 (1973)
Mignot, F.: Contrôle dans les inéquations variationelles elliptiques. J. Funct. Anal. 22(2), 130–185 (1976)
Pang, J.S., Fukushima, M.: Quasi-variational inequalities, generalized nash equilibria, and multi-leader-follower games. Comput. Manag. Sci. 2(1), 21–56 (2005). https://doi.org/10.1007/s10287-004-0010-0
Penot, J.P.: Calculus Without Derivatives. Graduate Texts in Mathematics, vol. 266. Springer, New York (2013). https://doi.org/10.1007/978-1-4614-4538-8
Prigozhin, L.: Sandpiles and river networks: extended systems with non-local interactions. Phys. Rev. E 49, 1161–1167 (1994)
Prigozhin, L.: On the Bean critical-state model in superconductivity. Eur. J. Appl. Math. 7, 237–247 (1996)
Prigozhin, L.: Sandpiles, river networks, and type-II superconductors. Free Boundary Probl. News 10, 2–4 (1996)
Prigozhin, L.: Variational model of sandpile growth. Eur. J. Appl. Math. 7(3), 225–235 (1996). https://doi.org/10.1017/S0956792500002321
Rodrigues, J.F., Santos, L.: A parabolic quasi-variational inequality arising in a superconductivity model. Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 29(1), 153–169 (2000)
Showalter, R.E.: Monotone Operators in Banach Space and Nonlinear Partial Differential Equations. American Mathematical Society, Providence (1997)
Tartar, L.: Inéquations quasi variationnelles abstraites. CR Acad. Sci. Paris Sér. A 278, 1193–1196 (1974)
Wachsmuth, G.: A guided tour of polyhedric sets: basic properties, new results on intersections and applications. To appear in J. Convex Anal. (2018)
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by L. Caffarelli.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised due to a retrospective Open Access order.
This research was carried out in the framework of MATHEON supported by the Einstein Foundation Berlin within the ECMath projects OT1, SE5 and SE19 and is funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy—The Berlin Mathematics Research Center \(\hbox {MATH}^+\) (EXC-2046/1, Project ID: 390685689), and the MATHEON project A-AP24. The authors further acknowledge the support of the DFG through the DFG–SPP 1962: Priority Programme ‘Non-smooth and Complementarity-based Distributed Parameter Systems: Simulation and Hierarchical Optimization’ within Projects 10 and 11.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Alphonse, A., Hintermüller, M. & Rautenberg, C.N. Existence, iteration procedures and directional differentiability for parabolic QVIs. Calc. Var. 59, 95 (2020). https://doi.org/10.1007/s00526-020-01732-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00526-020-01732-6