Abstract
In this article, we establish the existence of weak solutions to the shallow medium equation. We proceed by an approximation argument. First, we truncate the coefficients of the equation from above and below. Then, we prove convergence of the solutions of the truncated problem to a solution to the original equation.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In this work, we prove the existence of weak solutions to the Cauchy–Dirichlet problem associated with the doubly nonlinear equation
where \(\Omega \subset \mathbb {R}^n\) is an open bounded set, \(z:\Omega \rightarrow \mathbb {R}\) and \(f:\Omega _T\rightarrow \mathbb {R}\) are given sufficiently regular functions, and the parameters \(\alpha \) and p are restricted to the range
The solution u is required to satisfy \(u\ge z\) so that the factor \((u-z)^\alpha \) is well defined. The term doubly nonlinear refers to the fact that the diffusion part depends nonlinearly both on the gradient and on the solution itself. In this article, we consider all \(n\ge 2\), although the equation is typically used in models with only two spatial dimensions.
In the case \(z=0\), (1.1) reduces to a standard doubly nonlinear equation which was studied already in [27] and [21], although using a different notion of weak solution. If we furthermore set \(\alpha =0\), we recover the parabolic p-Laplace equation, whereas taking \(p=2\) we obtain the porous medium equation, after a formal application of the chain rule. Thus, (1.1) can be seen as a generalization of many previously studied equations.
Local boundedness of weak solutions to (1.1) has been proved in [24] for sufficiently regular f and z in the slow diffusion case \(\alpha +p >2\). The article focused on the case \(p<2\), but the value of p does not really play a role in the arguments. In [25], local Hölder continuity was established for \(p>2\). Existence of solutions to a Cauchy–Dirichlet problem corresponding to (1.1) was proved in the case of trivial topography \(z=0\) and slow diffusion \(\alpha + p >2\) in [3]. See also [4, 7, 15, 26] which study the existence of solutions to the Cauchy–Dirichlet problem for doubly nonlinear equations with general structure conditions. Existence, comparison properties and asymptotic behavior of solutions to a nonlinear parabolic problem related to our case have been studied in [8].
In the range \(p > 2\), Eq. (1.1) is used to describe the dynamics of glaciers in the so-called shallow ice approximation; see for example Chapter 2 of [20] and the classic work [19]. Laboratory experiments, theoretical considerations and field measurements suggest that in these applications \(p\approx 4\) provides the best description. This corresponds to the value 3 for the exponent appearing in the flow law for polycrystalline ice which was established in [13]. In the range \(p<2\), Eq. (1.1) is used to model shallow water dynamics in situations such as floods and dam breaks; see [3, 11, 14]. Due to the variety of applications, we follow the terminology from [25] and use the term diffusive shallow medium equation or more concisely, DSM equation, to describe Eq. (1.1) for all \(p>1\).
The given function z describes the elevation of the land on top of which the water or ice is moving, measured with respect to some arbitrarily chosen ground level. The value of u represents the height of the medium measured with respect to the same level. This provides a physical motivation for the condition \(u\ge z\) which was imposed earlier. Even though u and z depend on the ground level, their difference \(v:=u-z\) is invariant. Therefore, it is natural to reformulate the equation in terms of v; see (2.3) and Definition 2.1 of Sect. 2. As has been established in [24, 25], this formulation is also mathematically convenient.
The right-hand side f is a source term which can account for snowfall in the case of glaciers, and rainfall, evaporation or infiltration in the shallow water setting. Thus, its sign may vary in applications. However, in this paper we restrict ourselves to nonnegative source terms, which is necessary for the comparison principle argument utilized in Lemma 5.3. In physical applications, it is reasonable to assume that f is bounded, but our existence result requires only that the source term has sufficiently high integrability; see Sect. 2 for details.
The function z is required to belong to a first-order Sobolev space with sufficiently high exponent; see (2.1). In fact, the exponent is higher than the dimension of \(\Omega \), which means that z is locally Hölder continuous. The space of admissible z contains for example all Lipschitz functions, which include the topographies occurring in realistic diffusion models.
Although the shallow medium equation has been studied in the literature, to our knowledge existence of weak solutions has not been established except for the trivial topography \(z=0\). Our aim in this article is to close this gap. Our main result, Theorem 2.2, ensures the existence of weak solutions to the Cauchy–Dirichlet problem of the shallow medium equation (1.1).
2 Setting and main result
In the following, we explain the precise setting and present the natural definition of weak solutions which was introduced in [24]. For the right-hand side \(f :\Omega _T \rightarrow \mathbb {R}_{\ge 0}\) and the known function \(z:\Omega \rightarrow \mathbb {R}\) and the initial datum \(\psi :\Omega \rightarrow \mathbb {R}\), we assume
for some \(\sigma >\frac{n+p}{p}\) and where \(\beta \) is given by (2.4). Here, \(p'=\frac{p}{p-1}\) denotes the Hölder conjugate of p. We consider the Cauchy–Dirichlet problem
for \(u:\Omega _T\rightarrow \mathbb {R}\) with \(u\ge z\). In order to motivate the natural definition of weak solutions, we reformulate (2.2)\(_1\) in terms of \(v:=u-z\). Formally applying the chain rule as in [24], we can write Eq. (2.2)\(_1\) in the form
where
Throughout the article, we will focus on this form of the equation. In order to simplify the notation, we denote the vector field appearing in the diffusion part of (2.3) as
for \(v\ge 0\) and \(\xi \in \mathbb {R}^n\), so that the Cauchy–Dirichlet problem (2.2) can be rewritten as
We arrive at the following definition of weak solutions by multiplying (2.3) by a smooth test function and integrating formally by parts.
Definition 2.1
Suppose that f, z and \(\psi \) satisfy (2.1). A function \(u:\Omega _T\rightarrow \mathbb {R}\) is a weak solution to Eq. (1.1) if and only if \(v:=u-z\) is nonnegative, \(v^\beta \in L^p(0,T;W^{1,p}(\Omega ))\) and
for all \(\varphi \in C^\infty _0(\Omega _T)\). Moreover, in this case we say that v is a solution to (2.3). If in addition \(v\in C([0,T];L^{\beta +1}(\Omega ))\), \(v^\beta \in L^p(0,T;W^{1,p}_0(\Omega ))\) and if \(v(0)=\psi -z\), we say that v is a solution to the Cauchy–Dirichlet problem (2.6) and u is a solution to the Cauchy–Dirichlet problem (2.2).
Now, we can state our main existence theorem.
Theorem 2.2
Suppose that f, z and \(\psi \) satisfy (2.1). Then, there exists a weak solution to the Cauchy–Dirichlet problem (2.2) in the sense of Definition 2.1.
The main difficulty in the proof of Theorem 2.2 stems from the degeneracy of the shallow medium equation with respect to the solution and from the topography z which in a certain sense plays the role of an obstacle. For the construction of our solution, we proceed by an approximation argument. More precisely, we truncate the vector field A with respect to v from above and from below. For \(k>1\), we let
for \(v\in \mathbb {R}\) and \(\xi \in \mathbb {R}^n\), where the truncation \(T_k:\mathbb {R}\rightarrow \mathbb {R}\) is given by
Then, we consider approximating solutions \(v_k\) to a Cauchy–Dirichlet problem associated with the vector fields \(A_k\) and with initial and lateral boundary data \(\frac{1}{k}\), respectively, \(\frac{1}{k}+\Psi \). The next crucial step is to investigate the properties of the approximating solutions. We prove the lower bound \(v_k\ge \frac{1}{k}\) and an upper bound for \(v_k\) independent of k. For large k, this allows us to conclude that
Next, we prove a uniform bound for the \(L^p(\Omega _T)\)-norms of the gradients \(\nabla v_k^\beta \). This shows that there exists a weakly convergent subsequence of \(v_k^\beta \) in \(L^p(0,T;W^{1,p}(\Omega ))\) to some function w. By a delicate compactness argument (see Sect. 4), we identify the pointwise limit of \(v_k\) as \(v:=w^{\frac{1}{\beta }}\). In the next step, we use the differential equations for \(v_k\), the ellipticity of the vector field A and the previously proved convergences to conclude pointwise convergence of the gradients \(\nabla v_{k}^\beta \rightarrow \nabla v^\beta \) for a subsequence. This finally allows us to pass to the limit in the differential equations for \(v_k\) and in turn to conclude that \(u=v+z\) is the desired weak solution to the shallow medium equation.
We remark that there are also other ways of utilizing compactness to obtain the existence. For example, one could apply Corollary 7 of [23] to the sequence \((v_k^{\beta + 1})\), similarly as in the proof of Corollary 6.7 in [18]. One then needs to show that \(\partial _t v_k^{\beta +1}\) exists and is bounded in a suitable space, but this can be done by using the mollified formulation (5.9) with a test function involving \(([v_k]_h)^\beta \). The fact that \(v_k\) itself has a gradient makes the argument possible. While such an approach is rather elegant, the compactness results that we obtain have the advantage of being directly applicable to similar problems in future research.
3 Preliminaries
Here, we introduce some notation and present auxiliary tools that will be useful in the course of the paper.
3.1 Notation
With \(B_\varrho (y_o)\), we denote the open ball in \(\mathbb {R}^m\), \(m\in \mathbb {N}\) with radius \(\varrho \) at center \(y_o\in \mathbb {R}^m\). For \(v,w \ge 0\), we define
where \(\beta \) is introduced in (2.4). For convenience, we will sometimes use the shorthand notation \(w(t)=w(\cdot ,t)\) for \(t\in [0,T]\). By c, we denote a generic constant which may change from line to line.
3.2 Mollifications in time
Since weak solutions are not necessarily weakly differentiable with respect to time, we will use some mollification techniques. We first recall the definition of Steklov averages. Given a function \(v\in L^1(\Omega _T)\) and \(h\in (0,T)\), we define its Steklov mean by
and the reversed Steklov mean by
For some basic properties of Steklov averages, we refer to Chapter 1.3 of [9] and Sect. 9 of [10]. For porous medium-type equations and more generally doubly nonlinear equations, it is often necessary to work with the following so-called exponential time mollification, cf. [17]. For a function \(v\in L^1(\Omega _T)\) and \(h\in (0,T)\), we set
for \(x\in \Omega \) and \(t\in [0,T]\). Moreover, we define the reversed analogue by
For details regarding the properties of the exponential mollification, we refer to [17, Lemma 2.2], [6, Lemma 2.2], and [26, Lemma 2.9]. The properties of the mollification that we will use have been collected for convenience into the following lemma:
Lemma 3.1
Suppose that \(v \in L^1(\Omega _T)\), and let \(p\in [1,\infty )\). Then, the mollification \(\llbracket v\rrbracket _h\) defined in (3.4) has the following properties:
-
(i)
If \(v\in L^p(\Omega _T)\), then \(\llbracket v\rrbracket _h\in L^p(\Omega _T)\),
$$\begin{aligned} \Vert \llbracket v\rrbracket _h\Vert _{L^p(\Omega _T)}\le \Vert v\Vert _{L^p(\Omega _T)}, \end{aligned}$$and \(\llbracket v\rrbracket _h\rightarrow v\) in \(L^p(\Omega _T)\). A similar statement holds for \(\llbracket v\rrbracket _{\bar{h}}\).
-
(ii)
If \(v\in L^p(\Omega _T)\), then \(\llbracket v\rrbracket _h\) and \(\llbracket v\rrbracket _{\bar{h}}\) have weak time derivatives belonging to \(L^p(\Omega _T)\) given by
$$\begin{aligned} \partial _t \llbracket v\rrbracket _h=\tfrac{1}{h}(v-\llbracket v\rrbracket _h),\quad \partial _t \llbracket v\rrbracket _{\bar{h}}=\tfrac{1}{h}(\llbracket v\rrbracket _{\bar{h}}-v). \end{aligned}$$ -
(iii)
If \(v\in L^p(0,T;W^{1,p}(\Omega ))\), then \(\llbracket v\rrbracket _h\in L^p(0,T;W^{1,p}(\Omega ))\) and \(\llbracket v\rrbracket _h\rightarrow v\) in
\(L^p(0,T;W^{1,p}(\Omega ))\) as \(h\rightarrow 0\). Moreover, if \(v\in L^p(0,T;W_0^{1,p}(\Omega ))\), then also
\(\llbracket v\rrbracket _h\in L^p(0,T;W_0^{1,p}(\Omega ))\). Similar statements hold for \(\llbracket v\rrbracket _{\bar{h}}\).
-
(iv)
If \(v\in L^p(0,T;L^{p}(\Omega ))\), then \(\llbracket v\rrbracket _h,\,\llbracket v\rrbracket _{\bar{h}} \in C([0,T];L^{p}(\Omega ))\).
3.3 Elementary inequalities
We now recall some elementary inequalities that will be used later, and start by some useful estimates for the quantity \(\mathfrak {b}[v,w]\) that was defined in (3.1). The proof can be found in [5, Lemma 2.2 and Lemma 2.3].
Lemma 3.2
Let \(v,w \ge 0\) and \(\beta > 1\). Then, there exists a constant \(c\ge 1\) depending only on \(\beta \) such that:
-
(i)
\(\tfrac{1}{c}\big | w^{\frac{\beta +1}{2}}-v^{\frac{\beta +1}{2}} \big |^2 \le \mathfrak {b}[v,w] \le c \big | w^{\frac{\beta +1}{2}}-v^{\frac{\beta +1}{2}} \big |^2\),
-
(ii)
\(\mathfrak {b}[v,w] \ge \tfrac{1}{c} |v-w|^{\beta +1}\),
-
(iii)
\(\mathfrak {b}[v,w] \le c |v^{\beta }-w^{\beta }|^{\frac{\beta +1}{\beta }}\).
The following algebraic lemma is standard and can be obtained by reasoning as in the proof of Lemma 5.1 in Chapter IX of [9].
Lemma 3.3
Let \(p>1\). Then, for any \(\xi ,\eta \in \mathbb {R}^n\) which in the case \(p\in (1,2)\) are not both zero there holds
where c is a constant depending only on p.
Remark 3.4
Let \(p>1\) and \(\xi ,\eta ,\zeta \in \mathbb {R}^n\) with either \(\xi \not =\zeta \) or \(\eta \not =\zeta \) in the case \(p\in (1,2)\). Then, by Lemma 3.3 there holds
If \(p\ge 2\), we use \(|\xi -\eta |^2\le 2(|\xi +\zeta |^2+|\eta +\zeta |^2)\) to conclude
Lemma 3.5
Let \(p>1\). Then, for any \(\xi ,\eta \in \mathbb {R}^n\) there holds
Proof
By Young’s inequality, we obtain
This is the claimed inequality. \(\square \)
Lemma 3.6
Let \(p>1\) and \(\xi ,\eta \in \mathbb {R}^n\). In the case \(p<2\), we also assume that either \(\xi \) or \(\eta \) is nonzero. Then, there exists a constant c depending only on p such that:
Proof
Applying [1, Lemma 2.2] with \(\mu =0\), we obtain
which is the desired estimate. \(\square \)
3.4 Auxiliary tools
The following lemma can be proven using an inductive argument; see for example [9, § I, Lemma 4.1].
Lemma 3.7
Let \((Y_j)^\infty _{j=0}\) be a sequence of positive real numbers such that
where \(C, b >1\) and \(\delta >0\). If
then \((Y_j)\) converges to zero as \(j\rightarrow \infty \).
Next, we recall a well-known parabolic Sobolev inequality, which can be found for example in [9].
Lemma 3.8
Let \(\Omega \subset \mathbb {R}^{n}\), \(T>0\) and \(\theta >0\). Suppose that \(q>0\), \(p>1\). Then, for every
we have
for a constant \(c=c(n,p,q,\Omega )\).
We now present a version of the Cauchy–Peano existence theorem which will be used in connection with Galerkin’s method.
Lemma 3.9
Let A be an invertible \(m\times m\) matrix, let \(g:\mathbb {R}^m\rightarrow \mathbb {R}^m\) be continuous, and let \(\tilde{f}:[0,T]\rightarrow \mathbb {R}^m\) be integrable. Let \(y_o\in \mathbb {R}^m\). Then, there exists a \(\delta >0\) and an absolutely continuous function \(y:[0,\delta ]\rightarrow \mathbb {R}^m\) such that
Proof
Since the differential equation is equivalent to
and since \(A^{-1}\circ g\) is continuous and \(A^{-1}\circ \tilde{f}\) is integrable, we may assume \(A=I_{m\times m}\). Fix \(r>0\) and set \(M:=\max _{\bar{B}_r(y_o)}|g|\). Define \(\delta _1:=\tfrac{r}{2M+1}\) and pick \(\delta _2>0\) so small that
We choose \(\delta =\min \{\delta _1,\delta _2\}\). Denote by \(c(y_o)\) the constant function \([0,\delta ]\rightarrow \mathbb {R}^m\) taking the value \(y_o\) for all \(t\in [0,\delta ]\), and define \(\widetilde{F}(t):=M+|\tilde{f}(t)|\). Consider the subset
of the vector space \(C([0,\delta ];\mathbb {R}^m)\) equipped with the sup-norm. The set K is convex and closed. Furthermore, the elements of K are equicontinuous pointwise bounded functions, so the Arzelà–Ascoli theorem guarantees that K is compact. We note that we have a well-defined map \(T:K\rightarrow K\) defined by
Since
the uniform continuity of g on the compact set \(\bar{B}_r(y_o)\) shows that T is continuous \(K\rightarrow K\). Thus, Schauder’s fixed point theorem guarantees that there exists at least one \(y\in K\) such that \(Ty=y\). But then y is a solution to the original problem. \(\square \)
3.5 Time continuity
We have taken a certain time continuity as part of the definition of a solution to the Cauchy–Dirichlet problem. Our next goal is to show that all weak solutions vanishing on the lateral boundary and having sufficiently high integrability automatically have this type of time continuity. The extra integrability condition \(v\in L^{\beta +1}(\Omega _T)\) occurring in the two following lemmas is redundant in the case \(\alpha +p\ge 2\), since then \(\beta p\ge \beta + 1\). Note that we will only apply the results to the bounded solution found in Sect. 6, which obviously satisfies the extra integrability condition.
Lemma 3.10
For all \(v\in L^{\beta +1}(\Omega _T;\mathbb {R}_{\ge 0})\) with \(v^\beta \in L^p(0,T;W^{1,p}_0(\Omega ))\) satisfying (2.7), all \(\zeta \in W^{1,\infty }([0,T];\mathbb {R}_{\ge 0})\) satisfying \(\zeta (0)=\zeta (T)=0\), and all w in
we have
Proof
We use (2.7) with the test function \(\varphi _h=\zeta (w^\beta -\llbracket v^\beta \rrbracket _h)\). This is possible since we can find \(\varphi _{h,j}\in C^\infty _0(\Omega _T)\) such that \(\varphi _{h,j}\rightarrow \varphi _h\) in \(L^p(0,T;W^{1,p}(\Omega ))\) and \(\partial _t \varphi _{h,j} \rightarrow \partial _t \varphi _h\) in \(L^\frac{\beta +1}{\beta }(\Omega _T)\). We see immediately that
and
The parabolic term is treated as follows:
In the third step, we have used Lemma 3.1 (ii) to conclude that the last term is nonpositive, and the second term has been integrated by parts. The limit of the parabolic term as \(h\rightarrow 0\) exists due to (3.6), (3.7) and (2.7) and satisfies the estimate
In the last step, we integrate the second term by parts. Combining (3.6), (3.7) and (3.8), we have verified “\(\le \)” in (3.5). The reverse inequality follows in a similar way by taking \(\varphi =\zeta (w^\beta -\llbracket v^\beta \rrbracket _{\bar{h}})\) as a test function.\(\square \)
Using the previous lemma, we can now conclude the time continuity.
Lemma 3.11
Every \(v\in L^{\beta +1}(\Omega _T;\mathbb {R}_{\ge 0})\) with \(v^\beta \in L^p(0,T;W^{1,p}_0(\Omega ))\) satisfying (2.7) has a representative in \(C([0,T];L^{\beta +1}(\Omega ))\).
Proof
Take \(\psi \in C^\infty (\mathbb {R};[0,1])\) such that \(\psi (t)=1\) for \(t\le \tfrac{1}{2} T\), \(\psi (t)=0\) for \(t>\tfrac{3}{4} T\) and \(|\psi '|\le \tfrac{8}{T}\). For \(\tau \in (0, \tfrac{1}{2} T)\) and \(\varepsilon >0\) so small that \(\tau +\varepsilon < \tfrac{1}{2} T\), define
Using identity (3.5) from Lemma 3.10 with \(\zeta =\chi ^\tau _\varepsilon \psi \) and \(w=(\llbracket v^\beta \rrbracket _{\bar{h}})^\frac{1}{\beta }\), we obtain
Here, we used Lemma 3.1 (ii) to conclude that the term involving \(\partial _t w^\beta \) is nonpositive. Also, we have made use of Lemma 3.2 (iii) to estimate \(\mathfrak {b}[v,w]\). Passing to the limit \(\varepsilon \rightarrow 0\), we see that
for all \(\tau \in (0,\tfrac{T}{2}){\setminus } N_h\) where \(N_h\) is a set of measure zero. Using Lemma 3.2 (ii), we find a lower bound for the integrand on the left-hand side:
Taking now a sequence \(h_j\downarrow 0\) and setting \(w_j:= ( [\![v^\beta ]\!]_{\bar{h}_j})^\frac{1}{\beta }\) and \(N:=\cup N_{h_j}\), we see that
Observe that every map of the form \(w=(\llbracket v^\beta \rrbracket _{\bar{h}})^\frac{1}{\beta }\) is continuous \([0,T]\rightarrow L^{\beta +1}(\Omega )\) since
and \(\llbracket v^\beta \rrbracket _{\bar{h}}\) is continuous \([0,T]\rightarrow L^\frac{\beta +1}{\beta }(\Omega )\) by Lemma 3.1 (iv). Because of the uniform limit (3.9), v has a representative which is continuous on \([0,\tfrac{T}{2}]{\setminus } N\) and since N has measure zero we find a representative which is continuous on \([0,\tfrac{T}{2}]\). The continuity on \([\tfrac{T}{2},T]\) follows from a similar argument, but taking \(w=(\llbracket v^\beta \rrbracket _h)^\frac{1}{\beta }\) and reflecting \(\chi ^\tau _\varepsilon \) and \(\psi \) on the interval [0, T]. \(\square \)
4 Compactness results
In this section, we provide some compactness results which could be of interest in their own. They are consequences of the well-known compactness results in [23]. Such results are necessary when dealing with porous medium-type equations and doubly nonlinear equations. For the purpose of this paper, we only need Corollary 4.6. Since there is not much extra effort to obtain other versions, we preferred to include them as well for possible future applications. Some results in this direction were previously obtained for uniformly bounded functions that are piecewise constant in time; see [16, Lemma 8] and the references therein.
For a family of functions \(F \subset L^{1}(0,T;L^1(\Omega ;\mathbb {R}^N))\) with \(N\ge 1\), we denote
We start with an Arzéla–Ascoli-type theorem. It will be the basis for all the other compactness results. For a map \(f:[0,T]\rightarrow X\), we define the translated function \(\tau _h f:[0,T-h]\rightarrow X\) by \(\tau _h f(t)=f(t+h)\). We can now formulate the following theorem [23, § 3, Theorem 1]:
Theorem 4.1
Let \(1\le p\le \infty \), \(T>0\) and B be a Banach space. Let \(F\subset L^p(0,T;B)\). Then, F is relatively compact in \(L^p(0,T;B)\) for \(1\le p<\infty \), or in C([0, T]; B) for \(p=\infty \) if and only if
and
Using Theorem 4.1, we obtain the following result.
Theorem 4.2
Let \(m \in (0,\infty )\), \(1\le p\le \infty \), \(1\le q\le \infty \). Define \(\theta :=\max \{1, m p\}\) and \(\mu :=\max \{1,mq\}\). Let \(T>0\) and let \(\Omega \subset \mathbb {R}^n\) be a bounded domain. Let X be a Banach space with compact embedding \(X\hookrightarrow L^q(\Omega ;\mathbb {R}^N)\). Moreover, let \(F \subset L^{\theta }(0,T;L^\mu (\Omega ;\mathbb {R}^N))\) such that \(F^m\subset L^1(0,T;X)\). We assume that
and
and
Then, \(F^m\) is relatively compact in \(L^{p}(0,T;L^q(\Omega ;\mathbb {R}^N))\) for \(1\le p<\infty \). For \(p=\infty \), it is relatively compact in \(C([0,T];L^q(\Omega ;\mathbb {R}^N))\).
Before proceeding with the proof, we note that the assumption \(F \subset L^{\theta }(0,T;L^\mu (\Omega ;\mathbb {R}^N))\) guarantees that \(F^m\subset L^p(0,T;L^q(\Omega ;\mathbb {R}^N))\). In the case \(m\ge 1\), we additionally need to assume that \(F^m\) is bounded in \(L^p(0,T;L^q(\Omega ;\mathbb {R}^N))\). The assumption \(F^m\subset L^1(0,T;X)\) is to be understood in the following sense: For every \(f\in F\), there is an element of \(L^1(0,T;X)\) whose composition with the inclusion \(X\hookrightarrow L^q(\Omega )\) coincides with \(f^m\in L^p(0,T;L^q(\Omega ;\mathbb {R}^N))\).
Proof of Theorem 4.2
Our aim is to apply Theorem 4.1 to the functions \(F^m\) with \(B=L^q(\Omega ;\mathbb {R}^N)\). From (4.4), we know that \(F^m\) is bounded in \(L^1(t_1,t_2;X)\) for any \(0<t_1<t_2<T\). Due to the compact embedding \(X\subset L^q(\Omega ;\mathbb {R}^N)\), this implies that \(\{\int _{t_1}^{t_2} f^m(t) \,\mathrm {d}t: f^m\in F^m\}\) is relatively compact in \(L^q(\Omega ;\mathbb {R}^N)\) for any \(0<t_1<t_2<T\). Hence, (4.1) is satisfied. Next, we verify assumption (4.2). Suppose first \(p<\infty \). If \(m\in (0,1)\), we have
Otherwise, if \(m\ge 1\), then \(\theta =mp\) and \(\mu =mq\) and we compute
This ensures that also (4.2) is satisfied. In the case \(p=\infty \), one must again consider the two ranges for m. The calculations are similar. Therefore, Theorem 4.1 yields the compactness of \(F^m\) in \(L^{p}(0,T;L^q(\Omega ;\mathbb {R}^N))\) for \(1\le p<\infty \), or in \(C(0,T;L^q(\Omega ;\mathbb {R}^N))\) for \(p=\infty \). \(\square \)
This interpolation lemma is from [23, § 8, Lemma 8].
Lemma 4.3
Let B, X, Y be Banach spaces with \(X\subset B\subset Y\) and compact embedding \(X \hookrightarrow B\). Then, for any \(\eta >0\) there exists \(M_\eta >0\) such that
Let X be a Banach space and let \(q,\mu \in [1,\infty )\). Throughout the rest of this section, we will consider compact embeddings \(T:X\rightarrow L^q(\Omega ;\mathbb {R}^N)\) and \(S:L^{\mu '}(\Omega ;\mathbb {R}^N)\rightarrow X'\) that are compatible in the following way. For any \(v\in X\) for which it happens that the function Tv belongs to \(L^\mu (\Omega ;\mathbb {R}^N)\) and for any \(w\in L^{\mu '}(\Omega ;\mathbb {R}^N)\), we have
where \(\langle \cdot ,\cdot \rangle \) denotes the dual pairing of X and \(X'\).
We need the following more complicated version of Lemma 4.3.
Lemma 4.4
Let \(m\in (0,\infty )\), \(p,q,\mu \in [1,\infty )\), \(\theta :=\max \{1,m p\}\), \(T>0\), and \(\Omega \subset \mathbb {R}^n\) be a bounded domain and X, Y be Banach spaces such that \(X\subset L^q(\Omega ;\mathbb {R}^N)\) and \(L^{\mu '}(\Omega ;\mathbb {R}^N)\subset X'\subset Y\) with compact embeddings \(T: X \hookrightarrow L^q(\Omega ;\mathbb {R}^N)\) and \(S: L^{\mu '}(\Omega ;\mathbb {R}^N)\hookrightarrow X'\) satisfying (4.6). Then, for any \(\eta >0\) there exists \(M_\eta >0\) such that
for any \(f\in L^{\theta }(0,T;L^{\mu '}(\Omega ;\mathbb {R}^N))\) with \(f^m\in L^p(0,T;X)\cap L^p(0,T;L^\mu (\Omega ;\mathbb {R}^N))\).
Note that the assumption \(f^m\in L^p(0,T;X)\) must be understood in the same sense as in Lemma 4.2. In the sequel, we do not explicitly write out the embeddings. It will be clear from the context when they should be present.
Proof
We consider \(v\in L^p(0,\tau ;X)\cap L^p(0,\tau ;L^\mu (\Omega ;\mathbb {R}^N))\) (i.e., the composition of v with the embedding T results in a function belonging to \(L^p(0,\tau ;L^\mu (\Omega ;\mathbb {R}^N))\)) and \(w\in L^{\theta }(0,\tau ;L^{\mu '}(\Omega ;\mathbb {R}^N))\) with \(\tau \in (0,T]\). Since \(L^{\mu '}(\Omega ;\mathbb {R}^N)\subset X'\), we know that \(\langle v(t),w(t)\rangle \) is defined for a.e. \(t\in (0,\tau )\), where \(\langle \cdot ,\cdot \rangle \) denotes the dual pairing of X and \(X'\). Hence, we may calculate
where in the last line we applied Hölder’s inequality. Here, we have omitted the embeddings S and T to simplify the notation. In particular, choosing \(v=f^m\) and \(w=f\) in (4.7) yields
For \(t\in [0,T-h]\), set \(\Omega ^t=\Omega \cap \{\tau _h f^m(t)\ne f^m(t)\}\). Applying in turn Hölder’s inequality, Lemma 3.3 and (4.7) with the choice \(v=\tau _h f^m-f^m\) and \(w=\tau _h f-f\) leads to
for some constant \(c=c(m)\ge 1\). The set \(\Omega ^t\) was introduced instead of \(\Omega \) to avoid dividing by zero in the case \(m>1\). Now, let \(\eta _1>0\). By Lemma 4.3, there exists \(M_{\eta _1}>0\) such that
where \(c=c(m,p)\). If \(m\le 1\), we therefore immediately conclude that
where \(c=c(m,p)\). If \(m> 1\), we use Hölder’s inequality to estimate \(\text{ I }\) from below and get with the abbreviation \(F:=(|\tau _h f^m| + |f^m|)^{\frac{1}{m}}\), inequality (4.8) and Young’s inequality that
We have also used the fact that \(\theta =m p\) when \(m>1\). Combining both cases and choosing \(\eta _1\) in a suitable way, we conclude that for any \(\eta _2>0\) there exists \(M_{\eta _2}>0\) such that
If \(q\le \frac{m+1}{m}\), the asserted inequality follows by an application of Hölder’s inequality. In the other case where \(q> \frac{m+1}{m}\), we apply Lemma 4.3 again and find that for \(\eta >0\) there exists \(M_{\eta }>0\) such that
At this point, the claimed inequality follows by choosing \(\eta _2\) so small that \(cM_\eta \eta _2\le \eta \). \(\square \)
With these prerequisites at hand, we are able to prove the following more refined version of Theorem 4.2, where assumption (4.5) is weakened in the sense that \(L^\mu (\Omega ;\mathbb {R}^N)\) is replaced by a Banach space Y with \(X'\subset Y\).
Theorem 4.5
Let \(m\in (0,\infty )\), \(p,q,\mu \in [1,\infty )\), \(\theta :=\max \{1,m p\}\), \(T>0\), and \(\Omega \subset \mathbb {R}^n\) be a bounded domain and X, Y be Banach spaces such that \(X\subset L^q(\Omega ;\mathbb {R}^N)\) and \(L^{\mu '}(\Omega ;\mathbb {R}^N)\subset X'\subset Y\) with compact embeddings \(X \hookrightarrow L^{q}(\Omega ;\mathbb {R}^N)\) and \(L^{\mu '}(\Omega ;\mathbb {R}^N)\hookrightarrow X'\) satisfying (4.6). Moreover, let \(F \subset L^{\theta }(0,T;L^{\mu '}(\Omega ;\mathbb {R}^N))\) such that \(F^m\subset L^p(0,T;X)\cap L^p(0,T;L^\mu (\Omega ;\mathbb {R}^N))\). We assume that
and
-
(i)
If
$$\begin{aligned} \Vert \tau _h f - f\Vert _{L^\theta (0,T-h;Y)}\rightarrow 0 \quad \text{ as } h\downarrow 0, \quad \text{ uniformly } \text{ for } f\in F \end{aligned}$$is satisfied, then \(F^m\) is relatively compact in \(L^{p}(0,T;L^q(\Omega ;\mathbb {R}^N))\).
-
(ii)
If
$$\begin{aligned} \Vert \tau _h f - f\Vert _{L^1(0,T-h;Y)}\rightarrow 0 \quad \text{ as } h\downarrow 0, \quad \text{ uniformly } \text{ for } f\in F \end{aligned}$$is satisfied, then \(F^m\) is relatively compact in \(L^{\tilde{p}}(0,T;L^q(\Omega ;\mathbb {R}^N))\) for any \(\tilde{p}\in [1,p)\).
Proof
Our aim is to apply Theorem 4.1 with \(B=L^q(\Omega ;\mathbb {R}^N)\) and \(F^m\) instead of F. From (4.10), we know that \(F^m\) is bounded in \(L^p(t_1,t_2;X)\) and hence in \(L^1(t_1,t_2;X)\) for any \(0<t_1<t_2<T\). Due to the compact embedding \(X\hookrightarrow L^{q}(\Omega ;\mathbb {R}^N)\), this implies that \(\{\int _{t_1}^{t_2} f^m(t) \,\mathrm {d}t: f^m\in F^m\}\) is relatively compact in \(L^q(\Omega ;\mathbb {R}^N)\) for any \(0<t_1<t_2<T\). Hence, (4.1) is satisfied. Next, we verify assumption (4.2). For \(\eta >0\), we denote by \(M_\eta >0\) the constant from Lemma 4.4. The application of the lemma yields that
holds true for any \(f\in F\). Assumptions (4.9) and (4.10) ensure the existence of a constant \(C>0\) such that \(\Vert f^m\Vert _{L^{p}(0,T;X)}\le C\) and \(\Vert f\Vert _{L^{\theta }(0,T;L^{\mu '}(\Omega ;\mathbb {R}^N))}\le C\) for any \(f\in F\). Hence, for \(\varepsilon >0\) we may choose \(\eta >0\) in the preceding inequality small enough such that
for a constant \(C_\varepsilon >0\) depending on \(\varepsilon \), but not on h. If the assumption of assertion (i) holds, then there is \(h_\varepsilon \) such that if \(0<h<h_\varepsilon \), then
for all \(f\in F\). But this means that
for all \(f\in F\) when \(h\in (0,h_\varepsilon )\). Since \(\varepsilon >0\) was arbitrary, this verifies Assumption (4.2) in Theorem 4.1. Therefore, the application of the theorem yields the compactness of \(F^m\) in \(L^{p}(0,T;L^q(\Omega ;\mathbb {R}^N))\) proving assertion (i).
The second assertion (ii) will follow by an interpolation argument. If \(\theta =1\), then the result already follows by (i). Therefore, it is enough to consider the case \(1<\theta =m p\). We consider \(\tilde{p}\in [1,p)\). Without loss of generality, we may assume that \(\tilde{\theta }:=m\tilde{p}>1\). We interpolate
Due to Assumption (4.9), \(\Vert f\Vert _{L^{\theta }(0,T;L^{\mu '}(\Omega ;\mathbb {R}^N))}\) is bounded independent of f and therefore we have that \(\Vert \tau _h f-f\Vert _{L^{\tilde{\theta }}(0,T-h;Y)}\rightarrow 0\) as \(h\downarrow 0\) uniformly for \(f\in F\). This allows us to apply Theorem 4.1 with \(\tilde{p}\) instead of p and thus yields assertion (ii). \(\square \)
Applying Theorem 4.5 with \(X=W^{1,p}_0(\Omega ;\mathbb {R}^N)\) yields the following Corollary.
Corollary 4.6
Let \(m\in (0,\infty )\), \(p\in [1,\infty )\), and \(T>0\), \(\Omega \subset \mathbb {R}^n\) be a bounded domain and Y be a Banach space such that \((W^{1,p}_0(\Omega ;\mathbb {R}^N))'\subset Y\). Moreover, let \(\theta :=\max \{1,m p\}\), \(q\in [p,\frac{np}{n-p})\), \(\mu \in [p,\frac{np}{n-p})\) if \(p<n\) and \(q,\mu \in [p,\infty )\) if \(p\ge n\) and consider \(F \subset L^{\theta }(0,T;L^{\mu '}(\Omega ;\mathbb {R}^N))\) such that \(F^m\subset L^p(0,T;W^{1,p}(\Omega ;\mathbb {R}^N))\). We assume that
and
-
(i)
If
$$\begin{aligned} \Vert \tau _h f - f\Vert _{L^\theta (0,T-h;Y)}\rightarrow 0 \quad \text{ as } h\downarrow 0, \quad \text{ uniformly } \text{ for } f\in F \end{aligned}$$is satisfied, then \(F^m\) is relatively compact in \(L^{p}(0,T;L^q(\Omega ;\mathbb {R}^N))\).
-
(ii)
If
$$\begin{aligned} \Vert \tau _h f - f\Vert _{L^1(0,T-h;Y)}\rightarrow 0 \quad \text{ as } h\downarrow 0, \quad \text{ uniformly } \text{ for } f\in F \end{aligned}$$is satisfied, then \(F^m\) is relatively compact in \(L^{\tilde{p}}(0,T;L^q(\Omega ;\mathbb {R}^N))\) for any \(\tilde{p}\in [1,p)\).
Proof
We restrict ourselves to the proof of case (i), since (ii) is completely analogous. Let \(\widetilde{\Omega }\Subset \Omega \) and \(\eta \in C_0^\infty (\Omega )\) be a nonnegative function with \(\eta \equiv 1\) in \(\widetilde{\Omega }\). We now consider the family of functions \(F_\eta :=\{\eta ^{\frac{1}{m}}f: f\in F\}\). Then, \(F_\eta \subset L^{\theta }(0,T;L^{\mu '}(\Omega ;\mathbb {R}^N))\) and \(F_\eta ^m \subset L^p(0,T;W^{1,p}_0(\Omega ;\mathbb {R}^N))\). Our goal is to apply Theorem 4.5 with \(X=W^{1,p}_0(\Omega ;\mathbb {R}^N)\) and \(F_\eta \) in place of F. Note that due to the parameter ranges, all elements of X are \(L^q\)-integrable, and the inclusion \(T:X \rightarrow L^q(\Omega ;\mathbb {R}^N)\) is compact. Similarly, we have a compact inclusion \(\tilde{T}:X \rightarrow L^\mu (\Omega ;\mathbb {R}^N)\). Furthermore, we have an embedding \(L^{\mu '}(\Omega ;\mathbb {R}^N)\hookrightarrow X'\) given by \(S:=\tilde{T}'\circ J\), where \(\tilde{T}': (L^\mu (\Omega ;\mathbb {R}^N))'\rightarrow X'\) is the adjoint of \(\tilde{T}\), and J is the standard isomorphism from \(L^{\mu '}(\Omega ;\mathbb {R}^N)\) to \((L^\mu (\Omega ;\mathbb {R}^N))'\). Since \(\tilde{T}\) is compact, Schauder’s theorem guarantees that also S is compact. The condition (4.6) follows directly from the definitions of S, T and \(\tilde{T}\). Notice that assumptions (4.11), (4.12) and (i) imply the corresponding assumptions in Theorem 4.5. Thus, all the assumptions of Theorem 4.5 are satisfied, and hence \(F_\eta ^m\) is relatively compact in \(L^{p}(0,T;L^q(\Omega ;\mathbb {R}^N))\). In particular, this implies that \(F^m\) is relatively compact in \(L^{p}(0,T;L^q(\widetilde{\Omega };\mathbb {R}^N))\). Since \(\widetilde{\Omega }\Subset \Omega \) was arbitrary, we may conclude the relative compactness of \(F^m\) in \(L^{p}(0,T;L^q(\Omega ;\mathbb {R}^N))\) by a diagonal argument. \(\square \)
In the previous proof, we choose X to consist of compactly supported Sobolev functions since the corresponding space of arbitrary Sobolev functions typically is not compactly embedded into \(L^q\). Such a result holds only if \(\Omega \) is sufficiently regular, for example if \(\Omega \) satisfies a cone condition; see [2, Theorem 6.3]. In that case, the proof is somewhat simpler.
Remark 4.7
The assumption on the uniform convergence of the time differences in Theorem 4.6 (i) and (ii) is satisfied for functions with integrable time derivative. More precisely, [23, § 5, Lemma 4] ensures that
implies assumption (i) of Theorem 4.6 and
implies assumption (ii) of Theorem 4.6.
5 The approximation scheme
For \(k>1\), we approximate the vector field A defined in (2.5) by vector fields \(A_k\) given by
for \(v\in \mathbb {R}\) and \(\xi \in \mathbb {R}^n\), where the truncation \(T_k:\mathbb {R}\rightarrow \mathbb {R}\) is defined by
Exploiting the definition of A, we can express \(A_k\) as
5.1 Properties of \(A_k\)
Using the properties of the vector field corresponding to the p-Laplace operator, we can verify the following useful basic properties of \(A_k\).
5.1.1 Monotonicity
Due to Remark 3.4, we know that there exists a constant c depending only on p such that
holds true for any \(v\in \mathbb {R}\) and \(\xi ,\eta \in \mathbb {R}^n\). If \(p<2\) and \(\xi \equiv \eta \equiv -\nabla z\), then the right-hand side has to be interpreted as zero.
5.1.2 Boundedness
For any \(v\in \mathbb {R}\) and \(\xi \in \mathbb {R}^n\), we have
5.1.3 Coercivity
Due to the definition of \(A_k\) and Lemma 3.5, we have that
for any \(v\in \mathbb {R}\) and \(\xi \in \mathbb {R}^n\).
5.2 Weak solutions of the approximating equation
In this section, we want to find weak solutions to the approximating problems
where \(\Psi =\psi -z\). By formally integrating by parts, we are led to the following definition.
Definition 5.1
A function \(v_k\in C([0,T]; L^2(\Omega ))\cap \frac{1}{k}+ L^p(0,T; W^{1,p}_0(\Omega ))\) is an admissible weak solution to the Cauchy–Dirichlet problem (5.7) if
for all \(\varphi \in C^\infty _0(\Omega _T)\) and \(v_k(\cdot ,0)=\tfrac{1}{k}+\Psi \) in \(\Omega \).
Using a test function of the form \([\varphi (x,t)\xi (t)]_{\bar{h}}\) where \(\varphi \in C^\infty (\bar{\Omega }\times [0,T])\) vanishes if x is outside a compact subset of \(\Omega \) and \(\xi \) is any smooth function compactly supported in \((0,T-h)\), one easily verifies that any solution \(v_k\) in the sense of Definition 5.1 satisfies the following equation with Steklov means \([\,\cdot \,]_h\) defined in (3.2):
for all \(0\le a < b \le T-h\). In fact, by approximation with smooth functions, one sees that all \(\varphi \in L^p(0,T;W^{1,p}_0(\Omega ))\cap L^\infty (\Omega _T)\) are admissible in (5.9). An analogous identity holds true for the Steklov averages \([\,\cdot \,]_{\bar{h}}\) and \(h \le a < b \le T\).
We now prove the existence of a solution to the regularized problem in the sense of Definition 5.1. We will follow the functional analytic approach of Showalter [22] making use of Galerkin’s method. In fact, one can reason as in the proof of [22, Theorem 4.1, Sect. III.4], despite the somewhat weaker coercivity condition in our case. For the reader’s convenience, we present the full argument below. We have opted to avoid the theory of operators of type M used by Showalter, exploiting instead the stronger monotonicity property of the vector field \(A_k\).
Lemma 5.2
For any \(k>1\), there exists at least one admissible weak solution \(v_k\) to (5.7) in the sense of Definition 5.1.
Proof
We fix \(k>1\) and consider the modified vector field
for \(w\in \mathbb {R}\) and \(\xi \in \mathbb {R}^n\). We prove the existence of a function \(w\in C([0,T];L^2(\Omega ))\cap L^p(0,T;W^{1,p}_0(\Omega ))\) satisfying
for all \(\varphi \in C^\infty _0(\Omega _T)\), and \(w(\cdot ,0)= \Psi \) in \(\Omega \). Then, \(v_k:= \tfrac{1}{k} + w\) is an admissible weak solution in the sense of Definition 5.1.
We denote \(V=L^2(\Omega )\cap W^{1,p}_0(\Omega )\) and define \(\mathcal {A}: V\rightarrow V'\) by
We define \(F\in L^{p'}(0,T;V')\) by setting
Recall that we have the inclusions \(V \hookrightarrow L^2(\Omega )\hookrightarrow V'\) with V being dense in \(L^2(\Omega )\). Then, (5.10) is equivalent to
This equation can be understood in the weak sense using the Bochner integral, or equivalently pointwise a.e. Pick a basis \((v_j)^{\infty }_{j=1}\) of V, and for each \(m\in \mathbb {N}\) vectors \(\psi _m \in \text {span} (v_1,\dots ,v_m) =:V_m \) converging to \(\Psi \) in \(L^2(\Omega )\), and consider for a fixed \(m\in \mathbb {N}\) the problem of finding a map \(w_m:[0,T]\rightarrow V_m\) satisfying
Here, \((\cdot ,\cdot )\) denotes the inner product in \(L^2(\Omega )\) and \(\langle \cdot ,\cdot \rangle \) denotes the dual pairing of \(V'\) and V. We define \(g:\mathbb {R}^m\rightarrow \mathbb {R}^m\) with components
and \(\tilde{f}:[0,T]\rightarrow \mathbb {R}^m\) with components
The dominated convergence theorem shows that g is continuous, \(\tilde{f}\) is evidently integrable and the matrix with components \((v_i,v_j)\), \(i,j\in \{1,\dots ,m\}\) is invertible. Therefore, Lemma 3.9 guarantees that the problem (5.11) has a solution \(w_m\) on some interval \([0,\delta ]\). This solution can be extended to a maximal interval \(J\subset [0,T]\). Multiplying (5.11) by the component function \(w_m^j(t)\) of \(w_m\) in the basis \((v_j)^m_{j=1}\) and summing over j, we have
The coercivity property (5.6) of \(A_k\) implies the same property for \(\tilde{A}_k\), from which we obtain for all \(v\in V\) that
with a constant \(c> 1\) depending on \(\alpha , p, k\) and \(\Omega \). Using this estimate in (5.12) shows that
for a.e. \(t\in J\). Here, we have extended F(t) to an element of \(W^{-1,p'}(\Omega )=(W^{1,p}_0(\Omega ))'\) using the same formula as before. Applying Young’s inequality to the last term, we find that
where \(c>1\) depends on \(\alpha , p, k\) and \(\Omega \). Integrating the last inequality, we have
for all \(t\in J\). This shows that \(\Vert w_m\Vert _{L^2(\Omega )}\) and the component functions \(w^i_m\) stay bounded on J, and from the system of equations we conclude that \(w_m\) is absolutely continuous on all of J. If J is not the interval [0, T], then \(J=[0,b)\) where \(b<T\). Then, the uniform continuity of \(w_m\) and the finite dimension of \(V_m\) show that there is a limit of \(w_m(t)\) as \(t\uparrow b\) which allows us to extend \(w_m\), thus contradicting maximality. Hence, \(w_m\) must indeed be defined on all of [0, T]. Moreover, since \(\psi _m\rightarrow \Psi \) in \(L^2(\Omega )\), the estimate (5.13) shows on the one hand that \((w_m)\) is a bounded sequence in \(L^\infty (0,T;L^2(\Omega ))\) and on the other hand that \((w_m)\) is a bounded sequence in \(L^p(0,T;W^{1,p}(\Omega ))\). Hence, we infer that \(w_m\) is a bounded sequence in \(L^p(0,T;V)\). From the definition of \(\mathcal {A}\) and (5.5), we see that
with \(c=c(\alpha ,p,k)\), and hence \((\mathcal {A}(w_m))\) is a bounded sequence in \(L^{p'}(0,T;V')\). By reflexivity, we have a subsequence still labeled as \((w_m)\) which converges weakly to \(w\in L^p(0,T;V)\) and for which \((\mathcal {A}(w_m))\) converges weakly to \(\xi \in L^{p'}(0,T;V')\). Furthermore, (5.13) shows that \((w_m(T))\) is bounded in \(L^2(\Omega )\), so we may assume that \((w_m(T))\) converges weakly to some \(w^*\in L^2(\Omega )\). Take now \(\varphi \in C^\infty ([0,T])\) and \(v\in V_m\). From (5.11), we see that
Integrating this identity, we obtain
Due to the weak convergences mentioned above, we obtain by taking \(m\rightarrow \infty \) that
for all \(v\in V_{m_o}\) for any \(m_o\in \mathbb {N}\), and by approximation for all \(v\in V\). This shows (by taking \(\varphi \in C^\infty _0(0,T)\)) that
in \(L^{p'}(0,T;V')\), and thus \(w\in C([0,T];L^2(\Omega ))\); see also Proposition 1.2 of Sect. III.1 in [22]. Next, we show that w satisfies the right initial condition. Using the test function
in (5.14) we have for all \(v\in V\),
where the integral on the left-hand side is taken in the Bochner sense of w as an \(L^2(\Omega )\)-valued map. By the density of V in \(L^2(\Omega )\), this implies
The right-hand side converges to zero as \(\varepsilon \downarrow 0\). On the other hand, since \(w\in C([0,T];L^2(\Omega ))\), we know that the limit of the integral average appearing on the left-hand side is w(0). Thus, we have confirmed that \(w(0)=\Psi \). It only remains to show that \(\xi = \mathcal {A}(w)\). Since \(\xi \) is the weak limit of \((\mathcal {A}(w_m))\), it is sufficient to show that \((\mathcal {A}(w_m))\) converges weakly to \(\mathcal {A}(w)\). In order to prove the weak convergence, we first show the \(L^p\)-convergence of \((\nabla w_m)\) to \(\nabla w\). From the monotonicity condition (5.4) satisfied by \(A_k\), it follows that
where \(c=c(p,k)\) and
In the case \(p<2\), by Hölder’s inequality we may estimate
The last factor is bounded independently of m since \((w_m)\) is bounded in \(L^p(0,T;V)\). Thus, setting \(\nu =\max \{1,\tfrac{2}{p}\}\) we have in any case that
for a constant c independent of m. In the last step, we have used (5.12) integrated over [0, T]. The weak convergences of \((w_m)\) to w and \((\mathcal {A}(w_m))\) to \(\xi \), the norm convergence of \((\psi _m)\), and the weak lower semicontinuity of the norm applied to the term \(\Vert w_m(T)\Vert ^2_{L^2(\Omega )}\) then show that
where in the last step we have used (5.15) applied to w and integrated over [0, T]. Thus, we have obtained the desired \(L^p\)-convergence of \(\nabla w_m\) to \(\nabla w\). To see that the weak convergence of \(\mathcal {A}(w_m)\) to \(\mathcal {A}(w) \) follows from this, we use Lemma 3.6 and obtain
where \(b_p = 0\) if \(p<2\) and \(b_p=1\) if \(p\ge 2\). Hölder’s inequality and the \(L^p\)-convergence of \(\nabla w_m\) show that the last expression converges to zero as \(m \rightarrow \infty \), so we have confirmed that \(\mathcal {A}(w) = \xi \). \(\square \)
5.3 Properties of the approximating solutions
In this section, we investigate the properties of the approximating solutions obtained in Lemma 5.2. We prove lower bounds for the solutions \(v_k\) in terms of k and upper bounds which are independent of k. Moreover, we obtain a uniform bound for the \(L^p(\Omega _T)\)-norms of the gradients \(\nabla v_k^\beta \).
Lemma 5.3
Let \(k>1\) and \(v_k\) be an admissible weak solution to the Cauchy–Dirichlet problem (5.7) in the sense of Definition 5.1. Then, we have \(v_k\ge \tfrac{1}{k}\) a.e. in \(\Omega _T\).
Proof
We use a comparison principle argument. Consider the version of (5.9) with Steklov means \([\,\cdot \,]_{\bar{h}}\), that is
for \(h<a<b<T\) and the test function \(\varphi =H_\delta \left( \tfrac{1}{k}-v_k\right) \), where
The test function is admissible since \(v_k-\tfrac{1}{k}\in L^p(0,T; W^{1,p}_0(\Omega ))\), and \(H_\delta \) and \(H_\delta '\) are bounded. We define \(G_\delta \) as
By the properties of the Steklov average and the convexity of \(G_\delta \), we have
Using this estimate and the fact that \(f\ge 0\) in the above identity, we obtain
Taking \(h\downarrow 0\), we find that
holds true for a.e. \(0<a<b<T\). Note that
Thus, whenever the second integrand is nonzero we have \(v_k<\tfrac{1}{k}\) and then
This shows that the second integral of (5.16) is nonnegative so we can drop it. Thus, we end up with
for a.e. \(0<a<b<T\). Since \(v_k\in C([0,T];L^2(\Omega ))\) and \(v_k(0)=\frac{1}{k}+ \Psi \), we obtain, by taking \(a\downarrow 0\) that
where the last equality follows from the fact that \(\Psi \ge 0\). Taking the limit \(\delta \rightarrow 0\), we obtain
for a.e. \(b \in (0,T)\). Thus, \(v_k\ge \frac{1}{k}\) a.e. in \(\Omega _T\). \(\square \)
We now intend to show that the approximative solutions \(v_k\) are bounded in the \(L^\infty \)-norm by a constant independent of k. In the proof, we will make use of the truncation defined in (5.2) and the function
Note that for \(s\ge \frac{1}{k}\), we have \(T_k(s)=\min \{s,k\}\) and hence \(\tilde{v}_k=\min \{v_k,k\}\) by Lemma 5.3. The proof is divided into three steps. First, we establish an energy estimate for \(\tilde{v}_k\). We use this result to show that the \(L^{\beta p}\)-norm of \(\tilde{v}_k\) is bounded independently of k. Finally, the energy estimate is utilized in a De Giorgi-type iteration to obtain a bound in terms of the \(L^{\beta p}\)-norm of \(\tilde{v}_k\), which by the previous observation concludes the proof.
Lemma 5.4
Let \(k>1\) and \(v_k\) be an admissible weak solution to the Cauchy–Dirichlet problem (5.7) in the sense of Definition 5.1, and let \(M\ge \sup _{\bar{\Omega }}\Psi +1\). Then, the function \(\tilde{v}_k\) defined in (5.17) satisfies
for a constant \(c=c(\beta ,p,\Omega )\) which does not depend on k.
Proof
In the case \(k\le M\), the inequality trivially holds, since \(\tilde{v}_k\le k\le M\). Therefore, we are left with the case \(k> M\). For \(0<\varepsilon<\tau<\tau +\varepsilon <T\) and \(\zeta _{\tau ,\varepsilon }\) we define
We use the mollified formulation (5.9) of the differential equation with \(a=0\), \(b=T-h\) and \(\varphi =\zeta _{\tau ,\varepsilon }\big (T_k([v_k]_h)^\beta -M^\beta \big )_+\). We take h so small that the factor \(\zeta _{\tau ,\varepsilon }\) is supported in \([0,T-h]\). Then, the boundedness of \(T_k([v_k]_h)\) and the chain rule imply that \(\varphi \in L^p(0,T;W^{1,p}_0(\Omega ))\cap L^\infty (\Omega _T)\), which means that \(\varphi \) is admissible as a test function.
Our goal is to pass to the limit \(h\downarrow 0\) in the mollified differential equation, possibly replacing the equality by a suitable estimate. For the elliptic term, we conclude that
The convergence of \(\nabla \varphi \) in the \(L^p\)-norm can be seen from the chain rule, and the fact that the outer function \(s\mapsto (T_k(s)^\beta -M^\beta )_+\) is piecewise \(C^1\) with bounded derivative. We now introduce the abbreviation
and observe that \(\varphi =\zeta _{\tau ,\varepsilon }g([v_k]_h)\). Recall that the boundary term \(\mathfrak {b}\) has been introduced in (3.1). This allows us to treat the parabolic term as
The limit of the second term vanishes since \(G(v_k)(x,0)=G(\frac{1}{k}+\Psi (x))=0\) and \(M\ge \sup _{\bar{\Omega }}\Psi +1\) by assumption. The limits exist since \(\int _\Omega G(v_k)(x,t) \mathrm {d}x\) is continuous with respect to t. The continuity can be concluded from the fact that G is Lipschitz and \(v_k\in C([0,T]; L^2(\Omega ))\). Therefore, we have after passing to the limits \(h\downarrow 0\) and \(\varepsilon \downarrow 0\) that
holds true for any \(\tau \in (0,T]\). Using the expression (5.3) for \(A_k\), the fact that \(\nabla (\tilde{v}_k^\beta -M^\beta )_+=0\) a.e. on the sets \(\{(x,t)\in \Omega _T:v_k(x,t)\ge k\}\) and \(\{(x,t)\in \Omega _T:v_k(x,t)\le M\}\), the chain rule and Lemma 3.5, we have
Moreover, using the definition of G and Lemma 3.2 (i), we can estimate
The last term in the integral on the right-hand side of (5.19) can be estimated using the Schwarz inequality, Young’s inequality and the fact that \(k>1\) as
The integral over the first term can be included in the first term on the left-hand side of (5.19), and we end up with
for any \(\tau \in (0,T]\) and a constant \(c=c(\beta ,p)\). Since \((\tilde{v}_k^\beta -M^\beta )_+\in L^p(0,T;W^{1,p}_0(\Omega ))\), we have by Young’s and Poincaré’s inequality for any \(\varepsilon \in (0,1)\) that
and
Due to the Poincaré inequality, the constant c depends on \(\Omega \). Choosing \(\varepsilon \) small enough, we can reabsorb the terms involving \(|\nabla (\tilde{v}^\beta -M^\beta )_+|^p\) into the left-hand side. This leads us to
In the first term on the right-hand side, we take the supremum over \(\tau \in [0,T]\), while in the second one we choose \(\tau =T\). Proceeding in this way, we end up with inequality (5.18). \(\square \)
We utilize the previous lemma to show that the integral of \(\tilde{v}_k^{\beta p}\) is bounded independently of k.
Corollary 5.5
Let \(k>1\) and \(v_k\) be an admissible weak solution to the Cauchy–Dirichlet problem (5.7) in the sense of Definition 5.1. Then, there is a constant c depending only on \(\beta \), p and the domain \(\Omega \) such that the function \(\tilde{v}_k\) defined in (5.17) satisfies
where
Proof
We define \(M_o=\sup _{\bar{\Omega }}\Psi +1\) and observe that \((\tilde{v}_k^\beta -M_o^\beta )_+\in L^p(0,T;W^{1,p}_0(\Omega ))\). Combining Poincaré’s inequality applied slice-wise for a.e. \(t\in (0,T)\) and the energy estimate from Lemma 5.4, we obtain
Thus, we have
with a constant \(c=c(\beta ,p,\Omega )\). This is the desired bound for the \(L^{\beta p}\)-norm of \(\tilde{v}_k\). \(\square \)
Now, we are ready to prove the boundedness result.
Lemma 5.6
Let \(k>1\) and \(v_k\) be an admissible weak solution to the Cauchy–Dirichlet problem (5.7) in the sense of Definition 5.1. Then, there is a constant \(L>0\) depending only on \(n,\beta , p, \Omega _T, f, \psi , z\), and \( \sigma \) (and thus independent of k) such that for every \(k> L\) we have
Proof
For \(M\ge \sup _{\bar{\Omega }}\Psi +1\), we define the sequences
where \(\tilde{v}_k\) is defined in (5.17). Furthermore, we denote \(m:=\frac{\beta +1}{\beta }\) and \(A_j:=\Omega _T \cap \{ \tilde{v}_k>M_j\}\). Note that
Thus,
Using Hölder’s inequality, Gagliardo Nirenberg’s inequality from Lemma 3.8, (5.21) and the energy estimate from Lemma 5.4, we infer that
With the abbreviation \(G:=|\nabla z|^{\beta p}+ f^{p'}\), we obtain
We can estimate the measure of \(A_{j+1}\) by noting that
We use this to estimate the second term in the square brackets of (5.22) and the last factor. The remaining instance of \(|A_{j+1}|\) is treated in the same way except that we drop the factor containing M, which is possible since \(M>1\). In this way, we obtain
In view of Corollary 5.5, this shows that
where K is defined in (5.20). Thus, we have verified the iterative estimate of Lemma 3.7 with the choices
Note that \(\delta > 0\) since \(\sigma >\frac{n+p}{p}\). In order to apply Lemma 3.7, we also need to have
Since \(Y_0\le \Vert \tilde{v}_k\Vert _{L^{\beta p}(\Omega _T)}^{\beta p}\le c\,K\), it is sufficient that
which, using the definitions of C and b, is equivalent to
for a constant \(\tilde{c}\) depending on \(n, p, \beta , \Omega \). We now choose
Thus, M is a constant depending only on \(n, p, \beta , \Omega _T, f, z, \sigma \). Since \(Y_j\rightarrow 0\) with this choice, we have that
for all k. But this means that for \(k>L\), we have
which proves the claim. \(\square \)
The previous lemma together with Lemma 5.3 and the definition of \(A_k\) show that
for large k. Thus, for large k, we deduce from the differential equation (5.8) that \(v_k\) satisfies also the equation
for any \(\varphi \in C_0^\infty (\Omega _T)\). We now show a generalization of this result for test functions which do not necessarily vanish at time zero. This result will be used to verify that once we have concluded the existence of a solution to the original equation, it will also satisfy the correct boundary value.
Lemma 5.7
For sufficiently large k and \(\varphi \in C^\infty (\bar{\Omega }\times [0,T])\) with support in \(K\times [0,\tau ]\) where \(K\subset \Omega \) is compact and \(\tau \in (0,T)\), we have
Proof
For \(\varphi \) as in the statement of the lemma, we apply (5.23) with the test function \(\zeta _\varepsilon \varphi \), where
and pass to the limit \(\varepsilon \downarrow 0\). This is possible since \(v_k \in C([0,T];L^2(\Omega ))\). \(\square \)
Similarly as in Lemma 3.10, we can show that the weak solution \(v_k\) also satisfies the following modified weak form.
Lemma 5.8
Let \(k>1\) and \(v_k\) be an admissible weak solution to the Cauchy–Dirichlet problem (5.7) in the sense of Definition 5.1. Then, there holds
for all \(w^\beta \in k^{-\beta } + L^p(0,T;W_0^{1,p}(\Omega ))\) with \(\partial _t w^\beta \in L^{\frac{\beta +1}{\beta }}(\Omega _T)\) and any \(\zeta \in W^{1,\infty }([0,T],\mathbb {R}_{\ge 0})\) with \(\zeta (0)=0=\zeta (T)\).
The next step is a uniform \(L^p\)-bound for the gradients of the functions \(v_k^\beta \).
Lemma 5.9
Let \(k>1\) and \(v_k\) be an admissible weak solution to the Cauchy–Dirichlet problem (5.7) in the sense of Definition 5.1. Then, there is a constant \(C>0\) depending on \(n,\beta , p, \Omega _T, f, \psi \), and z such that
for all \(k\in \mathbb {N}\).
Proof
For \(\delta \in (0,\frac{T}{2})\), we define
and choose \(\zeta =\zeta _\delta \) and the comparison function \(w=\frac{1}{k}\) in the modified weak form (5.25) of the differential equation. Since \(\partial _t w=0\), the first term on the right-hand side is zero. Our goal now is to pass to the limit \(\delta \downarrow 0\). Note that \(\nabla w = 0\) and thus
Using the definition of the vector field A in (2.5), Lemmas 3.5 and 5.6, we obtain
For the term on the left-hand side, we have
We were able to omit the integral over \([T-\delta ,T]\) since \(\mathfrak {b}[v,\tfrac{1}{k}]\) is always nonnegative. Passing to the limits in the remaining terms on the right-hand side presents no problems, and taking into account the previous estimates we end up with
where \(c=c(p,\beta )\). Using Young’s inequality in the last integral, we finally obtain
for a constant c depending on \(\beta \) and p. \(\square \)
6 Proof of the main result
Lemmas 5.6 and 5.9 show that \((v_k^\beta )\) is a bounded sequence in the reflexive Banach space \(L^p(0,T;W^{1,p}(\Omega ))\). Therefore, there is a subsequence converging weakly to an element of \(L^p(0,T;W^{1,p}(\Omega ))\). This element can be regarded as a nonnegative function on \(\Omega _T\) since every \(v_k^\beta \) is nonnegative, and hence we can write the limit as \(v^\beta \) for some nonnegative function v. (Mazur’s lemma provides us with a subsequence of nonnegative functions converging in \(L^p(\Omega _T)\) and from this we obtain yet another subsequence converging pointwise a.e. in \(\Omega _T\).) We even have a stronger form of convergence.
Lemma 6.1
Let \((v_k)\) be the weak solutions to the Cauchy–Dirichlet problems (5.7) in the sense of Definition 5.1. Then, there exists a subsequence \((k_j)_{j\in \mathbb {N}}\) with \(k_j\rightarrow \infty \) as \(j\rightarrow \infty \) and a nonnegative function \(v\in L^{\infty }(\Omega _T)\) with \(v^\beta \in L^p(0,T;W^{1,p}_0(\Omega ))\) such that
Proof
From Lemma 5.9, we know that \((v_k^\beta -k^{-\beta })\) is a bounded sequence in the reflexive Banach space \(L^p(0,T;W_0^{1,p}(\Omega ))\). Therefore, there is a subsequence \((k_j)_{j\in \mathbb {N}}\) with \(k_j\rightarrow \infty \) as \(j\rightarrow \infty \) and a function \(w\in L^p(0,T;W^{1,p}_0(\Omega ))\) such that \(v_{k_j}^\beta -k^{-\beta }\rightharpoondown w\) weakly in \(L^p(0,T;W^{1,p}(\Omega ))\). This implies that also
Our next aim is to ensure strong convergence of \((v_{k_j})\) and thereby to identify the limit function as the pointwise a.e. limit of the subsequence. For this purpose, we let \(\tau \in (0,T)\). For \(h\in (0,T-\tau )\) and \(\delta \in (0,\min \{\tau ,T-\tau -h\})\), we define
and consider \(\varphi \in C^\infty _0(\Omega )\). For \(k>1\), we use the weak formulation (5.23) with the test function \(\zeta _\delta \varphi \) and obtain
Passing to the limit \(\delta \rightarrow 0\), we see that for any \(\tau \in (0,T)\) and \(h\in (0,T-\tau )\) there holds
Regarding \(v_k\) for fixed times as an element of \((W^{1,p}_0(\Omega ))'\), and letting \(\langle \cdot , \cdot \rangle \) denote the dual pairing of \((W^{1,p}_0(\Omega ))'\) and \(W^{1,p}_0(\Omega )\), we thus have
where in the last line we used the bound for \(v_k\) from Lemma 5.6. By Hölder’s inequality, we continue to estimate
where \(c=c(\alpha ,p,L)\). In light of Lemma 5.9, the expression in the square brackets is bounded by a constant independent of k. Thus, by the density of \(C^\infty _0(\Omega )\) in \(W^{1,p}_0(\Omega )\), we have shown that for almost all \(\tau \in (0,T)\) and \(h\in (0,T-\tau )\) there holds
for a constant c independent of k. Moreover, we recall from Lemmas 5.6 and 5.9 that \(v_k\) is a bounded sequence in \(L^\infty (\Omega _T)\) and \(v_k^\beta \) in \(L^p(0,T;W^{1,p}(\Omega ))\). We have thus ensured that the assumptions of Corollary 4.6 (i) are satisfied with \(Y=(W^{1,p}_0(\Omega ))'\), \(m=\beta \) and \(q=\mu =p\). Therefore, the application of Corollary 4.6 (i) to \((v_{k_j})\) ensures that \((v^\beta _{k_j})\) is relatively compact in \(L^{p}(\Omega _T)\). In particular, there exists a strongly convergent subsequence and by virtue of (6.1) this implies the strong convergence of the whole sequence \((v^\beta _{k_j})\) to the limit function w, i.e., we have that
By passing to another subsequence, we also obtain that \(v_k^\beta \) converges to w pointwise a.e. in \(\Omega _T\) and the uniform boundedness of \(v_k\) ensures that also \(w\in L^\infty (\Omega _T)\). We now define \(v\in L^\infty (\Omega _T)\) via \(v^\beta =w\), so that \(v_k^\beta \rightharpoondown v^\beta \) weakly in \(L^p(0,T;W^{1,p}(\Omega ))\) and \(v_k^\beta \rightarrow v^\beta \) strongly in \(L^p(\Omega _T)\) by (6.1) and (6.2). The uniform boundedness of the sequence \(v_k\) together with the strong \(L^p\)-convergence \(v_k^\beta \rightarrow v^\beta \) implies that \(v_k\rightarrow v\) strongly in \(L^q(\Omega _T)\) for any \(q\ge 1\). \(\square \)
The next step of our argument is to show the strong convergence of the gradients.
Lemma 6.2
Let the assumptions of Lemma 6.1 be in force. Then, for the subsequence \((k_j)_{j\in \mathbb {N}}\) there additionally holds that
for any closed subinterval I of (0, T).
Proof
Due to Remark 3.4, we know that for any \(k>1\) there holds
for a constant \(c=c(p)\) and where
The expression \(V_k^{p-2}\) which appears in the case \(p<2\) is not defined if \(V_k=0\), but this can occur only if \(\nabla v^\beta =\nabla v_k^\beta \) and will therefore cause no problems. Now, let \(\zeta \in C^\infty _0((0,T);[0,1])\). When \(p<2\), Hölder’s inequality implies that
By Lemmas 5.6 and 5.9, the expression on the last row is bounded by a constant independent of k. The previous observations show that regardless of the value of p,
where \(\nu = \max \{1,\frac{2}{p}\}\) and the constant c is independent of k. From Lemma 6.1, we know that the subsequence \((v_{k_j}^\beta )\) converges weakly in \(L^p(0,T;W^{1,p}(\Omega ))\) to \(v^\beta \) and that \(v_{k_j}\) converges strongly in \(L^{q}(\Omega _T)\) for any \(q\ge 1\) and pointwise a.e. in \(\Omega _T\) to v. Since \(\zeta A(v,\nabla v^\beta )\in L^{p'}(\Omega _T;\mathbb {R}^n)\), the weak convergence \(v_{k_j}^\beta \rightharpoondown v^\beta \) in \(L^p(0,T;W^{1,p}(\Omega ))\) shows that the second term on the right-hand side of (6.3) vanishes in the limit \(j\rightarrow \infty \). More precisely, we have
To treat the third term on the right-hand side of (6.3), we note that Lemma 5.6, the pointwise a.e. convergence \(v_{k_j}\rightarrow v\) and the dominated convergence theorem guarantee that \(A(v_{k_j},\nabla v^\beta )\) converges to \(A(v,\nabla v^\beta )\) strongly in \(L^{p'}\). The quantity \(\nabla v_k^\beta -\nabla v^\beta \) stays bounded in \(L^p\) due to Lemma 5.9, so by Hölder’s inequality we conclude that
It only remains to control the first term on the right-hand side of (6.3). This term can be rewritten as
where \(\llbracket v^\beta \rrbracket _h\) denotes the exponential time mollification defined in (3.4). Since \(A(v_k,\nabla v_k^\beta )\) is bounded in \(L^{p'}\) uniformly in k, we can use Hölder’s inequality to estimate the second term on the right-hand side as
with a constant c independent of k. In order to treat the first term, we use the modified weak formulation (5.25) with the comparison function \(w_{h,k}\) defined by \(w_{h,k}^\beta =k^{-\beta }+\llbracket v^\beta \rrbracket _h\). The choice of comparison function requires some justification. We know that \(k^{-\beta }+\llbracket v^\beta \rrbracket _h\in k^{-\beta }+L^p(0,T;W_0^{1,p}(\Omega ))\) since \(v^\beta \in L^p(0,T;W_0^{1,p}(\Omega ))\). By Lemma 3.1 (iii), the exponential time mollification preserves this space. Moreover, by Lemma 3.1 (ii) and the boundedness of v we see that \(\partial _t w_{h,k}^\beta \in L^\frac{\beta +1}{\beta }(\Omega _T)\). Thus, the comparison function \(w_{h,k}\) is admissible, so that
By the convergence properties of \(v_{k_j}\), we obtain
where in the last line the term involving the time derivative has been omitted, since it is nonpositive by Lemma 3.1 (ii). Combining the last two estimates and joining the previously obtained bounds and convergence properties for \(\text{ I}_{k_j}\) – \(\text{ III}_{k_j}\) with (6.3), we end up with
By Lemma 3.1 (iii), we see that the right-hand side converges to zero as \(h\rightarrow 0\). Thus, we have shown that
This proves the claim of the lemma, since for any closed subinterval \(I \subset (0,T)\) we can choose \(\zeta \in C^\infty _0(0,T;[0,1])\) such that \(\xi |_I=1\). In this case, \(\chi _I\le \xi \) and the result follows. \(\square \)
Lemma 6.3
The function v obtained in Lemma 6.1 satisfies
for every \(\varphi \in C^\infty (\bar{\Omega }\times [0,T])\) with support contained in \(K\times [0,\tau ]\) where \(K\subset \Omega \) is compact and \(\tau \in (0,T)\).
Proof
We fix a test function \(\varphi \) as above, and note that it is sufficient to consider functions satisfying \(|\nabla \varphi |\le 1\). Recall that \(v_{k_j}\) satisfies (5.24) with \(k=k_j\) and j sufficiently large. The goal is to pass to the limit \(j\rightarrow \infty \). The limit
follows from the \(L^1\)-convergence of \(v_{k_j}\) to v and the limits
are trivial (recall that \(v_k(0)=\Psi +\tfrac{1}{k}\)). It remains to treat the elliptic term. For this, we abbreviate
and note that for any \(\delta \in (0,\tau )\) we can estimate
where we have used the Cauchy–Schwarz inequality and the bound on \(\nabla \varphi \). Let \(\varepsilon >0\). Due to the definition of A, and the uniform bounds obtained in Lemma 5.6 and Lemma 5.9, the integrand in the first term on the right-hand side of (6.5) is bounded in the \(L^{p'}\)-norm independently of k, so Hölder’s inequality allows us to conclude that for sufficiently small \(\delta >0\), the bound
is satisfied independently of \(k\in \mathbb {N}\). With such a fixed \(\delta \), we now estimate the second term on the right-hand side of (6.5) as
We will use Lemma 3.6 to treat both terms. This lemma shows that the integrand in the first term may be estimated as
where \(c=c(p)\) and \(b_p=0\) if \(p<2\) and \(b_p=1\) if \(p\ge 2\). Convergence of the integral of the first term is clear as \(k=k_j\rightarrow \infty \) due to Lemma 6.2. In the case \(p\ge 2\), the integral of the second term is treated using Lemmas 5.6, 6.2 and Hölder’s inequality:
The exponent of the integral is less than p, so the integral is a finite number. These considerations show that the first term of (6.7) converges to zero as \(k\in \{k_j\,|\,j\in \mathbb {N}\}\) approaches infinity. The integrand in the second term of (6.7) can be estimated using Lemma 3.6 as
where \(c=c(p)\). The terms on the right-hand side can be treated by the dominated convergence theorem. Hence, for \(k=k_j\) and for all sufficiently large j, we have
Taking into account (6.5), (6.6) and (6.8), we have shown that
Taking into account all these limits, we have confirmed (6.4). \(\square \)
Now, we are ready to prove the main theorem.
Proof of Theorem 2.2
We will show that the function v obtained in Lemma 6.1 is a weak solution to the Cauchy–Dirichlet problem (2.6) in the sense of Definition 2.1. Lemma 6.3 shows that (2.7) is valid, since for test functions \(\varphi \in C^\infty _0(\Omega _T)\) the second term on the right-hand side of (6.4) vanishes. Moreover, by Lemma 3.11, we know that \(v\in C([0,T];L^{\beta +1}(\Omega ))\). It only remains to show that \(v(0)=\Psi \). Using (6.4) with the test function \(\varphi (x)\zeta _\varepsilon (t)\) where \(\varphi \in C^\infty _0(\Omega )\) and
one obtains
On the other hand, since \(v\in C([0,T];L^{\beta +1}(\Omega ))\), we see that also
holds. Since \(\varphi \in C^\infty _0(\Omega )\) is arbitrary, it follows that \(v(0)=\Psi \).
References
E. Acerbi and N. Fusco: Regularity for minimizers of nonquadratic functionals: the case\(1<p<2\). J. Math. Anal. Appl. 140, no., 115–135, 1989.
R. Adams and J. Fournier: Sobolev Spaces, Second edition, Elsevier Ltd, 2003.
R. Alonso, M. Santillana and C. Dawson: On the diffusive wave approximation of the shallow water equations. European J. Appl. Math. 19, no. 5, 575–606, 2008.
H.W. Alt and S. Luckhaus: Quasilinear elliptic-parabolic differential equations, Math. Z. 183, no. 3, 311–341, 1983.
V. Bögelein, F. Duzaar, R. Korte and C. Scheven: The higher integrability of weak solutions of porous medium systems, Adv. Nonlinear Anal, 8 (1), 1004 – 1034, 2019.
V. Bögelein, F. Duzaar, P. Marcellini: Existence of evolutionary variational solutions via the calculus of variations, J. Differential Equations 256, no. 12, 3912–3942, 2014.
V. Bögelein, F. Duzaar, P. Marcellini and C. Scheven: Doubly Nonlinear Equations of Porous Medium Type, Arch. Ration. Mech. Anal, 229, 503–545, 2018.
J. I. Díaz and F. De Thelin: On a nonlinear parabolic problem arising in some models related to turbulent flows, Siam J. Math. Anal., 25 (4), 1085–1111, 1994.
E. DiBenedetto: Degenerate Parabolic Equations, Springer Verlag, 1993.
F. Duzaar and G. Mingione: Second order parabolic systems, optimal regularity, and singular sets of solutions. Ann. Inst. H. Poincaré Anal. Non Linéaire 22 no. 6, 705–751, 2005.
K. Feng and F.J. Molz: A 2-d diffusion based, wetland flow model, J. Hydrol. 196, 230–250, 1997.
M. Giaquinta and G. Modica: Partial regularity of minimizers of quasiconvex integrals. Ann. Inst. H. Poincaré Anal. Non Linéaire 3 no. 3, 185–208, 1986.
R. Hooke: Flow Law for Polycrystalline Ice in Glaciers: Comparison of Theoretical Predictions, Laboratory Data, and Field Measurements, Geophys. Space Phys., 19, 664-672, 1981.
T.V. Hromadka, C.E. Berenbrock, J.R. Freckleton and G.L. Guymon: A twodimensional dam-break flood plain model, Adv. Water Resour. 8, 7–14, 1985.
A.V. Ivanov, P.Z. Mkrtychyan, and W. Jäger: Existence and uniqueness of a regular solution of the Cauchy-Dirichlet problem for a class of doubly nonlinear parabolic equations, J. Math. Sci., 84:845–855, 1997.
A. Jüngel: Boundedness of weak solutions to cross-diffuson population systems with Laplacian structure, https://arxiv.org/abs/1812.08719v1
J. Kinnunen and P. Lindqvist: Pointwise behaviour of semicontinuous supersolutions to a quasilinear parabolic equation, Ann. Mat. Pura Appl. (4) 185(3): 411–435, 2006.
E. Lindgren and R. Monneau: Pointwise estimates for the heat equation. Application to the free boundary of the obstacle problem with Dini coefficients, Indiana University Mathematics Journal, vol. 62, no. 1, 171–199, 2013.
M.W. Mahaffy: A Three-Dimensional Numerical Model of Ice Sheets: Tests on the Barnes Ice Cap, Northwest Territories, J. Geophys. Res., vol. 81 no. 6, 1059-1066, 1976.
D. Partridge: Numerical Modelling of Glaciers: Moving Meshes and Data, PhD Thesis, University of Reading, 2013.
M.M. Porzio and V. Vespri: Holder Estimates for Local Solutions of Some Doubly Nonlinear Degenerate Parabolic Equations, Journal of Differential Equations, 103, 146–178, 1993.
R. E. Showalter: Monotone Operators in Banach Space and Nonlinear Partial Differential Equations, American Mathematical Society, 1997.
J. Simon: Compact sets in the space\(L^p(0,T;B)\), Ann. Mat. Pura Appl. (IV), 146, 65–96, 1986.
T. Singer and M. Vestberg: Local Boundedness of Weak Solutions to the Diffusive Wave Approximation of the Shallow Water Equations, Journal of Differential Equations, Vol. 266 no. 6, 3014-3033, 2019.
T. Singer and M. Vestberg: Local Hölder Continuity of Weak Solutions to a Diffusive Shallow Medium equation, Nonlinear Analysis, Vol. 185, 306-335, 2019.
S. Sturm: Existence of weak solutions of doubly nonlinear parabolic equations, J. Math. Anal. Appl. 455, no. 1, 842–863, 2017.
V. Vespri: On the local behaviour of solutions of a certain class of doubly nonlinear parabolic equations, Manuscripta Math. 75, no. 1, 65–80, 1992.
Acknowledgements
Open access funding provided by Aalto University. V. Bögelein has been supported by the FWF-Project P31956-N32 “Doubly nonlinear evolution equations.” M. Vestberg wants to express gratitude to the Academy of Finland.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bögelein, V., Dietrich, N. & Vestberg, M. Existence of solutions to a diffusive shallow medium equation. J. Evol. Equ. 21, 845–889 (2021). https://doi.org/10.1007/s00028-020-00604-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00028-020-00604-y