Abstract
We propose a variational approach to solve Cauchy problems for parabolic equations and systems independently of regularity theory for solutions. This produces a universal and conceptually simple construction of fundamental solution operators (also called propagators) for which we prove \({\text {L}}^2\) off-diagonal estimates, which is new under our assumptions. In the special case of systems for which pointwise local bounds hold for weak solutions, this provides Gaussian upper bounds for the corresponding fundamental solution. In particular, we obtain a new proof of Aronson’s estimates for real equations. The scheme is general enough to allow systems with higher order elliptic parts on full space or second order elliptic parts on Sobolev spaces with boundary conditions. Another new feature is that the control on lower order coefficients is within critical mixed time-space Lebesgue spaces or even mixed Lorentz spaces.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The classical treatment of parabolic problems begins with solving the Cauchy problem with or without forcing terms and representing solutions by what is called fundamental solutions. Here, we consider operators of the form \(\partial _{t}+{\mathcal {L}}\), where \({\mathcal {L}}\) is an elliptic operator in divergence form with possibly complex-valued coefficients. Coefficients depend on all space and time variables. We assume strongly (Gårding) elliptic and bounded higher order coefficients and unbounded lower order coefficients controlled in mixed Lebesgue and even Lorentz norms that are compatible with Sobolev embeddings for solutions. In particular, our treatment includes parabolic Schrödinger operators with Coulomb like potentials.
When the coefficients are regular, several methods are possible to construct the fundamental solution and the most efficient one is via a parametrix, using the so-called freezing point technique, which reduces the situation to space-independent coefficients for which fundamental solutions are explicit kernels \(\Gamma (t,x, s,y)\) with exponential decay in \((|x-y|^{2m}/|t-s|)^{1/(2m-1)}\), where 2m is the order of the elliptic operator [21]. When \(m=1\), this is the Gaussian decay.
When coefficients become irregular (measurable, unbounded), one goes through the theory of weak solutions that was developed in the 1950’s–60’s, culminating in the treatise by Ladyženskaja, Solonnikov and Ural’ceva [28]. In parallel, when the coefficients are real-valued and the elliptic operator has order two, Aronson constructed generalized fundamental solutions, using Riesz representation theorems as a consequence of well-posedness of Cauchy problems that generate bounded solutions. He also proved Gaussian upper and lower bounds [3, 4]. This supposedly closed the topic but here we shall reveal some new phenomena.
Our starting point is the guiding principle that many results on elliptic problems have counterparts in parabolic world, taking into account evolution with respect to time. However, elliptic problems are tackled using a coercive variational formulation, while parabolic problems are attacked via the Cauchy problem as mentioned above. The presence of the first order time derivative seems to forbid any possibility of coercivity as in the elliptic case.
Nevertheless, the heat kernel
can be seen as the kernel of the operator \((\partial _{t}-\Delta )^{-1}\). Here, the inverse can be computed using Fourier transform, but for more general parabolic operators this is not possible. Our question is whether some form of invertibility can still be implemented.
We show that indeed there is a variational formulation in the parabolic setting, too. That is, we find a variational space \({\mathcal {V}}\) such that if \(\partial _{t}+{\mathcal {L}}:{\mathcal {V}}\rightarrow {\mathcal {V}}'\), \({\mathcal {V}}'\) being its dual, is invertible, then one can represent the inverse by Green operators that eventually become the fundamental solution operators for the Cauchy problem (and whose kernels, whenever they exist in a pointwise sense, give a generalized fundamental solution). Invertibility and causality are then checked under appropriate coercivity requirements. In other words, we are reversing the order of the usual arguments. Our main conclusion for Cauchy problems in the case of coefficients in mixed Lebesgue spaces is in Theorem 2.54.
One may think this is a matter of cosmetic changes in the theory, but it is not. For instance, in the case of second order elliptic part, the usual energy space of weak solutions \({\text {L}}^2({\text {I}}; {\text {H}}^1)\cap {\text {L}}^\infty ({\text {I}}; {\text {L}}^2)\) or the smaller Lions’ space \({\text {L}}^2({\text {I}}; {\text {H}}^1)\cap {\text {H}}^1({\text {I}}; {\text {H}}^{-1})\)Footnote 1 cannot play the role of a variational space as above, as the dual is either not handy or too big. Thus, we have to renounce to a priori boundedness in \({\text {L}}^2\). Also, for symmetry reasons it is easier to let \({\text {I}}={\mathbb {R}}\) as this avoids boundary conditions for the time derivative. If one looks for a variational space candidate, the space \({\text {L}}^2({\mathbb {R}}; {\text {H}}^1)\) is unavoidable for weak solutions since it is mapped to its dual by the leading terms. Another Hilbert space, \({\text {H}}^{1/2}({\mathbb {R}}; {\text {L}}^2)\), is mapped to its dual by the time derivative. This space already appeared in the theory [28, 29] but rather in the regularity theory than with an instrumental role. Hence, the space
or its homogeneous version \({\dot{{\mathcal {V}}}}\), is a natural candidate and we are going to assume from the start the alluded \({\dot{{\mathcal {V}}}}\rightarrow {\dot{{\mathcal {V}}}}'\) invertibility of the parabolic operator. Even though \({\text {H}}^{1/2}({\mathbb {R}}) \subset {\text {L}}^\infty ({\mathbb {R}})\) fails, the homogeneous versions of these two spaces have the same scale invariance and therefore the homogeneous versions of the spaces \({\mathcal {V}}\) and \({\text {L}}^2({\mathbb {R}}; {\text {H}}^1)\cap {\text {L}}^\infty ({\mathbb {R}}; {\text {L}}^2)\) have the same embeddings into the mixed spaces \({\text {L}}^r({\mathbb {R}};{\text {L}}^q)\) except for the endpoint exponents \(r=\infty , q=2\). Regularity theory based on improvements of Lions’s embedding theorem allows us to introduce a class of solutions where one can uniquely solve
for \(\psi \in {\text {L}}^2\) and \(\delta _{s}\) the Dirac mass at s, and show that such solutions are continuously \({\text {L}}^2\)-valued except at s. This turns out to be precisely what is needed to define Green operators with the expected properties that can be used to represent solutions of the Cauchy problem. All boils down to proving invertibility of the parabolic operator, which uses an idea going back to [24] that has been rediscovered several times since. There, Kaplan showed for the first time, in absence of lower order coefficients, where coercivity of the parabolic operator \(\partial _{t}+{\mathcal {L}}\) hides even though \(\partial _{t}\) alone is not coercive in any sense since \({\text {Re}}\int _{{\mathbb {R}}}{\partial _{t}u} \, {{\overline{u}}}\, \text {d}t=0\) holds for any reasonable function u.
In summary, solutions being in \({\text {L}}^\infty ({\text {L}}^2)\) is an a priori requirement in most references to develop the theory and that the solutions belong to \({\text {C}}({\text {L}}^2)\) and \({\text {H}}^{1/2}({\text {L}}^2)\) is an a posteriori gain. Here, we use the invertibility on a space involving \({\text {H}}^{1/2}({\text {L}}^2)\) to construct (unique) solutions that are proved to be \({\text {C}}({\text {L}}^2)\cap {\text {L}}^\infty ({\text {L}}^2)\) by a regularity argument, hence that are usual weak solutions in the end.
Let us next describe the new findings that emerge from these conceptual changes.
Weaker assumptions on the coefficients. An advantage of using the variational space \({\mathcal {V}}\), as opposed to classical energy spaces, is that it not only embeds into mixed Lebesgue spaces \({\text {L}}^r({\mathbb {R}};{\text {L}}^q)\) for pairs (r, q) of exponents that we call admissible, but also into mixed Lorentz spaces \({\text {L}}^{r,2}({\mathbb {R}}; {\text {L}}^{q,2})\). As a consequence, this allows us to relax assumptions from \({\text {L}}^{{{\tilde{r}}}}({\mathbb {R}}; {\text {L}}^{{{\tilde{q}}}})\) to \({\text {L}}^{\tilde{r},\infty }({\mathbb {R}}; {\text {L}}^{{{\tilde{q}}},\infty })\) for the lower order coefficients for pairs \(({{\tilde{r}}}, {{\tilde{q}}})\) that we call compatible, as far as invertibility is concerned. Also causality can be proved under a weaker assumption, namely \({\text {L}}^{{{\tilde{r}}}}({\mathbb {R}}; {\text {L}}^{{{\tilde{q}}},\infty })\). This is explained in Sect. 2.15.
Adaptability of the approach. The “hidden coercivity” using the space \({\mathcal {V}}\) discovered in [24] had been explicitly appeared in several instances for other questions [5,6,7, 16, 23] concerning local regularity, maximal regularity or boundary value problems. The heart of the matter are Sects. 2.2–2.14. Once the framework is set up correctly, numerous, otherwise non-trivial extensions, will come effortlessly: Lower order coefficients in Lorentz spaces (Sect. 2.15), unbounded leading coefficients in \(\text {BMO}\) (Sect. 2.16), higher order systems on full space with integrability varying over the coefficients (Sect. 3), second order equations and systems with lateral boundary conditions (Sect. 4). We provide full details for the first two extensions and restrict ourselves to sketching the strategies for the latter two as the article is already quite long.
A self-contained theory with simpler proof techniques and improvement of Lions’ embedding theorem. Many results we prove here could seem “well-known” to experts at first glance but we produce all details of the second order case in full space in order to show that the method we develop is self-contained with no recourse to older literature. Some arguments require new techniques of proof, hopefully simpler and without using Steklov averages. In particular, our approach is a consequence of \({\text {L}}^2\) continuity in time (up to constant) of solutions of the heat operator \(\partial _{t}u-\Delta u=f\) or its adjoint when u a priori belongs to \({\text {L}}^2({\mathbb {R}};{\text {H}}^1)\) or its homogeneous version and f belongs to sums of mixed Sobolev spaces of \({\text {L}}^2\) type with negative indices. This seems new. In the end, this yields an improvement of Lions’ embedding theorem (Lemma 2.17).
A universal construction without approximation of coefficients. Lastly, our construction of propagators or fundamental solution operators avoids density arguments from operators with smooth coefficients or Galerkin methods. Uniqueness implies that our construction agrees with others under common hypotheses. In this sense it is universal and also constructive. In particular, we obtain a new proof of the Gaussian upper bound of Aronson as a consequence of \({\text {L}}^2\) off-diagonal estimates for fundamental solution operators, which hold in full generality (see Theorem 2.61). The latter is a new result in its own right.
Further details and precise assumptions are given in the course of the article.
2 Second order problems on full space
In what follows, we use \({\text {L}}^2\) spaces in both \({\mathbb {R}}^n\), \(n\ge 1\), and \({\mathbb {R}}^{n+1 }\), equipped with Lebesgue measures. We denote by \(\langle \psi ,{{\tilde{\psi }}} \rangle \) the complex inner product in the variable \(x \in {\mathbb {R}}^n\) and by \(\langle \langle \phi ,{{\tilde{\phi }}} \rangle \rangle \) the complex inner product in the variables \((t,x) \in {\mathbb {R}}\times {\mathbb {R}}^n =:{\mathbb {R}}^{n+1}\). (We prefer this order for the variables for practical reasons.) For a function f of the two variables, we set \(f(t): x\mapsto f(t,x)\) for any t.
We use the notation \({\mathcal {D}}\) and \({\mathcal {D}}'\) for the spaces of \({\text {C}}^\infty \) functions with compact support and of distributions, respectively, and \({\mathcal {S}}\) and \({\mathcal {S}}'\) for the spaces of Schwartz functions and tempered distributions, respectively. Variables will be indicated at the time of use. Duality brackets extend inner products on \({\text {L}}^2\) spaces, hence they are sesquilinear.
2.1 Variational space
As said in the introduction, we need a space, which can be thought of as \({\text {L}}^2_{t}{\dot{{\text {H}}}}^{1}_{x}\cap {\dot{{\text {H}}}}^{1/2}_{t}{\text {L}}^2_{x}\). However, some care must be taken because we use homogeneous norms.
Let the Fourier transform on \({\mathbb {R}}^{n+1}\) be the usual extension to tempered distributions of the integral defined on \({\text {L}}^1\) functions by
Remark that \((\tau ,\xi )\mapsto (|\xi |^2+|\tau |)^{-1/2}\) is locally square integrable in \({\mathbb {R}}\times {\mathbb {R}}^{n}\) with
Thus, for any \(g\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) we have that \((|\xi |^2+|\tau |)^{-1/2}g\) is in both \({\text {L}}_{{\text {loc}}}^1({\mathbb {R}}^{n+1})\) and \({\mathcal {S}}'({\mathbb {R}}^{n+1})\). We define
Equipped with the norm
this is a Hilbert space. It is easy to see that it contains \({\mathcal {S}}({\mathbb {R}}^{n+1})\) as a dense subspace. Note also that constants do not belong to \({\dot{{\mathcal {V}}}}\).
Remark 2.1
For \(\varphi \in {\mathcal {S}}({\mathbb {R}}^{n+1})\), we have, using Plancherel’s formula,
Here, \({\text {D}}_{t}^\alpha \) is the Fourier multiplier with symbol \(|\tau |^\alpha \). In fact, the closure of \({\mathcal {S}}({\mathbb {R}}^{n+1})\) for the norm defined by the left-hand side is \({\dot{{\mathcal {V}}}}+{\mathbb {C}}\), seen as a subspace of \({\mathcal {S}}'({\mathbb {R}}^{n+1})/{\mathbb {C}}\). For a proof see Lemma 3.11 in [7]; this closure is denoted by \({\dot{{\text {E}}}}({\mathbb {R}}^{n+1})\) there. Hence \({\dot{{\mathcal {V}}}}\) is nothing but the realization of this closure within \({\mathcal {S}}'({\mathbb {R}}^{n+1})\) that eliminates constants. In particular, whenever \(u\in {\dot{{\mathcal {V}}}}\), then \(\nabla u\) and \({\text {D}}_{t}^{1/2}u\) exist as tempered distributions, belong to \({\text {L}}^2_{t}{\text {L}}^2_{x}\), and the identity above holds.
We let \({\dot{{\mathcal {V}}}}'\) be the dual of \({\dot{{\mathcal {V}}}}\) with respect to \(\langle \langle \cdot ,\cdot \rangle \rangle \). Thus, it is a subspace of \({\mathcal {S}}'({\mathbb {R}}^{n+1})\) and a distribution \(w \in {\mathcal {S}}'({\mathbb {R}}^{n+1})\) belongs to \({\dot{{\mathcal {V}}}}'\) if and only if \((|\tau |+ |\xi |^2)^{-1/2}{\hat{w}}\in {\text {L}}^2_{t}{\text {L}}^2_{x}\). It follows from Plancherel’s formula that \(w\in {\dot{{\mathcal {V}}}}'\) if and only if there exists a decomposition \({\hat{w}}= |\xi | g_{1}+|\tau |^{1/2}g_{2}\) with \(g_{1},g_{2}\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) and that \(\Vert w\Vert _{{\dot{{\mathcal {V}}}}'}\sim \inf (\Vert g_{1}\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}+\Vert g_{2}\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}})\) taken over all such decompositions. In this sense we write \({\dot{{\mathcal {V}}}}'={\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}+ {\dot{{\text {H}}}}^{-1/2}_{t}{\text {L}}^2_{x}\) with equivalent norms.
2.2 Embeddings
Recall that the homogeneous Sobolev space \({\dot{{\text {H}}}}^{1/2}({\mathbb {R}})={\dot{{\text {H}}}}^{1/2}_{t}\), that is, the closure of \({\mathcal {S}}({\mathbb {R}})\) for the norm \(\Vert {\text {D}}_{t}^{1/2}\!\varphi \Vert _{2}\), has the same scaling properties as \({\text {L}}^\infty ({\mathbb {R}})\). This results in continuous inclusions into mixed normed Lebesgue spaces for \({\dot{{\mathcal {V}}}}\) that, except for endpoints, are the same as for \({\text {L}}^2_{t}{\dot{{\text {H}}}}^1_{x}\cap {\text {L}}^\infty _{t}{\text {L}}^2_{x}\). We describe them next. We mention that \({\dot{{\text {H}}}}^{1/2}_{t}\) has an equivalent (semi-)norm, using difference quotients that is often used in the literature on this topic, see for instance [28]. We do not use them here.
We need the following mixed spaces. For pairs \((r,q)\in [1,\infty ]^2\) of exponents, intervals \({\text {I}}\subset {\mathbb {R}}\), and open sets \(\Omega \subset {\mathbb {R}}^n\), \(n\ge 1\), we write \({\text {L}}^r({\text {I}}; {\text {L}}^q(\Omega ))\) for the mixed norm space of measurable functions \(u:{\text {I}}\times \Omega \rightarrow {\mathbb {C}}\) with
and the usual changes if either \(r=\infty \) or \(q=\infty \). We set \({\text {L}}^{r}_{t}{\text {L}}^{q}_{ x}:={\text {L}}^r({\mathbb {R}}; {\text {L}}^q({\mathbb {R}}^n))\) with dummy variables in indices when \(t\in {\mathbb {R}}\) and \(x\in {\mathbb {R}}^n\).
We introduce the Banach space of tempered distributions
where the gradient is taken in the sense of distributions,Footnote 2 with norm
Duality theory for \({\dot{\Delta }}^{r,q}\) is easily understood by identifying \({\dot{\Delta }}^{r,q}\) with a closed subspace of \({\text {L}}^{r}_{t}{\text {L}}^{q}_{ x} \times {\text {L}}^{2}_{t}{\text {L}}^{2}_{x}\) through the map \(u\mapsto (u, \nabla u)\). As usual, the Hölder conjugate of \(q\in [1,\infty ]\) is \(q'=\frac{q}{q-1}\). Thus, for the duality \(\langle \langle \cdot \,,\cdot \rangle \rangle \) we have that if \((r,q)\in [1,\infty )^2\), then \(({\dot{\Delta }}^{r, q})' = {\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x} + {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\), the space of elements \({\text {div}}F + g\), with vector field \(F\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) and scalar function \(g\in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\), equipped with usual infimum norm. In the same manner, when \((r,q)\in (1,\infty ]^2\), then \({\dot{\Delta }}^{r, q}\) is identified with the dual space of \({\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x} + {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) and when \((r,q)\in (1,\infty )^2\), then it is reflexive. The duality theory implies that \({\mathcal {S}}({\mathbb {R}}^{n+1})\) is dense in \({\dot{\Delta }}^{r,q}\) when \((r,q) \in [1,\infty )^2\).
Definition 2.2
A pair (r, q) is admissible if \(\frac{1}{r}+\frac{n}{2q}= \frac{n}{4}\) with \(2\le r,q<\infty \).
Lemma 2.3
If (r, q) is an admissible pair, then \({\dot{{\mathcal {V}}}}\hookrightarrow {\dot{\Delta }}^{r, q}\) with continuous inclusion. In particular, elements of \({\dot{{\mathcal {V}}}}\) are locally integrable functions. By duality, this yields the continuous inclusion \( ({\dot{\Delta }}^{r, q})' \hookrightarrow {\dot{{\mathcal {V}}}}'\).
Proof
By density it suffices to work with \(\varphi \in {\mathcal {S}}({\mathbb {R}}^{n+1})\) and show the first inclusion. The proof relies on two ingredients. First, for \(\theta \in [0,1]\), using the convexity inequality
Fourier transform in the (t, x)-variable shows that
Next, Sobolev embeddings in \({\mathbb {R}}^n\) and \({\mathbb {R}}\) give us
where the first inequality holds exactly when \(\frac{1}{2} - \frac{1-\theta }{n} =\frac{1}{q}\) and \(2\le q<\infty \), and the second one exactly when \(\frac{\theta }{2}- \frac{1}{2}= -\frac{1}{r}\) and \(2\le r<\infty \). Note that the second embedding is the vector-valued extension of the scalar embedding on \({\mathbb {R}}\).
We can solve for \(\theta \in [0,1)\) if \(\frac{1}{r}+\frac{n}{2q}= \frac{n}{4}\) with \(2\le r<\infty \) and \(2\le q <\infty \), which is the definition of an admissible pair. Since the other part \(\Vert \nabla \varphi \Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}\) of the \({\dot{\Delta }}^{r, q}\) norm is controlled by \(\Vert \varphi \Vert _{{\dot{{\mathcal {V}}}}}\), we are done. \(\square \)
Remark 2.4
As \({\dot{{\text {H}}}}^{1/2}({\mathbb {R}})\) does not embed into \({\text {L}}^\infty ({\mathbb {R}})\), the result fails for \((r,q)=(\infty ,2)\) and we excluded this pair from our admissible range although it satisfies the relation \(\frac{1}{r}+\frac{n}{2q}= \frac{n}{4}\). Similarly, the embedding \({\dot{{\text {H}}}}^{1-\theta }_{x}\subset {\text {L}}^q_{x}\) never holds when \(q=\infty \) and requires \(1-\theta < \frac{n}{2}\).
2.3 Variational approach
We study parabolic equations on \({\mathbb {R}}^{n+1} = {\mathbb {R}}\times {\mathbb {R}}^n\), namely
and its adjoint equation
where \({\mathcal {L}}\) and its adjoint are second order elliptic operators in divergence form perturbed with unbounded lower order terms. The equalities are taken in the sense of distributions, provided \({\mathcal {L}}u\) and \({\mathcal {L}}^*{{\tilde{u}}}\) are well-defined. To be precise, we consider
where the coefficients \(A,{{\textbf {a}}}, {{\textbf {b}}}, a\) depend on (t, x). Sometimes, we consider \({\mathcal {L}}\) as an operator acting on functions of the x-variable by freezing t: context will make things clear. The leading coefficient \(A=(a_{ij})\) is an \(n\times n\) matrix of bounded, possibly complex-valued, measurable functions on \({\mathbb {R}}^{n+1}\). Thus, the sesquilinear form corresponding to the leading part in (2.3) satisfies
with \(\Lambda :=\Vert A\Vert _{\infty }\). The lower order coefficients \({{\textbf {a}}}, {{\textbf {b}}}\) are n-vectors of complex-valued, measurable functions on \({\mathbb {R}}^{n+1}\), and a is a complex-valued, measurable functions on \({\mathbb {R}}^{n+1}\). The formal complex adjoint of \({\mathcal {L}}\) corresponds to
We introduce the following quantity.
Definition 2.5
For a pair \(({{\tilde{r}}}_{1},{{\tilde{q}}}_{1})\in [1,\infty ]^2\) let
and define \((r_{1}, q_{1})\in [2,\infty ]^2\) through the relations
Remark 2.6
In principle, we could let \(({{\tilde{r}}}_{1},{{\tilde{q}}}_{1})\) be different for each entry of \({{\textbf {a}}}\) and \({{\textbf {b}}}\), and a. At this point we do not go into this in order to simplify the exposition of ideas but we shall come back to the general version later on in Sect. 3.
Next, we introduce the sesquilinear pairings corresponding to the lower order terms in (2.3). We set
so that for appropriate u, v and for (almost) each \(t \in {\mathbb {R}}\), we write
and
The relation (2.7) guarantees that the formal pairings above are absolutely convergent Lebesgue integrals, as becomes apparent from the next lemma.
Lemma 2.7
Let \(({{\tilde{r}}}_{1}, {{\tilde{q}}}_{1}) \in [1,\infty ]^2\) and let \((r_{1}, q_{1})\) given by (2.7). Suppose that \(P_{\tilde{r}_{1},{{\tilde{q}}}_{1}}\) is finite. If \(u, v\in {\dot{\Delta }}^{r_{1}, q_{1}}\), then
In particular, \(\langle \beta u(t),v(t) \rangle \in {\text {L}}^1_{t}\) and
Proof
Use Hölder inequalities in the x and then t-variables, taking into account the relations
\(\square \)
It is of course natural to relate the choice of pairs to Sobolev embeddings.
Definition 2.8
A pair \(({{\tilde{r}}}_{1}, {{\tilde{q}}}_{1})\) is said compatible for lower order coefficients if \(\frac{1}{ {{\tilde{r}}}_{1}}+\frac{n}{2 {{\tilde{q}}}_{1}}= 1\) and \(1< {{\tilde{r}}}_{1}, {{\tilde{q}}}_{1} \le \infty \). In this case, \((r_{1},q_{1})\) given by (2.7) is its admissible conjugate pair.
This terminology is motivated by the following principle.
Lemma 2.9
A pair \(({{\tilde{r}}}_{1}, {{\tilde{q}}}_{1})\) is compatible for lower order coefficients if and only if \((r_{1},q_{1})\) is admissible.
Proof
We can see that
\(\square \)
Remark 2.10
The compatibility and admissibility conditions already appear in Chapter 3 of [28]. As in there (see p. 137), we include the case \({{\tilde{r}}}_{1}=\infty \), \({{\tilde{q}}}_{1}=\frac{n}{2}\), \(n\ge 3\), but not the case \({{\tilde{r}}}_{1}=1\) as the variational space is not contained in \( {\text {L}}^\infty _{t}{\text {L}}^2_{x}\).
We now introduce the variational setup. We use the Hilbert space \({\dot{{\mathcal {V}}}}\) and its dual \({\dot{{\mathcal {V}}}}'\) for \(\langle \langle \cdot ,\cdot \rangle \rangle \). Since \({\mathcal {S}}\) is dense in \({\dot{{\mathcal {V}}}}\), this pairing is consistent with the sesquilinear pairing of tempered distributions and Schwartz functions. We have seen in Sect. 2.2 that \({\dot{{\mathcal {V}}}}\subset {\dot{\Delta }}^{r,q} \) and \({\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x} \subset {\dot{{\mathcal {V}}}}'\) if (r, q) is admissible. For \(u \in {\dot{{\mathcal {V}}}}\) and \(v \in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) the pairing \(\langle \langle u,v \rangle \rangle \) is therefore the Lebesgue integral \(\int _{{\mathbb {R}}^{n+1}}u(t,x)\overline{v}(t,x)\, \text {d}x\text {d}t\). This observation will be tacitly used throughout the section.
Proposition 2.11
Assume that \(P_{{{\tilde{r}}}_{1}, {{\tilde{q}}}_{1}}<\infty \) for some pair \(({{\tilde{r}}}_{1}, {{\tilde{q}}}_{1})\) compatible for lower order coefficients. Define the operator
through
for \(u,v \in {\dot{{\mathcal {V}}}}\). In the same fashion, define the dual operator
Then \({\mathcal {H}}, {\mathcal {H}}^*: {\dot{{\mathcal {V}}}}\rightarrow {\dot{{\mathcal {V}}}}'\) are well-defined, bounded and adjoint to one another.
Proof
For \(u\in {\dot{{\mathcal {V}}}}\) we have \((\partial _{t}u) \, \hat{}= |\tau |^{1/2} (i\tau |\tau |^{-1/2}{\hat{u}})\) and \(i\tau |\tau |^{-1/2}{\hat{u}}\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), so that \(\partial _{t}u\in {\dot{{\mathcal {V}}}}'\) with \(\Vert \partial _{t}u\Vert _{{\dot{{\mathcal {V}}}}'}\le \Vert u\Vert _{{\dot{{\mathcal {V}}}}}\). It follows that \(\partial _{t}\), seen as an operator acting on tempered distributions in the two variables (t, x), maps \({\dot{{\mathcal {V}}}}\) into \({\dot{{\mathcal {V}}}}'\).
Next, for the admissible conjugate pair \((r_{1},q_{1})\) and \(u,v\in {\dot{\Delta }}^{ r_{1}, q_{1}}\), the pairing \(\langle \langle {\mathcal {L}}u,v \rangle \rangle \) is defined as
so that by Lemma 2.7,
By the embedding \({\dot{{\mathcal {V}}}}\hookrightarrow {\dot{\Delta }}^{ r_{1}, q_{1}}\) for the admissible pair \((r_{1},q_{1})\), we conclude that \(\langle \langle {\mathcal {L}}u,v \rangle \rangle \) is defined on \({\dot{{\mathcal {V}}}}\times {\dot{{\mathcal {V}}}}\) and that with \(C= C(n, r_{1},q_{1})\) we have
Eventually, for \(u,v\in {\dot{{\mathcal {V}}}}\), it follows using Fourier transform that
and by inspection that
Hence, \({\mathcal {H}}^*\) is the adjoint of \({\mathcal {H}}\) and its boundedness follows. \(\square \)
Remark 2.12
We shall often write \(\langle \langle {\mathcal {L}}u,v \rangle \rangle = \int _{{\mathbb {R}}} \langle {\mathcal {L}}u(t),v(t) \rangle \, \text {d}t\) to mean (2.11). When \(u,v\in {\dot{\Delta }}^{ r_{1}, q_{1}}\), we have also shown integrability in t together with the intermediate estimate
Hence, \({\mathcal {L}}u\in ({\dot{\Delta }}^{r_{1}, q_{1}})'\) with \(\Vert {\mathcal {L}}u\Vert _{({\dot{\Delta }}^{r_{1}, q_{1}})'}\le (\Vert A\Vert _{\infty } + P_{ \tilde{r}_{1}, {{\tilde{q}}}_{1}}) \Vert u\Vert _{{\dot{\Delta }}^{ r_{1}, q_{1}}}\). This implies in particular that \({\mathcal {L}}u\) is defined as a distribution when \(u\in {\dot{\Delta }}^{ r_{1}, q_{1}}\), and so is \(\partial _{t}u + {\mathcal {L}}u\).
This remark suggests the following notion of solution. We try to be very explicit in this regard in order not to confuse the reader by the versatile terminology of weak solution, and also because we work on \({\mathbb {R}}^{n+1}\).
Definition 2.13
We say that u is a \({\dot{\Delta }}^ {r_{1},q_{1}}\)-solution of \( \partial _{t}u+{\mathcal {L}}u= f\) in \({\mathbb {R}}^{n+1}\) if \(u\in {\dot{\Delta }}^{r_{1},q_{1}}\) and the equation is satisfied in the sense of distributions on \({\mathbb {R}}^{n+1}\), that is for all \({{\tilde{\phi }}}\in {\mathcal {D}}({\mathbb {R}}^{n+1})\),
where \(\langle \langle u,\partial _{t}{{\tilde{\phi }}} \rangle \rangle = \int _{}\hspace{-3.39996pt}\int _{{\mathbb {R}}^{n+1}} u(t,x) \overline{{\partial _{t}{{\tilde{\phi }}}}(t,x)} \, \text {d}x\text {d}t.\)
There is no regularity in time attached to this definition. For appropriate right hand side, we shall show that in fact a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution is (up to a constant) continuous and bounded in time valued in \({\text {L}}^2_{x}\), so that in the end we will be able to identify with weak solutions when considering the Cauchy problem, see Sect. 2.11.
2.4 Main regularity estimates
Our approach builds on the results of the next two sections. Note that the assumptions have a homogeneous flavor, which is necessary when the time interval is infinite.
We begin with results providing existence and uniqueness of specific solutions for the heat operator \(\partial _{t}-\Delta \) in \({\mathbb {R}}^{n+1}\). (We could also use \(\partial _{t}+\Delta \) as the choice of forward or backward time is irrelevant when \(t\in {\mathbb {R}}\).) This relies on Fourier transform arguments with tempered distributions.
Lemma 2.14
Let \(u\in {\mathcal {D}}'({\mathbb {R}}^{n+1})\) be a solution of \(\partial _{t}u - \Delta u=0\) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\) with \(\nabla u\in {\text {L}}^2_{t}{\text {L}}^2_{x}\). Then u is constant.
Proof
For each \(j\in \{1,\ldots , n\}\) we have \(u_{j} :=\partial _{x_{j}}u\in {\text {L}}^2_{t}{\text {L}}^2_{x}\subset {\mathcal {S}}'({\mathbb {R}}^{n+1})\) and \(\partial _{t}u_{j}-\Delta u_{j}=0\) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\), hence also in \({\mathcal {S}}'({\mathbb {R}}^{n+1})\). This implies that \(\langle \langle u_{j},\partial _{t}\phi +\Delta \phi \rangle \rangle =0\) for all \(\phi \in {\mathcal {S}}_{0}({\mathbb {R}}^{n+1})\), the space of Schwartz functions whose Fourier transforms vanish to infinite order at 0. Now, \(\partial _{t}+\Delta \), whose Fourier symbol is the polynomial \(i\tau -|\xi |^2\), is an automorphism of \( {\mathcal {S}}_{0}({\mathbb {R}}^{n+1})\). We obtain \(\langle \langle u_{j},\phi \rangle \rangle =0\) for all \(\phi \in {\mathcal {S}}_{0}({\mathbb {R}}^{n+1})\), so that \(u_{j}\) is a polynomial. As \(u_{j}\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), this polynomial vanishes. We have shown that \(\nabla u=0\), hence \(\Delta u=0\), and \(\partial _{t}u=0\) follows from the equation. We conclude that u is constant. \(\square \)
Proposition 2.15
Let (r, q) be an admissible pair and \(w\in {\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}+ {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\). Then there exists \(v\in {\text {C}}_{0}({\text {L}}^2_{x})\cap {\dot{{\mathcal {V}}}}\), unique in the class of distributions with \(\nabla v\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), solution of \(\partial _{t}v-\Delta v=w\) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\). Moreover,
Uniqueness is provided by Lemma 2.14. The main work is to produce this solution. Using \({\mathbb {R}}\) as the time interval allows us to use embeddings for homogeneous Sobolev space on \({\mathbb {R}}\). For \(\theta \in [0,1)\), we introduce the space
equipped with the norm \(\Vert w\Vert _{{\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}} :=\Vert g\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}\). We recall that \({\text {D}}_{t}^\alpha \) is the Fourier multiplier with symbol \(|\tau |^\alpha \). For \(\theta =0\), we find \({\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}\). Elements in \({\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\) are tempered distributions with locally square integrable Fourier transforms.
Lemma 2.16
If (r, q) is an admissible pair, then \({\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x} \hookrightarrow {\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\) for \(\theta =1-\frac{2}{r}\).
Proof
Set
As \({\mathcal {S}}({\mathbb {R}}^{n+1})\) is dense in \({\text {L}}^2_{t}{\text {L}}^2_{x}\), we see that G is a dense subspace of \({\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\). For any \(v\in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) and \(g\in G\), we get using (2.2),
By density of G and the Riesz representation theorem there exists a unique \(w\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) with \(\Vert w\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}\le c\Vert v\Vert _{{\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}}\) such that \(\langle \langle v,M_{\theta }^{-1}g \rangle \rangle =\langle \langle w,g \rangle \rangle \) for all \(g\in G\), and we can see that \(v=M_{\theta }w\) in \({\mathcal {S}}'({\mathbb {R}}^{n+1})\). This concludes the proof. \(\square \)
Armed with this embedding, it suffices to prove the following stronger statement in purely \({\text {L}}^2\)-based mixed Sobolev spaces.
Lemma 2.17
Let \(\theta \in [0,1)\) and \(w\in {\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\). Then there exists \(v\in {\text {C}}_{0}({\text {L}}^2_{x})\cap {\dot{{\mathcal {V}}}}\), solution of \(\partial _{t}v-\Delta v=w\) in the sense of tempered distributions in \({\mathbb {R}}^{n+1}\), with
Proof
Guided by the Duhamel formula, the function v(t) is formally defined by
and we need to make sense of this integral. To this end, we introduce a smaller dense subspace of \({\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\).
Let \(M_\theta = {\text {D}}_{t}^{\theta /2}(-\Delta )^{(1-\theta )/2}\) as before and set \(G_{0}:=\{M_{\theta }g\, : \, g\in {\mathcal {S}}_{00}({\mathbb {R}}^{n+1})\}\), where \({\mathcal {S}}_{00}({\mathbb {R}}^{n+1})\) is the space of Schwartz functions whose Fourier transforms are supported away from \(\tau =0\) and \(\xi =0\). Note that \(M_{\theta }\) preserves \({\mathcal {S}}_{00}({\mathbb {R}}^{n+1})\) and that \(G_{0}\) is a dense subspace of \({\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\). Call \(\tau _{t}\) the translation by \(t\in {\mathbb {R}}\): \(\tau _{t}g(s):=g( s+t)\), not indicating the x-variable as usual. It commutes with \(M_{\theta }.\)
Step 1: \(v(t) \in {\text {L}}^2_{x}\) with \(\Vert v(t)\Vert _{{\text {L}}^2_{x}}\le c(\theta )\Vert w\Vert _{{\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}}\). We begin with a preliminary estimate. Let \(\varphi \in {\text {L}}^2_{x}\) and define
where \(1_{A}\) denotes the indicator function of A. A classical calculation shows that \({\hat{h}}(\tau ,\xi )= (-i\tau +|\xi |^2)^{-1} {{\hat{\varphi }}}(\xi )\), where \({{\hat{\varphi }}}\) is the Fourier transform of \(\varphi \) on \({\mathbb {R}}^n\). Using Plancherel’s formula in \({\mathbb {R}}^{n}\) and \({\mathbb {R}}\), Fubini’s theorem and the change of variables \(\tau =\sigma |\xi |^2\) when \(\xi \ne 0\), we find
with \(c(\theta ) :=(2\pi )^{-1/2}\big (\int _{-\infty }^\infty \frac{|\sigma |^\theta }{1+\sigma ^2}\, \text {d}\sigma \big )^{1/2}<\infty \) since \(\theta \in [0,1)\). It follows that for any \(g\in {\text {L}}^2_{t}{\text {L}}^2_{x}\),
Now assume \(g\in {\mathcal {S}}_{00}({\mathbb {R}}^{n+1})\), set \(w :=M_{\theta }g\) and fix \(t\in {\mathbb {R}}\). Then \(w\in {\mathcal {S}}_{00}({\mathbb {R}}^{n+1})\), so in particular \(w \in {\text {L}}^1_{t}{\text {L}}^2_{x}\) and the integral in (2.13) converges as a Bochner integral in \({\text {L}}^2_{x}\) thanks to the contractivity of the heat semigroup on \({\text {L}}^2_{x}\). To get the appropriate estimate on v(t), we calculate
and (2.14) yields
In total, we have produced a bounded map
By density, it has a bounded extension \({\mathcal {M}}:{\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\rightarrow {\text {L}}^\infty _{t}{\text {L}}^2_{x}\). Due to (2.15) this extension is defined weakly by
where as before \(w=M_{\theta }g\in {\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\) and \(h(t,x)=1_{(-\infty ,0)}(t)(e^{-t\Delta }\varphi )(x)\) with \(\varphi \in {\text {L}}^2_{x}\).
Step 2: \(v \in {\text {C}}_{0}({\text {L}}^2_{x})\). By density of \(G_{0}\) in \({\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}\) and closedness of \({\text {C}}_{0}({\text {L}}^2_{x})\) in \({\text {L}}^\infty _{t}{\text {L}}^2_{x}\), it is enough by (2.16) to show that \(v={\mathcal {M}}w \in C_{0}({\text {L}}^2_{x})\) for all \( w\in G_{0}\). In that case we have seen that \(w\in {\text {L}}^1_{t}{\text {L}}^2_{x}\). For the continuity in time, write (2.13) as \(v(t)= \int _{-\infty }^0 \text {e}^{-s\Delta }w(s+t)\, \text {d}s\), so that continuity follows right away from the continuity of time translations in \({\text {L}}^1_{t}{\text {L}}^2_{x}\) and contractivity of the heat semigroup on \({\text {L}}^2_{x}\). For the limits \(v(t) \rightarrow 0\) as \(|t| \rightarrow \infty \), we use dominated convergence in (2.13) as follows. By contractivity of the heat semigroup, the integrand is bounded in \({\text {L}}^2_{x}\) by \(\Vert w(s)\Vert _{{\text {L}}^2_{x}}\) and this function is integrable with respect to s. The limit as \(t \rightarrow - \infty \) follows immediately, whereas for \(t \rightarrow \infty \) we additionally use that the heat semigroup tends to 0 strongly in \({\text {L}}^2_{x}\).
Step 3: \(v={\mathcal {M}}w\) is a solution of \(\partial _{t}v-\Delta v=w\) in \({\mathcal {S}}'({\mathbb {R}}^{n+1})\). Assume that \(w= M_\theta g \in G_0\) with \(g\in {\mathcal {S}}_{00}({\mathbb {R}}^{n+1})\). Since \(w, \Delta w \in {\text {L}}^1_{t}{\text {L}}^2_{x}\) and \(t\mapsto w(t)\) is continuous as an \({\text {L}}^2_{x}\)-valued function, we obtain \(v'(t)=w(t)+\Delta v(t)\) in \({\text {L}}^2_{x}\) for all \(t \in {\mathbb {R}}\). From this we conclude that \(\langle \langle v,-\partial _{t}\phi -\Delta \phi \rangle \rangle =\langle \langle w,\phi \rangle \rangle \) for all \(\phi \in {\mathcal {S}}({\mathbb {R}}^{n+1})\). It remains to argue by density, letting g approximate any element in \({\text {L}}^2_{t}{\text {L}}^2_{x}\) in this equality for fixed \(\phi \).
Step 4: \({v={\mathcal {M}}w\in {\dot{{\mathcal {V}}}}}\) with \(\Vert v\Vert _{{\dot{{\mathcal {V}}}}}\lesssim \Vert w\Vert _{{\dot{{\text {H}}}}_{t}^{{-\theta /2}}{\dot{{\text {H}}}}^{{\theta -1}}_{ x}}\). Again, it is enough to proceed by density after proving the claim for \(w = M_\theta g\) when \(g\in {\mathcal {S}}_{00}({\mathbb {R}}^{n+1})\) with a constant that does not depend on this assumption. By Fourier transform from the equation, \((i\tau +|\xi |^2){\hat{v}}= |\tau |^{\theta /2}|\xi |^{1-\theta }{\hat{g}}\) as tempered distributions so that \({\hat{v}}= (i\tau +|\xi |^2)^{-1} |\tau |^{\theta /2}|\xi |^{1-\theta }{\hat{g}}\). Here, the right-hand side is again a tempered distribution of the form \((|\tau |+|\xi |^2)^{-1/2}m{\hat{g}}\), so that
where
\(\square \)
We observe that there is a substitute result for the non admissible pair \((\infty ,2)\) that does not involve the variational space \({\dot{{\mathcal {V}}}}\).
Proposition 2.18
Let \(w\in {\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}+ {\text {L}}^{1}_{t}{\text {L}}^{2}_{x}\). Then there exists \(v\in {\text {C}}_{0}({\text {L}}^2_{x})\cap {\text {L}}^2_{t}{\dot{{\text {H}}}}^1_{x}\), unique in the class of distributions with \(\nabla v\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), solution of \(\partial _{t}v-\Delta v=w\) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\). It has the estimates
Proof
Uniqueness is again provided by Lemma 2.14. The case where \(w=-{\text {div}}F\in {\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}\) is given by the solution v in Lemma 2.17 with \(\theta =0\), and checking the constants we have
Assuming now that \(w\in {\text {L}}^1_{t}{\text {L}}^2_{x}\), Steps 1 and 2 of the proof of Lemma 2.17 show that v given by (2.13) belongs to \({\text {C}}_{0}({\text {L}}^2_{x})\) with \(\Vert v(t)\Vert _{{\text {L}}^2_{x}} \le \Vert w\Vert _{{\text {L}}^{1}_{t}{\text {L}}^{2}_{x}}\). To show that \(\nabla v\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), we take a vector field \({{\widetilde{\Phi }}}\in {\mathcal {S}}({\mathbb {R}}^{n+1})\) and \({{\tilde{v}}}\) given by \(\tilde{v}(s)=\int ^{\infty }_{s} \text {e}^{(t-s)\Delta }({\text {div}}{{\widetilde{\Phi }}})(t)\, \text {d}t\), \(s\in {\mathbb {R}}\), is the solution of Lemma 2.17 in the case \(\theta =0\) for the backward equation \(-\partial _{s}{{\tilde{v}}}-\Delta {{\tilde{v}}}= {\text {div}}{{\tilde{\Phi }}}\). In particular \({{\tilde{v}}} \in {\text {C}}_{0}({\text {L}}^2_{x})\) with \(\Vert {{\tilde{v}}}\Vert _{{\text {L}}^\infty _{t}{\text {L}}^2_{x}} \le c(0) \Vert {{\tilde{\Phi }}}\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}\) and \(c(0) = 2^{-1/2}\). As
we deduce that \( \Vert \nabla v\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}\le 2^{-1/2} \Vert w\Vert _{{\text {L}}^1_{t}{\text {L}}^2_{x}}\). Eventually, we get \(\partial _{t}v-\Delta v=w\) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\) as before. \(\square \)
Let us state consequences of these two propositions. The first one shows a set of lower bounds for the heat operator.
Corollary 2.19
For any distribution u with \(\nabla u \in {\text {L}}^2_{t}{\text {L}}^2_{x}\), there is a constant \(c\in {\mathbb {C}}\) such that \(u-c\in {\text {C}}_{0}({\text {L}}^2_{x})\) with
provided that (r, q) is admissible or \((r,q)=(\infty ,2)\) and that the right hand side is finite.
Proof
Let v be the solution of \(\partial _{t}v-\Delta v=\partial _{t}u-\Delta u\) given by Propositions 2.15 or 2.18. By uniqueness, \(v-u\) is a constant c so that \(u-c\in {\text {C}}_{0}({\text {L}}^2_{x})\). Optimizing over all possible decompositions of \(\partial _{t}u-\Delta u\) in \({\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}+ {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) yields the estimate. \(\square \)
The second one will be our fundamental regularity estimate in the following.
Corollary 2.20
Let \(u\in {\mathcal {D}}'({\mathbb {R}}^{n+1})\). Assume \(\nabla u\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) and \(\partial _{t}u\in {\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}+ {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) for (r, q) an admissible pair or \((r,q)=(\infty ,2)\). Then there is a constant \(c\in {\mathbb {C}}\) such that \(u-c\in {\text {C}}_{0}({\text {L}}^2_{x})\) with
Moreover, if (r, q) is admissible, then \(u-c\in {\dot{{\mathcal {V}}}}\) with the same estimate on \(\Vert u-c\Vert _{{\dot{{\mathcal {V}}}}}\).
Proof
We see that \(\partial _{t}u-\Delta u \in {\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x} + {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) with
We apply Corollary 2.19 to get the estimate. That \(u-c\in {\dot{{\mathcal {V}}}}\) when (r, q) is admissible is already in Proposition 2.15. \(\square \)
Remark 2.21
-
(i)
The case \(\theta =0\) of Lemma 2.17 on the half-space \((0,\infty )\times {\mathbb {R}}^{n}\) appears in [9]. It can be seen as the homogeneous version of Lions’ embedding theorem, which would apply had we assumed in addition \(u\in {\text {L}}^2_{t}{\text {L}}^2_{x}\). An important point is that the interval is infinite, otherwise the homogeneous version is wrong. Here, the statement with the time interval being the real line simplifies some matters of the proof in [9] when \(\theta =0\) and we extend it to \(\theta <1\). However, the statement does not hold when \(\theta =1\). When \(\theta >0\), the situation is not as symmetric as Lions’ embedding theorem since the hypothesis cannot be put into a form \(u\in E\), \(\partial _{t}u\in E'\), where E is a Banach space and \(E'\) its dual for the \({\text {L}}^2_{t}{\text {L}}^2_{x}\) duality. We relax on u since we do not want to impose more than \(u\in {\text {L}}^2_{t}{\dot{{\text {H}}}}^1_{x}\) for the spatial regularity. In any case, the conditions \(u\in {\dot{{\mathcal {V}}}}\) and \(\partial _{t}u\in {\dot{{\mathcal {V}}}}'\) do not imply u continuous into \({\text {L}}^2_{x}\). Indeed, we saw that the time derivative maps \({\dot{{\mathcal {V}}}}\) into \({\dot{{\mathcal {V}}}}'\), but \({\dot{{\mathcal {V}}}}\) is not contained in \({\text {C}}({\text {L}}^2_{x})\).
-
(ii)
In the hypotheses of Corollaries 2.19 and 2.20, one can take a finite sum of spaces of the same types with different pairs of exponents. This is a consequence of our constructive proof: \(u-c\) is the sum of solutions to the heat equation with right-hand sides equal to the various different components.
2.5 Integral equalities
Integral identities are well-known in our context when the time interval is finite [28]. When the time interval is infinite, they require additional care in both the assumptions and the proofs, and Lemma 2.17 becomes the essential instrument. Therefore, the next lemma is not a straightforward generalization of known results.
Lemma 2.22
Let \(u\in {\mathcal {D}}'({\mathbb {R}}^{n+1})\). Assume \(\nabla u\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) and \(\partial _{t}u= -{\text {div}}F+ g\) with \(F\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), \(g\in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x} \) and (r, q) an admissible pair or \((r,q)=(\infty ,2)\). Then, up to a constant, \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\), \(t\mapsto \Vert u(t)\Vert _{{\text {L}}^2_{x}}^2 \) is absolutely continuous on \({\mathbb {R}}\) and for all \(\sigma <\tau \),
Remark 2.23
In the spirit of Remark 2.21 (ii), one can replace g by linear combinations of functions of the same type with different admissible pairs and the pair \((\infty ,2)\).
Proof
From Corollary 2.20 we know that \(u-c\in {\text {C}}_{0}({\text {L}}^2_{x})\cap {\text {L}}^{r}_{t}{\text {L}}^{q}_{ x}\) for some constant c. Adjusting the constant to 0, the integrand of (2.17) is well-defined and integrable on \({\mathbb {R}}\). It remains to prove the integral identity.
We begin with assuming that (r, q) is an admissible pair. Let \(\varphi _{\varepsilon }=\frac{1}{\varepsilon }\varphi (\frac{\cdot }{\varepsilon })\), \(\varepsilon >0\), be a mollifying sequence of \({\mathbb {R}}\) with \(\varphi \) smooth, compactly supported in \([-1,1]\) and \(\int _{\mathbb {R}}\varphi =1\), and let \(u_{\varepsilon } :=\varphi _{\varepsilon }\star u\), where convolution is in the t-variable. Clearly, \(t\mapsto u_{\varepsilon }(t)\) is of class \({\text {C}}^1\) as an \({\text {L}}^2_{x}\)-valued function. Thus,
Since \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\), the left-hand side converges to the one of (2.17) as \(\varepsilon \rightarrow 0\). Next, to justify the convergence of the integral in (2.18) to the one in (2.17), a computation yields
where \(F_{\varepsilon } :=\varphi _{\varepsilon } \star F\) and \(g_{\varepsilon } :=\varphi _{\varepsilon } \star g\). Observe that \( (h_{1},h_{2})\mapsto \int _{\sigma }^\tau \langle h_{1}(t),h_{2}(t) \rangle \, \text {d}t\) is a sesquilinear continuous form on \(X\times Y\), where \((X,Y)=({\text {L}}^2_{t}{\text {L}}^2_{x},{\text {L}}^2_{t}{\text {L}}^2_{x})\) or \(({\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}, {\text {L}}^{r}_{t}{\text {L}}^{q}_{ x})\) and that the pairings above are of this type as (r, q) is admissible. In each factor the convolution with \(\varphi _{\varepsilon }\) is uniformly bounded and converges strongly to the identity operator. The convergence follows.
If \((r,q)=(\infty ,2)\), then we repeat the above argument with \((X,Y)=({\text {L}}^{1}_{t}{\text {L}}^{2}_{x},{\text {C}}_{0}({\text {L}}^2_{x}))\). \(\square \)
The lemma above can be localized to a half-infinite time interval as follows.
Corollary 2.24
Let \({\text {I}}\) be an open half-infinite interval of \({\mathbb {R}}\) and \(u\in {\mathcal {D}}'({\text {I}}\times {\mathbb {R}}^{n})\). Assume \(\nabla u\in {\text {L}}^2({\text {I}};{\text {L}}^2_{x})\) and \(\partial _{t}u= -{\text {div}}F+ g\) with \(F\in {\text {L}}^2({\text {I}};{\text {L}}^2_{x})\), \(g\in {\text {L}}^{r'}({\text {I}};{\text {L}}^{q'}_{x} )\) and (r, q) an admissible pair or \((r,q)=(\infty ,2)\). Then, up to a constant, \(u\in {\text {C}}_{0}(\overline{{\text {I}}}; {\text {L}}^2_{x})\) and the function \(t\mapsto \Vert u(t)\Vert _{{\text {L}}^2_{x}}^2 \) is absolutely continuous on \({{\overline{{\text {I}}}}}\) with
for all \(\sigma , \tau \in {{\overline{{\text {I}}}}}\), \(\sigma <\tau \).
Proof
It is enough to consider the case \({\text {I}}=(a,\infty )\), \(a\in {\mathbb {R}}\). The strategy is to extend u by even reflection at \(t=a\) and F and g by odd reflection at \(t=a\) so that the assumptions of Lemma 2.22 apply to these extensions. The conclusion follows by restricting back to \({{\overline{{\text {I}}}}}\). However, some care needs to be taken since we do not assume a priori that u is a locally integrable function and we provide details for the convenience of the reader. Take \(a=0\) for simplicity. We construct a distribution \(v \in {\mathcal {D}}'({\mathbb {R}}^{n+1})\) (the even extension of u) verifying the hypothesis of Lemma 2.22 with the odd extensions of F and g, and whose restriction to \((0,\infty )\times {\mathbb {R}}^{n}\) is u.
For any \(\psi \in {\mathcal {D}}({\mathbb {R}}^n)\) we define the distribution \(\langle u(t),\psi \rangle \) on \((0,\infty )\) by
where \(f \otimes \psi \) is the tensor product. The equation \(\partial _{t}u= -{\text {div}}F+ g\) implies that for any \( \psi \in {\mathcal {D}}({\mathbb {R}}^n)\) we have \(\langle u(t),\psi \rangle '= \langle F(t),\nabla \psi \rangle +\langle g(t),\psi \rangle \) in \({\mathcal {D}}'(0,\infty )\). As the right-hand side is locally integrable from the assumptions on F and g, this shows that \(\langle u(t),\psi \rangle \) can be identified with a continuous function on \((0,\infty )\) that extends continuously to 0. We continue to use the suggestive notation \(t \mapsto \langle u(t),\psi \rangle \). For \(\psi \in {\mathcal {D}}({\mathbb {R}}^n)\) and \(f\in {\mathcal {D}}({\mathbb {R}})\) the formula
defines a distribution \(v \in {\mathcal {D}}'({\mathbb {R}}^{n+1})\). (Recall that distributions in \({\mathbb {R}}^{n+1}={\mathbb {R}}\times {\mathbb {R}}^n\) are uniquely determined on tensor products.) Taking f supported in \((0,\infty )\) gives that \(v=u\) in \( {\mathcal {D}}'((0,\infty )\times {\mathbb {R}}^{n})\). Next, integration by parts shows that
where \(F_{o}\) and \(g_{o}\) are the odd extensions of F and g, respectively. Thus, \(\partial _{t} v= -{\text {div}}F_{o}+ g_{o}\) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\). Lastly,
where \((\nabla u)_{e}\) is the even extension of \(\nabla u\), so that \(\nabla v= (\nabla u)_{e}\) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\). We have proved our claim. \(\square \)
The conclusion of Lemma 2.22 can be polarized, given two functions \(u, {\tilde{u}}\) that verify the assumptions (with possibly different pairs (r, q)). Due to the extendability that we have seen in the previous proof, the same also works with open, half-infinite intervals and the conclusion reads as follows.
Corollary 2.25
If \(u,{{\tilde{u}}}\) satisfy the same assumptions as in Corollary 2.24 on two open half-infinite intervals \({\text {I}}\) and \({\text {J}}\), then after eliminating a constant from each of u and \({{\tilde{u}}}\), the function \(t\mapsto \langle u(t),{{\tilde{u}}}(t) \rangle \) is absolutely continuous on \(\overline{{\text {I}}\cap {\text {J}}}\) and
whenever \(\sigma ,\tau \in \overline{{\text {I}}\cap {\text {J}}}\), \(\sigma <\tau \).
2.6 Existence and uniqueness results
We come back to the study of parabolic equations. Up until Sect. 2.9 included, we
Recall that Proposition 2.11 yields boundedness of an operator \({\mathcal {H}}:{\dot{{\mathcal {V}}}}\rightarrow {\dot{{\mathcal {V}}}}'\), which acts as \(\partial _{t}+{\mathcal {L}}\). We develop the existence and uniqueness theory under the hypothesis
This means that the constants in our estimates will also depend on \(\Lambda =\Vert A\Vert _{\infty }\), \(P_{{{\tilde{r}}}_{1}, {{\tilde{q}}}_{1}}\) and the norm of the inverse of \({\mathcal {H}}\). We do not make this dependence explicit until Sect. 2.9, where we discuss a sufficient condition for invertibility. So, we write for instance C(n, q, r) to mean dependence on n, q, r and possibly the aforementioned quantities.
In the following, we only state results involving the operator \(\partial _{t}+{\mathcal {L}}\). All results apply mutatis mutandis to \(-\partial _{t}+{\mathcal {L}}^*\) since both operators are indistinguishable from the assumptions at this stage. We are going to prove uniqueness and existence in the class of \({\dot{\Delta }}^{r_{1},q_{1}}\)-solutions that we introduced at the end of Sect. 2.3.
Proposition 2.26
Any \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of \(\partial _{t}u+{\mathcal {L}}u=f\in {\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}+ {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) for (r, q) an admissible pair or \((r,q)=(\infty ,2)\) belongs to \({\text {C}}_{0}({\text {L}}^2_{x})\) with
Proof
We have that \({\mathcal {L}}u \in ({\dot{\Delta }}^{r_{1},q_{1}})'={\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}+ {\text {L}}^{r_{1}'}_{t}{\text {L}}^{q_{1}'}_{ x}\) with bound controlled by \(\Vert u\Vert _{{\dot{\Delta }}^{r_{1},q_{1}}}\) according to Remark 2.12, so that \(\partial _{t}u=-{\mathcal {L}}u+ f \in {\text {L}}^2_{t}{\dot{{\text {H}}}}^{-1}_{x}+ {\text {L}}^{r_{1}'}_{t}{\text {L}}^{q_{1}'}_{ x}+ {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\). Remark 2.21 (ii) yields the desired conclusion up to a constant for u and the constant must be 0 as \(u\in {\dot{\Delta }}^{r_{1},q_{1}}\). \(\square \)
Theorem 2.27
Assume (H\(_{\textbf{0}}\)). If u is a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of \(\partial _{t}u+{\mathcal {L}}u= 0\), then \(u=0\).
Proof
We may apply Corollary 2.20, so that \(u-c\in {\dot{{\mathcal {V}}}}\cap {\text {C}}_{0}({\text {L}}^2_{x})\). The constant c vanishes as \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\) from the above proposition. Thus, we have \(u\in {\dot{{\mathcal {V}}}}\) with \({\mathcal {H}}u=0\) and \(u=0\) follows by \(({{\textbf {H}}}_{{{\textbf {0}}}})\). \(\square \)
Theorem 2.28
Assume (H\(_{\textbf{0}}\)). Let (r, q) be any admissible pair, \(F\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) and \(g\in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\). Then \(u :={\mathcal {H}}^{-1}(-{\text {div}}F+g)\in {\dot{{\mathcal {V}}}}\) is the unique \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of
Moreover, it belongs to \({\text {C}}_{0}({\text {L}}^2_{x})\) with
Proof
As \(-{\text {div}}F +g \in {\dot{{\mathcal {V}}}}'\) by Lemma 2.3, we see that u is well-defined in \({\dot{{\mathcal {V}}}}\) and belongs to \({\dot{\Delta }}^{r_{1},q_{1}}\) thanks to the same lemma. It is thus a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of (2.21). By Theorem 2.27 it is unique and by Proposition 2.26 it belongs to \({\text {C}}_{0}({\text {L}}^2_{x})\). The estimate (2.22) follows from that. \(\square \)
The previous theorem does not apply when \(g \in {\text {L}}^{1}_{t}{\text {L}}^{2}_{x}\) since \((r,q)=(\infty ,2)\) is not admissible and \({\mathcal {H}}^{-1}g\) does not make sense. Yet, we can construct a \({\dot{\Delta }}^{r_1,q_1}\)-solution that falls outside of the variational \({\dot{{\mathcal {V}}}}- {\dot{{\mathcal {V}}}}'\) setting.
Theorem 2.29
Assume (H\(_{{\textbf{0}}}\)). For every \(g\in {\text {L}}^1_{t}{\text {L}}^2_{x}\), there exists a unique \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of
Moreover, this solution belongs to \({\text {L}}^{r}_{t}{\text {L}}^{q}_{ x}\) for any admissible pair (r, q) and to \({\text {C}}_{0}({\text {L}}^2_{x})\) with
and
Proof
The uniqueness follows from Theorem 2.27. For the existence, let \(T:{\text {L}}^1_{t}{\text {L}}^2_{x} \rightarrow {\mathcal {D}}'({\mathbb {R}}^{n+1})\) defined by
From Theorem 2.28 applied to \({\mathcal {H}}^*\), we have that the restriction \({{\mathcal {H}}^*}^{-1}:({\dot{\Delta }}^{ r, q})' \rightarrow {\text {C}}_{0}({\text {L}}^2_{x})\) is bounded for any admissible pair (r, q), so that
We next show that \(u :=Tg\) is a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of (2.23). Observe that u agrees with \({\mathcal {H}}^{-1}g\) for \(g\in ({\dot{\Delta }}^{r,q})' \cap {\text {L}}^1_{t}{\text {L}}^2_{x}\), hence for g in a dense subspace, for example \({\mathcal {D}}({\mathbb {R}}^{n+1})\). For those functions g, we have that u is a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of (2.23). Secondly, let \(g\in {\text {L}}^1_{t}{\text {L}}^2_{x}\) and \((g_{k})\) be a sequence in \({\mathcal {D}}({\mathbb {R}}^{n+1})\) that converges to g in \( {\text {L}}^1_{t}{\text {L}}^2_{x}\). Testing the equation for \(u_{k} :=Tg_{k}\) against a function \({{\tilde{\phi }}}\in {\mathcal {D}}({\mathbb {R}}^{n+1})\), we have
The estimate (2.27) shows that \(u_{k}\) converges to u in the norm of \({\dot{\Delta }}^{r_{1},q_{1}}\), and this allows us to pass to the limit on the left-hand side, using Remark 2.12 for the second term. This proves (2.23) and (2.24).
Eventually, \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\) follows from Proposition 2.26 and we obtain the estimate (2.25) from the estimate in that proposition if we plug in (2.24) with \((r,q)=(r_{1},q_{1})\). \(\square \)
The next result is central for our constructive approach to fundamental solutions and Green operators.
Theorem 2.30
Assume (H\(_{{\textbf{0}}}\)). For all \(s\in {\mathbb {R}}\) and \(\psi \in {\text {L}}^2_{x}\) there exists a unique \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of
where \(\delta _{s}\) is the Dirac mass at \(t=s\) and \(\otimes \) denotes the tensor product. Moreover, this solution belongs to \({\dot{\Delta }}^{r,q}\) for any admissible pair (r, q) with
Furthermore, \(u\in {\text {C}}_{0}({\mathbb {R}}\setminus \{s\}; {\text {L}}^2_{x})\), it has \({\text {L}}^2_{x}\) limits \(u(s^{\pm })\) when \(t\rightarrow s^{\pm }\) with the jump relation
and \(t\mapsto \Vert u(t)\Vert _{{\text {L}}^2_{x}}^2\) has an absolutely continuous extension to each of \([s,\infty )\) and \((-\infty ,s]\) with estimate
where \(C(n,q_{1},r_{1})\) does not depend on s and \(\psi \). Eventually, on each of the intervals \((s,\infty )\) and \((-\infty ,s)\), the solution u is the restriction of a function in \({\dot{{\mathcal {V}}}}\).
Proof
The uniqueness follows from Theorem 2.27. For the existence, given \(s\in {\mathbb {R}}\), let \(T_{s}:{\text {L}}^2_{x} \rightarrow {\mathcal {D}}'({\mathbb {R}}^{n+1})\) be defined by
For any admissible pair (r, q), Theorem 2.28 applies to \({{\mathcal {H}}^*}^{-1}\) so that \({{\mathcal {H}}^*}^{-1}:({\dot{\Delta }}^{r,q})'\rightarrow {\text {C}}_{0}({\text {L}}^2_{x})\) is bounded and it follows that
In particular, \(u :=T_{s}\psi \) satisfies (2.29). It is no loss of generality to assume \(s=0\) as the general argument is the same (or can be deduced from \(s=0\) by a translation in time, which preserves all assumptions with uniform constants).
Step 1: u is a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of (2.28). Let \((\varphi _{\varepsilon })\) be a mollifying sequence as in the proof of Lemma 2.22. We apply Theorem 2.29 to \(g_{\varepsilon } :=\varphi _{\varepsilon }\otimes \psi \in {\text {L}}^1_{t}{\text {L}}^2_{x}\). We test the equation for \(u_{\varepsilon } :=Tg_{\varepsilon }\) against a function \({{\tilde{\phi }}}\in {\mathcal {D}}({\mathbb {R}}^{n+1})\) and pass to the limit in this equation as \(\varepsilon \rightarrow 0\). First
where we used \({{\mathcal {H}}^*}^{-1}{{\tilde{\phi }}}\in {\text {C}}_{0}({\text {L}}^2_{x})\). Replacing \({{\tilde{\phi }}}\) by \({\partial _{t}{{\tilde{\phi }}}}\), we have that
and similarly
Eventually, we have seen that \(u_{\varepsilon }\) converges to \(u :=T_{0}\psi \) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\). The estimate (2.27) shows that \((u_{\varepsilon })\) is uniformly bounded in \({\dot{\Delta }}^{r_{1},q_{1}}\), which is a dual space (it is even reflexive). Hence, it converges also weakly-star in \({\dot{\Delta }}^{r_{1},q_{1}}\) to u and so Remark 2.12 shows that
This proves (2.28) and (2.29).
Step 2: Proof of (2.31). We can apply the integral equality (2.19) of Corollary 2.24, in which the right-hand side is \(-2{\text {Re}}\int _{\sigma }^\tau \langle {\mathcal {L}}u(t), u(t) \rangle \, \text {d}t\), when \(\sigma ,\tau \in (-\infty ,0)\) or \(\sigma ,\tau \in (0,\infty )\). Thus, letting \(\sigma \rightarrow -\infty \) in the first case or \(\tau \rightarrow \infty \) in the second case and using the estimate in Remark 2.12, we obtain
We conclude, using (2.29).
Step 3: Proof of (2.30). Let \({{\tilde{\phi }}}\in {\mathcal {D}}({\mathbb {R}}^{n+1})\) and \(\theta :{\mathbb {R}}\rightarrow {\mathbb {R}}\) even, smooth, 0 on [0, 1] and 1 on \([2,\infty )\). Set \(\theta _{\varepsilon }(t) :=\theta (t/\varepsilon )\) when \(\varepsilon >0\). Using \({{\tilde{\phi }}}\, \theta _{\varepsilon }\) as a test function for the equation,
Expanding the first term, we obtain
We now pass to the limit as \(\varepsilon \rightarrow 0\). Using again the duality between \({\dot{\Delta }}^{r_{1},q_{1}}\) and its pre-dual for the \({\text {L}}^2_{t}{\text {L}}^2_{x}\) duality, \(\langle \langle {\mathcal {L}}u,{{\tilde{\phi }}}\, \theta _{\varepsilon } \rangle \rangle \rightarrow \langle \langle {\mathcal {L}}u,{{\tilde{\phi }}} \rangle \rangle \) by dominated convergence. Similarly, \(\langle \langle u,(\partial _{t}{{\tilde{\phi }}})\, \theta _{\varepsilon } \rangle \rangle \rightarrow \langle \langle u,\partial _{t}{{\tilde{\phi }}} \rangle \rangle \). Hence, the left-hand side above converges to \(\langle \psi ,{{\tilde{\phi }}}(0) \rangle \). The right-hand side rewrites after a change of variable as
which, by dominated convergence and the existence of limits from the left and the right at 0 from Corollary 2.24, tends to
This proves (2.30).
Step 4: On the left and right of 0, u is a restriction of an element in \({\dot{{\mathcal {V}}}}\). It remains to see this last point. Consider w the even extension across \(t=0\) of the restriction of u to \((0,\infty )\times {\mathbb {R}}^n\). Using the same functions \({{\tilde{\phi }}}\) and \(\theta \) as above, we have
Since w and \(\theta _{\varepsilon }\) are even in t, the only contribution of \({{\tilde{\phi }}}\) is through its odd part \({{\tilde{\phi }}}_{o}(t) =\frac{1}{2}({{\tilde{\phi }}}(t)-{{\tilde{\phi }}}(-t))\) and
where \(\theta ^+_{\varepsilon }\) is the restriction of \(\theta _{\varepsilon }\) to \((0,\infty )\). The first term on the right-hand side equals \(2 \langle \langle {\mathcal {L}}u,{{\tilde{\phi }}}_{o}\, \theta ^+_{\varepsilon } \rangle \rangle \), which converges to \(2 \langle \langle {\mathcal {L}}u,{{\tilde{\phi }}}_{o} \rangle \rangle = \langle \langle v,{{\tilde{\phi }}} \rangle \rangle \), where v is the odd extension of \({\mathcal {L}}u \) restricted to \((0,\infty )\times {\mathbb {R}}^n\). It is clearly an element of \(({\dot{\Delta }}^{r_{1},q_{1}})'\). For the second term,
As \(\Vert u(t)\Vert _{{\text {L}}^2_{x}}\) is bounded and \(\Vert {{\tilde{\phi }}}_{o}(t)\Vert _{{\text {L}}^2_{x}} \le Ct\), we obtain a bound on the order of \(\varepsilon \) and this term tends to 0.
In conclusion, we proved \(\partial _t w \in ({\dot{\Delta }}^{r_{1},q_{1}})'\). Since \(w \in {\dot{\Delta }}^{r_{1},q_{1}}\subset {\text {L}}^2_{t}{\dot{{\text {H}}}}^1_{x}\), we obtain \(w \in {\dot{{\mathcal {V}}}}\) from Corollary 2.20 and since \(w|_{(0,\infty )} = u\), we are done.
The same argument can be done with the restriction of u to \((-\infty ,0)\times {\mathbb {R}}^n\). \(\square \)
2.7 Green operators
Theorem 2.30 can be used to construct Green operators for the parabolic equation and its adjoint. We borrow this terminology from [29].
Definition 2.31
Assume \(({\textbf{H}}_{{\textbf{0}}})\) and let \(s,t \in {\mathbb {R}}\) and \(\psi ,{{\tilde{\psi }}}\in {\text {L}}^2_x\).
-
(i)
For \(t\ne s\), define \(G(t,s)\psi \) as the value at time t in \({\text {L}}^2_{x}\) of the \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution u of \(\partial _{t}u+{\mathcal {L}}u= \delta _{s}\otimes \psi \) in Theorem 2.30.
-
(ii)
For \(s\ne t\), define \({{\widetilde{G}}}(s,t){{\tilde{\psi }}}\) as the value at time s in \({\text {L}}^2_{x}\) of the \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution \({{\tilde{u}}}\) of \(-\partial _{s}{{\tilde{u}}}+{\mathcal {L}}^*{{\tilde{u}}}= \delta _{t}\otimes {{\tilde{\psi }}}\) in Theorem 2.30.
The operators \(G(t,s)\) and \({{\widetilde{G}}}(s,t)\) are called the Green operators for the parabolic operator \(\partial _{t}+{\mathcal {L}}\) and the (adjoint) parabolic operator \(-\partial _{t}+{\mathcal {L}}^*\), respectively.
We recall that the orbit \(G(\cdot ,s)\psi \) was defined as \(T_s \psi \) in (2.32), which reads as the “double duality formula”
Indeed, \(G(\cdot ,s)\psi \) and \({{\mathcal {H}}^*}^{-1}{{\tilde{\phi }}}\) are solutions of parabolic problems for adjoint operators.
Rephrasing parts of Theorem 2.30 in terms of Green operators yields the following result.
Proposition 2.32
Assume (H\(_{{\textbf{0}}}\)).
-
(i)
Let \(s\in {\mathbb {R}}\) and \(\psi \in {\text {L}}^2_x\). The function \(t\mapsto G(t,s)\psi \) is in \({\text {C}}_{0}({\mathbb {R}}\setminus \{s\}; {\text {L}}^2_{x})\) and the following limits exist in \({\text {L}}^2_{x}\):
$$\begin{aligned} \Pi _{s}^{\pm } \psi :=\lim _{t\rightarrow s^{\pm }} G(t,s)\psi . \end{aligned}$$ -
(ii)
Let \(t \in {\mathbb {R}}\) and \({{\tilde{\psi }}}\in {\text {L}}^2_x\). The function \(s\mapsto {{\widetilde{G}}}(s,t){{\tilde{\psi }}}\) is in \({\text {C}}_{0}({\mathbb {R}}\setminus \{t\}; {\text {L}}^2_{x})\) and the following limits exist in \({\text {L}}^2_x\):
$$\begin{aligned} {{\widetilde{\Pi }}}_{t}^{\pm } {{\tilde{\psi }}}= \lim _{s\rightarrow t^{\pm }} {{\widetilde{G}}}(s,t){{\tilde{\psi }}}. \end{aligned}$$ -
(iii)
The operators \(G(t,s), {{\widetilde{G}}}(s,t)\) are uniformly bounded on \({\text {L}}^2_{x}\) with respect to (s, t) with \( t\ne s\).
Next, we list a number of properties involving the Green operators and their limits.
Theorem 2.33
Assume (H\(_{{\textbf{0}}}\)).
-
(i)
For each s,
$$\begin{aligned} \Pi _{s}^+ - \Pi _{s}^-={\text {Id}}, \quad {{\widetilde{\Pi }}}_{s}^+ - {{\widetilde{\Pi }}}_{s}^-=-{\text {Id}}\end{aligned}$$and \(\Pi _{s}^{\pm }\) and \({{\widetilde{\Pi }}}_{s}^{\mp }\) are adjoint operators, respectively.
-
(ii)
For \(s\ne t\), \(G(t,s)\) and \({{\widetilde{G}}}(s,t)\) are adjoint operators.
-
(iii)
If \(t>s\), then
$$\begin{aligned} \Pi _{t}^+G(t,s)&= G(t,s), \quad \Pi _{t}^-G(t,s) =0, \\ G(t,s)\Pi _{s}^+&= G(t,s), \quad G(t,s)\Pi _{s}^-=0. \end{aligned}$$ -
(iv)
If \(s>t\), then
$$\begin{aligned} \Pi _{t}^-G(t,s)&= G(t,s), \quad \Pi _{t}^+G(t,s)=0, \\ G(t,s)\Pi _{s}^-&= G(t,s), \quad G(t,s)\Pi _{s}^+=0. \end{aligned}$$ -
(v)
For s, r, t distinct reals with r between s and t,
$$\begin{aligned} G(t,s)=G(t,r)G(r,s). \end{aligned}$$
Remark 2.34
-
(i)
The reader familiar with parabolic problems expects causality, that is, \(G(t,s)=0\) when \(t<s\), and recovery of initial data, that is \(G(t,s)\psi \rightarrow \psi \) when \(t\rightarrow s^+\), meaning that \(\Pi _{s}^+={\text {Id}}\). At this stage however, there is yet no reason to expect these properties since all assumptions apply indifferently to the equation and its adjoint, going backward in time. In view of this, it is remarkable that we can prove the adjointness property (ii) and the Chapman–Kolmogorov formula (v) under the mere assumption \(({{\textbf {H}}}_{{{\textbf {0}}}})\).
-
(ii)
Properties (iii) and (iv) mean that the Green operators \(G(t,s)\) propagate the ranges of the limit operators \(\Pi _{t}^+\) when \(t>s\) and \(\Pi _{t}^-\) when \(t<s\) even though we do not have further information on the range and kernel of these operators at this point.
Proof
We proceed as follows.
Proof of (i). We may assume \(s=0\) as usual. The property \(\Pi _{0}^+ - \Pi _{0}^-={\text {Id}}\) is a rephrasing of the jump relation (2.30) and similarly \({{\widetilde{\Pi }}}_{0}^+ - {{\widetilde{\Pi }}}_{0}^-=-{\text {Id}}\), the negative sign coming from the fact that \(-\partial _{t}+{\mathcal {L}}^*\) is backward in time. Fix \(\psi ,{{\tilde{\psi }}}\in {\text {L}}^2_{x}\). We apply the integral identity of Corollary 2.25 to \(u:=G(\cdot , 0)\psi \) and \({{\tilde{u}}}:={{\widetilde{G}}}(\cdot , 0){{\tilde{\psi }}}\) in the intervals \((0,\infty )\) and \((-\infty ,0)\), knowing that the integrand vanishes almost everywhere in each interval and that \(\langle u(t),\tilde{u}(t) \rangle \rightarrow 0\) in the limit as \(t\rightarrow {\pm }\infty \). This gives us
that is,
The relations above yield
which shows that \((\Pi _{0}^+)^*={{\widetilde{\Pi }}}_{0}^-\). Since \(\Pi _{0}^+ - \Pi _{0}^-={\text {Id}}\), also \((\Pi _{0}^-)^*={{\widetilde{\Pi }}}_{0}^+\) follows.
Proof of (iii) and (ii) for \(t>s\). Fix \(\psi ,{{\tilde{\psi }}}\in {\text {L}}^2_{x}\). This time we can apply the integral identity of Corollary 2.25 to \(u:=G(\cdot , s)\psi \) and \({{\tilde{u}}}:={{\widetilde{G}}}(\cdot , t){{\tilde{\psi }}}\) in the intervals \((-\infty ,s)\), (s, t) and \((t,\infty )\), knowing that the integrand vanishes almost everywhere in each interval and that \(\langle u(\tau ),{{\tilde{u}}}(\tau ) \rangle \rightarrow 0\) in the limit as \(\tau \rightarrow {\pm }\infty \). We obtain
that is,
Subtracting the first and third equalities to the second one and using (i), we obtain
which proves (ii) in this case. From the adjoint relations in (i) we also see that \(\Pi _{t}^-G(t,s)\psi =0\) and \( G(t,s)\Pi _{s}^-\psi =0\). Again by (i) we conclude for the two missing relations in (iii).
Proof of (iv) and (ii) for \(s>t\). The argument is completely symmetric to the previous case.
Proof of (v). Let us first treat the case \(s<r<t\). Fix \(\psi \in {\text {L}}^2_{x}\). Let \(u:=G(\cdot , s)\psi \), \(v:=G(\cdot , r)(u(r))\) and define
Since the gluing procedure preserves \( {\dot{\Delta }}^{ r_{1}, q_{1}}\), the equality \(w=u\) follows by uniqueness provided we can show that w is a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of
By translation we can assume \(r=0\) in order to simplify the exposition. Let \({{\tilde{\phi }}}\in {\mathcal {D}}'({\mathbb {R}}^{n+1})\). The argument is a reprise of the proof of the jump relation in Theorem 2.30. Using the same cut-off functions \(\theta _{\varepsilon }\) supported outside of \([-\varepsilon , \varepsilon ]\), we have, as before,
For the first two terms we see that \({{\tilde{\phi }}}\, \theta _{\varepsilon }\) is the sum of one test function \({{\tilde{\phi }}}_{+}\) supported in \((0,\infty )\times {\mathbb {R}}^n\) and another one \({{\tilde{\phi }}}_{-}\) supported in \((-\infty ,0)\times {\mathbb {R}}^n\). With the first one we use the equation for v and with the second the equation for u, so that we find
where the second step works provided \(\varepsilon \) is small enough in order to guarantee that \(\theta _{\varepsilon }(s)=1\). For the third term we can argue as in Step 3 of the proof of Theorem 2.30 to find
By definition and (iii), we have
so that altogether we have shown that
as desired. The proof when \(t<r<s\) is similar, using (iv) instead of (iii). \(\square \)
Remark 2.35
The adjointness property implies that \(s\mapsto G(t,s)\) is weakly continuous in \({\text {L}}^2_{x}\) on \({\mathbb {R}}\setminus \{t\}\). Similarly, \(t\mapsto {{\widetilde{G}}}(s,t)\) is weakly continuous in \({\text {L}}^2_{x}\) on \({\mathbb {R}}\setminus \{s\}\).
2.8 Representation with Green operators
In this section we shall detail how the Green operators can be seen as operator-valued Schwartz kernels for the inverse of \({\mathcal {H}}\). This illustrates nicely how we can re-discover objects of the classical theory for smooth coefficients as part of our ‘universal’ construction. While Proposition 2.36 is not used in other sections, a key ingredient in its proof implies representations for solutions (Theorem 2.38) that will be useful when we deal with Cauchy problems.
The operator-valued Schwartz kernels result is as follows.
Proposition 2.36
Assume (H\(_{{\textbf{0}}}\)). For any \(f,{{\tilde{f}}}\in {\mathcal {D}}({\mathbb {R}})\), \(\psi ,{{\tilde{\psi }}}\in {\mathcal {D}}({\mathbb {R}}^{n})\),
Before we come to the proof, let us interpret the result in the context of the Schwartz kernel representation [34]. Indeed, there exists a unique \(K\in {\mathcal {D}}'({\mathbb {R}}^{n+1}\times {\mathbb {R}}^{n+1})\) such that
We indicate the dummy variables for notational simplicity and the bracket with parentheses are the bilinear dualities. For example, building \({\mathcal {H}}\) from the heat operator \(\partial _{t}-\Delta \), we see that \(K_{t,x,s,y}\) can be identified with the heat kernel
Not all such operators may have kernels with pointwise bounds. In any case, we can also proceed by fixing \(\psi ,{{\tilde{\psi }}}\in {\mathcal {D}}({\mathbb {R}}^n)\) and looking at the bilinear map \((f,{{\tilde{f}}})\mapsto (( {\mathcal {H}}^{-1}(f\otimes \psi ),{{\tilde{f}}}\otimes {{\tilde{\psi }}} ))\) on \({\mathcal {D}}({\mathbb {R}})\times {\mathcal {D}}({\mathbb {R}})\). Again, the Schwartz kernel theorem provides a unique distribution \(K_{\psi ,{{\tilde{\psi }}}}\in {\mathcal {D}}'({\mathbb {R}}^2)\) such that
Thus, Theorem 2.36 establishes that \(K_{\psi ,{{\tilde{\psi }}},t,s}\) can be identified with a locally integrable function and we can set
the values on the diagonal being irrelevant. In particular, \(K_{\psi ,{{\tilde{\psi }}},t,s}\) agrees with a separately continuous function on \({\mathbb {R}}^2\setminus \{(t,t): t\in {\mathbb {R}}\}\) that vanishes if |s| or |t| tend to \(\infty \).
In order to prove Proposition 2.36, we begin with a pointwise variant for all \(t \in {\mathbb {R}}\).
Lemma 2.37
Assume \(({\textbf{H}}_{{\textbf{0}}})\). For any \(f\in {\mathcal {D}}({\mathbb {R}})\), \(t\in {\mathbb {R}}\) and \( \psi ,{{\tilde{\psi }}}\in {\mathcal {D}}({\mathbb {R}}^n)\),
Proof
In order to simplify the exposition, we assume \(t=0\). The general case follows by translation as usual.
Let \((\varphi _{\varepsilon })\) be a standard mollifying sequence in the t-variable. Since \({{\mathcal {H}}^*}^{-1}\) is the adjoint of \({\mathcal {H}}^{-1}\) and \(f \otimes \psi \), \(\varphi _{\varepsilon } \otimes {{\tilde{\psi }}}\) belong to \({\dot{{\mathcal {V}}}}\), we have
Since \(f \otimes \psi \) and \(\varphi _{\varepsilon } \otimes {{\tilde{\psi }}}\) are in \({\text {L}}^{1}_t {\text {L}}^{2}_x\), we know from Theorem 2.29 (and its proof) that \({\mathcal {H}}^{-1}(f \otimes \psi )\) and \({{\mathcal {H}}^*}^{-1}(\varphi _\varepsilon \otimes {{\tilde{\psi }}})\) belong to \( {\text {C}}_0({\text {L}}^2_x)\). Both duality pairings are given by absolutely convergent Lebesgue integrals and we can apply Fubini’s theorem on the right-hand side in order to write
For fixed s we use (2.34) in order to arrive at
Now, we pass to the limit as \(\varepsilon \rightarrow 0\) as follows.
Since \({\mathcal {H}}^{-1}(f\otimes \psi ) \in {\text {C}}_0({\text {L}}^2_x)\) as mentioned before, we see that the left-hand side of (2.38) tends to the left-hand side of (2.37) as \(\varepsilon \rightarrow 0\).
On the right-hand side we use the properties of the Green operators from Proposition 2.32. Since \(G(\cdot ,s)\psi \) is continuous with values in \({\text {L}}^2_x\) except at s, we have pointwise convergence
for all \(s \ne 0\). Moreover, \(G(\cdot ,s)\psi \) is bounded on \({\mathbb {R}}\setminus \{s\}\) with values in \({\text {L}}^2_x\) and the duality pairing on the right-hand side of (2.38) is another convergent Lebesgue integral that obeys the estimate
with C independent of s and \(\varepsilon \). Since f is integrable, the right-hand side of (2.38) tends to the right-hand side of (2.37) as \(\varepsilon \rightarrow 0\) by dominated convergence. \(\square \)
Proof of Proposition 2.36. We use the continuity of \(t\mapsto {\mathcal {H}}^{-1}(f\otimes \psi )(t)\) in \({\text {L}}^2_{x}\) to rewrite the left-hand side of (2.36) as a Lebesgue integral and use (2.37) for the integral in x in order to obtain
The remaining question is therefore whether the above iterated integral can be taken in the sense of Lebesgue on \({\mathbb {R}}^2\), so that we can use Fubini’s theorem to conclude the proof of (2.36).
Since \((s,t)\mapsto \langle G(t,s)\psi ,{{\tilde{\psi }}} \rangle \) is separately continuous on \({\mathbb {R}}^2\setminus \{(t,t)\, : \, t\in {\mathbb {R}}\}\), it is a (Borel) measurable function on \({\mathbb {R}}^2\setminus \{(t,t)\, : \, t\in {\mathbb {R}}\}\) and we can consider it as an almost everywhere defined measurable function on \({\mathbb {R}}^2\). Finally, uniform boundedness of the Green operators in \({\text {L}}^2_x\) yields
\(\square \)
Lemma 2.37 in turn implies a representation formula for solutions to equations with general right-hand side.
Theorem 2.38
Assume \(({\textbf{H}}_{{\textbf{0}}})\). Let (r, q) be an admissible pair or \((r,q)=(\infty , 2)\), \(F\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), \(g\in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\), \(s\in {\mathbb {R}}\) and \(\psi \in {\text {L}}^2_{x}\). Then the \({\text {L}}^2_{x}\) value at time t of the unique \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution u of
obtained by combining Theorems 2.28, 2.29 and 2.30, can be represented by the equality
when \(t\ne s\) for the first term and where the integrals are defined in the weak sense, that is,
for all \({{\tilde{\psi }}}\in {\text {L}}^2_{x}\).
Proof
We prove (2.42). By uniqueness, it suffices to consider the three terms in the right-hand side of the equation individually, assuming the other two vanish. Fix \(t\in {\mathbb {R}}\).
For the term involving \(\psi \), this is the definition of \(G(t,s)\psi \) if \(t\ne s\).
For the term involving g, Lemma 2.37 and the adjoint relation \({{\widetilde{G}}}(\sigma ,t)= G(t,\sigma )^*\) yield the result when \({{\tilde{\psi }}}\) is a test function and \(g = f \otimes \psi \) a tensor product of test functions (or a linear combinations of such tensor products). Hence, \({{\tilde{\psi }}}\) and g already describe dense subsets of \({\text {L}}^2_x\) and \({\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\), respectively. Consider \(g\in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) and \({{\tilde{\psi }}}\in {\text {L}}^2_{x}\). If (r, q) is admissible, then (2.22) in Theorem 2.28 and Cauchy–Schwarz yield
and by (2.29) for the adjoint equation,
If \((r,q)=(\infty ,2)\), then Theorem 2.29 yields
and by (2.31) for the adjoint equation,
Under either assumption one can thus pass to the limit by density in (2.37), and (2.42) is proved in this case.
For the term involving F the proof is analogous. As before, for \(F\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) and \({{\tilde{\psi }}}\in {\text {L}}^2_{x}\),
and by construction of \({{\widetilde{G}}}\) and Theorem 2.30, \(\nabla {{\widetilde{G}}}(\cdot ,t){{\tilde{\psi }}}\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) and
Hence, the integral involving F on the right-hand side of (2.42) is defined and, by density, it remains to check that it agrees with \(-\langle {\mathcal {H}}^{-1}( {\text {div}}F)(t),{{\tilde{\psi }}} \rangle \) when \({{\tilde{\psi }}}\) is a test function and F is a tensor product \(f\otimes \psi \) of test functions. But in this case we have \({\text {div}}F= f\otimes {\text {div}}\psi \) and Lemma 2.37 along with the adjointness relations for \(G\) yields
as required. \(\square \)
Remark 2.39
We draw the reader’s attention to the following regularity result that is implicit from the equality proved in Theorem 2.38. The integral involving g is continuous in \({\text {L}}^2_{x}\) as a function of t, while its definition merely yields continuity for the weak \({\text {L}}^2_{x}\) topology. The same comment applies to the integral involving F.
2.9 Invertibility, causality
So far, we have not addressed sufficient conditions for invertibility of \({\mathcal {H}}\) and the question of causality. The true use of the space \({\dot{{\mathcal {V}}}}\) will become transparent here. The first two results require a smallness assumption on the lower order terms, hence are of perturbative nature from the purely second order case. The third one is non-perturbative and uses lower bounds.
At this stage, we eventually impose ellipticity to A in the sense of Gårding : for some \(\lambda >0\) we assume that for all \(u \in {\dot{{\mathcal {V}}}}\),
Note that this is equivalent to having for almost every t and every \(w\in {\dot{{\text {H}}}}^1({\mathbb {R}}^n)\) that
This lower bound would also be the one to assume for systems. When A(t) is a matrix with real measurable entries with respect to x, this is also equivalent to a pointwise lower bound for A(t, x), see [36]. However, this last observation is not valid for complex matrices or systems.
Theorem 2.40
(Invertibility) Assume that A is elliptic and bounded with parameters \(\lambda ,\Lambda \) as in (2.4), (2.43). There is \(\varepsilon _{0}>0\) small enough depending on \(\lambda , \Lambda , n, q_{1}, r_{1}\) such that \(P_{{{\tilde{r}}}_{1},{{\tilde{q}}}_{1}}\le \varepsilon _{0}\) implies that \({\mathcal {H}}\) is invertible.
Proof
For the sesquilinear form \(\beta \) corresponding to the lower order coefficients, we have seen in Lemma 2.7 that
By the embedding \({\dot{{\mathcal {V}}}}\hookrightarrow {\dot{\Delta }}^{ r_{1}, q_{1}}\), we have with \(C= C(n, r_{1}, q_{1})\),
Next, we let
be of the same type as \({\mathcal {H}}\) but without lower order terms. We use the classical hidden coercivity inequality
where \(\delta >0\) and \({\text {H}}_{t}\) is the Hilbert transform in the t-variable with symbol \(i\tau /|\tau |\), see [24]. This inequality follows from the factorization \(\partial _{t}= {\text {D}}_{t}^{1/2}{\text {H}}_{t}{\text {D}}_{t}^{1/2}\), so that, using also the skew-adjointness of the Hilbert transform and commutation,
Altogether, we get
As \(\Vert u\Vert _{{\dot{{\mathcal {V}}}}}^2= \Vert {\text {D}}_{t}^{1/2}\!u\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}^2 + \Vert \nabla u\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}^2\), we can now set \(\delta :=\lambda /(1+\Lambda )\) and define \(\varepsilon _0\) through \(C\varepsilon _{0} \sqrt{1+\delta ^2}= \delta /2\) in order to conclude that \(P_{\tilde{r}_{1},{{\tilde{q}}}_{1}}\le \varepsilon _{0}\) implies that
It follows from the Lax–Milgram lemma that \(({\text {Id}}+\delta {\text {H}}_{t})^*{\mathcal {H}}\) is invertible from \({\dot{{\mathcal {V}}}}\) to \({\dot{{\mathcal {V}}}}'\). As \({\text {Id}}+\delta {\text {H}}_{t}\) is also invertible on \({\dot{{\mathcal {V}}}}\) and its dual, this proves the invertibility of \({\mathcal {H}}\). \(\square \)
Theorem 2.41
(Causality) Assume that A is elliptic and bounded with parameters \(\lambda ,\Lambda \) as in (2.4), (2.43). There is \(\varepsilon _{0}>0\) small enough depending on \(\lambda , \Lambda , n, q_{1}, r_{1}\) such that \(P_{{{\tilde{r}}}_{1},{{\tilde{q}}}_{1}}\le \varepsilon _{0}\) implies that \({\mathcal {H}}\) is causal in the following sense:
-
(i)
If u is a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of \(\partial _{t}u+{\mathcal {L}}u=-{\text {div}}F+g\) as in Theorems 2.28 or 2.29 or combination of both, and if F, g vanish on \((-\infty , s)\times {\mathbb {R}}^n\) for some \(s\in {\mathbb {R}}\), then \(u=0\) in \((-\infty ,s]\times {\mathbb {R}}^n\).
-
(ii)
If u is a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of \(\partial _{t}u+{\mathcal {L}}u=\delta _{s}\otimes \psi \) as in Theorem 2.30, then \(u=0\) in \((-\infty ,s)\times {\mathbb {R}}^n\).
Proof
We begin with the proof of (i). We know that \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\). As usual, we may assume \(s=0\) to simplify the exposition. We let \(S:=\sup _{t \le 0}\Vert u(t)\Vert _{{\text {L}}^2_{x}}^2\). The integral identities of Lemma 2.22 apply to u and for \( \sigma \le \tau \le 0\) we obtain
since F, g vanish in this range. We send \(\sigma \rightarrow -\infty \) and take some \(\tau \le 0\) at which the supremum S is attained. Then, we have
where \(I :=\int _{-\infty }^\tau \Vert \nabla u(t)\Vert _{{\text {L}}^2_{x}}^2\, \text {d}t. \) The standard mixed estimates, taking into account integration on \((-\infty ,\tau )\) in t, give us
see Sect. 5 for a very quick proof. Using the pair \(({{\tilde{r}}}_{1}, {{\tilde{q}}}_{1})\) for the lower order coefficients and Hölder’s inequality as in the proof of Lemma 2.7, we get
Altogether,
where the second step is by Young’s inequality, keeping in mind that \(2\le r_{1}< \infty \). If \(P_{{{\tilde{r}}}_{1},{{\tilde{q}}}_{1}}\le \varepsilon _{0}\) is small enough, then we can hide the contribution of S on the left and obtain that \(S\le 0\). Hence, \(u(t)=0\) for all \(t\le 0\).
For the proof of (ii), we know that \(u\in {\text {C}}_{0}(-\infty ,s; {\text {L}}^2_{x})\) with \({\text {L}}^2_{x}\) limit when \(t\rightarrow s^-\). In particular, we can argue with \(S= \sup _{t \le s}\Vert u(t)\Vert _{{\text {L}}^2_{x}}^2\) where u(s) means \(u(s^-)\) and the proof is the same. \(\square \)
We turn to lower bounds assumptions. We assume \(P_{{{\tilde{r}}}_{1}, {{\tilde{q}}}_{1}}\) finite but not necessarily small. Here, we do not explicitly need the lower bound (2.43) on A but it is hidden in checking the assumptions.
Theorem 2.42
(Invertibility through lower bounds)
-
(i)
Assume that there exists \(c>0\) such that
$$\begin{aligned}{\text {Re}}\langle \langle {\mathcal {L}}u,u \rangle \rangle \ge c \Vert \nabla u\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}^2\end{aligned}$$for all \(u\in {\dot{{\mathcal {V}}}}\). Then \({\mathcal {H}}\) is invertible.
-
(ii)
Assume that
$$\begin{aligned} {\text {Re}}\langle {\mathcal {L}}w,w \rangle \ge 0\end{aligned}$$almost everywhere for all \(w\in {\dot{{\text {H}}}}^1({\mathbb {R}}^n)\). Then \({\mathcal {H}}\) is causal in the sense of Theorem 2.41.
Proof
To prove (i), arguing as before and using the assumption, we have
with \(C :=\Vert A\Vert _{\infty } + P_{ {{\tilde{r}}}_{1}, \tilde{q}_{1}}.\)
When \(r_{1}=2\) (hence \(n\ge 3\) and \(q_{1}=\frac{2n}{n-2}\)), the Sobolev inequality gives us
Since the Hilbert transform is isometric on \({\text {L}}^2\) and commutes with the gradient, we see that if \(\delta >0\) is so small that \(c-c(n,q_{1},r_{1})C\delta >0\), then \({\mathcal {H}}\) is invertible.
When \(r_{1}>2\), we refine the first embedding of Lemma 2.3, by replacing (2.1) with
for \(\varepsilon >0\). As \(\Vert (-\Delta )^{1 /2} \varphi \Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}= \Vert \nabla \varphi \Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}\), we get
Using that \({\text {H}}_{t}\) is isometric on \({\dot{{\mathcal {V}}}}\), we obtain
with \(C'=c'(n,q_{1},r_{1})C\). One chooses first \(\varepsilon \) with \(0<C' \varepsilon ^{1/\theta } \theta <1\) and then \(\delta \) with \( 0<C'((1-\theta ) \varepsilon ^{-1/(1-\theta )}+1)\delta < c\). This yields the desired invertibility for \({\mathcal {H}}\).
The proof of (ii) is easy. Let u be a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution of \(\partial _{t}u+{\mathcal {L}}u=\delta _{s}\otimes \psi -{\text {div}}F+g\). Then we know that for \(\tau <s\),
We conclude right away that \(u(\tau )=0\). \(\square \)
Remark 2.43
In practice, the hypothesis in (i) follows from elliptic inequalities of the form \({\text {Re}}\langle \beta w,w \rangle \le (1-\gamma ){\text {Re}}\langle A\nabla w,\nabla w \rangle \) with \(\gamma <1\), when there is a lower bound \(\lambda >0\) for A as in (2.43). It is not so much the smallness of \(P_{\tilde{r}_{1},{{\tilde{q}}}_{1}}\) that matters (although its size can be used in proofs).
We obtain as a corollary the further identities for the Green operators that have been mentioned in Remark 2.34 (i) earlier on.
Corollary 2.44
If the previous results on invertibility and causality under smallness assumptions or lower bounds hold, then \(G(t,s)=0\) if \(t<s\) and \(G(t,s)\rightarrow I\) strongly as \(t\rightarrow s^+\).
Proof
In both cases, the solution u of (2.28) is given by \(G(\cdot ,s)\psi \), \(t\ne s\). Thus, \(G(t,s)=0\) if \(t<s\) and \(\lim _{t\rightarrow s^-}G(t,s)\psi =0\). Hence, \(\lim _{t\rightarrow s^+}G(t,s)\psi =\psi \) follows from (i) in Theorem 2.33. \(\square \)
2.10 Inhomogeneous assumptions on lower order terms
So far, we have put ourselves in the situation where the lower order terms bring a contribution that is homogeneous to the gradient in \({\text {L}}^2\). However, if the size of this contribution is not small enough, then invertibility of \({\mathcal {H}}\) is not clear and we have to consider inhomogeneous assumptions by adding a positive constant.
We define the inhomogeneous versions of \({\dot{{\mathcal {V}}}}\) and \({\dot{\Delta }}^{r, q}\). We set \( {\mathcal {V}}:={\dot{{\mathcal {V}}}}\cap {\text {L}}^2_{t}{\text {L}}^2_{x}\) with norm
and \(\Delta ^{r, q}: ={\dot{\Delta }}^{r, q} \cap {\text {L}}^2_{t}{\text {L}}^2_{x}= {\text {L}}^2_{t}{\text {H}}^{1}_{x} \cap {\text {L}}^{r}_{t}{\text {L}}^{q}_{ x}\) with norm
The continuous inclusion \({\mathcal {V}}\hookrightarrow \Delta ^{r, q}\) for admissible pairs follows from Lemma 2.3. For admissible pairs, we still miss the extreme cases \(r=\infty \) that one obtains when \({\text {L}}^{\infty }_{t}{\text {L}}^2_{x}\) replaces \({\text {H}}^{1/2}_{t}{\text {L}}^2_{x}\), and \(q=\infty \). The descriptions of the dual or pre-dual of \( \Delta ^{r, q}\) are similar.
We may as well enlarge the class of coefficients and assume from now on that
with \(({{\tilde{r}}}_{1},{{\tilde{q}}}_{1})\) being a compatible pair for lower order coefficients, as in Sect. 2.6. Recall that this means \(\frac{1}{{{\tilde{r}}}_{1}}+\frac{n}{2\tilde{q}_{1}}=1\) with \(({{\tilde{r}}}_{1},{{\tilde{q}}}_{1})\in (1,\infty ]^2\).
Remark 2.45
(Subcritical exponents) Let \(({{\tilde{r}}}, {{\tilde{q}}})\in [1,\infty ]^2\) satisfy the subcritical compatibility relation \(\frac{1}{{{\tilde{r}}}}+\frac{n}{2{{\tilde{q}}}}<1\). Coefficients with
have such a decomposition when, in addition,
Indeed, this is immediately seen from visualizing exponents in a \((\frac{1}{{{\tilde{r}}}}, \frac{1}{{{\tilde{q}}}})\)-plane and truncating coefficients at a fixed height. In [4], subcritical compatibility is assumed because the goal is to deal with bounded solutions. The same condition appears for Cauchy problems in [28]. See also [26].
We can define
and
where \({\mathcal {V}}'\) is the dual of \({\mathcal {V}}\) with respect to \({\text {L}}^2_{t}{\text {L}}^2_{x}\) duality. We use the same notation \({\mathcal {H}}\) as before, although \({\mathcal {V}}\) now is a smaller space. These operators are bounded and adjoint to one another. Instead of \(({{\textbf {H}}}_{{{\textbf {0}}}})\) we now work under the hypothesis that
Then we naturally work with \({\mathcal {H}}+\kappa \), which means that we add the assumption \(u\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) in most statements. Again the estimates will depend on \(\Lambda =\Vert A\Vert _{\infty }\), the bound implicit in (2.45), with fixed compatible pair \(({{\tilde{r}}}_{1},\tilde{q}_{1})\) for lower order coefficients, and the norm of the inverse of \({\mathcal {H}}+\kappa \). Here is a description of changes in Sects. 2.1–2.9:
-
(i)
In the modification of Corollary 2.20 we assume \(u\in {\text {L}}^2_{t}{\text {H}}^1_{x}\) and \(\partial _{t}u\in {\text {L}}^2_{t}{\text {H}}^{-1}_{x}+ {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}+{\text {L}}^2_{t}{\text {L}}^2_{x}\). We obtain \(u\in {\mathcal {V}}\cap {\text {C}}_{0}({\text {L}}^2_{x})\) when (r, q) is an admissible pair or just \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\) when \((r,q)=(\infty ,2)\). The proofs of the technical lemmas use estimates for the operator \(\partial _{t}- \Delta +1\). (Note that we still cannot use the Lions embedding theorem to deduce continuity because we do not, and do not want to, assume that \(u\in {\mathcal {V}}\).)
-
(ii)
In Lemma 2.22, we assume \(u\in {\text {L}}^2_{t}{\text {H}}^1_{x}\) and \(\partial _{t}u= -{\text {div}}F+ g+ h\), where \(F\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), \(g\in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x} \) with (r, q) an admissible pair or \((r,q)=(\infty ,2)\) and \(h\in {\text {L}}^2_{t}{\text {L}}^2_{x}\). Then the conclusion is the same (without a constant) with the extra term \(\langle h(t),u(t) \rangle \) in (2.17). The statements that follow in the same section are adapted similarly.
-
(iii)
In all statements of Sect. 2.6, Assumption \(({{\textbf {H}}}_{{{\textbf {0}}}})\) is replaced by \(({{\textbf {H}}}_{\kappa })\) and the equation to solve is for the operator \(\partial _{t}+{\mathcal {L}}+\kappa \) in the sense of \(\Delta ^{r_{1},q_{1}}\)-solutions, which are defined by changing \({\dot{\Delta }}^{r,q}\) to \(\Delta ^{r,q}\) in Definition 2.13. Such solutions belong to \({\text {C}}_{0}({\text {L}}^2_{x})\). Uniqueness is in that class, and existence theorems through the inverse of \({\mathcal {H}}+\kappa \) or its adjoint are proved for this operator with possible addition of an extra term \(h\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) in Theorem 2.28.
-
(iv)
In Sect. 2.7, if we assume \(({{\textbf {H}}}_{\kappa })\) instead of \(({{\textbf {H}}}_{{{\textbf {0}}}})\), we obtain exactly the same statements for the Green operators \(G_{\kappa }(t,s)\) and \({{\widetilde{G}}}_{\kappa }(s,t)\) of \(\partial _{t}+{\mathcal {L}}+\kappa \) and \(-\partial _{t}+{\mathcal {L}}^*+\kappa \), respectively. In particular, they are uniformly bounded operators on \({\text {L}}^2_{x}\) provided \(t\ne s\).
-
(v)
Section 2.8 can be adapted mutatis mutandis with the Green operators \(G_{\kappa }\) under \(({{\textbf {H}}}_{\kappa })\). In the statement corresponding to Theorem 2.38, one can add again an extra forcing term \(h\in {\text {L}}^2_{t}{\text {L}}^2_{x}\).
In order to check invertibility and causality, we introduce the following property on the lower order coefficients, where \(\varepsilon \ge 0\) will be chosen appropriately small later.
Assumption \(({{\textbf {D}}}_{\varepsilon })\). For some compatible pair \(({{\tilde{r}}}_{1},{{\tilde{q}}}_{1})\) for lower order coefficients, one can find a decomposition
with
Remark 2.46
By truncation at large height one can always do such a decomposition for any \(\varepsilon > 0\) starting from \(|{{\textbf {a}}}|^2+|{{\textbf {b}}}|^2+|a|\in {\text {L}}^{{{\tilde{r}}}_{1}}_{t}{\text {L}}^{{{\tilde{q}}}_{1}}_{ x}+ {\text {L}}^\infty _{t}{\text {L}}^\infty _{x}\), except if \({{\tilde{r}}}_{1}=\infty \). In this case, one needs further assumptions, such as that the part in \({\text {L}}^\infty _{t}{\text {L}}^{{{\tilde{q}}}_{1}}_{x}\) is uniformly continuous in time. For example, independence of time is a valid hypothesis. In other words, \(({{\textbf {D}}}_{\varepsilon })\) always holds for all \(\varepsilon >0\) except when \({{\tilde{r}}}_{1}=\infty \).
The quantities \(P_{{{\tilde{r}}}_{1},{{\tilde{q}}}_{1}}, P_{\infty }\) turn out to quantify nicely some estimates.
Theorem 2.47
(Invertibility, inhomogeneous case) Assume that A has bounds \(\lambda ,\Lambda \) as in (2.4), (2.43) and that (D\(_{\varepsilon }\)) holds for \(\varepsilon >0\) small enough. There exists \(\kappa _{0}>0\) large enough so that \({\mathcal {H}}+\kappa \) is invertible from \({\mathcal {V}}\) onto \({\mathcal {V}}'\) for any \(\kappa \ge \kappa _{0}\). Here, \(\varepsilon ,\kappa _{0}\) and the lower bound depend on \(\lambda ,\Lambda , n, q_{1},r_{1}\) and \(\kappa _{0}\) depends additionally on \(P_{\infty }\).
Proof
As in the proof of Theorem 2.40 we may write with obvious notation
so that
and for the other term \(|\langle \langle \beta _{\infty }u,v \rangle \rangle | \) we have a simple bound by
Hence, with \(\delta >0\) and \(\varepsilon =\varepsilon _{0}\) chosen as before, we have
By Young’s inequality \(ab \le (1/4\gamma )a^2+ \gamma b^2\) with \(\gamma =\delta /4\), we see that
Hence, for, say, \(\kappa _{0} :=\frac{\delta }{4}+ P_{\infty }^2\big (\frac{ (1+\delta ^2)}{\delta }+ \sqrt{1+\delta ^2}\big )\) and \(\kappa \ge \kappa _{0}\), we get
and invertibility follows. \(\square \)
Remark 2.48
The proof shows that under the assumptions of Theorem 2.40, which corresponds to \(P_{\tilde{r}_{1},{{\tilde{q}}}_{1}}\le \varepsilon _{0}\) and \(P_{\infty }=0\), that \({\mathcal {H}}+\kappa :{\mathcal {V}}\rightarrow {\mathcal {V}}'\) is invertible for all \(\kappa >0\).
For causality, Theorem 2.41 becomes the following result.
Theorem 2.49
(Causality, inhomogeneous case) With the assumptions of Theorem 2.47 there exists \(\kappa _{0}> 0\) such that \({\mathcal {H}}+\kappa \) is causal for \(\kappa \ge \kappa _{0}\) in the following sense:
-
(i)
If u is a \(\Delta ^{r_{1},q_{1}}\)-solution of \(\partial _{t}u+{\mathcal {L}}u+\kappa u=-{\text {div}}F+g+h\) as in the modifications of Theorems 2.28 or 2.29 (here, \(h\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), while \(g\in {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\)) and F, g, h vanish on \((-\infty , s)\times {\mathbb {R}}^n\) for some \(s\in {\mathbb {R}}\), then \(u=0\) identically on \((-\infty ,s]\times {\mathbb {R}}^n\).
-
(ii)
If u is a \(\Delta ^{r_{1},q_{1}}\)-solution of \(\partial _{t}u+{\mathcal {L}}u+\kappa u=\delta _{s}\otimes \psi \) as in the modification of Theorem 2.30, then \(u=0\) identically on \((-\infty ,s)\times {\mathbb {R}}^n\).
Proof
It begins as the proof of Theorem 2.41. In the first case, as \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\), we fix \(s=0\) to simplify the exposition, let \(S :=\sup _{t \le 0}\Vert u(t)\Vert _{{\text {L}}^2_{x}}^2\), which is attained at some \(\tau \), and obtain
where \(I :=\int _{-\infty }^\tau \Vert \nabla u(t)\Vert _{{\text {L}}^2_{x}}^2\, \text {d}t. \) The treatment of the term \(\beta _{0}\) is as before: we have
and Young’s inequality allows us to hide the contribution coming from S on the left-hand side up to loosing a little on \(-2\lambda I\) by choosing \(\varepsilon \) small enough. Next the contribution of \(\beta _{\infty }\) is
and Young’s inequality yields again a contribution of \(\int _{-\infty }^\tau \Vert u(t)\Vert _{{\text {L}}^2_{x}}^2\, \text {d}t\) that is compensated if \(\kappa \) is larger than a constant times \(P_{\infty }^2\), up to loosing again a little on \(-2\lambda I\). We obtain \(S\le 0\), and thus \(u(t)=0\) for \(t\le 0\).
In the second case, we know that \(u\in {\text {C}}_{0}(-\infty ,s; {\text {L}}^2_{x})\) with \({\text {L}}^2_{x}\) limit when \(t\rightarrow s^-\). In particular, we can argue with \(S= \sup _{t \le s}\Vert u(t)\Vert _{{\text {L}}^2_{x}}^2\), where u(s) means \(u(s^-)\), and the proof is the same. \(\square \)
Finally the result with lower bounds (Theorem 2.42) becomes the following and we skip the easy adaptation of the proof. Again, a lower bound on A is implicit in order to check the assumptions.
Theorem 2.50
(Invertibility through lower bounds, inhomogeneous case)
-
(i)
Assume that there exists \(c,c'>0\) such that
$$\begin{aligned} {\text {Re}}\langle \langle {\mathcal {L}}u,u \rangle \rangle \ge c \Vert \nabla u\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}^2- c'\Vert u\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}^2 \end{aligned}$$for all \(u\in {\mathcal {V}}\). Then \({\mathcal {H}}+\kappa \) is invertible for all \(\kappa \ge \kappa _{1}\) with \(\kappa _{1}(\Lambda , n, q_{1}, r_{1}, c, c')>0\) large enough.
-
(ii)
Assume that
$$\begin{aligned}{\text {Re}}\langle {\mathcal {L}}w,w \rangle \ge - c'\Vert w\Vert _{{\text {L}}^2_{x}}^2\end{aligned}$$almost everywhere for some \(c'>0\) and for all \(w\in {\text {H}}^1({\mathbb {R}}^n)\). Then \({\mathcal {H}}+ \kappa \) is causal in the sense of Theorem 2.49 for any \(\kappa \ge c'\).
Corollary 2.51
Under either smallness assumption (Theorem 2.49) or lower bounds (Theorem 2.50 (ii)) it follows for all \(\kappa \) large enough that \(G_{\kappa }(t,s)=0\) if \(t<s\) and \(G_{\kappa }(t,s)\rightarrow I\) strongly if \(t\rightarrow s^+\).
Proof
The \(\Delta ^{r_1,q_1}\)-solution u of \(\partial _{t}u+{\mathcal {L}}u+\kappa u=\delta _{s}\otimes \psi \) is given by \(G_{\kappa }(t,s)\psi \) for \(t\ne s\). Thus, \(G_{\kappa }(t,s)=0\) if \(t<s\) and \(\lim _{t\rightarrow s^-}G_{\kappa }(t,s)\psi =0\). Hence, \(\lim _{t\rightarrow s^+}G_{\kappa }(t,s)\psi =\psi \) follows from (i) in (the modification of) Theorem 2.33. \(\square \)
2.11 The Cauchy problem and the fundamental solution operator
We consider in fine the Cauchy problem on the strip \([0,T]\times {\mathbb {R}}^n\) with \(0<T<\infty \). (We shall say a word concerning \(T=\infty \) in Remark 2.55). It is of course sufficient to consider coefficients only on this strip and the foregoing results will allow us to work under the following set of assumptions.
-
(A1)
\({\mathcal {L}}\) is given as in (2.3) with coefficients \(A,{{\textbf {a}}}, {{\textbf {b}}}, a\) defined almost everywhere in \((0,T)\times {\mathbb {R}}^n\).
-
(A2)
A has bounds \(\lambda ,\Lambda \) as in (2.4), (2.43) for almost every \(t\in (0,T)\).
-
(A3)
The lower order coefficients satisfy \(({{\textbf {D}}}_{\varepsilon })\) on \((0,T)\times {\mathbb {R}}^n\) for all \(\varepsilon \) small enough with compatible pair \((\tilde{r}_{1},{{\tilde{q}}}_{1})\) as in Definition 2.8.
Recall that we take coefficients with \(|{{\textbf {a}}}|^2+|{{\textbf {b}}}|^2+|a|\in {\text {L}}^{{{\tilde{r}}}_{1}}_{t}{\text {L}}^{{{\tilde{q}}}_{1}}_{ x}+ {\text {L}}^\infty _{t}{\text {L}}^\infty _{x}\) in \((0,T)\times {\mathbb {R}}^n\). As in the case of \({\mathbb {R}}^{n+1}\), (A3) automatically holds except if \({{\tilde{r}}}_{1}=\infty \) (hence \(n\ge 3\) and \({{\tilde{q}}}_{1}=n/2\)), in which case we can proceed by imposing it or by assuming (uniform) t-continuity on [0, T] instead of mere boundedness, valued in \({\text {L}}^{n/2}_{x}\), compare with Remark 2.46. An alternative in the case \(({{\tilde{r}}}_1, {{\tilde{q}}}_1) = (\infty , n/2)\) is to replace (A3) by the following condition.
- (A3)’:
-
\({{\tilde{r}}}_{1}=\infty \) and there exist \(c>0\) and \(c'\ge 0\) such that
$$\begin{aligned} {\text {Re}}\langle {\mathcal {L}}v,v \rangle \ge c \Vert \nabla v\Vert _{{\text {L}}^2_{x}}^2- c'\Vert v\Vert _{{\text {L}}^2_{x}}^2 \end{aligned}$$for almost every \(t\in (0,T)\) and all \(v \in {\mathcal {V}}\).
Let \((r_{1},q_{1})\) be the admissible conjugate pair to \((\tilde{r}_{1},{{\tilde{q}}}_{1})\) defined in (2.7).
For \(\psi \in {\text {L}}^2_{x}\), \(F\in {\text {L}}^2(0,T; {\text {L}}^2_{x})\), \(g\in {\text {L}}^{r'}(0,T; {\text {L}}^{q'}_{x})\), where (r, q) is an arbitrary admissible pair as in Definition 2.2 or \((r,q)=(\infty ,2)\), and \(h\in {\text {L}}^2(0,T; {\text {L}}^2_{x})\), the Cauchy problem with initial condition \(\psi \) and forcing terms \(-{\text {div}}F+g+h\) consists of finding
solving
in the sense that the first equation is satisfied weakly against test functions \({{\tilde{\phi }}}\in {\mathcal {D}}((0,T)\times {\mathbb {R}}^n)\) as in (2.12) and that the second equation means \(u(t,\cdot ) \rightarrow \psi \) in \({\mathcal {D}}'({\mathbb {R}}^n)\) as \(t\rightarrow 0\). Weak solutions for the Cauchy problem (2.49) on \([0,T]\times {\mathbb {R}}^n\) are those solutions in the class \({\text {L}}^2(0,T; {\text {H}}^1({\mathbb {R}}^n)) \cap {\text {L}}^{\infty }(0,T; {\text {L}}^{2}({\mathbb {R}}^n))\). This space embeds into \( {\text {L}}^{r_{1}}(0,T; {\text {L}}^{q_{1}}({\mathbb {R}}^n))\), see Proposition 5.1 for a quick proof. So we have defined an a priori larger class of solutions. We next show that in fact a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution is continuous in time valued in \({\text {L}}^2_{x}\), so that in the end it is a weak solution. This continuity can be obtained as a regularity result in the inhomogeneous variational setting.
Lemma 2.52
Let \({\text {I}}=(0,T)\) and \( u\in {\text {L}}^2({\text {I}};{\text {H}}^1_{x})\) with \(\partial _{t}u= -{\text {div}}F+ g+h\), where \(F\in {\text {L}}^2({\text {I}};{\text {L}}^2_{x})\), \(g\in {\text {L}}^{r'}({\text {I}};{\text {L}}^{q'}_{x} )\) for (r, q) an admissible pair or \((r,q)=(\infty ,2)\), and \(h\in {\text {L}}^2({\text {I}};{\text {L}}^2_{x})\). Then \(u\in {\text {C}}([0,T]; {\text {L}}^2_{x})\) and we have the integral equalities (2.17) for \(0 \le \sigma < \tau \le T\).
Proof
According to the discussion (ii) in Sect. 2.10, the result holds when \({\text {I}}=(0,\infty )\). We proceed as in Corollary 2.24 by constructing an extension v of u to which this applies. The situation here is easier since u is locally integrable; still the language of distributions is convenient.
More precisely, for \(f\in {\mathcal {D}}(0,\infty )\) and \(\psi \in {\mathcal {D}}({\mathbb {R}}^n)\), we first note that \(t\mapsto \langle u(t),\psi \rangle \) is absolutely continuous on [0, T] with derivative equal to \(\langle F(t),\nabla \psi \rangle + \langle g(t),\psi \rangle + \langle h(t),\psi \rangle \) almost everywhere on (0, T). Hence, if \(\chi \) denotes a smooth function that is 1 for \(t\le T\) and 0 for \(t\ge 2T\), the expression
defines a distribution in \((0,\infty )\times {\mathbb {R}}^n\), which equals u when restricted to \((0,T)\times {\mathbb {R}}^n\). Calculations show that \(\partial _{t}v= (-{\text {div}}F_{o}+ g_{o}+h_{o})\chi + u(2T-\cdot )\chi '\) and \(\nabla v= (\nabla u)_{e}\chi \), where the subscripts \({\tiny o,e}\) denote odd and even extensions at \(t=T\), respectively. Hence, all the required assumptions can be verified to conclude that \(v\in {\text {C}}_{0}([0,\infty ), {\text {L}}^2_{x})\). \(\square \)
As usual one can add to g several other terms of the same type with different admissible pairs.
Corollary 2.53
Any \(\Delta ^{r_{1},q_{1}}_{0,T}\)-solution of the Cauchy problem (2.49) is a weak solution.
Our main result on the Cauchy problem is a synthesis of most of the theory that we developed so far.
Theorem 2.54
Assume (A1), (A2) and one of (A3) or (A3)’. There is a solution u of (2.49), unique in the class \( {\text {L}}^2(0,T; {\text {H}}^1_{x}) \cap {\text {L}}^{r_{1}}(0,T; {\text {L}}^{q_{1}}_{x})\), with the following properties.
-
(i)
u belongs to \({\text {C}}([0,T]; {\text {L}}^2_{x})\) and is the restriction to (0, T) of a function in \({\dot{{\text {H}}}}^{1/2}_{t}{\text {L}}^2_{x}\).
-
(ii)
Changing the origin of time, for \(0\le s\le t\le T\) there exists a unique \({\text {L}}^2_{x}\)-bounded operator \(\Gamma (t,s)\), called fundamental solution operator for the Cauchy problem on \((s,T)\times {\mathbb {R}}^n\) with no forcing terms and initial condition in \({\text {L}}^2_{x}\), that sends the initial condition at time s to the value of the unique solution at time t.
-
(iii)
For \(t\in [0,T]\) the solution u above is then given by
$$\begin{aligned} u(t)= & {} \Gamma (t,0)\psi - \int _{0}^t\Gamma (t,s){\text {div}}F(s)\, \text {d}s + \int _{0}^t\Gamma (t,s)g(s)\, \text {d}s\nonumber \\{} & {} + \int _{0}^t\Gamma (t,s)h(s)\, \text {d}s, \end{aligned}$$(2.50)where the first two integrals are weakly defined in \({\text {L}}^2_{x}\), while the last one converges strongly (i.e., in the Bochner sense).
-
(iv)
Define \({\mathcal {H}}\) on extending A by the identity and the lower order coefficients by 0 on \(({\mathbb {R}}\setminus (0,T))\times {\mathbb {R}}^n\).Footnote 3 The fundamental solution operators themselves are given by
$$\begin{aligned} \Gamma (t,s)= \text {e}^{\kappa (t-s)}G_{\kappa }(t,s), \quad { 0\le s \le t\le T,} \end{aligned}$$(2.51)for all \(\kappa \ge 0\) for which \({\mathcal {H}}+\kappa \) is invertible and causal and \(G_{\kappa }(t,s)\) is the Green operator obtained under this assumption.
Proof
We extend the forcing terms F, g, h by 0, keeping the same notation for the extensions and \({\mathcal {L}}\): they satisfy the same conditions on full space-time. We fix \(\kappa > 0\) for which \({\mathcal {H}}+\kappa \) is invertible and causal (Theorems 2.47 and 2.49 or 2.50).
First, we can use the inhomogeneous version of Theorems 2.28, 2.29 and 2.30 to build a (unique) \(\Delta ^{r_{1},q_{1}}\)-solution v to
and take \(u :=v\text {e}^{\kappa t}\) restricted to \([0,T]\times {\mathbb {R}}^n\). The assumption of causality implies indeed that \(u(t) \rightarrow \Gamma (t,0)\psi = \psi \) in \({\text {L}}^2_{x}\) as \(t \rightarrow 0\). Applying the inhomogeneous version of Theorem 2.38 to the Green operators \(G_{\kappa }(t,s)\) of \({\mathcal {H}}+\kappa \) gives us (2.50) with \(\Gamma (t,s)= \text {e}^{\kappa (t-s)}G_{\kappa }(t,s)\). We refer in particular to (v) in Sect. 2.10.
Next, we check uniqueness in the class \( {\text {L}}^2(0,T; {\text {H}}^1_{x}) \cap {\text {L}}^{r_{1}}(0,T; {\text {L}}^{q_{1}}_{x})\). Assume that u solves the Cauchy problem with \(\psi =0\), \(F=0\), \(g=0\), \(h=0\). By Corollary 2.53 we know that \(u\in {\text {C}}([0,T]; {\text {L}}^2_{x})\). With \(\kappa \) as above, \(v :=u\text {e}^{-\kappa t}\) solves the Cauchy problem with 0-data for \(\partial _{t}v+{\mathcal {L}}v+\kappa v=0\) in \((0,T)\times {\mathbb {R}}^n\). By restriction, as before, we can use the global parabolic operator also to build a solution \(u^T \in {\text {L}}^2(T,\infty ; {\text {H}}^1_{x}) \cap {\text {L}}^{ r_{1}}(T,\infty ; {\text {L}}^{q_{1}}_{x})\cap {\text {C}}_0([T,\infty ); {\text {L}}^2_{x})\) to the same equation with initial data \(u^T(T) = u(T)\) and in \((-\infty ,0]\times {\mathbb {R}}^n\) we set \(u^0(t) :=0\). By continuity valued in \({\text {L}}^{2}_{x}\), we can glue \(u^0, u, u^T\) together to a \(\Delta ^{r_{1},q_{1}}\)-solution w of \(\partial _{t}w+{\mathcal {L}}w+\kappa w=0 \) in \({\mathbb {R}}^{n+1}\), which vanishes identically by the inhomogeneous version of Theorem 2.27. Hence, we have \(u=0\).
The rest of the statement follows easily. By construction, u is the restriction of a function which belongs to \({\dot{{\mathcal {V}}}}\). Next, uniqueness implies that the formula (2.51) is valid for all \(\kappa > 0\) for which \({\mathcal {H}}+\kappa \) is invertible and causal.
It remains to include the case \(\kappa =0\) for (2.51) when \({\mathcal {H}}\) is invertible and causal. To this end, we can apply the homogeneous versions in Sects. 2.6, 2.8 and 2.9 with F, g, h being all zero. In that case, we consider a \({\dot{\Delta }}^{r_{1},q_{1}}\)-solution on \({\mathbb {R}}^{n+1}\), which produces a solution by restriction. This solution has a representation using the Green operators \(G_{0}(t,s)\). Already established uniqueness shows that \(\Gamma (t,s)=G_{0}(t,s)\). \(\square \)
Remark 2.55
In case that \(T=\infty \) and \(h=0\), inspection of the proof above with \(\kappa =0\) shows that if \({\mathcal {H}}\) is invertible and causal, then existence and uniqueness hold in the class \({\text {L}}^2(0,\infty ; {\dot{{\text {H}}}}^1({\mathbb {R}}^n)) \cap {\text {L}}^{ r_{1}}(0,\infty ; {\text {L}}^{ q_{1}}({\mathbb {R}}^n))\), with regularity in \({\text {C}}_{0}([0,\infty );{\text {L}}^2_{x})\).
Remark 2.56
Having forcing terms in \({\text {L}}^{ r'}(0,T; {\text {L}}^{ q'}({\mathbb {R}}^n))+{\text {L}}^{2}(0,T; {\text {L}}^{2}({\mathbb {R}}^n))\) allows us to cover for example supercritical forcing terms \(g\in {\text {L}}^{\rho '}(0,T; {\text {L}}^{\eta '}_{x})\), by which we mean \(2< \rho ,\eta < \infty \) with \(\frac{1}{\rho }+\frac{n}{2\eta } > \frac{n}{4}\) and, when \(n=1\), additionally \(\frac{1}{\rho }-\frac{1}{2\eta }{<} \frac{1}{4}\). Indeed, visualizing the exponents in a \((\frac{1}{\rho },\frac{1}{\eta })\)-plane immediately reveals that they can be decomposed as required.
Remark 2.57
If the coefficients for \({\mathcal {H}}\) are defined on \({\mathbb {R}}^{n+1}\), then by (2.51) we have
for all \(\kappa \ge \kappa _{0}\ge 0\) and \(t,s\in {\mathbb {R}}\), where \(\kappa _{0}\) is such that \({\mathcal {H}}+\kappa _{0}\) is invertible and causal (setting also \(G_{0}:=G\)). It is interesting to note that this relation between Green operators cannot be seen directly in \({\mathbb {R}}^{n+1}\) because the conjugation by the exponentials does not preserve the spaces of solutions. Another interesting consequence is that it implies exponential decay estimates for the operator norm:
recalling that \(t-s\ge 0\) for the Green operators to be non-zero.
Remark 2.58
If (A3) is used, then constants in the implicit estimates for u depend on the choice of \(\varepsilon _{0}\) for invertibility and causality, which was seen to depend on \(\lambda ,\Lambda , n,q_{1},r_{1}\), and \(P_{\infty }\) in the decomposition \(({{\textbf {D}}}_{\varepsilon _{{{\textbf {0}}}}})\). They do not depend on T unless \(P_{\infty }\) does.
Corollary 2.59
Under the same assumptions as in Theorem 2.54 we have for all \(t>s\) the equality
where \(^*\) is the complex adjoint and \({{\widetilde{\Gamma }}}(s,t)\) is the generalized fundamental solution of the adjoint problem.
Proof
We know that the Green operators \(G_{\kappa }(t,s)\) and \({{\widetilde{G}}}_{\kappa }(s,t)\) are adjoint operators. If we adapt the proof above to the adjoint backward operator \(-\partial _{t}+{\mathcal {L}}^*\), we produce solutions in \((0,T)\times {\mathbb {R}}^n\) on restricting the ones in \({\mathbb {R}}^{n+1}\) for \(-\partial _{t}+{\mathcal {L}}^*+\kappa \) multiplied by \(\text {e}^{\kappa (T-s)}\) (in the variable s). Changing the initial time T to t, this yields that the fundamental solution operator \({{\widetilde{\Gamma }}}(s,t)\) for the adjoint problem agrees with \( \text {e}^{\kappa (t-s)}{{\widetilde{G}}}_{\kappa }(s,t)= \Gamma (t,s)^*\), where the last equality follows from (2.51). \(\square \)
Corollary 2.60
The solution u of Theorem 2.54 agrees with the ones build in [28] and [4] under assumptions in these references.
Proof
By mixed embeddings (Proposition 5.1), the space for uniqueness in Theorem 2.54 contains the standard energy space \({\text {L}}^2(0,T; {\text {H}}^1_{x}) \cap {\text {L}}^{\infty }(0,T; {\text {L}}^{2}_{x})\). In Chapter 3 of [28], weak solutions in the latter class are constructed exactly under the same assumptions (A1), (A2) and subcritical (Remark 2.45) or critical (a.k.a compatible) conditions on the coefficients (which are even assumed real there). In [4] this is being done under the more restrictive conditions of subcritical and real coefficients with the structural conditions (A1) and (A2). \(\square \)
2.12 \({{\text {L}}^2}\) off-diagonal estimates
Aronson further proved pointwise Gaussian estimates of the generalized fundamental solution when the coefficients are real-valued [3, 4]. As already mentioned, assumptions on lower order coefficients in [4] amount to what we called subcritical compatibility (Remark 2.45), used in an essential way together with the fact that the coefficients are real, to obtain local boundedness of weak solutions. Already in the elliptic case with leading term the Laplacian on the unit ball, explicit examples show existence of unbounded weak solutions for some first order coefficients in \({\text {L}}^n\) or some zero order coefficients in \({\text {L}}^{n/2}\), see [27].
We know from Corollary 2.60 that our solutions agree with the ones of Aronson under his assumptions; in particular his generalized fundamental solution operator and ours are identical. Hence, pointwise bounds under (critical) compatibility assumptions are not to be expected. Still, under this assumption, we will be able to show \({\text {L}}^2\) off-diagonal estimates (or Gaffney estimates) for the fundamental solution operator, that is, decay of localized \({\text {L}}^2\) norms.
When there are no lower order terms, the method of Aronson has been streamlined with the exponential trick of Davies [14] for time independent A and this has been adapted by Fabes–Stroock [19] when A is time-dependent, see also Hofmann–Kim [22] for a nice presentation, using the Gronwall lemma as a starting point. The same ideas go through with bounded lower order coefficients but when they are allowed to be unbounded, it is not clear how to set up the arguments properly. In [9], a construction is proposed in absence of lower order terms, starting from the semigroup case. This approach is not possible when using mixed norms on lower order coefficients because there is no semigroup to begin with. Our approach allows us to overcome these difficulties by extending Davies’ ideas to the context of variational parabolic forms.
Theorem 2.61
Assume the conditions of Theorem 2.54. Then there are constants \(0<C,c_{0},\omega <\infty \) such that for all \(0\le s<t\le T\), all closed sets \(E,F \subset {\mathbb {R}}^n\) and all \(\psi \in {\text {L}}^2_{x}\) with support in F, we have
Let us comment on the three constants. If (A3) is used, then \(\omega =c_{0}P_{\infty }^2\) with \(P_{\infty }\) from the decomposition given by \(({{\textbf {D}}}_{\varepsilon _{{{\textbf {0}}}}})\) and \(C, c_{0}\) depend only on \(\lambda ,\Lambda , n, q_{1}, r_{1}\), where \(\varepsilon _{0}\) is such that the arguments for invertibility and causality apply. As in Remark 2.58, they may depend on T but only through \(P_{\infty }\). If (A3)’ is used, then \(C, \omega = c_{0}\) depend on \(\Lambda , n\) and \(c, c'\) in (A3)’.
Proof
We extend the coefficients to full space-time as in the proof of Theorem 2.54 and use the same notation. Henceforth, we work in \({\mathbb {R}}^{n+1}\) and prove (2.53) for all \( s<t\).
For a function \(h:{\mathbb {R}}^n\rightarrow [0,\infty [\) bounded and Lipschitz, consider the operator obtained in \({\mathbb {R}}^{n+1}\) on conjugating \({\mathcal {H}}\, (=\partial _{t}+{\mathcal {L}})\) with the multiplication by \(\text {e}^h\). A calculation (in the weak sense) shows that
where
and with \(A^t\) being the real transpose of A,
The coefficients \({{\textbf {a}}}_{h}\) and \( {{\textbf {b}}}_{h}\) are bounded by \(\Vert A\Vert _{\infty }\Vert \nabla h\Vert _{\infty }\). In \({a_{h}}\), the first term is bounded by \(\Vert A\Vert _{\infty }\Vert \nabla h\Vert _{\infty }^2\). To handle the second term, we distinguish the two assumptions.
Proof under (A3). The number \(\varepsilon _{0}\) is chosen in particular such that (2.48) for \({\mathcal {H}}+\kappa \) holds with \(\kappa \ge \kappa _{0}\) where \(\kappa _{0} =\delta /4+ c_{\delta }P_{\infty }^2\). Our first goal is to check that (2.48) for \({\mathcal {H}}+\kappa +\beta _{h}\) holds with \(\delta /4\) replaced by, say, \(\delta /8\) for large enough \(\kappa \) that will also depend on \(\Vert \nabla h\Vert ^2_{\infty }\). To this end, it will suffice to revisit the proof of that inequality after adding the contribution of the coefficients in (2.56).
Step 1: Proof of the lower bound for the perturbed \({\mathcal {H}}+\kappa +\beta _{h}\). We decompose \({{\textbf {a}}}-{{\textbf {b}}}=({{\textbf {a}}}_{0} -{{\textbf {b}}}_{0})+({{\textbf {a}}}_{\infty } -{{\textbf {b}}}_{\infty }) \) as in the assumption \(({{\textbf {D}}}_{\varepsilon })\) with \(\varepsilon =\varepsilon _{0}\). The term coming from \(({{\textbf {a}}}_{\infty }-{{\textbf {b}}}_{\infty })\cdot \nabla h\) brings a bounded contribution of size \(P_{\infty }\Vert \nabla h\Vert _{\infty }\). For the other term, we observe that \({{\textbf {a}}}_{0}-{{\textbf {b}}}_{0}\) belongs to \({\text {L}}^{2{{\tilde{r}}}_{1}}_{t}{\text {L}}^{2{{\tilde{q}}}_{1}}_{ x}\) with norm not exceeding \(2\varepsilon _{0}\) and \((2{{\tilde{r}}}_{1}, 2{{\tilde{q}}}_{1})\) is a subcritically compatible pair for coefficients of order 0. We decompose further this term as suggested in Remark 2.45. To this end, call \({\mathcal {L}}_{0}\) the elliptic operator with coefficients \(A, {{\textbf {a}}}_{0}, {{\textbf {b}}}_{0}, a_{0}\) and \({\mathcal {H}}_{0}\) the corresponding parabolic operator. Through the choice of \(\varepsilon _0\), we can make sure that (2.44) holds for \({\mathcal {H}}_{0}\). We also know that the multiplication by \(V\in {\text {L}}^{\tilde{r}_{1}}_{t}{\text {L}}^{{{\tilde{q}}}_{1}}_{ x}\) is a bounded operator \({\dot{{\mathcal {V}}}}\rightarrow {\dot{{\mathcal {V}}}}'\). Thus, we can choose \(\eta >0\) (depending on \(n, q_{1},r_{1}, \delta \)) so small that \(\Vert V\Vert _{{\text {L}}^{\tilde{r}_{1}}_{t}{\text {L}}^{{{\tilde{q}}}_{1}}_{ x}}\le \eta \) implies
For \(m>0\), the truncation \(V_{0} :=1_{|{{\textbf {a}}}_{0}-{{\textbf {b}}}_{0}|>m}({{\textbf {a}}}_{0}-{{\textbf {b}}}_{0})\cdot \nabla h\) satisfies
We choose m so that \(4\varepsilon _{0}^2m^{-1}\Vert \nabla h\Vert _{\infty }=\eta \). On the other hand, \(V_{\infty } :=1_{|{{\textbf {a}}}_{0}-{{\textbf {b}}}_{0}|\le m}({{\textbf {a}}}_{0}-{{\textbf {b}}}_{0})\cdot \nabla h\) satisfies
Setting \(\beta _{\infty }= {\mathcal {H}}-{\mathcal {H}}_{0}\) and \(\tilde{\beta }_{h}=\beta _{h}-V_{0}\), we have established the decomposition
with \({{\tilde{\beta }}}_{h}+\beta _{\infty }\) having first order coefficients bounded by \(\Vert \nabla h\Vert _{\infty }+P_{\infty }\) and zero order coefficients bounded by \(\Vert \nabla h\Vert _{\infty }^2+P_{\infty }^2\) up to multiplicative constants that depend only on \(\lambda , \Lambda , n, q_{1},r_{1}\). This was the key point.
Applying the same simple absorption argument as in Theorem 2.47 to this decomposition reveals that for some constant \(c_{0}\) with the same dependency and \(\kappa =1+ c_{0}(\Vert \nabla h\Vert _{\infty }^2+P_{\infty }^2)\), the operator in (2.57) is invertible from \({\mathcal {V}}\) onto \({\mathcal {V}}'\) with a lower bound \(\delta /8\) in (2.48).
Our next goal is to transfer such lower lower bounds to operator norms for the perturbed fundamental solution operator, following the dependency in h.
Step 2: Norm bounds for the perturbed fundamental solution operator. With the constraints on \(\kappa \) and h above, the norm of the inverse of \({\mathcal {H}}+\kappa +\beta _{h}\) depends on \(\lambda , \Lambda , n, q_{1},r_{1}\) but not on h. Altogether, it follows that the Green operators \(G_{h,\kappa }(t,s)\) associated to \( \text {e}^{h}{\mathcal {H}}\text {e}^{-h} + \kappa \) are uniformly bounded on \({\text {L}}^2_{x}\) with respect to (t, s) with a bound \(C_{0}\) depending only on \(\lambda , \Lambda , n, q_{1},r_{1}\).
Now, by construction we have \(G_{h,\kappa }(t,s)=\text {e}^h G_{\kappa }(t,s)\text {e}^{-h}\) and by Theorem 2.54 we have \(\Gamma (t,s)=\text {e}^{\kappa (t-s)}G_{\kappa }(t,s)\). Hence, \(\text {e}^h\Gamma (t,s)\text {e}^{-h}= \text {e}^{\kappa (t-s)} G_{h,\kappa }(t,s)\). This infers that for all \(t-s=1\) and \(\psi \in {\text {L}}^2_{x}\),
A scaling argument will now provide us with the right dependence of \(\omega \).
Fix \(s=0\) to simplify matters by time translation invariance of the assumptions. Set \(u(t,\cdot ):=\Gamma (t,0)\psi \). Recall that u solves \(\partial _{t}u+{\mathcal {L}}u= \delta _{0}\otimes \psi \), so that if \(R>0\), then \(u^R(t,x) :=u(R^2t,Rx)\) solves \(\partial _{t}u^R+{\mathcal {L}}^Ru^R= \delta _{0}\otimes \psi ^R\), with \(\psi ^R(x)=\psi (Rx)\) and \({\mathcal {L}}^R\) has coefficients \(A(R^2t,Rx)\), \(R\, {{\textbf {a}}}(R^2t,Rx)\), \(R\, {{\textbf {b}}}(R^2t,Rx)\), \(R^2a(R^2t,Rx)\). The quantity \(P_{{{\tilde{r}}}_{1},{{\tilde{q}}}_{1}}\) is scale invariant and therefore does not depend on R. The same applies to the ellipticity constants \(\lambda , \Lambda \), while \(P_{\infty }\) becomes \(P_{\infty }R\). Applying the above conclusion to the Green operator of \(\partial _{t}+{\mathcal {L}}^R\) at \(t=1\) with \(h^R(x)=h(Rx)\), and changing variables in space, yields
Altogether, this shows for all \(t>s\) and \(\psi \in {\text {L}}^2_{x}\),
It remains to optimize h appropriately.
Step 3: Choice of h. Fix E, F closed sets, let \(t>s\) and assume \(d(E,F)^2>t-s\), since otherwise there is nothing to prove. Let \(h(x) :=\inf ( \frac{d(E,F)d(x,F)}{2c_{0}(t-s)}, N)\) with \(N>\frac{d(E,F)^2}{2c_{0}(t-s)}\). We see that \(h\ge \frac{d(E,F)^2}{2c_{0}(t-s)}\) on E, \(h=0\) on F, and \(\Vert \nabla h\Vert _{\infty }=\frac{d(E,F)}{2c_{0}(t-s)}\). Thus, if \(\psi \) has support in F, we obtain (2.53) with \(C=C_{0}\text {e}\) and \(\omega =c_{0}P_{\infty }^2\).
Proof under (A3)’. We modify the argument, explaining how to adapt the proof of Theorem 2.50 (or Theorem 2.42 in the inhomogeneous setting, to be precise). As \({{\tilde{r}}}_{1}=\infty \), we have \(n\ge 3\) and \({{\tilde{q}}}_{1}=n/2\).
There are two key observations. First, if we add lower order terms with bounded coefficients to \({\mathcal {L}}\), then we still have the lower bound in (A3)’ up to taking c smaller and \(c'\) larger. Second, if \(V\in {\text {L}}^{\infty }_{t}{\text {L}}^{n/2}_{x}\), then
so that in particular if \(\eta = c(n)^{-1}c/2\) and \( \Vert V\Vert _{{\text {L}}^{\infty }_{t}{\text {L}}^{n/2}_{x}}\le \eta \), then we preserve the lower bound assumption of Theorem 2.42 on adding V.
In order to make use of these two observations, we recall that \({{\textbf {a}}}-{{\textbf {b}}}\in {\text {L}}^{\infty }_{t}{\text {L}}^{n}_{x}\) and decompose \(({{\textbf {a}}}-{{\textbf {b}}})\cdot \nabla h\) further in \({\text {L}}^{\infty }_{t}{\text {L}}^{n/2}_{x}+ {\text {L}}^{\infty }_{t}{\text {L}}^{\infty }_{x}\) as \(V_{0}+V_{\infty }\), where as usual we take \(V_{0} :=1_{|{{\textbf {a}}}-{{\textbf {b}}}|>m}({{\textbf {a}}}-{{\textbf {b}}})\cdot \nabla h\) and \(V_{\infty } :=1_{|{{\textbf {a}}}-{{\textbf {b}}}|\le m}({{\textbf {a}}}-{{\textbf {b}}})\cdot \nabla h\). We have
and we choose m so that this bound equals \(\eta \). Thus,
The decomposition replacing (2.57) is
where \({{\tilde{\beta }}}_{h}\) has first order coefficients bounded by \(C\Vert \nabla h\Vert _{\infty }\) and zeroth order coefficients bounded by \(C(1+\Vert \nabla h\Vert _{\infty }^2)\).
Applying the two introductory observations and choosing \(\kappa =c_{0}(1+\Vert \nabla h\Vert _{\infty }^2)\) for an appropriate constant \(c_{0}\), we see that the inverse of \({\mathcal {H}}+{{\tilde{\beta }}}_{h}+ V_{0}+\kappa \) has a norm that is bounded by a constant independent of h. The rest of the proof is as in the first case but the scaling argument is not needed: we first obtain
for all \(t>s\) and then then same choice of h as before leads to (2.53) with \(\omega =c_{0}\) and \(C=C_{0}\). \(\square \)
2.13 Pointwise Gaussian bounds
We prove that pointwise Gaussian bounds for the fundamental solution operator follow from an assumption of local boundedness on weak solutions of both the parabolic equation and its adjoint. To this end, we extend the argument presented in [22] without lower order coefficients. This argument adapts once we have (2.58) at hand. As said before, we do not know how to modify the argument in [22] for this inequality directly in the presence of lower order coefficients.
We recall that a weak solution of \(\partial _{t}u+{\mathcal {L}}u=0\) in an open set \({\text {I}}\times \Omega \) is a function u that is in the class \({\text {L}}^\infty ( {\text {I}}; ({\text {L}}^2(\Omega ))\) with \(\nabla u\) in \({\text {L}}^2({\text {I}}; ({\text {L}}^2(\Omega ))\) which satisfies the equation weakly against test functions \({{\tilde{\phi }}}\in {\mathcal {D}}({\text {I}}\times \Omega )\) as in (2.12). It is well-known that u is continuous in time locally in \({\text {L}}^2\), see also Lemma 2.52. The following definition introduces quantitative boundedness in the two variables.
For \((t,x)\in {\mathbb {R}}^{n+1}\) and \(r>0\), we let \(Q_{r}(t,x)=(t-r^2, t]\times B(x,r)\) and \(Q_{r}^*(t,x)=[t,t+r^2)\times B(x,r)\) be the usual forward and backward in time parabolic cylinders.
Definition 2.62
We say that \(\partial _{t}+{\mathcal {L}}\) and \(-\partial _{t}+{\mathcal {L}}^*\) have the local boundedness property if there are \(\rho \in (0,\infty ]\) and \(0<B<\infty \) such that for all \((t,x)\in {\mathbb {R}}^{n+1}\) and \(0<r<\rho \), any weak solution of \(\partial _{t}u+{\mathcal {L}}u=0\) and \(-\partial _{t}{{\tilde{u}}}+{\mathcal {L}}^*{{\tilde{u}}}=0\) on neighborhoods of \(Q_{2r}(t,x)\) and \(Q_{2r}^*(t,x)\), respectively, has local bounds of the form
Remark 2.63
If \(\rho =\infty \), the condition is scale invariant; here we will also encounter non-scale invariant situations, in which we need to consider \(\rho <\infty \).
Note that these conditions are usually presented by taking suprema on \(Q_{r}(t,x)\), \(Q_{r}^*(t,x)\) respectively, which means that one needs to know that solutions have pointwise values. Our weaker formulation suffices.
Theorem 2.64
Assume the conditions of Theorem 2.54 and that \(\partial _{t}+{\mathcal {L}}\) and \(-\partial _{t}+{\mathcal {L}}^*\) have the local boundedness property for some \(\rho \in (0,\infty ]\). Then, for all \(t>s\), the fundamental solution operator \(\Gamma (t,s)\) has a kernel \(\Gamma (t,x,s,y)\), called generalized fundamental solution, with almost everywhere pointwise Gaussian upper bound
whenever \(k\rho ^2\le t-s<(k+1)\rho ^2\) for some \(k\in {\mathbb {N}}\). (If \(\rho =\infty \), the only non-void case is \(k=0\).) Here,
where the constants \(0< C, \omega , c_{0}<\infty \) are the ones explicated in Theorem 2.61.
Proof
Under the hypotheses of Theorem 2.61 we have proved (2.58), which we rewrite for all \(t>s\), \(\psi \in {\text {L}}^2_{x}\) and real, Lipschitz and bounded h as
with \(\Gamma ^h(t,s):=\text {e}^h\Gamma (t,s)e^{-h}\) and \(\Vert \nabla h\Vert _{\infty }=\gamma \). By duality this inequality holds also for \({{\widetilde{\Gamma }}}^{h}(s,t)=\Gamma ^{-h} (t,s)^*\). Let
We may apply (2.59) to \(u^h\) and obtain for \(0<t-s<\rho ^2/2\) and \(x\in {\mathbb {R}}^n\) that for almost every \(z\in B(x,\sqrt{t-s}/2)\),
hence
Note that the right-hand side does not depend on the space variable. As \(\tau -s\le t-s\), this implies
Using (2.60) and (2.62) for the adjoint of \(\Gamma ^h(t,s)\) and duality, this yields
Let us momentarily assume \(0<t-s<\rho ^2\), that is \(k=0\). We shall remove this in the final step. By the Chapman–Kolmogorov identity of Theorem 2.33, which implies \(\Gamma (t,s)=\Gamma (t,r)\Gamma (r,s)\) with \(r=\frac{t+s}{2}\), we obtain
By the Dunford–Pettis theorem (Theorem 1.3 in [2]), this amounts to the fact that for all \(t>s\), \(\Gamma (t,s)= \text {e}^{-h}\Gamma ^h(t,s)\text {e}^h \) is an integral operator with measurable kernel that we denote by \(\Gamma (t,x,s,y)\), having an almost everywhere bound
Taking \(h=0\) already gives us a uniform almost everywhere bound
In order to prove (2.61), we fix x, y, t, s and assume \(\frac{|x-y|}{2\sqrt{2}\sqrt{t-s}}\ge 2\); otherwise we can simply use (2.67) since \(1 \le \text {e}^{\frac{2}{c_{0}}}\text {e}^{-\frac{ |x-y|^2}{16 c_{0}(t-s)}}\). We pick \(h(z)= \inf (\gamma |z-y|, N)\) with \(\gamma = \tfrac{|x-y|}{{4c_{0}(t-s)}}\) and \(N>\gamma |x-y|\). Thus, h is bounded and Lipschitz with \(\Vert \nabla h\Vert _{\infty }=\gamma \) and \(h(x)=\gamma |x-y|\), \(h(y)=0\). Observe that \(2\sqrt{2} \gamma \sqrt{t-s} \le \frac{h(x)}{2}\) and \(-\frac{h(x)}{2}+ {c_{0}\gamma ^2}(t-s)= - \frac{|x-y|^2}{16c_{0}(t-s)}\). Hence,
This concludes the argument when \(0<t-s<\rho ^2\).
We are of course done when \(\rho =\infty \). To conclude the proof when \(\rho <\infty \), we iteratively apply the Chapman–Kolmogorov formula for \(\Gamma (t,s)\) together with the upper bound just found and the convolution rule \(g_{\alpha }\star g_{\beta }=g_{\alpha +\beta }\), where \(g_{\alpha }(x)= (4\pi \alpha )^{-n/2} \text {e}^{-|x|^2/4\alpha }\) for \(\alpha ,\beta >0\). \(\square \)
Corollary 2.65
Under the same assumptions as in Theorem 2.64 we have for all \(t>s\) the equality
for almost every \(x,y\in {\mathbb {R}}^{n+1}\), where \(^*\) is the complex adjoint (here the conjugation as the kernels are complex-valued) and \({{\widetilde{\Gamma }}}(s,y,t,x)\) is the generalized fundamental solution of the adjoint problem.
Proof
We know that \(\Gamma (t,s)={{\widetilde{\Gamma }}}(s,t)^*\) and both have integral kernels. \(\square \)
Remark 2.66
Aronson’s prerequisite to obtaining Gaussian upper bounds for their generalized fundamental solution (which we now know agree with ours) is a condition on coefficients that insures the local boundedness property with the supremum, see Theorem B in [4]. Thus, Theorem 2.64 reproves Aronson’s upper bound in a constructive way through identification of the general fundamental solution operators with integral kernels.
Remark 2.67
The stability result in Proposition 2.1 of [22] for pure second-order \({\mathcal {L}}\) could be adapted but not with full lower order terms. Although formulated as a perturbation result for local bounds, it proves more, namely: if weak solutions of \(\partial _{t}- {\text {div}}A \nabla + {{\textbf {b}}}\cdot \nabla \) satisfy local Hölder bounds with proper scaling, one preserves this regularity up to changing the Hölder exponent, on perturbing of A in \({\text {L}}^\infty \) and \({{\textbf {b}}}\) in the compatible mixed Lebesgue space. It is not clear what happens when adding the other terms with \({{\textbf {a}}}\) or a.
2.14 Pure second order elliptic part
When the lower order coefficients are zero, that is, the elliptic part is the pure second order operator \({\mathcal {L}}_{0}:=-{\text {div}}A \nabla \), we see that there is no need to introduce the compatible pair \(({{\tilde{r}}}_{1},{{\tilde{q}}}_{1})\) to define \({\mathcal {H}}_{0}=\partial _{t}+{\mathcal {L}}_{0}:{\dot{{\mathcal {V}}}}\rightarrow {\dot{{\mathcal {V}}}}'\) in Proposition 2.11 and the information that \(\nabla u \in {\text {L}}^2_{t}{\text {L}}^2_{x}\) suffices. Thus, we can introduce the (larger) class of \({\text {L}}^2_{t}{\dot{{\text {H}}}}^{1}_{ x}\)-solutions of \(\partial _{t}u+{\mathcal {L}}_{0}u=f\) in \({\mathbb {R}}^{n+1}\), which we define as the class of distributions u with \(\nabla u \in {\text {L}}^2_{t}{\text {L}}^2_{x}\) such that \( \partial _{t}u+{\mathcal {L}}_{0} u= f\) in \({\mathcal {D}}'({\mathbb {R}}^{n+1})\).
Inspection of the arguments in Sect. 2.6 reveals that if \({\mathcal {H}}_{0}\) is invertible, then the statements extend by replacing systematically \({\mathcal {H}}\), \({\dot{\Delta }}^{r_{1},q_{1}}\) and \(\Vert u\Vert _{{\dot{\Delta }}^{r_{1}, q_{1}}}\) by \({\mathcal {H}}_{0}\), \({\text {L}}^2_{t}{\dot{{\text {H}}}}^{1}_{ x}\) and \(\Vert \nabla u\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}\), respectively. In particular, uniqueness up to a constant (assuming invertibility) is obtained in the larger class \({\text {L}}^2_{t}{\dot{{\text {H}}}}^{1}_{ x}\).
From there on, the theory develops analogously in this special case. The Cauchy problem for \(\partial _{t}+{\mathcal {L}}_{0}\) can be posed and solved uniquely in \({\text {L}}^2(0,T; {\text {H}}^1({\mathbb {R}}^n))\) when \(T<\infty \) for arbitrary data \(\psi , F,g,h\) in appropriate spaces, or in \({\text {L}}^2(0,\infty ; {\dot{{\text {H}}}}^1({\mathbb {R}}^n))\) when \(T=\infty \) and \(h=0\) (recovering and extending the result in [9]). The elimination of the constant comes from the initial data being in \({\text {L}}^2_{x}\). The \({\text {L}}^2\) off-diagonal decay was already known in this case (see the beginning of Sect. 2.12) but we still offer a different proof.
2.15 Lower order coefficients in Lorentz spaces
We have developed our variational approach under control of mixed Lebesgue norms on the lower order coefficients. We shall now explain why these conditions can be relaxed with hardly any effort, using the Lorentz spaces \({\text {L}}^{p,\infty }\). Recall that on a measure space \((M,\mu )\), a measurable function f belongs to \({\text {L}}^{p,q}\) in the case \(1\le p,q<\infty \) if
and in the case \(1\le p<\infty , q=\infty \) if
Here, \(f^*\) is the non-increasing rearrangement of f. It is known that \({\text {L}}^{p,p}={\text {L}}^p\) and that \(\Vert f\Vert _{{\text {L}}^{p,q}}\) is non-increasing as a function of q, so that \({\text {L}}^{p,q}\subset {\text {L}}^{p,p} \subset {\text {L}}^{p,r}\) if \(q\le p\le r\). Details are found in Chapter 5 of [32]. Mixed Lorentz spaces in (t, x) have been introduced by Fernandez [20], who also proved that they behave in the same way as Lebesgue spaces concerning duality and multiplication (Hölder’s inequality). Simple functions are dense in spaces for which all exponents are finite.
The extension mainly relies on the following lemma.
Lemma 2.68
Let \(({{\tilde{r}}}_{1}, {{\tilde{q}}}_{1})\) be a compatible pair for lower order coefficients with admissible conjugate \((r_{1}, q_{1})\). Then \({\dot{{\mathcal {V}}}}\hookrightarrow {\text {L}}^{r_{1},2}_{t} {\text {L}}^{r_{2},2}_{x}\) with continuous inclusion. Consequently, if
then \({\mathcal {H}}:{\dot{{\mathcal {V}}}}\rightarrow {\dot{{\mathcal {V}}}}'\) is well-defined and bounded and if
then \({\mathcal {H}}:{\mathcal {V}}\rightarrow {\mathcal {V}}'\) is well-defined and bounded.
Proof
Sobolev embeddings are equivalent to \({\text {L}}^p-{\text {L}}^q\) boundedness of Riesz potentials with \(p<q\). However, it was observed by O’Neil [30] that such Riesz potentials also have \({\text {L}}^{p,s}-{\text {L}}^{q,s}\) boundedness for the same p, q and all \(1\le s\le \infty \). In particular, they are \({\text {L}}^{p}-{\text {L}}^{q,p}\) bounded as \({\text {L}}^p={\text {L}}^{p,p}\). Thus, with the same relations between q, r and \(\theta \) as in Lemma 2.3 but with different constants,
and the continuous inclusion for \({\dot{{\mathcal {V}}}}\) follows from (2.2).
Now, we assume (2.69). A modification of Lemma 2.7, using Hölder’s inequality in Lorentz spaces to guarantee that a product of three functions in \({\text {L}}^{p_{i},s_{i}}\) belongs to \({\text {L}}^1\) if \(1 = \frac{1}{p_{1}} +\frac{1}{p_{2}}+\frac{1}{p_{3}}\) and \(1 = \frac{1}{s_{1}} +\frac{1}{s_{2}}+ \frac{1}{s_{3}}\), yields
With this at hand, the boundedness of \({\mathcal {H}}\) from \({\dot{{\mathcal {V}}}}\) to its dual follows exactly as in Proposition 2.11.
Likewise, if we assume (2.70), then we proceed with the modifications as in Sect. 2.10. \(\square \)
Assuming that (2.69) holds for the compatible pair \(({{\tilde{r}}}_{1}, {{\tilde{q}}}_{1})\), one can define \({\mathcal {H}}\) and develop the variational theory upon replacing in the definition of the space \({\dot{\Delta }}^{r_{1},q_{1}}\), where \((r_{1},q_{1})\) is the conjugate admissible pair, the mixed Lebesgue space \({\text {L}}^{r_{1}}_{t}{\text {L}}^{q_{1}}_{x}\) by the mixed Lorentz space \({\text {L}}^{r_{1},2}_{t}{\text {L}}^{q_{1},2}_{x}\). With this precaution and these changes, the estimates in Corollary 2.20 and the integral equalities in Lemma 2.22 hold. (When \((r,q)=(\infty ,2)\), there is no weakening of assumptions and we keep working with the space \( {\text {L}}^{1}_{t}{\text {L}}^{2}_{x}\).) We may proceed with the regularity Proposition 2.26, the uniqueness Theorem 2.27, the well-posedness Theorem 2.28 with \(g\in {\text {L}}^{r',2}_{t}{\text {L}}^{q',2}_{x}\) on the right-hand side, and so on up until Theorem 2.40. It is only for Theorem 2.41 that we need a stronger assumption on the coefficients to guarantee causality, as we have used inequalities in the spirit of Gagliardo-Nirenberg. It follows from Proposition 5.1 and Hölder inequalities that it is enough to impose
One can also develop the corresponding inhomogeneous theory with coefficients as in (2.70), working mainly under the Lorentz–Lorentz analogue of Assumption \(({{\textbf {D}}}_\varepsilon )\). While this amounts to the same symbolic changes from Lebesgue to Lorentz spaces in \(({{\textbf {D}}}_\varepsilon )\) itself, the succeeding Remark 2.46 has to be interpreted correctly: it says that by truncation a decomposition as in \(({{\textbf {D}}}_\varepsilon )\) for arbitrarily small \(\varepsilon >0\) can be achieved starting from \(|{{\textbf {a}}}|^2+|{{\textbf {b}}}|^2+|a|\in {\text {L}}^{{{\tilde{r}}}_{1}, {{\tilde{r}}}_{2}}_{t}{\text {L}}^{{{\tilde{q}}}_{1}, \tilde{q}_{2}}_{ x}+{\text {L}}^{\infty }_{t}{\text {L}}^{\infty }_{x}\) with \(1 \le {{\tilde{q}}}_{2}, {{\tilde{r}}}_{2} < \infty \), but not when one of \(\tilde{q}_{2}, {{\tilde{r}}}_{2}\) is infinite. Hence, the lower bounds assumption (A3)’ becomes more interesting here. In particular there is a statement corresponding to Theorem 2.54 in which mixed Lebesgue norms are replaced with mixed Lebesgue–Lorentz norms on the lower order coefficients with the same pairs \(({{\tilde{r}}}_{1}, \tilde{q}_{1})\) and
and in the equation the forcing term g can be taken in \( {\text {L}}^{r',2}(0,T; {\text {L}}^{q',2}_{ x})\) when (r, q) is admissible (but not when \((r,q)=(\infty ,2)\), where we take \(g\in {\text {L}}^{1}(0,T; {\text {L}}^{2}_{x})\) as before).
All the direct consequences of this result also extend: Corollary 2.59, Theorem 2.61 and Theorem 2.64. In the latter theorem it depends on whether the local boundedness assumption is true for the particular \({\mathcal {L}}\) and its adjoint. Note that neither [28] nor [4] consider coefficients in mixed Lebesgue–Lorentz spaces. Hence this extension is quite a new observation.
Let us give an example in the case \(({{\tilde{r}}}_{1},\tilde{q}_{1})=(\infty ,n/2)\), when \(n\ge 3\). Consider parabolic Schrödinger operators
with c a complex-valued measurable and bounded function. One cannot use the assumption \(({{\textbf {D}}}_{\varepsilon })\) here. But the classical Hardy inequality
which follows from Hardy’s one dimensional inequality [33, Appendix A] using polar coordinates, allows one to apply Theorem 2.42 when \({\text {ess inf}} {\text {Re}}c> -(\frac{n-2}{2})^2 =:c_{n}\). Thus, \({\mathcal {H}}\) is invertible and causal (for causality, \({\text {Re}}c \ge c_{n}\) works). One can therefore solve the Cauchy problem as above and obtain \({\text {L}}^2\) off-diagonal Gaussian decay of its fundamental solution operator. In [10], the slightly different but related question of existence of a distributional non-negative solution to the Cauchy problem for \(\partial _{t}-\Delta + c|x|^{-2}\) with non-negative initial \({\text {L}}^1\) or measure data and c a constant with \(c\in [c_{n}, 0]\) is considered.
2.16 Adding a skew-symmetric real \(\text {BMO}\) matrix to higher order coefficients
Motivated by fluid dynamics, it has become interesting to add to the usual elliptic matrix A a skew-symmetric term with boundedness replaced by a BMO condition. Indeed, formally, pointwise lower ellipticity of the matrix A does not change if one adds to it a real and skew-symmetric matrix D(t, x) as
and, if D(t, x) has finite \(\text {BMO}\) norm in the x-variable, uniformly for each t, then for \(u,v\in {\dot{{\mathcal {V}}}}\),
using the \(\text {BMO}_{x}-{\mathcal {H}}^1_{x}\) duality and compensated compactness [13]. Integrating this in time guarantees boundedness and ellipticity of the second order term in \({\mathcal {L}}\) if A is changed to \(A+D\) with \(\Vert D\Vert _{{\text {L}}^\infty _{t}\text {BMO}_{x}}<\infty \). We shall make this precise below.
All the results obtained up to this point extend with A replaced by \(A+D\) under this assumption on D. Indeed, the extension only affects the second order term, which has been treated via bounds for the pairing \(\langle A \nabla u,\nabla v \rangle \) at each occurrence rather than concrete bounds on A, with one sole exception that we address next.
The only subtle thing to handle is the proof of the \({\text {L}}^2\) off-diagonal estimates (2.53) as in Theorem 2.61 (with a less precise control on the constants \(C, \omega , c_{0}\)), the difficulty being that D re-appears in lower order coefficients when using Davies’ exponential trick in (2.54). We first give rigorous definitions of the bracket terms to justify computations.
We would like to set
but the inner term is usually not an honest Lebesgue integral for arbitrary \(u,v\in {\dot{{\mathcal {V}}}}\).
We introduce the set \({\mathcal {E}}\) of functions in \({\dot{{\mathcal {V}}}}\) that are in \({\mathcal {S}}({\mathbb {R}}^{n+1})\) with bounded support in the x-variable, which is dense in \({\dot{{\mathcal {V}}}}\) (resp. \({\mathcal {V}}\)). Indeed, we know that \({\mathcal {S}}({\mathbb {R}}^{n+1})\) is dense in \({\dot{{\mathcal {V}}}}\) and from there, we can use smooth truncations. Consider \(u,v\in {\mathcal {E}}\). Let Q be a cube containing their support. Set for \(i,j\in \{1,\ldots , n\}^2\),
For each t, this is a bounded function with support in Q and mean value zero. Hence, it is a constant multiple of an atom in \({\mathcal {H}}^1_{x}\), the real Hardy space on \({\mathbb {R}}^n\), and the \(\text {BMO}_{x}-{\mathcal {H}}^1_{x}\) duality is realized in this case as a Lebesgue integral
As we know from [13] that
we deduce
Using the skew-symmetry of D, that is, \(d_{i,j}=-d_{j,i}\), we can set
and this form extends boundedly to \({\dot{{\mathcal {V}}}}\times {\dot{{\mathcal {V}}}}\). We now explain the necessary modifications.
Proof of Theorem 2.61, BMO-case
To check the invertibility, it suffices as before to look for lower bounds of \(\text {e}^{h}({\mathcal {H}}+\kappa )\text {e}^{-h}u\). Thus, we study again \(\text {e}^{h}{\mathcal {H}}\text {e}^{-h}\) with h Lipschitz. We do not want to assume (qualitative) boundedness of h this time. Hence, we first restrict the operator to \({\mathcal {E}}\) but it extends to \({\mathcal {V}}\) through the right-hand side of (2.54). This allows us to take h an affine real-valued function given by \(h(x)=x\cdot \zeta + c\), with \(\zeta \in {\mathbb {R}}^n\) and \(c\in {\mathbb {R}}\). It will be important that the gradient of h is constant (as in [17, 31]). Thus, we compute \( \langle \langle \text {e}^{h}({\mathcal {H}}+\kappa )\text {e}^{-h}u,v \rangle \rangle \) with \(u,v\in {\mathcal {E}}\) and h affine.
Step 1: New error estimate. Compared to (2.54), we get an extra term coming from the presence of D. A calculation yields, with \(g_{i,j}\) defined in (2.72),
Next, we claim that for \(f,g \in H^1_{x}\) and each \(i\in \{1, \ldots , n\}\) the function \(\partial _{x_{i}}(fg)\) belongs to \({\mathcal {H}}^1_{x}\) with the estimate
For \(f=g\) this is Proposition 3.2 in [31] and the argument applies mutadis mutandis in the general case. Moreover, if f, g are smooth with bounded support, then \(\partial _{x_{i}}(fg)\) is a multiple of an atom in \({\mathcal {H}}^{1}_{x}\), so that for any \(b\in \text {BMO}_{x}\),
and the \(\text {BMO}_{x}-{\mathcal {H}}^1_{x}\) duality gives us a bound
Hence, for each fixed t, this applies to \(f=u(t), g=\overline{v}(t)\) and, using again the skew-symmetry of D, we arrive at
with
Using the above estimate and Young’s inequality, we see that for any \(\varepsilon >0\),
This is the required estimate for the additional error term in the presence of D.
Step 2: Off-diagonal estimate with affine perturbation. Now, it follows in the case (A3) that if \(({{\textbf {D}}}_{\varepsilon _{{{\textbf {0}}}}})\) holds for \(\varepsilon _{0}\) small enough, then \(\text {e}^{h}({\mathcal {H}}+\kappa )\text {e}^{-h}:{\mathcal {V}}\rightarrow {\mathcal {V}}'\) is invertible for \(\kappa \ge 1+c_{0}(| \zeta |^2+P_{\infty }^2)\). In the case of lower bounds assumptions for \({\mathcal {L}}\), this is for \(\kappa \ge c_{0}(1+| \zeta |^2)\). Of course, \(c_0\) now also depends on \(\Vert D\Vert _{{\text {L}}^\infty _{t} \text {BMO}_{x}}\). Moreover, in both cases, the operator norm of the inverse is bounded independently of \(|\zeta |\). In conclusion, we obtain an estimate of the form
for all \(t>s\) with positive constants \(C, \omega , c_{0}\).
Step 3: Proof of (2.53). Let us first treat the case that E, F are convex and compact sets with \(d(E,F)^2>4n(t-s)\). In this case, take \(e\in E, f\in F\) such that \(|e-f|=d(E,F)\) and set
Note that e is the orthogonal projection of f onto E and vice-versa. Hence,
for \(x\in E\) and \(h(y)\le 0\) for \(y\in F\), from which we obtain (2.53). For the general situation where E, F are arbitrary closed sets, we can assume \(d(E,F)^2>8 n (t-s)\); otherwise, we are done with the uniform \({\text {L}}^2_{x}\) bound for \(\Gamma (t,s)\). Let \(Q_{k} :=[0,\sqrt{t-s}]^n +\{\sqrt{t-s}\, k\}\), \(k\in {\mathbb {Z}}^n\). Cover E with the cubes \(Q_{k}\) that intersect E, and F with the cubes \(Q_{\ell }\) that intersect F. We have \(d(Q_{k},Q_{\ell })^2> 4 n (t-s)\). We apply the estimate just obtained for each pair \(Q_{k},Q_{\ell }\) and sum in order to conclude (of course the constants change), using that the cubes form a partition of \({\mathbb {R}}^n\) up to a null set and simple discrete convolution inequalities. \(\square \)
Remark 2.69
When A is also a real matrix, pointwise upper and lower bounds were obtained for the fundamental solution of the parabolic operator with pure second order term and matrix coefficient \(A+D\) in [31]. Here, we allow complex A and unbounded lower order terms and limit ourselves to an \({\text {L}}^2-{\text {L}}^2\) upper bound. Some similar estimates are obtained for time-independent matrix coefficients of the form \(A+D\) without lower order terms in [17]. In principle, we could re-discover pointwise upper bounds from (the extension of) Theorem 2.64, were we able to verify the local boundedness property without resorting to itself [31]. This is yet another example that illustrates how the order of classical arguments is reversed in our work.
2.17 Systems
The theory and its previous extensions do not change for systems of N equations, \(N\ge 2\). The results are the same with pointwise ellipticity in the x-variable replaced by ellipticity in the Gårding sense (uniformly in t): The matrix A(t) has entries being \(N\times N\) matrices of bounded measurable coefficients in (t, x) and
holds for all t. Indeed, we have never used pointwise bounds and ellipticity on A for means other than bounding \(\langle A \nabla u,\nabla v \rangle \) from above and below.
If one wants to add a matrix of \(\text {BMO}\)-type, it should be block diagonal, that is \(D=(\delta _{\alpha ,\beta }D^\alpha )_{1\le \alpha ,\beta \le N}\), where \(\delta _{\alpha ,\beta }\) is the Kronecker symbol, with each \(D^\alpha \) as in the previous section.
If the Gårding inequality comes with a negative \({\text {L}}^2\) norm on \( {{\textbf {u}}}(t)\), then one should apply the inhomogeneous theory. We leave details to the reader.
3 Higher order problems on full space
It is mainly a matter to fix algebraic notation as the analysis done for second order parabolic operators goes through almost verbatim for higher order problems on full space. We give details of the setup and sketch the main points, following faithfully what was done for second order problems. Given our omission of proofs, this section should be considered as an announcement of results, the verification of which is left to the interest readers. Results in this section also provide the generalization of the theory for second order elliptic parts when the compatible pairs are allowed to vary with the coefficients as mentionned earlier.
3.1 The elliptic operator
The elliptic part \({\mathcal {L}}\) is now 2mth order, \(m\ge 2\), given formally by
where the sum is taken over pairs \((\alpha ,\beta ) \) of multi-indices with \(0\le |\alpha |, |\beta | \le m\) and \(\partial ^\alpha \) are partial derivatives in the x-variable of order \(\alpha \). We have set \(|\alpha |=\alpha _{1}+\cdots +\alpha _{n}\) for \(\alpha =(\alpha _{1},\ldots , \alpha _{n})\).
3.2 Variational space
For the homogeneous theory, the space \({\dot{{\mathcal {V}}}}\) becomes the space of tempered distributions u having Fourier transforms \((|\xi |^{2m}+|\tau |)^{-1/2}g\) for some (unique) \(g\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), equipped with the norm \(\Vert u\Vert _{{\dot{{\mathcal {V}}}}} :=(2\pi )^{-(n+1)/2} \Vert g\Vert _{{\text {L}}^2_{t}{\text {L}}^2_{x}}\). As in the case of order 2, this space realizes \({\text {L}}^2_{t}{\dot{{\text {H}}}}^m_{x}\cap {\dot{{\text {H}}}}^{1/2}_{t}{\text {L}}^2_{x}\) defined within tempered distributions modulo polynomials with norm
3.3 Embeddings
For an arbitrary collection \(({\textbf {r,q}})\) of pairs of exponents \((r^{\alpha ,\beta }, q^{\alpha ,\beta })\) in \([1,\infty ]^2\) indexed by multi-indices \((\alpha ,\beta ) \) with \(0\le |\alpha |, |\beta | \le m\), we set
For each \(\alpha \), there could be several mixed spaces involved to which \(\partial ^\alpha u\) belongs, parametrized by the multi-indices \(\beta \). If all pairs of exponents belong to \([1,\infty )^2\), then the dual space of \({\dot{\Delta }}^{{\textbf {r,q}}}\) in the duality extending the \({\text {L}}^2_{t}{\text {L}}^2_{x}\) inner product can be identified with
with the same interpretation as in the case \(m=1\) in Sect. 2.2 and \(({\textbf {r}}',{\textbf {q}}')\) is the collection of pairs of Hölder conjugates obtained from \(({\textbf {r,q}})\). When all pairs of exponents in \(({\textbf {r,q}})\) belong to \((1,\infty ]^2\), then the dual space of \(\Sigma ^{{\textbf {r}}',{\textbf {q}}'}\) can be identified with \({\dot{\Delta }}^{{\textbf {r,q}}}\) for the same duality. In particular, \({\dot{\Delta }}^{{\textbf {r,q}}}\) is reflexive when all pairs belong to \((1,\infty )^2\).
Sobolev embeddings for partial derivatives \(\partial ^\alpha \) in the spirit of Lemma 2.3 are as follows: If \(u\in {\dot{{\mathcal {V}}}}\) and \(0\le |\alpha |\le m\), then \(\partial ^\alpha u\in {\text {L}}^{r}_{t}{\text {L}}^{q}_{ x}\) with \(\Vert \partial ^\alpha u \Vert _{{\text {L}}^{r}_{t}{\text {L}}^{q}_{ x}} \lesssim \Vert u\Vert _{{\dot{{\mathcal {V}}}}}\) provided
We say that pairs (r, q) with the condition (3.3) are admissible for \(\partial ^\alpha \). When \(|\alpha |=m\), the only admissible pair for \(\partial ^\alpha \) is (2, 2). If \(|\alpha |<m\), then there is more flexibility. A collection \(({\textbf {r,q}})\) of pairs \((r^{\alpha ,\beta }, q^{\alpha ,\beta })\) indexed by multi-indices \((\alpha ,\beta )\) with \(0\le |\alpha |,|\beta | \le m\) is admissible (resp. super admissible) if each pair \((r^{\alpha ,\beta }, q^{\alpha ,\beta })\) is admissible for \(\partial ^\alpha \) (resp. admissible for \(\partial ^\alpha \) when \(\alpha \ne 0\) and admissible for \(\partial ^\alpha \) or equal to \((\infty , 2)\) when \(\alpha =0\)). In particular, the continuous inclusion \({\dot{{\mathcal {V}}}}\hookrightarrow {\dot{\Delta }}^{{\textbf {r,q}}}\) holds for all admissible collections.
3.4 Variational approach
When \(0\le |\alpha |,|\beta | \le m\), critical mixed Lebesgue spaces \({\text {L}}_{t}^{r({\alpha ,\beta })}{\text {L}}^{q({\alpha ,\beta })}_{ x}\) for the coefficients \(a_{\alpha ,\beta }\) are given by the relations
We say that \((r({\alpha ,\beta }), q({\alpha ,\beta }))\) is a compatible pair for \((\alpha ,\beta )\). If such a pair is given, any choice of admissible pairs \( (r^{\alpha ,\beta }, q^{\alpha ,\beta }) \) and \( (r_{\beta , \alpha }, q_{\beta , \alpha }) \) for \(\partial ^\alpha \) and \(\partial ^\beta \), respectively, yields
provided that
Note that this covers the higher order derivatives when \(|\alpha |=|\beta |=m\), where \(a_{\alpha ,\beta }\) are bounded and the admissible pairs for \(\partial ^\alpha \) and \(\partial ^\beta \) are (2, 2). If \(|\alpha |+|\beta |<2m\), then we have several choices.
We come to the definition of \({\mathcal {H}}\) on \({\dot{{\mathcal {V}}}}\). First, we fix once and for all a collection \(({{\tilde{{\textbf {r}}}}}_{{\textbf {1}}},{{\tilde{{\textbf {q}}}}}_{{\textbf {1}}})\) of compatible pairs \((r({\alpha ,\beta }), q({\alpha ,\beta }))\) for \((\alpha ,\beta )\) with \(0\le |\alpha |,|\beta | \le m\). We assume \(a_{\alpha ,\beta }\in {\text {L}}_{t}^{r({\alpha ,\beta })}{\text {L}}^{q({\alpha ,\beta })}_{ x}\) and setFootnote 4
Secondly, we need to work with two collections of admissible pairs, one for \(\partial ^\alpha \) denoted by \(({\textbf {r}}_{{\textbf {1}}},{\textbf {q}}_{{\textbf {1}}})= (r^{\alpha ,\beta }, q^{\alpha ,\beta })_{\alpha ,\beta } \), the other one for \(\partial ^\beta \) denoted by \(({\bar{{\textbf {r}}}}_{{\textbf {1}}},{\bar{{\textbf {q}}}}_{{\textbf {1}}})=(r_{\beta , \alpha }, q_{\beta , \alpha })\), both satisfying in addition (3.6).Footnote 5 We define accordingly the space \({\dot{\Delta }}^{{\textbf {r}}_{{\textbf {1}}},{\textbf {q}}_{{\textbf {1}}}}\) as above, and, taking into account the symmetric roles of multi-indices \(\alpha ,\beta \), we set
With this notation and using (3.5) and (3.6), we see that \({\mathcal {L}}\) in (3.1) satisfies
so that \({\mathcal {L}}\) acts boundedly from \({\dot{\Delta }}^{{{\bar{{\textbf {r}}}}_{{\textbf {1}}},{\bar{{\textbf {q}}}}_{{\textbf {1}}}}}\) into \(({\dot{\Delta }}^{{{\textbf {r}}_{{\textbf {1}}},{\textbf {q}}_{{\textbf {1}}}}})'\). From the continuous inclusions \({\dot{{\mathcal {V}}}}\hookrightarrow {\dot{\Delta }}^{{{\bar{{\textbf {r}}}}_{{\textbf {1}}},{\bar{{\textbf {q}}}}_{{\textbf {1}}}}}\) and \(({\dot{\Delta }}^{{{\textbf {r}}_{{\textbf {1}}},{\textbf {q}}_{{\textbf {1}}}}})' \hookrightarrow {\dot{{\mathcal {V}}}}'\) for admissible collections, we obtain that \({\mathcal {H}}=\partial _{t}+{\mathcal {L}}: {\dot{{\mathcal {V}}}}\rightarrow {\dot{{\mathcal {V}}}}'\) is well-defined and bounded.
3.5 Main regularity estimates
We can now state the main regularity lemma. We set \(\nabla ^mu=(\partial ^\alpha u)_{ |\alpha |= m}\) for simplicity.
Lemma 3.1
Let \(u\in {\mathcal {D}}'({\mathbb {R}}^{n+1})\). Assume \(\nabla ^m u\in {\text {L}}^2_{t}{\text {L}}^2_{x}\) and \(\partial _{t}u\in {{\dot{\Sigma }}}^{{{\textbf {r}}',{\textbf {q}}'}}\), where \(({\textbf {r,q}})\) is a super admissible collection. Then, there is a polynomial P in the x-variable with degree not exceeding \(m-1\), such that \(u-P\in {\text {C}}_{0}({\text {L}}^2_{x})\) and
with some constant C independent of u and P. Moreover, if the collection \(({\textbf {r,q}})\) is admissible, then \(u-P\in {\dot{{\mathcal {V}}}}\) with the same estimate on \(\Vert u-P\Vert _{{\dot{{\mathcal {V}}}}}\).
The proofs are similar to that of Sect. 2.4, replacing \(-\Delta \) by \((-\Delta )^m\): for example, one uses
as each \({\text {L}}_{t}^{(r^{\alpha ,\beta })'}\big (\partial ^\alpha {\text {L}}_{ x}^{(q^{\alpha ,\beta })'}\big )\) embeds into one \({\dot{{\text {H}}}}_{t}^{-\theta /2}{\dot{{\text {H}}}}_{ x}^{m(\theta -1)}\) for some \(\theta \in [0,1)\) when \((r^{\alpha ,\beta },q^{\alpha ,\beta })\) is an admissible pair for \(\partial ^\alpha \).
The integral equalities of Sect. 2.5 are also proved similarly.
3.6 The resulting theory
The invertibility of \({\mathcal {H}}\) is again enough to develop the uniqueness and existence of \({\dot{\Delta }}^{{{\bar{{\textbf {r}}}}_{{\textbf {1}}}, {\bar{{\textbf {q}}}}_{{\textbf {1}}}}}\)-solutions and to produce Green operators in order to obtain representations. For example, the uniqueness statement corresponding to Theorem 2.27 becomes that whenever \({\mathcal {H}}\) is invertible, then any \(u\in {\dot{\Delta }}^{{{\bar{{\textbf {r}}}}_{{\textbf {1}}}, {\bar{{\textbf {q}}}}_{{\textbf {1}}}}}\) such that \(\partial _{t}u+{\mathcal {L}}u=0\) vanishes.
The invertibility for \({\mathcal {H}}\) can be checked provided there is a Gårding inequality in the spirit of (2.43) for the leading coefficients, that is,
and for the lower order coefficients, smallness of \(P_{{\tilde{{\textbf {r}}}}_{{\textbf {1}}}, {{\tilde{{\textbf {q}}}}}_{{\textbf {1}}}}\) is needed. Alternatively, invertibility can also follow from lower bounds on \({\mathcal {L}}\) as in Theorem 2.42.
If (3.7) holds and the leading part of \({\mathcal {H}}\) is a pure 2m-order operator, then one can work with the uniqueness class of \({\text {L}}^2_{t}{\dot{{\text {H}}}}^{m}_{ x}\)-solutions, which is defined analogously to Sect. 2.14.
If the Gårding inequality comes with a negative \({\text {L}}^2_{t}{\text {L}}^2_{x}\) norm on u, or \(P_{{{\tilde{{\textbf {r}}}}}_{{\textbf {1}}}, {{\tilde{{\textbf {q}}}}}_{{\textbf {1}}}}\) is not small enough, or bounded coefficients are added to the lower order coefficients while \(P_{{{\tilde{{\textbf {r}}}}}_{{\textbf {1}}}, {{\tilde{{\textbf {q}}}}}_{{\textbf {1}}}}\) remains small, or again that a lower bound is assumed on \({\mathcal {L}}\), then one uses inhomogeneous spaces to prove invertibility of \({\mathcal {H}}+\kappa :{\mathcal {V}}\rightarrow {\mathcal {V}}'\) for large enough \(\kappa \).
Using the improvement of (3.5) with the mixed Lorentz spaces \({\text {L}}_{t}^{r({\alpha ,\beta }),\infty }{\text {L}}_{ x}^{q({\alpha ,\beta }),\infty }\) replacing the mixed Lebesgue spaces \({\text {L}}_{t}^{r({\alpha ,\beta })}{\text {L}}_{ x}^{q({\alpha ,\beta })}\) for the lower order coefficients is possible and \(P_{{\tilde{{\textbf {r}}}}_{{\textbf {1}}},{{\tilde{{\textbf {q}}}}}_{{\textbf {1}}}}\) is modified accordingly. This covers, for example, power weights \(c(t,x)|x|^{-n/q(\alpha ,\beta )}\) with c bounded above and below, when \(r(\alpha ,\beta )=\infty \). For forcing terms and solutions, the mixed Lorentz spaces \({\text {L}}_{t}^{r,2}{\text {L}}_{ x}^{q,2}\) may replace the mixed Lebesgue spaces \({\text {L}}_{t}^{r}{\text {L}}_{ x}^{q}\) with the same collections of pairs.
The proof of causality uses a variant of Gagliardo–Nirenberg inequalities and requires mixed Lebesgue–Lorentz norms. A quick proof of this variant can be found in Proposition 5.1.
The Cauchy problem can be stated and proved in a similar fashion. The fundamental solution operator can be identified with exponentially weighted Green operators as before. Under the same assumptions guaranteeing invertibility and causality of \({\mathcal {H}}+\kappa \), the fundamental solution operator enjoys \({\text {L}}^2\) off-diagonal estimates. Lipschitz bounded functions of the x-variable are replaced by the regular functions considered by Davies in [15] for the case of time-independent parabolic operators with bounded lower terms. This is more complicated here, because we take unbounded coefficients. But we can obtain lower bounds of perturbed operators \(\text {e}^h ({\mathcal {H}}+\kappa )\text {e}^{-h}\) using successive and tedious decompositions of the perturbed coefficients as in the condition \(({{\textbf {D}}}_{\varepsilon })\), where \(\kappa \) is chosen on the order of \(c+c\Vert \nabla h\Vert _{\infty }^{2m}\) and optimization in h gives exponential decay in \((d(E,F)^{2m}/|t-s|)^{1/(2m-1)}\).
Extensions to systems work without difficulty.
4 Second order problems with lateral boundary conditions
In this short section we describe an extension of our theory to second order parabolic problems on cylinders with lateral boundary conditions. As the previous section, this should be considered an announcement of results. Working out the details along our sketch and extending the results to systems is again left to interested readers. Adaptation to higher order problems is likely to hold but would require further work.
4.1 The geometric setup
We work on \({\mathbb {R}}\times \Omega \), where \(\Omega \) is an open set in \({\mathbb {R}}^n\), and encode lateral boundary conditions through the choice of a variational space V with
equipped with the Hilbertian norm
The cases \(V = {\text {W}}^{1,2}_{0}(\Omega )\) and \(V = {\text {W}}^{1,2}(\Omega )\) correspond to (pure) lateral Dirichlet and Neumann boundary conditions. Spaces in between can be used to model for instance a mix of the two.
The only geometric assumption that we make on \(\Omega \) are (fractional) Sobolev embeddings for V. We write \([\cdot \,,\cdot ]_\theta \) for the complex interpolation bracket, see for example Section 1.9 in [35].
Assumption \(({{\textbf {V}}})\). We assume that there exists an embedding dimension \(d \in [1,\infty )\) with the following property: For all \(\theta \in [0,1]\) and \(2 \le q < \infty \) such that \(\frac{1}{2} - \frac{1-\theta }{d} = \frac{1}{q}\), we have
with continuous inclusion.
Remark 4.1
If \(({{\textbf {V}}})\) holds for one choice of d, then it holds for all larger choices. Hence, it will be advantageous to take d as small as possible. The primary example we have in mind is when \(\theta = 0\) is allowed above (hence \(d >2\)) and therefore V itself satisfies the Sobolev embedding
In this case, the other embeddings required in \(({{\textbf {V}}})\) follow by complex interpolation. However, already for \(\Omega = {\mathbb {R}}^2\) the optimal choice is \(d=2\) and by fractional Sobolev embeddings we have indeed \(({{\textbf {V}}})\) with \(d=2\) and that (4.1) is satisfied when \(\theta \in (0,1]\), even though we do not have (4.2). In ambient dimension \(n=1\) and when \(\Omega \) is an interval, \(({{\textbf {V}}})\) holds with embedding dimension \(d=1\) no matter what the boundary conditions are and (4.1) is satisfied in the limited range \(\theta \in (\frac{1}{2}, 1]\) due to the constraint \(2\le q<\infty \).
Remark 4.2
Testing (4.1) with cut-off functions \(\psi \) for arbitrarily small balls contained in \(\Omega \), reveals that d cannot be smaller than the ambient dimension n. In principle, d can be larger than n. When \(V={\text {W}}^{1,2}_{0}(\Omega )\) or when \(\Omega \) is sufficiently regular, the value \(d=n\) is obtained. For a discussion of irregular sets that satisfy \(({{\textbf {V}}})\), we refer to the introduction of [12] or [1, Ch. 4] for the case \(d>n\) and to [18, Sec. 3] for mixed Dirichlet-Neumann boundary conditions.
4.2 Variational space
The variational space is now \({\mathcal {V}}:={\text {L}}^2_{t}V \cap {\text {H}}^{1/2}_{t}{\text {L}}^2_{x}\), equipped with the Hilbertian norm \(\Vert u\Vert _{{\mathcal {V}}}\) given by
where in this section we use the notation \({\text {L}}^p_{x} :={\text {L}}^p(\Omega )\). Let \(-\Delta _{V}\) be the positive self-adjoint operator built from the sesquilinear form \((\psi , {{\tilde{\psi }}}) \mapsto \langle \nabla \psi ,\nabla {{\tilde{\psi }}} \rangle \) on \(V\times V\). We let \(S=(1-\Delta _{V})^{1/2}\), so that by Kato’s second representation theorem [25] the domain of S is equal to V with \(\Vert S\psi \Vert _{{\text {L}}^2_{x}}= \Vert \psi \Vert _{V}\) for all \(\psi \in V\). It is also known that the domains of the powers \(S^\alpha \), \(\alpha \in {\mathbb {R}}\), interpolate by the complex method [8].
4.3 Embeddings
We begin by developing the theory along the lines of Sect. 2. As the reader may have already observed, we have used the full strength of distribution theory only in the t-variable, whereas in the x-variable distributions and test functions have mostly appeared for the sake of simple arguments but they could have been replaced by spectral theory for the Laplacian and functions in less regular spaces such as \({\dot{\Delta }}^{r,q}\). This is our general guideline.
Our first task is to identify the pairs (r, q) for which we have the embedding
where \(\Delta ^{r, q}\) is equipped with the norm \( \Vert u\Vert _{\Delta ^{r, q}}: =\Vert u\Vert _{{\text {L}}^2_{t}V} + \Vert u\Vert _{{\text {L}}^{r}_{t}{\text {L}}^{q}_{ x}}. \) We set \(\Sigma ^{r',q'}= {\text {L}}^2_{t}V'+ {\text {L}}^{r'}_{t}{\text {L}}^{q'}_{ x}\) with the usual infimum norm.
Lemma 4.3
Under Assumption (V), the embedding (4.3), and by duality \(\Sigma ^{r',q'} \hookrightarrow {\mathcal {V}}'\), hold if \(\frac{1}{r}+\frac{d}{2q}= \frac{d}{4}\) with \(2\le r,q < \infty \).
Proof
We modify the proof of Lemma 2.3. In order to prove (2.1) we have previously used the Fourier transform on \({\text {L}}^2({\mathbb {R}}^n)\) to obtain unitary equivalence of \(-\Delta \) to a multiplication operator \(m(\xi ) = |\xi |^2\). Here, we use the spectral theorem for \((1-\Delta _{V})\) and the same argument applies. As for (2.2), the required Sobolev inequality in the spatial variable is precisely our Assumption \(({{\textbf {V}}})\) and now d instead of n plays the role of the dimension. Hence, (4.3) holds under the given conditions on (r, q). \(\square \)
4.4 Variational approach
Pairs that satisfy the relation in Lemma 4.3 will be called admissible pairs (for the boundary value problems under assumption \(({{\textbf {V}}})\)). Once again, admissible pairs (r, q) are conjugates of pairs \(({{\tilde{r}}}, {{\tilde{q}}})\), called compatible pairs for lower order coefficients, which are defined by
The conjugation rule is \((r, q)=(2({{\tilde{r}}})',2({{\tilde{q}}})') \) as in (2.7). Fixing once and for all a compatible pair \(({{\tilde{r}}}_{1}, {{\tilde{q}}}_{1})\) for lower order coefficients, we define the parabolic operator \({\mathcal {H}}\) on \({\mathcal {V}}\) by the sesquilinear form
where as before, \(\beta \) includes the lower order terms, \(\langle \cdot \,,\cdot \rangle \) is now the inner product on \({\text {L}}^2_{x}={\text {L}}^2(\Omega )\) and \(\langle \langle \cdot \,,\cdot \rangle \rangle \) the sesquilinear duality extending the \({\text {L}}^2_{t}{\text {L}}^2_{x}\) inner product. As \({\mathcal {D}}({\mathbb {R}}; V)\) is a dense subspace of \({\mathcal {V}}\), we have
for all \(u\in {\mathcal {V}}\) and \(v\in {\mathcal {D}}({\mathbb {R}}; V)\), where \({\mathcal {L}}\) is defined by the integral above. Hölder’s inequality, which is dimensionless in terms of exponents, yields
with \(P_{{\tilde{r}}_{1},{{\tilde{q}}}_{1}}\) as in (2.6), so that
Hence, using \({\mathcal {H}}\) gives access to weak solutions in \({\text {L}}^2_{t}V\) of \(\partial _{t} u+{\mathcal {L}}u=w\) with lateral boundary conditions prescribed by V.
4.5 Main regularity estimates
Modifying the proofs of Sect. 2.4 on replacing \(-\Delta \) systematically by \((1-\Delta _{V})\), the main regularity lemma becomes the following statement.
Lemma 4.4
Let \(u\in {\mathcal {D}}'({\mathbb {R}}; V')\) with \(u\in {\text {L}}^2_{t}V\) and \(\partial _{t}u\in \Sigma ^{r', q'} \) for (r, q) an admissible pair or \((r,q)=(\infty ,2)\) under Assumption \(({\textrm{V}})\). Then \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\) and for some constant \(C<\infty \) independent of u,
Moreover, if (r, q) is admissible, then \(u\in {\mathcal {V}}\) with the same estimate on \(\Vert u\Vert _{{\mathcal {V}}}\).
Proof
We indicate the main changes.
Modification of the uniqueness Lemma 2.14. This is now stated for \(u\in {\mathcal {D}}'({\mathbb {R}}; V')\) such that \(\partial _{t}u + (1-\Delta _{V}) u=0\) in \({\mathcal {D}}'({\mathbb {R}}; V')\): if \(u\in {\text {L}}^2_{t}V\), then \(u=0\). Indeed, we see that \(\partial _{t}u \in {\text {L}}^2_{t}V'\). By Lions’ embedding theorem we have \(u\in {\text {C}}_{0}({\text {L}}^2_{x})\), and testing the equation against u yields that \(\langle \langle (1-\Delta _{V})u,u \rangle \rangle =0\), which implies \(u=0\).
Modification of the embedding in Lemma 2.16. Here, we have to show that with \(\theta =1-\frac{2}{r}\) we have the continuous inclusion
where
In the definition of this space, \(S^{1-\theta }\) is extended dy duality to a map from \({\text {L}}^2\) into the dual of V with respect to the \({\text {L}}^2_x\) duality. This uses that V is the domain of S and that \(0 \le 1-\theta \le 1\). Hence, we are working with a subspace of \( {\mathcal {D}}'({\mathbb {R}}; V')\). The embedding itself is a repetition of the proof of Lemma 2.16 except that now we take \(G = {\mathcal {S}}({\mathbb {R}}; V)\) as dense subset. This is where we use assumption \(({{\textbf {V}}})\).
Modification of the stronger regularity statement in Lemma 2.17. We need a new dense subspace \(G_0\), which we can take as \(G_0 :={\mathcal {S}}_{00}({\mathbb {R}}; {\text {dom}}(\Delta _V^2))\) here.
Step 1 then goes through mutadis mutandis if we understand Fourier in the x-variable as a special case of the spectral theorem for the Laplacian, compare with the proof of Lemma 4.3.
Step 2 remains unchanged.
For Step 3, we obtain \(v'(t)+ (1-\Delta _{V})v(t)=w(t)\) in \({\text {L}}^2_x\) for all \(t \in {\mathbb {R}}\), whenever \(g\in G_{0}\). The equation can also be interpreted in \({\mathcal {D}}'({\mathbb {R}}; V')\) for the test functions \(\phi \in {\mathcal {D}}({\mathbb {R}}; V)\): this interpretation passes to the limit for \(g\in {\text {L}}^2_{t}{\text {L}}^2_{x}\), thanks to Step 1.
Lastly, Step 4, again for \(g\in G_{0}\), has been a Fourier transform argument and now its use in the x-variable should be replaced by the spectral theorem. From this perspective, the proof is the same as before.
End of proof. Modifications of Proposition 2.18 and of the corollaries that follow are proved similarly with constant \(c=0\). \(\square \)
4.6 The resulting theory
From this point on, the theory can be developed similar to the inhomogeneous setting of Sect. 2.10, assuming \(({{\textbf {V}}})\). The two exceptional topics are pointwise Gaussian upper bounds (Sect. 2.13) and \(\text {BMO}\)-coefficients in the principal part (Sect. 2.16), the extension of which will require finer geometrical properties of the underlying domain \(\Omega \) and should be considered open at this point.
The rest works out smoothly, as long as we assume uniform ellipticity in the sense of Gårding: There should exist \(\lambda >0\) and \(c_0 \in {\mathbb {R}}\) such that for almost every t and every \(w \in V\) we have
Then, we can work with lower order coefficients in Lebesgue-Lebesgue mixed spaces with the assumption \(({{\textbf {D}}}_{\varepsilon })\). This gives access to representation by Green operators for the inverse of \({\mathcal {H}}+\kappa \) for appropriate \(\kappa \ge 0\), causality and fundamental solution operators for the Cauchy problem. The proof of \({\text {L}}^2\) off-diagonal estimates can be adapted if V is invariant under multiplication with bounded Lipschitz functions. For example, the variational spaces for mixed Dirichlet-Neumann boundary conditions have this property [18, Lem. 4].
If (4.4) comes with \(c_{0}=0\) and the leading part of \({\mathcal {H}}\) is a pure second order operator, then one can also develop the theory in the class \({\text {L}}^2_{t}V\) similar to Sect. 2.14.
Finally, for the extension of the definition of \({\mathcal {H}}\) when coefficients belong to mixed Lorentz spaces, we can use the following self-improvement property to treat all compatible pairs \(({{\tilde{r}}}_{1},{{\tilde{q}}}_{1})\) with \({{\tilde{r}}}_{1}<\infty \).
Lemma 4.5
If Assumption (V) holds, then \({\text {L}}^q(\Omega )\) can be replaced with \({\text {L}}^{q,2}(\Omega )\) in (4.1) when \(\theta >0\).
Proof
Let us fix \(\theta \in (0,1]\) with corresponding Lebesgue exponent q. By open-endedness, we pick \(\vartheta \in (0,\theta )\) with corresponding larger exponent r. By (4.1) we have a continuous inclusion
For Hilbert spaces the complex method agrees with the \((\cdot \,, \cdot )_{\theta ,2}\)-real method [8, Sec. 6]. With \(\sigma :=\frac{1-\theta }{1-\vartheta } \in (0,1)\) we obtain the required continuous inclusion
The second equality is the reiteration theorem [35, Sec. 1.10.2] and the final equality follows from the real interpolation property for Lebesgue spaces [35, Sec. 1.18.6] and the relation \(\frac{1-\sigma }{2} + \frac{\sigma }{r} = \frac{1}{q}\). \(\square \)
With the previous lemma at hand, the extension of the definition of \({\mathcal {H}}\) with coefficients in mixed Lorentz spaces can be carried out as before for compatible pairs for the coefficients with \(\tilde{r}_{1}<\infty \). The case \(r_{1}=2\) for the conjugate admissible pair \((r_{1},q_{1})\) is not covered by this statement. Invertibility can be shown under Lorentz–Lorentz mixed norms for the lower order coefficients and causality follows under Lebesgue–Lorentz mixed spaces.
In order to include Lorentz spaces for compatible pairs with \(\tilde{r}_{1}=\infty \), which is probably the most interesting case in applications, the improvement in Lemma 4.5 for \(\theta = 0\) is needed, that is, \(d \ge 3\) and the embedding \(V \hookrightarrow {\text {L}}^{2d/(d-2),2}_{x}\) holds. One simple way to guarantee this embedding for \(d=n \ge 3\) is to assume that there is a bounded Sobolev extension operator \(V \rightarrow {\text {W}}^{1,2}({\mathbb {R}}^n)\) since then one can use the O’Neil’s Sobolev embedding on \({\mathbb {R}}^n\) and restrict back to \(\Omega \). Hence, this always works for pure lateral Dirichlet conditions (\(V = {\text {W}}^{1,2}_0(\Omega )\)), using the extension by zero. For the existence of an extension operator in the case of mixed lateral boundary conditions, the most general geometric assumptions as far we are aware can be found in [11].
5 Gagliardo–Nirenberg inequalities
We prove here a version of Gagliardo–Nirenberg inequalities including Lorentz norms. We work on \({\mathbb {R}}^n\).
Proposition 5.1
Let \(m\ge 1\) be an integer, \({\text {I}}\) be an interval and \(u\in {\text {L}}^\infty ({\text {I}}; {\text {L}}^2_{x})\) with \(\nabla ^m u \in {\text {L}}^2({\text {I}}; {\text {L}}^2_{x})\). Let \(\alpha \) be a multi-index such that \(0\le |\alpha |\le m\) and
Then
Proof
Let \(a=|\alpha |\). Define \(b\ge 0\) by \(b-\frac{n}{2}= a- \frac{n}{q}\). Then, for (almost) every \(t\in {\text {I}}\), we invoke the boundedness of \(\partial ^\alpha (-\Delta )^{-a /2}\) on Lorentz spaces and the O’Neil’s Sobolev embedding theorem [30],
We next use the classical interpolation inequality
Using \(u\in {\text {L}}^\infty ({\text {I}}; {\text {L}}^2_{x})\), we can conclude by integrating the rth power and working out the exponents. The interpolation inequality itself is easily seen by using the Fourier transform in \({\text {L}}^2_{x}\). Indeed, writing \(|\xi |^{2b} |\hat{u}(\xi )|^2 = (|\xi |^{2m} |\hat{u}(\xi )|^2)^{b/m}(|\hat{u}(\xi )|^2)^{(m-b)/m}\), it boils down to Hölder’s inequality with exponent \(\frac{m}{b} \in [1,\infty )\). This finishes the proof. \(\square \)
Data Availability
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
Notes
Here, we do not define all spaces precisely as we just want to explain the spirit of our results.
From now on, we choose not to indicate the target vector space in the notation.
There are many ways to do this extension, but this one does not increase the various constants on the coefficients and could be called canonical. Alternately, if the coefficients are already defined on full space-time, then one only needs their restrictions anyway.
With this parametrization, when \(m=1\), we have considered the compatible collection consisting of \((\tilde{r}_{1},{{\tilde{q}}}_{1})\) when \(|\alpha |=|\beta |=0\), \((2{{\tilde{r}}}_{1}, 2 {{\tilde{q}}}_{1})\) when \(|\alpha | +|\beta |=1\) and \((\infty ,\infty )\) when \( |\alpha |=|\beta |=1\).
With this parametrization, there is no notion of unambiguously defined conjugate collection associated to the compatible collection for the coefficients. Besides, there are many possible choices of collections \(({\textbf {r}}_{{\textbf {1}}} , {\textbf {q}}_{{\textbf {1}}})\) and \(({\bar{{\textbf {r}}}}_{{\textbf {1}}}, {\bar{{\textbf {q}}}}_{{\textbf {1}}})\) and we fix one once and for all.
References
Adams, R.A., Fournier, J.F.: Sobolev Spaces. Pure and Applied Mathematics (Amsterdam), vol. 140. Elsevier/Academic Press, Amsterdam (2003)
Arendt, W., Bukhvalov, A.V.: Integral representations of resolvents and semigroups. Forum Math. 6(1), 111–135 (1994)
Aronson, D.G.: Bounds for the fundamental solution of a parabolic equation. Bull. Am. Math. Soc. 73, 890–896 (1967)
Aronson, D.G.: Non-negative solutions of linear parabolic equations. Ann. Scuola Norm. Sup. Pisa (3) 22, 607–694 (1968)
Auscher, P., Bortz, S., Egert, M., Saari, O.: On regularity of weak solutions to linear parabolic systems with measurable coefficients. Math. Pures Appl. 121(9), 216–243 (2019)
Auscher, P., Egert, M.: On non-autonomous maximal regularity for elliptic operators in divergence form. Arch. Math. 107(3), 271–284 (2016)
Auscher, P., Egert, M., Nyström, K.: \(\text{ L}^2\) well-posedness of boundary value problems for parabolic systems with measurable coefficients. J. Eur. Math. Soc. 22(22), 2943–3058 (2020)
Auscher, P., McIntosh, A., Nahmod, N.: Holomorphic functional calculi of operators, quadratic estimates and interpolation. Indiana Univ. Math. J. 46(2), 375–403 (1997)
Auscher, P., Monniaux, S., Portal, P.: On existence and uniqueness for non-autonomous parabolic Cauchy problems with rough coefficients. Ann. Scu. Norm. Pisa 19(5), 387–471 (2019)
Baras, P., Goldstein, J.A.: The heat equation with a singular potential. Trans. Am. Math. Soc. 284(1), 121–139 (1984)
Bechtel, S., Brown, R.M., Haller, R., Tolksdorf, P.: Extendability of functions with partially vanishing trace. arXiv:1910.06009
Besov, O.V.: Sobolev’s embedding theorem for anisotropically irregular domains. Eurasian Math. J. 2(1), 32–51 (2011)
Coifman, R., Lions, P.-L., Meyer, Y., Semmes, S.: Compensated compactness and Hardy spaces. J. Math. Pures Appl. (9) 72(3), 247–286 (1993)
Davies, E.B.: Explicit constants for Gaussian upper bounds on heat kernels. Am. J. Math. 109(2), 319–333 (1987)
Davies, E.B.: Uniformly elliptic operators with measurable coefficients. J. Funct. Anal. 132(1), 141–169 (1995)
Dier, D., Zacher, R.: Non-autonomous maximal regularity in Hilbert spaces. J. Evol. Equ. 17(3), 883–907 (2017)
Escauriaza, L., Hofmann, S.: Kato square root problem with unbounded leading coefficients. Proc. Am. Math. Soc. 146(12), 5295–5310 (2018)
Egert, M.: On \(p\)-elliptic divergence form operators and holomorphic semigroups. J. Evol. Equ. 20(3), 705–724 (2020)
Fabes, E.B., Stroock, D.W.: A new proof of Moser’s parabolic Harnack inequality using the old ideas of Nash. Arch. Rational Mech. Anal. 96(4), 327–338 (1986)
Fernandez, D.L.: Lorentz spaces, with mixed norms. J. Funct. Anal. 25(2), 128–146 (1977)
Friedman, A.: Partial Differential Equations of Parabolic Type. Prentice-Hall Inc, Englewood Cliffs (1964)
Hofmann, S., Kim, S.: Gaussian estimates for fundamental solutions to certain parabolic systems. Publ. Mat. 48(2), 481–496 (2004)
Hofmann, S., Lewis, J.L.: The \(L^p\) Neumann problem for the heat equation in non-cylindrical domains. J. Funct. Anal. 220(1), 1–54 (2005)
Kaplan, S.: Abstract boundary value problems for linear parabolic equations. Ann. Scuola Norm. Sup. Pisa (3) 20, 395–419 (1966)
Kato, T.: Perturbation Theory for Linear Operators. Classics in Mathematics. Springer, Berlin (1995)
Kim, D., Ryu, S., Woo, K.: Parabolic equations with unbounded lower-order coefficients in Sobolev spaces with mixed norms. J. Evol. Equ. 22, 9 (2022)
Kim, S., Sakellaris, G.: Green’s function for second order elliptic equations with singular lower order coefficients. Commun. Partial Differ. Equ. 44(3), 228–270 (2019)
Ladyženskaja, O.A., Solonnikov, V.A., Ural’ceva, N.N.: Linear and quasilinear equations of parabolic type. Translated from the Russian by S. Smith. Translations of Mathematical Monographs, Vol. 23, American Mathematical Society, Providence, R.I. (1968)
Lions, J.-L.: Equations différentielles opérationnelles et problèmes aux limites. Die Grundlehren der mathematischen Wissenschaften, Band, vol. 111. Springer, Berlin (1961)
O’Neil, R.: Convolution operators and \(L(p, q)\) spaces. Duke Math. J. 30, 129–142 (1963)
Qian, Z., Xi, G.: Parabolic equations with singular divergence-free drift vector fields. J. Lond. Math. Soc. (2) 100(1), 17–40 (2019)
Stein, E., Weiss, G.: Introduction to Fourier analysis in Euclidean Spaces of Several Variables Princeton Mathematical Series, no. 32. Princeton University Press, Princeton (1971)
Stein, E.: Singular Integrals and Differentiability Properties of Functions. Princeton Mathematical Series, No. 30. Princeton University Press, Princeton (1970)
Schwartz, L.: Théorie des distributions à valeurs vectorielles I. Anna. Inst. Fourier Tome 7, 1–141 (1957)
Triebel, H.: Interpolation Theory, Function Spaces, Differential Operators. North-Holland Mathematical Library, vol. 18. North-Holland Publishing, Amsterdam (1978)
Vogt, H.: Equivalence of pointwise and global ellipticity estimates. Math. Nachr. 237(1), 125–128 (2002)
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Andrea Mondino.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The authors were supported by the ANR project RAGE ANR-18-CE40-0012. A CC-BY 4.0 https://creativecommons.org/licenses/by/4.0/ public copyright license has been applied by the authors to the present document and will be applied to all subsequent versions up to the Author Accepted Manuscript arising from this submission, in accordance with the grant’s open access conditions. The authors also acknowledge support and hospitality from the Hausdorff Research Institute for Mathematics funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC-2047/1 – 390685813, where part of this material was developed. The authors would like to thank Sylvie Monniaux for a very careful reading of earlier versions of their manuscript.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Auscher, P., Egert, M. A universal variational framework for parabolic equations and systems. Calc. Var. 62, 249 (2023). https://doi.org/10.1007/s00526-023-02577-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00526-023-02577-5