Abstract
The existence of global nonnegative martingale solutions to cross-diffusion systems of Shigesada–Kawasaki–Teramoto type with multiplicative noise is proven. The model describes the stochastic segregation dynamics of an arbitrary number of population species in a bounded domain with no-flux boundary conditions. The diffusion matrix is generally neither symmetric nor positive semidefinite, which excludes standard methods for evolution equations. Instead, the existence proof is based on the entropy structure of the model, a novel regularization of the entropy variable, higher-order moment estimates, and fractional time regularity. The regularization technique is generic and is applied to the population system with self-diffusion in any space dimension and without self-diffusion in two space dimensions.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Shigesada, Kawasaki, and Teramoto (SKT) suggested in their seminal paper [37] a deterministic cross-diffusion system for two competing species, which is able to describe the segregation of the populations. A random influence of the environment or the lack of knowledge of certain biological parameters motivate the introduction of noise terms, leading to the stochastic system for n species with the population density \(u_i\) of the ith species:
with initial and no-flux boundary conditions
and diffusion coefficients
where \({\mathcal {O}}\subset {{\mathbb {R}}}^d\) (\(d\ge 1\)) is a bounded domain, \(\nu \) is the exterior unit normal vector to \(\partial {\mathcal {O}}\), \((W_1,\ldots ,W_n)\) is an n-dimensional cylindrical Wiener process, and \(a_{ij}\ge 0\) for \(i=1,\ldots ,n\), \(j=0,\ldots ,n\) are parameters. The stochastic framework is detailed in Sect. 2.
The deterministic analog of (1)–(3) generalizes the two-species model of [37] to an arbitrary number of species. The deterministic model can be derived rigorously from nonlocal population systems [19, 35], stochastic interacting particle systems [8], and finite-state jump Markov models [2, 13]. The original system in [37] also contains a deterministic environmental potential and Lotka–Volterra terms, which are neglected here for simplicity.
We call \(a_{i0}\) the diffusion coefficients, \(a_{ii}\) the self-diffusion coefficients, and \(a_{ij}\) for \(i\ne j\) the cross-diffusion coefficients. We say that system (1)-(3) is with self-diffusion if \(a_{i0}\ge 0\), \(a_{ii} > 0\) for all \(i=1,\ldots ,n\), and without self-diffusion if \(a_{i0}>0\), \(a_{ii}=0\) for all \(i=1,\ldots ,n\).
The aim of this work is to prove the existence of global nonnegative martingale solutions to system (1)–(3) allowing for large cross-diffusion coefficients. The existence of a local pathwise mild solution to (1)–(3) with \(n=2\) was shown in [30, Theorem 4.3] under the assumption that the diffusion matrix is positive definite. Global martingale solutions to a SKT model with quadratic instead of linear coefficients \(A_{ij}(u)\) were found in [18]. Besides detailed balance, this result needs a moderate smallness condition on the cross-diffusion coefficients. We prove the existence of global martingale solutions to the SKT model for general coefficients satisfying detailed balance. This result seems to be new.
There are two major difficulties in the analysis of system (1). The first difficulty is the fact that the diffusion matrix associated to (1) is generally neither symmetric nor positive semidefinite. In particular, standard semigroup theory is not applicable. These issues have been overcome in [9, 10] in the deterministic case by revealing a formal gradient-flow or entropy structure. The task is to extend this idea to the stochastic setting.
In the deterministic case, usually an implicit Euler time discretization is used [24]. In the stochastic case, we need an explicit Euler scheme because of the stochastic Itô integral, but this excludes entropy estimates. An alternative is the Galerkin scheme, which reduces the infinite-dimensional stochastic system to a finite-dimensional one; see, e.g., the proof of [32, Theorem 4.2.4]. This is possible only if energy-type (\(L^2\)) estimates are available, i.e. if \(u_i\) can be used as a test function. In the present case, however, only entropy estimates are available with the test function \(\log u_i\), which is not an element of the Galerkin space.
In the following, we describe our strategy to overcome these difficulties. We say that system (1) has an entropy structure if there exists a function \(h:[0,\infty )^n\rightarrow [0,\infty )\), called an entropy density, such that the deterministic analog of (1) can be written in terms of the entropy variables (or chemical potentials) \(w_i=\partial h/\partial u_i\) as
where \(w=(w_1,\ldots ,w_n)\), \(u_i\) is interpreted as a function of w, and \(B(w)=A(u(w))h''(u(w))^{-1}\) with \(B=(B_{ij})\) is positive semidefinite. For the deterministic analog of (1), it was shown in [11] that the entropy density is given by
where the numbers \(\pi _i>0\) are assumed to satisfy \(\pi _ia_{ij}=\pi _ja_{ji}\) for all \(i,j=1,\ldots ,n\). This condition is the detailed-balance condition for the Markov chain associated to \((a_{ij})\), and \((\pi _1,\ldots ,\pi _n)\) is the corresponding reversible stationary measure [11]. Using \(w_i=\pi _i\log u_i\) in (4) as a test function and summing over \(i=1,\ldots ,n\), a formal computation shows that
A similar expression holds in the stochastic setting; see (29). It provides \(L^2\) estimates for \(\nabla \sqrt{u_i}\) if \(a_{i0}>0\) and for \(\nabla u_i\) if \(a_{ii}>0\). Moreover, having proved the existence of a solution w to an approximate version of (1) leads to the positivity of \(u_i(w)=\exp (w_i/\pi _i)\) (and nonnegativity after passing to the de-regularization limit).
To define the approximate scheme, our idea is to “regularize” the entropy variable w. Indeed, instead of the algebraic mapping \(w\mapsto u(w)\), we introduce the mapping \(Q_\varepsilon (w)=u(w) + \varepsilon L^*Lw\), where \(L:D(L)\rightarrow H\) with domain \(D(L)\subset H\) is a suitable operator and \(L^*\) its dual; see Sect. 3 for details. The operator L is chosen in such a way that all elements of D(L) are bounded functions, implying that u(w) is well defined. Introducing the regularization operator \(R_\varepsilon :D(L)'\rightarrow D(L)\) as the inverse of \(Q_\varepsilon :D(L)\rightarrow D(L)'\), the approximate scheme to (1) is defined, written in compact form, as
The existence of a local weak solution \(v^\varepsilon \) to (7) with suitable initial and boundary conditions is proved by applying the abstract result of [32, Theorem 4.2.4]; see Theorem 13. The entropy inequality for \(w^\varepsilon :=R_\varepsilon (v^\varepsilon )\) and \(u^\varepsilon :=u(w^\varepsilon )\),
up to some stopping time \(\tau _R>0\) allows us to extend the local solution to a global one (Proposition 16).
For the de-regularization limit \(\varepsilon \rightarrow 0\), we need suitable uniform bounds. The entropy inequality provides gradient bounds for \(u_i^\varepsilon \) in the case with self-diffusion and for \((u_i^\varepsilon )^{1/2}\) in the case without self-diffusion. Based on these estimates, we use the Gagliardo–Nirenberg inequality to prove uniform bounds for \(u_i^\varepsilon \) in \(L^q(0,T;L^q({\mathcal {O}}))\) with \(q\ge 2\). Such an estimate is crucial to define, for instance, the product \(u_i^\varepsilon u_j^\varepsilon \). Furthermore, we show a uniform estimate for \(u_i^\varepsilon \) in the Sobolev–Slobodeckij space \(W^{\alpha ,p}(0,T;D(L)')\) for some \(\alpha <1/2\) and \(p>2\) such that \(\alpha p>1\). These estimates are needed to prove the tightness of the laws of \((u^\varepsilon )\) in some sub-Polish space and to conclude strong convergence in \(L^2\) thanks to the Skorokhod–Jakubowski theorem.
For the uniform estimates, we need to distinguish the cases with and without self-diffusion. In the former case, we obtain an \(L^2(0,T;H^1({\mathcal {O}}))\) estimate for \(u_i^\varepsilon \), such that the product \(u_i^\varepsilon \nabla u_j^\varepsilon \) is integrable, and we can pass to the limit in the coefficients \(A_{ij}(u_i^\varepsilon )\). Without self-diffusion, we can only conclude that \((u_i^\varepsilon )\) is bounded in \(L^2(0,T;W^{1,1}({\mathcal {O}}))\), and products like \(u_i^\varepsilon \nabla u_j^\varepsilon \) may be not integrable. To overcome this issue, we use the fact that
and write (1) in a “very weak” formulation by applying the Laplace operator to the test function. Since the bound in \(L^2(0,T;W^{1,1}({\mathcal {O}}))\) implies a bound in \(L^2(0,T;L^2({\mathcal {O}}))\) bound in two space dimensions, products like \(u_i^\varepsilon u_j^\varepsilon \) are integrable. In the deterministic case, we can exploit the \(L^2\) bound for \(\nabla (u_i^\varepsilon u_j^\varepsilon )^{1/2}\) to find a bound for \(u_i^\varepsilon u_j^\varepsilon \) in \(L^1(0,T;L^1({\mathcal {O}}))\) in any space dimension, but the limit involves an identification that we could not extend to the martingale solution concept.
On an informal level, we may state our main result as follows. We refer to Sect. 2 for the precise formulation.
Theorem 1
(Informal statement) Let \(a_{ij}\ge 0\) satisfy the detailed-balance condition, let the stochastic diffusion \(\sigma _{ij}\) be Lipschitz continuous on the space of Hilbert–Schmidt operators, and let a certain interaction condition between the entropy and stochastic diffusion hold (see Assumption (A5) below). Then there exists a global nonnegative martingale solution to (1)–(3) in the case with self-diffusion in any space dimension and in the case without self-diffusion in at most two space dimensions.
We discuss examples for \(\sigma _{ij}(u)\) in Sect. 7. Here, we only remark that an admissible diffusion term is
where \(1/2\le \alpha \le 1\), \(\delta _{ij}\) is the Kronecker symbol, \(a_k\ge 0\) decays sufficiently fast, \((e_k)\) is a basis of the Hilbert space U with inner product \((\cdot ,\cdot )_{U}\).
We end this section by giving a brief overview of the state of the art for the deterministic SKT model. First existence results for the two-species model were proven under restrictive conditions on the parameters, for instance in one space dimension [26], for the triangular system with \(a_{21}=0\) [33], or for small cross-diffusion parameters, since in the latter situation the diffusion matrix becomes positive definite [17]. Amann [1] proved that a priori estimates in the \(W^{1,p}({\mathcal {O}})\) norm with \(p>d\) are sufficient to conclude the global existence of solutions to quasilinear parabolic systems, and he applied this result to the triangular SKT system. The first global existence proof without any restriction on the parameters \(a_{ij}\) (except nonnegativity) was achieved in [22] in one space dimension. This result was generalized to several space dimensions in [9, 10] and to the whole space problem in [21]. SKT-type systems with nonlinear coefficients \(A_{ij}(u)\), but still for two species, were analyzed in [15, 16]. Global existence results for SKT-type models with an arbitrary number of species and under a detailed-balance condition were first proved in [11] and later generalized in [31].
This paper is organized as follows. We present our notation and the main results in Sect. 2. The operators needed to define the approximative scheme are introduced in Sect. 3. In Sect. 4, the existence of solutions to a general approximative scheme is proved and the corresponding entropy inequality is derived. Theorems 4 and 5 are shown in Sects. 5 and 6, respectively. Section 7 is concerned with examples for \(\sigma _{ij}(u)\) satisfying our assumptions. Finally, the proofs of some auxiliary lemmas are presented in Appendix A, and Appendix B states a tightness criterion that (slightly) extends [5, Corollary 2.6] to the Banach space setting.
2 Notation and main result
2.1 Notation and stochastic framework
Let \({\mathcal {O}}\subset {{\mathbb {R}}}^d\) (\(d\ge 1\)) be a bounded domain. The Lebesgue and Sobolev spaces are denoted by \(L^p({\mathcal {O}})\) and \(W^{k,p}({\mathcal {O}})\), respectively, where \(p\in [1,\infty ]\), \(k\in {{\mathbb {N}}}\), and \(H^k({\mathcal {O}})=W^{k,2}({\mathcal {O}})\). For notational simplicity, we generally do not distinguish between \(W^{k,p}({\mathcal {O}})\) and \(W^{k,p}({\mathcal {O}};{{\mathbb {R}}}^n)\). We set \(H_N^m({\mathcal {O}}) = \{v\in H^m({\mathcal {O}}):\nabla v\cdot \nu =0\) on \(\partial {\mathcal {O}}\}\) for \(m\ge 2\). If \(u=(u_1,\ldots ,u_n)\in X\) is some vector-valued function in the normed space X, we write \(\Vert u\Vert _X^2=\sum _{i=1}^n\Vert u_i\Vert _X^2\). The inner product of a Hilbert space H is denoted by \((\cdot ,\cdot )_H\), and \(\langle \cdot ,\cdot \rangle _{V',V}\) is the dual product between the Banach space V and its dual \(V'\). If \(F:U\rightarrow V\) is a Fréchet differentiable function between Banach spaces U and V, we write \(\textrm{D}F[v]:U\rightarrow V\) for its Fréchet derivative, for any \(v\in U\).
Given two quadratic matrices \(A=(A_{ij})\), \(B=(B_{ij})\in {{\mathbb {R}}}^{n\times n}\), \(A:B=\sum _{i,j=1}^n A_{ij}B_{ij}\) is the Frobenius matrix product, \(\Vert A\Vert _F=(A:A)^{1/2}\) the Frobenius norm of A, and \({\text {tr}}A =\sum _{i=1}^n A_{ii}\) the trace of A. The constants \(C>0\) in this paper are generic and their values change from line to line.
Let \((\Omega ,{\mathcal {F}},{{\mathbb {P}}})\) be a probability space endowed with a complete right-continuous filtration \({{\mathbb {F}}}=({\mathcal {F}}_t)_{t\ge 0}\) and let H be a Hilbert space. Then \(L^0(\Omega ;H)\) consists of all measurable functions from \(\Omega \) to H, and \(L^2(\Omega ;H)\) consists of all H-valued random variables v such that \({{\mathbb {E}}}\Vert v\Vert _H^2=\int _\Omega \Vert v(\omega )\Vert _H^2{{\mathbb {P}}}(\textrm{d}\omega )<\infty \). Let U be a separable Hilbert space and \((e_k)_{k\in {{\mathbb {N}}}}\) be an orthonormal basis of U. The space of Hilbert–Schmidt operators from U to \(L^2({\mathcal {O}})\) is defined by
and it is endowed with the norm \(\Vert F\Vert _{{{\mathcal {L}}}_2(U;L^2({\mathcal {O}}))} = (\sum _{k=1}^\infty \Vert Fe_k\Vert _{L^2({\mathcal {O}})}^2)^{1/2}\).
Let \(W=(W_1,\ldots ,W_n)\) be an n-dimensional U-cylindrical Wiener process, taking values in the separable Hilbert space \(U_0\supset U\) and adapted to the filtration \({{\mathbb {F}}}\). We can write \(W_j=\sum _{k=1}^\infty e_k W_j^k\), where \((W_j^k)\) is a sequence of independent standard one-dimensional Brownian motions [12, Section 4.1.2]. Then \(W_j(\omega )\in C^0([0,\infty );U_0)\) for a.e. \(\omega \) [32, Section 2.5.1].
2.2 Assumptions
We impose the following assumptions:
-
(A1)
Domain: \({\mathcal {O}}\subset {{\mathbb {R}}}^d\) (\(d\ge 1\)) is a bounded domain with Lipschitz boundary. Let \(T>0\) and set \(Q_T={\mathcal {O}}\times (0,T)\).
-
(A2)
Initial datum: \(u^0=(u_1^0,\ldots ,u_n^0)\in L^\infty (\Omega ;L^2({\mathcal {O}};{{\mathbb {R}}}^n))\) is a \({\mathcal {F}}_0\)-measurable random variable satisfying \(u^0(x)\ge 0\) for a.e. \(x\in {\mathcal {O}}\) \({{\mathbb {P}}}\)-a.s.
-
(A3)
Diffusion matrix: \(a_{ij}\ge 0\) for \(i=1,\ldots ,n\), \(j=0,\ldots ,n\) and there exist \(\pi _1,\ldots ,\pi _n>0\) such that \(\pi _i a_{ij}=\pi _j a_{ji}\) for all \(i,j=1,\ldots ,n\) (detailed-balance condition).
-
(A4)
Multiplicative noise: \(\sigma =(\sigma _{ij})\) is an \(n\times n\) matrix, where \(\sigma _{ij}:L^2({\mathcal {O}};{{\mathbb {R}}}^n)\rightarrow {{\mathcal {L}}}_2(U;L^2({\mathcal {O}}))\) is \({\mathcal {B}}(L^2({\mathcal {O}};{{\mathbb {R}}}^n))/\) \({\mathcal {B}} ({{\mathcal {L}}}_2(U;L^2({\mathcal {O}})))\)-measurable and \({{\mathbb {F}}}\)-adapted. Furthermore, there exists \(C_\sigma >0\) such that for all u, \(v\in L^2({\mathcal {O}};{{\mathbb {R}}}^n)\),
$$\begin{aligned} \Vert \sigma (u)-\sigma (v)\Vert _{{{\mathcal {L}}}_2(U;L^2({\mathcal {O}}))}&\le C_\sigma \Vert u-v\Vert _{L^2({\mathcal {O}})}, \\ \Vert \sigma (v)\Vert _{{{\mathcal {L}}}_2(U;L^2({\mathcal {O}}))}&\le C_\sigma (1+\Vert v\Vert _{L^2({\mathcal {O}})}). \end{aligned}$$ -
(A5)
Interaction between entropy and noise: There exists \(C_h>0\) such that for all \(u\in L^\infty ({\mathcal {O}}\times (0,T))\),
$$\begin{aligned}&\bigg \{\int _0^t\sum _{k=1}^\infty \sum _{i,j=1}^n \bigg (\int _{\mathcal {O}}\frac{\partial h}{\partial u_i}(u(s)) \sigma _{ij}(u(s))e_k\textrm{d}x\bigg )^2\textrm{d}s\bigg \}^{1/2}\\&\quad \le C_h\bigg (1 + \int _0^t\int _{\mathcal {O}}h(u(s))\textrm{d}x\textrm{d}s\bigg ), \\&\int _0^t\sum _{k=1}^\infty \int _{\mathcal {O}}{\text {tr}}\big [ (\sigma (u)e_k)^T h''(u)\sigma (u)e_k\big ](s)\textrm{d}x\textrm{d}s \\&\quad \le C_h\bigg (1 + \int _0^t\int _{\mathcal {O}}h(u(s))\textrm{d}x\textrm{d}s\bigg ), \end{aligned}$$where h is the entropy density defined in (5).
Remark 2
(Discussion of the assumptions)
-
(A1)
The Lipschitz regularity of the boundary \(\partial {\mathcal {O}}\) is needed to apply the Sobolev and Gagliardo–Nirenberg inequalities.
-
(A2)
The regularity condition on \(u^0\) can be weakened to \(u^0\in L^p(\Omega ;L^2({\mathcal {O}};{{\mathbb {R}}}^n))\) for sufficiently large \(p\ge 2\) (only depending on the space dimension); it is used to derive the higher-order moment estimates.
-
(A3)
The detailed-balance condition is also needed in the deterministic case to reveal the entropy structure of the system; see [11].
-
(A4)
The Lipschitz continuity of the stochastic diffusion \(\sigma (u)\) is a standard condition for stochastic PDEs; see, e.g., [36].
-
(A5)
This is the most restrictive assumption. It compensates for the singularity of \((\partial h/\partial u_i)(u)=\pi _i\log u_i\) at \(u_i=0\). We show in Lemma 34 that
$$\begin{aligned} \sigma _{ij}(u)(\cdot ) = \frac{u_i\delta _{ij}}{1+u_i^{1/2+\eta }} \sum _{k=1}^\infty a_k(e_k,\cdot )_U \end{aligned}$$satisfies Assumption (A5), where \(\eta >0\) and \((a_k)\in \ell ^2({{\mathbb {R}}})\). Taking into account the gradient estimate from the entropy inequality (see 6), we can allow for more general stochastic diffusion terms like (9); see Lemma 35.
Remark 3
(Reaction terms) It is possible to include additional nonlinear, continuous reaction terms \(f:{{\mathbb {R}}}^{n}\rightarrow {{\mathbb {R}}}^{n}\) satisfying
A prominent choice are the so-called Lotka-Volterra source terms
where \(b_{ij}\ge 0\) for \(i=1,\dots ,n\), \(j=0,1,\dots ,n\). Considering the entropy density h given by (5), it is easy to see that this reaction term even improves the integrability of the solution, due to bounds for terms of the form \(u_{i}^{2}\log (u_{i})\), \(i=1,\dots ,n\).
2.3 Main results
Let \(T>0\), \(m\in {{\mathbb {N}}}\) with \(m>d/2+1\), and \(D(L)=H_N^m({\mathcal {O}})\).
Definition 1
(Martingale solution) A martingale solution to (1)–(3) is the triple \(({\widetilde{U}},{\widetilde{W}},{\widetilde{u}})\) such that \({\widetilde{U}} =({{\widetilde{\Omega }}},\widetilde{{\mathcal {F}}},{{\widetilde{{{\mathbb {P}}}}}},{{\widetilde{{{\mathbb {F}}}}}})\) is a stochastic basis with filtration \({{\widetilde{{{\mathbb {F}}}}}}=(\widetilde{{\mathcal {F}}}_t)_{t\ge 0}\), \({\widetilde{W}}\) is an n-dimensional cylindrical Wiener process, and \({\widetilde{u}}=({\widetilde{u}}_1,\ldots ,{\widetilde{u}}_n)\) is a continuous \(D(L)'\)-valued \({\widetilde{{{\mathbb {F}}}}}\)-adapted process such that \({\widetilde{u}}_i\ge 0\) a.e. in \({\mathcal {O}}\times (0,T)\) \({{\widetilde{{{\mathbb {P}}}}}}\)-a.s.,
the law of \({\widetilde{u}}_i(0)\) is the same as for \(u_i^0\), and for all \(\phi \in D(L)\), \(t\in (0,T)\), \(i=1,\ldots ,n\), \({{\widetilde{{{\mathbb {P}}}}}}\)-a.s.,
Our main results read as follows.
Theorem 4
(Existence for the SKT model with self-diffusion) Let Assumptions (A1)– (A5) be satisfied and let \(a_{ii}>0\) for \(i=1,\ldots ,n\). Then (1)–(3) has a global nonnegative martingale solution in the sense of Definition 1.
Theorem 5
(Existence for the SKT model without self-diffusion) Let Assumptions (A1)–(A5) be satisfied, let \(d\le 2\), and let \(a_{0i}>0\) for \(i=1,\ldots ,n\). We strengthen Assumption (A4) slightly by assuming that for all \(v\in L^2({\mathcal {O}};{{\mathbb {R}}}^n)\),
where \(\gamma <1\) if \(d=2\) and \(\gamma =1\) if \(d=1\). Then (1)–(3) has a global nonnegative martingale solution in the sense of Definition 1 with the exception that (10) and (11) are replaced by
and, for all \(\phi \in D(L)\cap W^{2,\infty }({\mathcal {O}})\),
The weak formulation for the SKT system without self-diffusion is weaker than that one with self-diffusion, since we have only the gradient regularity \(\nabla \widetilde{u}_i\in L^1({\mathcal {O}})\), and \(A_{ij}(\widetilde{u})\) may be nonintegrable. However, system (1) can be written in Laplacian form according to (8), which allows for the “very weak” formulation stated in Theorem 5. The condition on \(\gamma \) if \(d=2\) is needed to prove the fractional time regularity for the approximative solutions.
Remark 6
(Nonnegativity of the solution) The a.s. nonnegativity of the population densities is a consequence of the entropy structure, since the approximate densities \(u_i^\varepsilon \) satisfy \(u_i^\varepsilon = u_i(R_\varepsilon (v^\varepsilon )) = \exp (R_\varepsilon (v^\varepsilon )/\pi _i)>0\) a.e. in \(Q_T\). This may be surprising since we do not assume that the noise vanishes at zero, i.e. \(\sigma _{ij}(u)=0\) if \(u_i=0\). This condition is replaced by the weaker integrability condition for \(\sigma _{ij}(u)\log u_i\) in Assumption (A5). A similar, but pointwise condition was imposed in the deterministic case; see Hypothesis (H3) in [25, Section 4.4]. The examples in Sect. 7 satisfy \(\sigma _{ij}(u)=0\) if \(u_i=0\). \(\square \)
3 Operator setup
In this section, we introduce the operators needed to define the approximate scheme.
3.1 Definition of the connection operator L
We define an operator L that “connects” two Hilbert spaces V and H satisfying \(V\subset H\). This abstract operator allows us to define a regularization operator that “lifts” the dual space \(V'\) to V.
Proposition 7
(Operator L) Let V and H be separable Hilbert spaces such that the embedding \(V\hookrightarrow H\) is continuous and dense. Then there exists a bounded, self-adjoint, positive operator \(L:D(L)\rightarrow H\) with domain \(D(L)=V\). Moreover, it holds for L and its dual operator \(L^*:H\rightarrow V'\) (we identify H and its dual \(H'\)) that, for some \(0<c<1\),
We abuse slightly the notation by denoting both dual and adjoint operators by \(A^*\). The proof is similar to [27, Theorem 1.12]. For the convenience of the reader, we present the full proof.
Proof
We first construct some auxiliary operator by means of the Riesz representation theorem. Let \(w\in H\). The mapping \(V\rightarrow {{\mathbb {R}}}\), \(v\mapsto (v,w)_H\), is linear and bounded. Hence, there exists a unique element \(\widetilde{w}\in V\) such that \((v,\widetilde{w})_{V}=(v,w)_H\) for all \(v\in V\). This defines the linear operator \(G:H\rightarrow V\), \(G(w):=\widetilde{w}\), such that
The operator G is bounded and symmetric, since \(\Vert G(w)\Vert _V=\Vert \widetilde{w}\Vert _V = \Vert w\Vert _H\) and
This means that G is self-adjoint as an operator on H. Choosing \(v=w\in H\) in (13) gives \((G(v),v)_H=\Vert G(v)\Vert _V^2\ge 0\), i.e., G is positive. We claim that G is also one-to-one. Indeed, let \(G(w)=0\) for some \(w\in H\). Then \(0=(v,G(w))_V=(v,w)_H\) for all \(v\in V\) and, by the density of the embedding \(V\hookrightarrow H\), for all \(v\in H\). This implies that \(w=0\) and shows the claim.
The properties on G allow us to define \(\Lambda :=G^{-1}:D(\Lambda )\rightarrow H\), where \(D(\Lambda )={\text {ran}}(G)\subset V\) and \(D(\Lambda )\) denotes the domain of \(\Lambda \). By definition, this operator satisfies
Hence, for all \(v,w\in D(\Lambda )\), we have \((v,\Lambda (w))_H=(v,w)_V=(\Lambda (v),w)_H\), i.e., \(\Lambda \) is symmetric. Since \(G=G^*\), we have \(D(\Lambda ^*)={\text {ran}}(G^*) ={\text {ran}}(G)=D(\Lambda )\) and consequently, \(\Lambda \) is self-adjoint. Moreover, \(\Lambda \) is densely defined (since \(V\hookrightarrow H\) is dense). As a densely defined, self-adjoint operator, it is also closed. Finally, \(\Lambda \) is one-to-one and positive:
for all \(v\in D(\Lambda )\) and some \(C>0\) and consequently, \(\Vert \Lambda (v)\Vert _H\ge C^{-1}\Vert v\Vert _V\).
Therefore, we can define the square root of \(\Lambda \), \(\Lambda ^{1/2}:D(\Lambda ^{1/2})\rightarrow H\), which is densely defined and closed. Its domain can be obtained by closing \(D(\Lambda )\) with respect to
for \(v\in D(\Lambda ^{1/2})\). In particular, the graph norm \(\Vert \cdot \Vert _H + \Vert \Lambda ^{1/2}(\cdot )\Vert _H\) is equivalent to the norm in V. We claim that \(D(\Lambda ^{1/2})=V\). To prove this, let \(w\in V\) be orthogonal to \(D(\Lambda ^{1/2})\). Then \((w,v)_V=0\) for all \(v\in D(\Lambda ^{1/2})\) and, since \(D(\Lambda )\subset D(\Lambda ^{1/2})\), in particular for all \(v\in D(\Lambda )\). It follows that \(0=(w,v)_V=(w,\Lambda (v))_H\) for \(v\in D(\Lambda )\). Since \(\Lambda \) is the inverse of \(G:H\rightarrow V\), we have \({\text {ran}}(\Lambda )=H\), and it holds that \((w,\xi )_H=0\) for all \(\xi \in H\), implying that \(w=0\). This shows the claim.
Finally, we define \(L:=\Lambda ^{1/2}:D(L)=V\rightarrow H\), which is a positive and self-adjoint operator. Estimate (14) shows that \(\Vert L(v)\Vert _H = \Vert v\Vert _V\) for \(v\in V\). We deduce from the equivalence between the norm in V and the graph norm of L that, for some \(C>0\) and all \(v\in V\),
which proves the lower bound in (12). The dual operator \(L^*:H\rightarrow V'\) is bounded too, since it holds for all \(w\in H\) that
This ends the proof. \(\square \)
We apply Proposition 7 to \(V=H^m_N({\mathcal {O}})\) and \(H=L^2({\mathcal {O}})\), recalling that \(H^m_N({\mathcal {O}})=\{v\in H^m({\mathcal {O}}):\nabla v\cdot \nu =0\) on \(\partial {\mathcal {O}}\}\) and \(m>d/2+1\). Then, by Sobolev’s embedding, \(D(L)\hookrightarrow W^{1,\infty }({\mathcal {O}})\). Observe the following two properties that are used later:
The following lemma is used in the proof of Proposition 16 to apply Itô’s lemma.
Lemma 8
(Operator \(L^{-1}\)) Let \(L^{-1}:{\text {ran}}(L)\rightarrow D(L)\) be the inverse of L and let \(D(L^{-1}):=\overline{D(\Lambda )}\) be the closure of \(D(\Lambda )\) with respect to \(\Vert L^{-1}(\cdot )\Vert _H\). Then \(D(L)'\) is isometric to \(D(L^{-1})\). In particular, it holds that \((L^{-1}(v),L^{-1}(w))_{H} = (v,w)_{D(L)'}\) for all v, \(w\in D(L)'\).
Proof
The proof is essentially contained in [27, p. 136ff] and we only sketch it. Let \(F\in D(L^{-1})'\). Then \(|F(v)|\le C\Vert L^{-1}(v)\Vert _H\) for all \(v\in D(\Lambda )\) and, as a consequence, \(|F(Lu)|\le C\Vert u\Vert _H\) for \(u=L^{-1}(v)\in D(L)\). The density of \(L^{-1}(D(\Lambda ))\) in H guarantees the unique representation \(F(Lu)=(u,w)_H\) for some \(w\in H\), and we can represent F in the form \(F(v)=(L^{-1}v,w)_H=(v,L^{-1}w)_H\), where \(L^{-1}w\in D(L)\). This shows that every element of \(D(L^{-1})'\) can be identified with an element of D(L).
Conversely, if \(w\in D(L)\), we consider functionals of the type \(v\mapsto (v,w)_H\) for \(v\in D(\Lambda )\), which are bounded in \(\Vert L^{-1}(\cdot )\Vert _H\). These functionals can be extended by continuity to functionals F belonging to \(D(L^{-1})'\). The proof in [27, p. 137] shows that \(\Vert F\Vert _{D(L^{-1})'}=\Vert w\Vert _{D(L)}\). We conclude that \(D(L^{-1})'\) is isometric to D(L). Since Hilbert spaces are reflexive, \(D(L^{-1})\) is isometric to \(D(L)'\). \(\square \)
Lemma 9
(Operator u) The mapping \(u:=(h')^{-1}\) from D(L) to \(L^\infty ({\mathcal {O}})\) is Fréchet differentiable and, as a mapping from D(L) to \(D(L)'\), monotone.
Proof
Let \(w\in D(L)\hookrightarrow L^\infty ({\mathcal {O}})\) (here we use \(m>d/2\)). Then \(u(w)=(x\mapsto u(w(x))) \in L^\infty ({\mathcal {O}})\), showing that \(u:D(L)\rightarrow L^\infty ({\mathcal {O}})=(L^1({\mathcal {O}}))'\hookrightarrow D(L)'\) is well defined. It follows from the mean-value theorem that for all w, \(\xi \in D(L)\),
Since \(u''\) maps bounded sets to bounded sets, the integral is bounded. Thus, \(u:D(L)\rightarrow L^\infty ({\mathcal {O}})\) is Fréchet differentiable. For the monotonicity, we use the convexity of h and hence the monotonicity of \(h'\):
for all v, \(w\in D(L)\). This proves the lemma. \(\square \)
3.2 Definition of the regularization operator \(R_\varepsilon \)
First, we define another operator, denoted by \(Q_{\varepsilon }\), that maps D(L) to \(D(L)'\). Its inverse is the desired regularization operator.
Lemma 10
(Operator \(Q_\varepsilon \)) Let \(\varepsilon >0\) and define \(Q_\varepsilon :D(L)\rightarrow D(L)'\) by \(Q_\varepsilon (w)=u(w)+\varepsilon L^*Lw\), where \(w\in D(L)\). Then \(Q_\varepsilon \) is Fréchet differentiable, strongly monotone, coercive, and invertible. Its Fréchet derivative \(\textrm{D}Q_\varepsilon [w](\xi )=u'(w)\xi + \varepsilon L^*L\xi \) for w, \(\xi \in D(L)\) is continuous, strongly monotone, coercive, and invertible.
Proof
The mapping \(Q_\varepsilon \) is well defined since \(w\in D(L)\hookrightarrow L^\infty ({\mathcal {O}})\) implies that \(u(w)\in L^\infty ({\mathcal {O}})\) and hence, \(\Vert u(w)\Vert _{D(L)'}\le C\Vert u(w)\Vert _{L^1({\mathcal {O}})}\) is finite. We show that \(Q_\varepsilon \) is strongly monotone. For this, let v, \(w\in D(L)\) and compute
where we used the monotonicity of \(w\mapsto u(w)\) and the lower bound in (12). The coercivity of \(Q_\varepsilon \) is a consequence of the strong monotonicity:
for \(v\in D(L)\). Based on these properties, the invertibility of \(Q_\varepsilon \) now follows from Browder’s theorem [20, Theorem 6.1.21].
Next, we show the properties for \(\textrm{D}Q_\varepsilon \). The operator \(\textrm{D}Q_\varepsilon [w]:D(L)\rightarrow D(L)'\) is well defined for all \(w\in D(L)\), since
for all \(\xi \in D(L)\hookrightarrow L^\infty ({\mathcal {O}})\). The strong monotonicity of \(\textrm{D}Q_\varepsilon [w]\) for \(w\in D(L)\) follows from the positive semidefiniteness of \(u'(w)=(h'')^{-1}(u(w))\) and the lower bound in (12):
for \(\xi \), \(\eta \in D(L)\). The choice \(\eta =0\) yields immediately the coercivity of \(\textrm{D}Q_\varepsilon [w]\). The invertibility of \(\textrm{D}Q_\varepsilon [w]\) follows again from Browder’s theorem. \(\square \)
Lemma 10 shows that the inverse of \(Q_\varepsilon \) exists. We set \(R_\varepsilon :=Q_\varepsilon ^{-1}:D(L)'\rightarrow D(L)\), which is the desired regularization operator. It has the following properties.
Lemma 11
(Operator \(R_\varepsilon \)) The operator \(R_\varepsilon :D(L)'\rightarrow D(L)\) is Fréchet differentiable and strictly monotone. In particular, it is Lipschitz continuous with Lipschitz constant \(C/\varepsilon \), where \(C>0\) does not depend on \(\varepsilon \). The Fréchet derivative is also Lipschitz continuous with the same constant and satisfies
and it is Lipschitz continuous with constant \(C/\varepsilon \), satisfying \(\Vert \textrm{D}R_\varepsilon [v](\xi )\Vert _{D(L)}\le \varepsilon ^{-1}C\Vert \xi \Vert _{D(L)'}\) for v, \(\xi \in D(L)'\).
Proof
We show first the Lipschitz continuity of \(R_\varepsilon \). Let \(v_1\), \(v_2\in D(L)'\). Then there exist \(w_1\), \(w_2\in D(L)\) such that \(v_1=Q_\varepsilon (w_1)\), \(v_2=Q_\varepsilon (w_2)\). Hence, using (12) and (16),
proving that \(R_\varepsilon \) is Lipschitz continuous with Lipschitz constant \(C/\varepsilon \). The Fréchet differentiability is a consequence of the inverse function theorem and \(\textrm{D}R_\varepsilon [v] = (\textrm{D}Q_\varepsilon [R_\varepsilon (v)])^{-1}\) for \(v\in D(L)'\).
We verify the strict monotonicity of \(R_\varepsilon \). Let v, \(w\in D(L)'\) with \(v\ne w\). Because of the strong monotonicity of \(Q_\varepsilon \), we have
and the right-hand side vanishes only if \(v=w\), since \(R_\varepsilon \) is one-to-one.
Next, we show that \(\textrm{D}R_\varepsilon [v]\) is Lipschitz continuous. Let \(w_1\), \(w_2\in D(L)\). By Lemma 10, \(\textrm{D}Q_\varepsilon [w]\) is strongly monotone. Thus, for any \(w\in D(L)\),
Let \(v_1=\textrm{D}Q_\varepsilon [w](w_1)\) and \(v_2=\textrm{D}Q_\varepsilon [w](w_2)\). We infer that
showing the Lipschitz continuity of \((\textrm{D}Q_\varepsilon [w])^{-1}\) and \(\textrm{D}R_\varepsilon [v] = (\textrm{D}Q_\varepsilon [R_\varepsilon (v)])^{-1}\). Finally, choosing \(w=R_\varepsilon [v]\) and \(v_2=0\), \(\Vert \textrm{D}R_\varepsilon [v](v_1)\Vert _{D(L)}\le \varepsilon ^{-1}C\Vert v_1\Vert _{D(L)'}\). \(\square \)
4 Existence of approximate solutions
In the previous section, we have introduced the regularization operator \(R_\varepsilon :D(L)'\rightarrow D(L)\). The entropy variable w is replaced by the regularized variable \(R_\varepsilon (v)\) for \(v\in D(L)'\). Setting \(v = u(R_\varepsilon (v)) + \varepsilon L^*LR_\varepsilon (v)\), we consider the regularized problem
recalling that \(B(w)=A(u(w))h''(u(w))^{-1}\) for \(w\in {{\mathbb {R}}}^n\).
We clarify the notion of solution to problem (17)–(18). Let \(T>0\), let \(\tau \) be an \({{\mathbb {F}}}\)-adapted stopping time, and let v be a continuous, \(D(L)'\)-valued, \({{\mathbb {F}}}\)-adapted process. We call \({(v,\tau )}\) a local strong solution to (17) if
for a.e. \(\omega \in \Omega \) and for all \(t\in [0,T\wedge \tau )\),
It can be verified that \(R_\varepsilon \) is strongly measurable and, if v is progressively measurable, also progressively measurable. Furthermore, if w is progressively measurable then so does u(w), and if \(v\in C^0([0,T];D(L)')\), we have \(R_\varepsilon (v)\in C^0([0,T];D(L))\) and \(u(R_\varepsilon (v))\in L^\infty (Q_T)\). Finally, if \(v\in L^0(\Omega ;L^p(0,T;D(L)'))\) for \(1\le p\le \infty \), then \({\text {div}}(B(u(R_\varepsilon (v)))\nabla R_\varepsilon (v))\in L^0(\Omega ;L^p(0,T;D(L)')))\). Therefore, the integrals in (19) are well defined. The local strong solution is called a global strong solution if \({{\mathbb {P}}}(\tau =\infty )=1\). Given \(t>0\) and a process \(v\in L^2(\Omega ;C^0([0,t];D(L)'))\), we introduce the stopping time
The stopping time \(\tau _{R}\) is \({{\mathbb {P}}}\)-a.s. positive. Indeed, by Chebychev’s inequality, it holds for \(\delta >0\) that
Then, inserting (19) and using the properties of the operators introduced in Sect. 3, we can show that \({{\mathbb {P}}}(\tau _R>\delta )\ge 1-C(\delta )\), where \(C(\delta )\rightarrow 0\) as \(\delta \rightarrow 0\), which proves the claim.
We impose the following general assumptions.
-
(H1)
Entropy density: Let \({\mathcal {D}}\subset {{\mathbb {R}}}^n\) be a domain and let \(h\in C^2({\mathcal {D}};[0,\infty ))\) be such that \(h':{\mathcal {D}}\rightarrow {{\mathbb {R}}}^n\) and \(h''(u)\in {{\mathbb {R}}}^{n\times n}\) for \(u\in {\mathcal {D}}\) are invertible and there exists \(C>0\) such that \(|u|\le C(1+h(u))\) for all \(u\in {\mathcal {D}}\).
-
(H2)
Initial datum: \(u^0=(u_1^0,\ldots ,u_n^0)\in L^\infty (\Omega ;L^2({\mathcal {O}};{{\mathbb {R}}}^n))\) is \({\mathcal {F}}_0\)-measurable satisfying \(u^0(x)\in {\mathcal {D}}\) for a.e. \(x\in {\mathcal {O}}\) \({{\mathbb {P}}}\)-a.s.
-
(H3)
Diffusion matrix: \(A=(A_{ij})\in C^1({\overline{{\mathcal {O}}}};{{\mathbb {R}}}^{n\times n})\) grows at most linearly and the matrix \(h''(u)A(u)\) is positive semidefinite for all \(u\in {\mathcal {D}}\).
Remark 12
(Discussion of the assumptions) Hypothesis (H1) and the positive semidefiniteness condition of \(h''(u)A(u)\) in (H3) are necessary for the entropy structure of the general cross-diffusion system. The entropy density (5) with \({\mathcal {D}}=(0,\infty )^n\) satisfies Hypothesis (H1), and the diffusion matrix (3) fulfills (H3). The differentiability of A is needed to apply [32, Prop. 4.1.4] (stating that the assumptions of the abstract existence Theorem 4.2.2 are satisfied) and can be weakened to continuity, weak monotonicity, and coercivity conditions. The growth condition for A is technical; it guarantees that the integral formulation associated to (1) is well defined. Hypothesis (H2) guarantees that \(h(u^0)\) is well defined. \(\square \)
We consider general approximate stochastic cross-diffusion systems, since the existence result for (17) may be useful also for other stochastic cross-diffusion systems.
Theorem 13
(Existence of approximate solutions) Let Assumptions (A1)–(A2), (A4)–(A5), (H1)–(H3) be satisfied and let \(\varepsilon >0\), \(R>0\). Then problem (17)–(18) has a unique local solution \({(v^\varepsilon ,\tau _R)}\).
Proof
We want to apply Theorem 4.2.4 and Proposition 4.1.4 of [32]. To this end, we need to verify that the operator \(M:D(L)'\rightarrow D(L)'\), \(M(v):= {\text {div}}(B(R_\varepsilon (v))\nabla R_\varepsilon (v))\), is Fréchet differentiable and has at most linear growth, \(\textrm{D}M[v]-cI\) is negative semidefinite for all \(v\in D(L)'\) and some \(c>0\), and \(\sigma \) is Lipschitz continuous.
By the regularity of the matrix A and the entropy density h, the operator \(D(L)\rightarrow D(L)'\), \(w\mapsto {\text {div}}(B(w)\nabla w)\), is Fréchet differentiable. Then the Fréchet differentiability of \(R_\varepsilon \) (see Lemma 11) and the chain rule imply that the operator M is also Fréchet differentiable with derivative
where v, \(\xi \in D(L)'\). We claim that this derivative is locally bounded, i.e. if \(\Vert v\Vert _{D(L)'}\le K\) then \(\Vert \textrm{D}M[v](\xi )\Vert _{D(L)'}\le C(K)\Vert \xi \Vert _{D(L)'}\). For this, we deduce from the Lipschitz continuity of \(R_\varepsilon \) (Lemma 11) and the property \(u(R_\varepsilon (v))\in L^\infty ({\mathcal {O}})\) for \(v\in D(L)'\) that
where \(\textrm{D}B[R_\varepsilon (v)]\) is interpreted as a matrix. Recalling from Lemma 11 that
we obtain for \(\Vert v\Vert _{D(L)'}\le K\) and \(\xi \in D(L)'\):
This proves the claim. Thus, if \(\Vert v\Vert _{D(L)'}\le K\), there exists \(c>0\) such that
Moreover, by Lemma 11 again,
It follows from Assumption (A4) and Lemma 9 that for v, \({\bar{v}}\in D(L)'\) with \(\Vert v\Vert _{D(L)'}\le K\) and \(\Vert {\bar{v}}\Vert _{D(L)'}\le K\),
where C(K) also depends on the \(L^\infty ({\mathcal {O}})\) norms of \(u'(R_\varepsilon (v))\) and \(u'(R_\varepsilon ({\bar{v}}))\).
These estimates show that the assumptions of [32, Theorem 4.2.4] are satisfied in the ball \(\{v\in D(L)':\Vert v\Vert _{D(L)'}\le K\}\). An inspection of the proof of that theorem, which is based on the Galerkin method and Itô’s lemma, shows that local bounds are sufficient to conclude the existence of a local solution v up to the stopping time \(\tau _R\). The boundary conditions follow from \(R_\varepsilon (v)\in D(L)=H^m_N({\mathcal {O}})\) and the definition of the space \(H^m_N({\mathcal {O}})\). \(\square \)
For the entropy estimate we need two technical lemmas whose proofs are deferred to Appendix A.
Lemma 14
Let \(w\in D(L)\), \(a=(a_{ij})\in L^1({\mathcal {O}};{{\mathbb {R}}}^{n\times n})\), and \(b=(b_{ij})\in D(L)^{n\times n}\) satisfying \(\textrm{D}R_\varepsilon [w](a) = b\). Then
Lemma 15
Let \(v^0\in L^p(\Omega ;L^1({\mathcal {O}}))\) for some \(p\ge 1\) satisfies \({{\mathbb {E}}}\int _{\mathcal {O}}h(v^0)\textrm{d}x\le C\). Then
We turn to the entropy estimate.
Proposition 16
(Entropy inequality) Let \({(v^\varepsilon ,\tau _R)}\) be a local solution to (17)–(18) and set \(v^R(t)=v^\varepsilon (\omega ,t\wedge \tau _R(\omega ))\) for \(\omega \in \Omega \), \(t\in (0,\tau _R(\omega ))\). Then there exists a constant \(C(u^0,T)>0\), depending on \(u^0\) and T but not on \(\varepsilon \) and R, such that
where \(u^\varepsilon :=u(R_\varepsilon (v^R))\) and \(w^\varepsilon :=R_\varepsilon (v^R)\).
Proof
The result follows from Itô’s lemma using a regularized entropy. More precisely, we want to apply the Itô lemma in the version of [29, Theorem 3.1]. To this end, we verify the assumptions of that theorem. Basically, we need a twice differentiable function \({{\mathcal {H}}}\) on a Hilbert space H, whose derivatives satisfy some local growth conditions on H and V, where V is another Hilbert space such that the embedding \(V\hookrightarrow H\) is dense and continuous. We choose \(V=H=D(L)'\) and the regularized entropy
Recall that \(R_\varepsilon (v) = h'(u(R_\varepsilon (v)))\) for \(v\in D(L)'\), since \(u=u(w)\) is the inverse of \(h'\). Then, in view of the regularity assumptions for h and Lemma 11, \({{\mathcal {H}}}\) is Fréchet differentiable with derivative
where v, \(\xi \in D(L)'\). In other words, \(\textrm{D}{{\mathcal {H}}}[v]\) can be identified with \(R_\varepsilon (v)\in D(L)\). In a similar way, we can prove that \(\textrm{D}{{\mathcal {H}}}[v]\) is Fréchet differentiable with
We have, thanks to the Lipschitz continuity of \(R_\varepsilon \) and \(\textrm{D}R_\varepsilon [v]\) (see Lemma 11) for all v, \(\xi \in D(L)'\) with \(\Vert v\Vert _{D(L)'}\le K\) for some \(K>0\),
Finally, for any \(\eta \in D(L)'\), we need an estimate for the mapping \(D(L)'\rightarrow {{\mathbb {R}}}\), \(v\mapsto \textrm{D}{{\mathcal {H}}}[v](\eta )\). We have identified \(\textrm{D}{{\mathcal {H}}}[v]\) with \(R_\varepsilon (v)\in D(L)\), but we need an identification in \(D(L)'\). As in Lemma 8, the operator L can be constructed in such a way that the Riesz representative in \(D(L)'\) of a functional acting on \(D(L)'\) can be expressed via the application of \(L^*L\) to an element of D(L). Indeed, for \(F\in D(L)\) and \(\xi \in D(L)'\), we infer from Lemma 8 that
Hence, we can associate \(\textrm{D}{{\mathcal {H}}}[v]\) with \(L^*LR_\varepsilon (v) \in D(L)'\). Then, by the first estimate in (15) and the Lipschitz continuity of \(R_\varepsilon \),
giving the desired estimate for \(\textrm{D}{{\mathcal {H}}}[v]\) in \(D(L)'\). Thus, the assumptions of the Itô lemma, as stated in [29], are satisfied.
To simplify the notation, we set \(u^\varepsilon := u(R_\varepsilon (v^R))\) and \(w^\varepsilon := R_\varepsilon (v^R)\) in the following. By Itô’s lemma, using \(\textrm{D}{{\mathcal {H}}}[v^R]=h'(u^\varepsilon )\), \(\textrm{D}^2 {{\mathcal {H}}}[v^R]=\textrm{D}R_\varepsilon (v^R)\), we have
Lemma 15 shows that the first term on the right-hand side can be estimated from above by \(\int _{\mathcal {O}}h(u^0)\textrm{d}x\). Using \(w^\varepsilon = R_\varepsilon (v^R)=h'(u^\varepsilon )\) and integrating by parts, the second term on the right-hand side can be written as
The boundary integral vanishes because of the choice of the space \(D(L)=H^m_N({\mathcal {O}})\). The last inequality follows from Assumption (A3), which implies that \(B(w^\varepsilon ) = A(u(w^\varepsilon ))h''(u(w^\varepsilon ))^{-1}\) is positive semidefinite.. We reformulate the last term in (22) by applying Lemma 14 with \(a=\sigma (u^\varepsilon )e_k\) and \(b=\textrm{D}R_\varepsilon [v](\sigma (u^\varepsilon )e_k)\):
Taking the supremum in (22) over \((0,T_R)\), where \(T_R\le T\wedge \tau _R\), and the expectation yields
We apply the Burkholder–Davis–Gundy inequality [32, Theorem 6.1.2] to \(I_1\) and use Assumption (A5):
Also the remaining integral \(I_2\) can be bounded from above by Assumption (A5):
Therefore, (23) becomes
We apply Gronwall’s lemma to the function \(F(t)=\sup _{0<s<t}\int _{\mathcal {O}}h(u^\varepsilon (s))\textrm{d}x\) to find that
Using this bound in (24) then finishes the proof. \(\square \)
The entropy inequality allows us to extend the local solution to a global one.
Proposition 17
Let \({(v^\varepsilon ,\tau _R)}\) be a local solution to (19)–(20), constructed in Theorem 13. Then \(v^\varepsilon \) can be extended to a global solution to (19)–(20).
Proof
With the notation \(u^\varepsilon = u(R_\varepsilon (v^\varepsilon ))\) and \(w^\varepsilon =R_\varepsilon (v^\varepsilon )\), we observe that \(v^\varepsilon =Q_\varepsilon (R_\varepsilon (v^\varepsilon ))\) \(= u(R_\varepsilon (v^\varepsilon ))+\varepsilon L^*LR_\varepsilon (v^\varepsilon ) = u^\varepsilon + \varepsilon L^*Lw^\varepsilon \). Thus, we have for \(T_R\le T\wedge \tau _R\),
We know from Hypothesis (H1) that \(|u^\varepsilon |\le C(1+h(u^\varepsilon ))\). Therefore, taking into account the entropy inequality and the second inequality in (15),
This allows us to perform the limit \(R\rightarrow \infty \) and to conclude that we have indeed a solution \(v^\varepsilon \) in (0, T) for any \(T>0\). \(\square \)
5 Proof of Theorem 4
We prove the global existence of martingale solutions to the SKT model with self-diffusion.
5.1 Uniform estimates
Let \(v^\varepsilon \) be a global solution to (19)–(20) and set \(u^\varepsilon =u(R_\varepsilon (v^\varepsilon ))\). We assume that A(u) is given by (3) and that \(a_{ii}>0\) for \(i=1,\ldots ,n\). We start with some uniform estimates, which are a consequence of the entropy inequality in Proposition 16.
Lemma 18
(Uniform estimates) There exists a constant \(C(u^0,T)>0\) such that for all \(\varepsilon >0\) and \(i,j=1,\ldots ,n\) with \(i\ne j\),
Moreover, we have the estimate
Proof
Let \(v^\varepsilon \) be a global solution to (19)–(20). We observe that \(R_\varepsilon (v^\varepsilon ) = h'(u(R_\varepsilon (v^\varepsilon ))) = h'(u^\varepsilon )\) implies that \(\nabla R_\varepsilon (v^\varepsilon ) = h''(u^\varepsilon )\nabla u^\varepsilon \). It is shown in [11, Lemma 4] that for all \(z\in {{\mathbb {R}}}^n\) and \(u\in (0,\infty )^n\),
Using \(B(R_\varepsilon (v^\varepsilon ))=A(u^\varepsilon )h''(u^\varepsilon )^{-1}\) and the previous inequality with \(z=\nabla u^\varepsilon \), we find that
Therefore, the entropy inequality in Proposition 16 becomes
This is the stochastic analog of the entropy inequality (6). By Hypothesis (H1), we have \(|u|\le C(1+h(u))\) and consequently,
which proves (25). Estimate (26) then follows from the Poincaré–Wirtinger inequality.
It remains to show estimate (27). We deduce from the second inequality in (15) that
This shows that
ending the proof. \(\square \)
We also need higher-order moment estimates.
Lemma 19
(Higher-order moments I) Let \(p\ge 2\). There exists a constant \(C(p,u^0,T)\), which is independent of \(\varepsilon \), such that
Moreover, we have
Proof
Proceeding as in the proof of Proposition 16 and taking into account identity (22) and inequality (28), we obtain
recalling Definition 21 of \({{\mathcal {H}}}(v^\varepsilon )\). We raise this inequality to the pth power, take the expectation, apply the Burkholder–Davis–Gundy inequality (for the second term on the right-hand side), and use Assumption (A5) to find that
We neglect the expression \(\varepsilon \Vert LR_\varepsilon (v^\varepsilon (t))\Vert _{L^2({\mathcal {O}})}^2\) and apply Gronwall’s lemma. Then, taking into account the fact that the entropy dominates the \(L^1({\mathcal {O}})\) norm, thanks to Hypothesis (H1), and applying the Poincaré–Wirtinger inequality, we obtain estimates (30)–(32). Going back to (34), we infer that
Combining the previous estimates and arguing as in the proof of Lemma 18, we have
This ends the proof. \(\square \)
Using the Gagliardo–Nirenberg inequality, we can derive further estimates. We recall that \(Q_T={\mathcal {O}}\times (0,T)\).
Lemma 20
(Higher-order moments II) Let \(p\ge 2\). There exists a constant \(C(p,u^0,T)\) \(>0\), which is independent of \(\varepsilon \), such that
Proof
We apply the Gagliardo–Nirenberg inequality:
where \(r>1\) and \(\theta \in (0,1]\) are related by \(1/r=1-\theta (d+2)/(2d)\) and \(s=2/\theta \ge 2\). The right-hand side is bounded in view of estimates (30) and (31). Estimate (35) follows after choosing \(r=s\), implying that \(r=2+2/d\), and (36) follows from the choice \(s=2+4/d\), implying that \(r=2\). \(\square \)
Next, we show some bounds for the fractional time derivative of \(u^\varepsilon \). This result is used to establish the tightness of the laws of \((u^\varepsilon )\) in a sub-Polish space. Alternatively, the tightness property can be proved by verifying the Aldous condition; see, e.g., [18]. We recall the definition of the Sobolev–Slobodeckij spaces. Let X be a vector space and let \(p\ge 1\), \(\alpha \in (0,1)\). Then \(W^{\alpha ,p}(0,T;X)\) is the set of all functions \(v\in L^p(0,T;X)\) for which
With this norm, \(W^{\alpha ,p}(0,T;X)\) becomes a Banach space. We need the following technical lemma, which is proved in Appendix A.
Lemma 21
Let \(g\in L^1(0,T)\) and \(\delta <2\), \(\delta \ne 1\). Then
We obtain the following uniform bounds for \(u^\varepsilon \) and \(v^\varepsilon \) in Sobolev–Slobodeckij spaces.
Lemma 22
(Fractional time regularity) Let \(\alpha <1/2\). There exists a constant \(C(u^0,T)>0\) such that, for \(p:=(2d+4)/d >2\),
Since \(p>2\), we can choose \(\alpha <1/2\) such that \(\alpha p>1\). Then the continuous embedding \(W^{\alpha ,p}(0,T)\hookrightarrow C^{0,\beta }([0,T])\) for \(\beta =\alpha -1/p>0\) implies that
Proof
First, we derive the \(W^{\alpha ,p}\) estimate for \(v^\varepsilon \) and then we conclude the estimate for \(u^\varepsilon \) from the definition \(v^\varepsilon = u^\varepsilon + \varepsilon L^*LR_\varepsilon (v^\varepsilon )\) and Lemma 20. Equation (17) reads in terms of \(u^\varepsilon \) as
We know from (33) that \({{\mathbb {E}}}\Vert v^\varepsilon \Vert _{L^\infty (0,T;D(L)')}^p\) is bounded. Thus, to prove the bound for the second term in (38), it remains to estimate the following seminorm:
We need some preparations before we can estimate \(J_1\). We observe that
It follows from the embedding \(L^1({\mathcal {O}})\hookrightarrow D(L)'\) that
We use Hölder’s inequality and fix \(p=(2d+4)/d\) to obtain
In view of estimate (31) and (37), the right-hand side is finite if \(1+\alpha p-p/2<2\) or, equivalently, \(\alpha <(d+1)/(d+2)\), and this holds true since \(\alpha <1/2\). Applying Hölder’s inequality again, we have
Because of estimates (31), (36), and (37), the right-hand side of is finite if \(1+\alpha p-p/(d+2)<2\), which is equivalent to \(\alpha <1/2\).
To estimate \(J_2\), we use the embedding \(L^2({\mathcal {O}})\hookrightarrow D(L)'\), the Burkholder–Davis–Gundy inequality, the linear growth of \(\sigma \) from Assumption (A4), and the Hölder inequality:
By (36) and (37), the right-hand side is finite if \(1+\alpha p-(p-2)/2<2\), which is equivalent to \(\alpha <(3d+2)/(2d+4)\), and this is valid due to the condition \(\alpha <1/2\). We conclude that \((v^\varepsilon )\) is bounded in \(L^p(\Omega ;W^{\alpha ,p}(0,T;D(L)'))\) with \(p=(2d+4)/d\).
Next, we derive the uniform bounds for \(u^\varepsilon \). By definition of \(v^\varepsilon \) and the \(W^{\alpha ,p}\) seminorm,
It follows from (15) and the Lipschitz continuity of \(R_\varepsilon \) (Lemma 11) that
Then we find that
which finishes the proof. \(\square \)
5.2 Tightness of the laws of \((u^\varepsilon )\)
We show that the laws of \((u^\varepsilon )\) are tight in a certain sub-Polish space. For this, we introduce the following spaces:
-
\(C^0([0,T];D(L)')\) is the space of continuous functions \(u:[0,T]\rightarrow D(L)'\) with the topology \({\mathbb {T}}_1\) induced by the norm \(\Vert u\Vert _{C^0([0,T];D(L)')} =\sup _{0<t<T}\Vert u(t)\Vert _{D(L)'}\);
-
\(L_w^2(0,T;H^1({\mathcal {O}}))\) is the space \(L^2(0,T;H^1({\mathcal {O}}))\) with the weak topology \({\mathbb {T}}_2\).
We define the space
endowed with the topology \(\widetilde{{\mathbb {T}}}\) that is the maximum of the topologies \({\mathbb {T}}_1\) and \({\mathbb {T}}_2\). The space \(\widetilde{Z}_T\) is a sub-Polish space, since \(C^0([0,T];D(L)')\) is separable and metrizable and
where \((v_m)_m\) is a dense subset of \(L^2(0,T;H^1({\mathcal {O}}))\), is a countable family \((f_m)\) of point-separating functionals acting on \(L^2(0,T;H^1({\mathcal {O}}))\). In the following, we choose a number \(s^*\ge 1\) such that
Then the embedding \(H^1({\mathcal {O}})\hookrightarrow L^{s^*}({\mathcal {O}})\) is compact.
Lemma 23
The set of laws of \((u^\varepsilon )\) is tight in
with the topology \({\mathbb {T}}\) that is the maximum of \(\widetilde{{\mathbb {T}}}\) and the topology induced by the \(L^2(0,T;\) \(L^{s^*}({\mathcal {O}}))\) norm, where \(s^*\) is given by (40).
Proof
We apply Chebyshev’s inequality for the first moment and use estimate (39) with \(\beta =\alpha -1/p>0\), for any \(\eta >0\) and \(\delta >0\),
This means that for all \(\theta >0\) and all \(\eta >0\), there exists \(\delta >0\) such that
which is equivalent to the Aldous condition [5, Section 2.2]. Applying [38, Lemma 5, Theorem 3] with the spaces \(X=H^1({\mathcal {O}})\) and \(B=D(L)'\), we conclude that \((u^\varepsilon )\) is precompact in \(C^0([0,T];D(L)')\). Then, proceeding as in the proof of the basic criterion for tightness [34, Chapter II, Section 2.1], we see that the set of laws of \((u^\varepsilon )\) is tight in \(C^0([0,T];D(L)')\).
Next, by Chebyshev’s inequality again and estimate (26), for all \(K>0\),
This implies that for any \(\delta >0\), there exists \(K>0\) such that \({{\mathbb {P}}}(\Vert u^\varepsilon \Vert _{L^2(0,T;H^1({\mathcal {O}}))}\le K)\le 1-\delta \). Since closed balls with respect to the norm of \(L^2(0,T;H^1({\mathcal {O}}))\) are weakly compact, we infer that the set of laws of \((u^\varepsilon )\) is tight in \(L^2_w(0,T;H^1({\mathcal {O}}))\).
The tightness in \(L^2(0,T;L^{s^*}({\mathcal {O}}))\) follows from Lemma 37 in Appendix B with \(p=q=2\) and \(r=2+4/d\). \(\square \)
Lemma 24
The set of laws of \((\sqrt{\varepsilon }L^*LR_\varepsilon (v^\varepsilon ))\) is tight in
with the associated topology \({\mathbb {T}}_Y\).
Proof
We apply the Chebyshev inequality and use the inequality \(\Vert L^*LR_\varepsilon (v^\varepsilon )\Vert _{D(L)'}\le C\Vert LR_\varepsilon (v^\varepsilon )\Vert _{L^2({\mathcal {O}})}\) and estimate (27):
for any \(K>0\). Since closed balls in \(L^2(0,T;D(L)')\) are weakly compact, the set of laws of \((\sqrt{\varepsilon }L^*LR_\varepsilon (v^\varepsilon ))\) is tight in \(L_w^2(0,T;D(L)')\). The second claim follows from an analogous argument. \(\square \)
5.3 Convergence of \((u^\varepsilon )\)
Let \(\text{ P }(X)\) be the space of probability measures on X. We consider the space \(Z_T\times Y_T\times C^0([0,T];U_0)\), equipped with the probability measure \(\mu ^\varepsilon :=\mu _u^\varepsilon \times \mu _w^\varepsilon \times \mu _W^\varepsilon \), where
recalling the choice (40) of \(s^*\). The set of measures \((\mu ^\varepsilon )\) is tight, since the set of laws of \((u^\varepsilon )\) and \((\sqrt{\varepsilon }L^*LR_\varepsilon (v^\varepsilon ))\) are tight in \((Z_T,{\mathbb {T}})\) and \((Y_T,{\mathbb {T}}_Y)\), respectively. Moreover, \((\mu _W^\varepsilon )\) consists of one element only and is consequently weakly compact in \(C^0([0,T];U_0)\). By Prokhorov’s theorem, \((\mu _W^\varepsilon )\) is tight. Hence, \(Z_T\times Y_T\times C^0([0,T];U_0)\) satisfies the assumptions of the Skorokhod–Jakubowski theorem [6, Theorem C.1]. We infer that there exists a subsequence of \((u^\varepsilon ,\sqrt{\varepsilon }L^*LR_\varepsilon (v^\varepsilon ))\), which is not relabeled, a probability space \(({\widetilde{\Omega }},\widetilde{{\mathcal {F}}},{\widetilde{{{\mathbb {P}}}}})\) and, on this space, \((Z_T\times Y_T\times C^0([0,T];U_0))\)-valued random variables \((\widetilde{u},\widetilde{w},\widetilde{W})\) and \((\widetilde{u}^\varepsilon , \widetilde{w}^\varepsilon ,\widetilde{W}^\varepsilon )\) such that \((\widetilde{u}^\varepsilon , \widetilde{w}^\varepsilon ,\widetilde{W}^\varepsilon )\) has the same law as \((u^\varepsilon ,\sqrt{\varepsilon }L^*LR_\varepsilon (v^\varepsilon ),W)\) on \({{\mathcal {B}}}(Z_T\times Y_T\times C^0([0,T];U_0))\) and, as \(\varepsilon \rightarrow 0\),
By the definition of \(Z_T\) and \(Y_T\), this convergence means \({\widetilde{{{\mathbb {P}}}}}\)-a.s.,
We derive some regularity properties for the limit \(\widetilde{u}\). We note that \(\widetilde{u}\) is a \(Z_T\)-Borel random variable, since \({{\mathcal {B}}}(Z_T\times Y_T\times C^0([0,T];U_0))\) is a subset of \({{\mathcal {B}}}(Z_T)\times {{\mathcal {B}}}(Y_T)\times {{\mathcal {B}}}(C^0([0,T];U_0))\). We deduce from estimates (25) and (26) and the fact that \(u^\varepsilon \) and \(\widetilde{u}^\varepsilon \) have the same law that
We infer the existence of a further subsequence of \((\widetilde{u}^\varepsilon )\) (not relabeled) that is weakly converging in \(L^p({\widetilde{\Omega }};L^2(0,T;H^1({\mathcal {O}})))\) and weakly* converging in \(L^p({\widetilde{\Omega }};C^0([0,T];D(L)'))\) as \(\varepsilon \rightarrow 0\). Because \(\widetilde{u}^\varepsilon \rightarrow \widetilde{u}\) in \(Z_T\) \({\widetilde{{{\mathbb {P}}}}}\)-a.s., we conclude that the limit function satisfies
Let \({\widetilde{{{\mathbb {F}}}}}\) and \({\widetilde{{{\mathbb {F}}}}}^\varepsilon \) be the filtrations generated by \((\widetilde{u},\widetilde{w},\widetilde{W})\) and \((\widetilde{u}^\varepsilon ,\widetilde{w}^\varepsilon ,\widetilde{W})\), respectively. By following the arguments of the proof of [7, Proposition B4], we can verify that these new random variables induce actually stochastic processes. The progressive measurability of \(\widetilde{u}^\varepsilon \) is a consequence of [4, Appendix B]. Set \(\widetilde{W}_j^{\varepsilon ,k}(t):=(\widetilde{W}^\varepsilon (t),e_k)_U\). We claim that \(\widetilde{W}_j^{\varepsilon ,k}(t)\) for \(k\in {{\mathbb {N}}}\) are independent, standard \(\widetilde{{\mathcal {F}}}_t\)-Wiener processes. The adaptedness is a direct consequence of the definition; the independence of \(\widetilde{W}_j^{\varepsilon ,k}(t)\) and the independence of the increments \(\widetilde{W}^{\varepsilon ,k}(t)-\widetilde{W}^{\varepsilon ,k}(s)\) with respect to \(\widetilde{{\mathcal {F}}}_s\) are inherited from \((W(t),e_k)_U\). Passing to the limit \(\varepsilon \rightarrow 0\) in the characteristic function, by using dominated convergence, we find that \(\widetilde{W}(t)\) are \(\widetilde{{\mathcal {F}}}_t\)-martingales with the correct marginal distributions. We deduce from Lévy’s characterization theorem that \(\widetilde{W}(t)\) is indeed a cylindrical Wiener process.
By definition, \(u_i^\varepsilon =u_i(R_\varepsilon (v^\varepsilon ))=\exp (R_\varepsilon (v^\varepsilon ))\) is positive in \(Q_T\) a.s. We claim that also \(\widetilde{u}_i\) is nonnegative in \({\mathcal {O}}\) a.s.
Lemma 25
(Nonnegativity) It holds that \(\widetilde{u}_i\ge 0\) a.e. in \(Q_T\) \({\widetilde{{{\mathbb {P}}}}}\)-a.s. for all \(i=1,\ldots ,n\).
Proof
Let \(i\in \{1,\ldots ,n\}\). Since \(u_i^\varepsilon > 0\) in \(Q_T\) a.s., we have \({{\mathbb {E}}}\Vert (u_i^\varepsilon )^-\Vert _{L^2(0,T;L^2({\mathcal {O}}))}=0\), where \(z^-=\min \{0,z\}\). The function \(u_i^\varepsilon \) is \(Z_T\)-Borel measurable and so does its negative part. Therefore, using the equivalence of the laws of \(u_i^\varepsilon \) and \(\widetilde{u}_i^\varepsilon \) in \(Z_T\) and writing \(\mu _i^\varepsilon \) and \({\widetilde{\mu }}_i^\varepsilon \) for the laws of \(u_i^\varepsilon \) and \(\widetilde{u}_i^\varepsilon \), respectively, we obtain
This shows that \(\widetilde{u}_i^\varepsilon \ge 0\) a.e. in \(Q_T\) \({\widetilde{{{\mathbb {P}}}}}\)-a.s. The convergence (up to a subsequence) \(\widetilde{u}^\varepsilon \rightarrow \widetilde{u}\) a.e. in \(Q_T\) \({\widetilde{{{\mathbb {P}}}}}\)-a.s. then implies that \(\widetilde{u}_i\ge 0\) in \(Q_T\) \({\widetilde{{{\mathbb {P}}}}}\)-a.s. \(\square \)
The following lemma is needed to verify that \((\widetilde{u},\widetilde{W})\) is a martingale solution to (1)–(2).
Lemma 26
It holds for all \(t\in [0,T]\), \(i=1,\ldots ,n\), and all \(\phi _1\in L^2({\mathcal {O}})\) and all \(\phi _2\in D(L)\) that
Proof
The proof is a combination of the uniform bounds and Vitali’s convergence theorem. Convergences (41) and (42) have been shown in the proof of [18, Lemma 16], and (43) is a direct consequence of (38) and
Convergence (44) follows from \(\widetilde{w}_i^\varepsilon \rightharpoonup \widetilde{w}_i\) weakly* in \(L^\infty (0,T;D(L)')\). We establish (45):
By the Lipschitz continuity of A and the uniform bound for \(\nabla {\widetilde{u}}^\varepsilon \), we have \(I_1^\varepsilon \rightarrow 0\) as \(\varepsilon \rightarrow 0\) \({\widetilde{{{\mathbb {P}}}}}\)-a.s. At this point, we use the embedding \(D(L)\hookrightarrow W^{1,\infty }({\mathcal {O}})\). Also the second integral \(I_2^\varepsilon \) converges to zero, since \(A_{ij}({\widetilde{u}})\nabla \phi _2 \in L^2(0,T;L^2({\mathcal {O}}))\) and \(\nabla {\widetilde{u}}_j^\varepsilon \rightharpoonup \nabla {\widetilde{u}}_j\) weakly in \(L^2(0,T;L^2({\mathcal {O}}))\). This shows that \({{\widetilde{{{\mathbb {P}}}}}}\)-a.s.,
A straightforward estimation and bound (31) lead to
Hence, Vitali’s convergence theorem gives (45).
It remains to prove (46). By Assumption (A4), \({{\widetilde{{{\mathbb {P}}}}}}\)-a.s.,
This convergence and \({\widetilde{W}}^\varepsilon \rightarrow {\widetilde{W}}\) in \(C^0([0,T];U_0)\) imply that [14, Lemma 2.1]
By Assumption (A4) again,
We infer from Vitali’s convergence theorem that
The estimate
for all \(\phi _1\in L^2({\mathcal {O}})\) and the dominated convergence theorem yield (46). \(\square \)
To show that the limit is indeed a solution, we define, for \(t\in [0,T]\), \(i=1,\ldots ,n\), and \(\phi \in D(L)\),
The following corollary is a consequence of the previous lemma.
Corollary 27
It holds for any \(\phi _1\in L^2({\mathcal {O}})\) and \(\phi _2\in D(L)\) that
Since \(v^\varepsilon \) is a strong solution to (17), it satisfies for a.e. \(t\in [0,T]\) \({{\mathbb {P}}}\)-a.s., \(i=1,\ldots ,n\), and \(\phi \in D(L)\),
and in particular,
We deduce from the equivalence of the laws of \((u^\varepsilon ,\varepsilon L^*LR_\varepsilon (v^\varepsilon ),W)\) and \(({\widetilde{u}}^\varepsilon ,\sqrt{\varepsilon }{\widetilde{w}}^\varepsilon ,{\widetilde{W}})\) that
By Corollary 27, we can pass to the limit \(\varepsilon \rightarrow 0\) to obtain
This identity holds for all \(i=1,\ldots ,n\) and all \(\phi \in D(L)\). This shows that
We infer from the definition of \(\Lambda _i\) that
for a.e. \(t\in [0,T]\) and all \(\phi \in D(L)\). Set \({\widetilde{U}}=({{\widetilde{\Omega }}},\widetilde{{\mathcal {F}}},{{\widetilde{{{\mathbb {P}}}}}}, {{\widetilde{{{\mathbb {F}}}}}})\). Then \(({\widetilde{U}},{\widetilde{W}},{\widetilde{u}})\) is a martingale solution to (1)–(3).
6 Proof of Theorem 5
We turn to the existence proof of the SKT model without self-diffusion.
6.1 Uniform estimates
Let \(v^\varepsilon \) be a global solution to (19)–(20) and set \(u^\varepsilon =u(R_\varepsilon (v^\varepsilon ))\). We assume that A(u) is given by (3) and that \(a_{i0}>0\), \(a_{ii}=0\) for \(i=1,\ldots ,n\). The uniform estimates of Lemmas 18 and 19 are still valid. Since \(a_{ii}=0\), we obtain an \(H^1({\mathcal {O}})\) bound for \((u_i^\varepsilon )^{1/2}\) instead of \(u_i^\varepsilon \), which yields weaker bounds than those in Lemma 20.
Lemma 28
Let \(p\ge 2\) and set \(\rho _1:=(d+2)/(d+1)\). Then there exists a constant \(C(p,u^0,T)>0\), which is independent of \(\varepsilon \), such that
Proof
The identity \(\nabla u_i^{\varepsilon }=2(u_i^{\varepsilon })^{1/2}\nabla (u_i^{\varepsilon })^{1/2}\) and the Hölder inequality show that
Because of (30) and (31), the right-hand side is bounded. Using (30) again, we infer that (47) holds. Estimate (48) is obtained from the Gagliardo–Nirenberg inequality similarly as in the proof of Lemma 20:
where \(s=2/\theta \ge 2\) and \(1/r=1/2-\theta /d=1/2-2/(ds)\). Choosing \(r=(2d+4)/d\) gives \(s=r\), and \(r=4\) leads to \(s=8/d\); this proves estimates (48) and (49). Finally, (50) follows from Hölder’s inequality:
and taking the expectation and using (48) and (31) ends the proof. \(\square \)
The following lemma is needed to derive the fractional time estimate.
Lemma 29
Let \(p\ge 2\) and set \(\rho _2:=(2d+2)/(2d+1)\). Then it holds for any \(i,j=1,\ldots ,n\) with \(i\ne j\):
Proof
The Hölder inequality and (30) immediately yield
and we conclude from the Poincaré–Wirtinger inequality, estimate (32), and the previous estimate that
By the Gagliardo–Nirenberg inequality, with \(\theta =d/(d+1)\),
Taking the expectation and applying the Hölder inequality, we infer that
Finally, the identity \(\nabla (u_i^\varepsilon u_j^\varepsilon ) = 2(u_i^\varepsilon u_j^\varepsilon )^{1/2} \nabla (u_i^\varepsilon u_j^\varepsilon )^{1/2}\) and Hölder’s inequality lead to
The bounds (52)–(53) yield, after taking the expectation and applying Hölder’s inequality again, the conclusion (51). \(\square \)
We show now that the fractional time derivative of \(u^\varepsilon \) is uniformly bounded.
Lemma 30
(Fractional time regularity) Let \(d\le 2\). Then there exist \(0<\alpha <1\), \(p>1\), and \(\beta >0\) such that \(\alpha p>1\) and
Proof
We proceed similarly as in the proof of Lemma 22. First, we estimate the diffusion part, setting
Then, using \(D(L)\subset W^{1,\infty }({\mathcal {O}})\) (which holds due to the assumption \(m>d/2+1\)),
The embedding \(W^{1,p}(0,T;{{\mathbb {R}}})\hookrightarrow W^{\alpha ,p}(0,T;{{\mathbb {R}}})\) and estimates (51) and (50) show that for \(1\le p\le \rho _1 = (d+2)/(d+1)\),
Next, we consider the stochastic part, using the Burkholder–Davis–Gundy inequality, Hölder’s inequality, and the sublinear growth condition in the statement of the theorem:
The last step follows from estimate (49) (assuming that \(1\le \gamma p\le 4/d\)) and Lemma 21, since \(1+\alpha p-p/2+1<2\) if and only if \(\alpha <1/2\). We conclude that the second term of the right-hand side of
is uniformly bounded in \({{\mathbb {E}}}|\cdot |_{W^{\alpha ,p}(0,T;D(L)')}\) for \(\alpha <1\) and \(p\le (d+2)/(d+1)\), while the third term is uniformly bounded in that norm for \(\alpha <1/2\) and \(p\le 4/(\gamma d)\). In both cases, we can choose \(\alpha \) such that \(\alpha p>1\). At this point, we need the condition \(\gamma <1\) if \(d=2\). (The result holds for any space dimension if \(\gamma <2/d\).) Taking into account (33), \((v^\varepsilon )\) is bounded in \(W^{\alpha ,p}(0,T;D(L)')\). The embedding \(W^{\alpha ,p}(0,T;D(L)')\hookrightarrow C^{0,\beta }([0,T];D(L)')\) for \(\beta =\alpha -1/p>0\) implies that \((v^\varepsilon )\) is bounded in the latter space.
We turn to the estimate of \(u^\varepsilon \) in the \(W^{\alpha ,p}(0,T;D(L)')\) norm:
It remains to consider the last term. In view of estimate (15) and the Lipschitz continuity of \(R_\varepsilon \) with Lipschitz constant \(C/\varepsilon \), we obtain
Moreover, by (15) and the Lipschitz continuity of \(R_\varepsilon \) again,
where we used estimate (33). This finishes the proof. \(\square \)
6.2 Tightness of the laws of \((u^\varepsilon )\)
The tightness is shown in a different sub-Polish space than in Sect. 5.2:
endowed with the topology \(\widetilde{{\mathbb {T}}}\) that is the maximum of the topology of \(C^0([0,T];D(L)')\) and the weak topology of \(L_w^{\rho _1}(0,T;W^{1,\rho _1}({\mathcal {O}}))\), recalling that \(\rho _1{=}(d+2)/(d+1){>}1\).
Lemma 31
The family of laws of \((u^\varepsilon )\) is tight in
with the topology that is the maximum of \(\widetilde{{\mathbb {T}}}\) and the topology induced by the \(L^2(0,T;L^{2}({\mathcal {O}}))\) norm.
Proof
The tightness in \(L^2(0,T;L^q({\mathcal {O}}))\) for \(q<d/(d-1)=2\) is a consequence of the compact embedding \(W^{1,1}({\mathcal {O}})\hookrightarrow L^q({\mathcal {O}})\) as well as estimates (47) and (54). In fact, we can extend this result up to \(q=2\) because of the uniform bound of \(u_i^\varepsilon \log u_i^\varepsilon \) in \(L^\infty (0,T;L^1({\mathcal {O}}))\), which originates from the entropy estimate. Indeed, we just apply [3, Prop. 1], using additionally (26) with \(a_{i0}>0\). Then the tightness in \(L^2(0,T;L^2({\mathcal {O}}))\) follows from Lemma 37. Finally, the tightness in \(\widetilde{Z}_T\) is shown as in the proof of Lemma 23 in Appendix B. \(\square \)
In three space dimensions, we do not obtain tightness in \(L^2(0,T;L^2({\mathcal {O}}))\) but in the larger space \(L^{4/3}(0,T;L^2({\mathcal {O}}))\). This follows similarly as in the proof of Lemma 23 taking into account the compact embedding \(W^{1,\rho _1}({\mathcal {O}})\hookrightarrow L^2({\mathcal {O}})\), which holds as long as \(d\le 3\), as well as estimates (50) and (54). Unfortunately, this result seems to be not sufficient to identify the limit of the product \(\widetilde{u}_i^\varepsilon \widetilde{u}_j^\varepsilon \). Therefore, we restrict ourselves to the two-dimensional case.
The following result is shown exactly as in Lemma 24.
Lemma 32
The family of laws of \((\sqrt{\varepsilon }L^*LR_\varepsilon (v^\varepsilon ))\) is tight in \(Y_T=L_w^2(0,T;D(L)')\cap L^\infty _{w*}(0,T;D(L)')\).
Arguing as in Sect. 5.3, the Skorokhod–Jakubowski theorem implies the existence of a subsequence, a probability space \(({{\widetilde{\Omega }}},\widetilde{{\mathcal {F}}},{\widetilde{{{\mathbb {P}}}}})\), and, on this space, \((Z_T\times Y_T\times C^0([0,T];U_0))\)-valued random variables \((\widetilde{u}^\varepsilon ,\widetilde{w}^\varepsilon ,\widetilde{W}^\varepsilon )\) and \((\widetilde{u},\widetilde{w},\widetilde{W})\) such that \((\widetilde{u}^\varepsilon ,\widetilde{w}^\varepsilon ,\widetilde{W}^\varepsilon )\) has the same law as \((u^\varepsilon ,\sqrt{\varepsilon }L^*LR_\varepsilon (v^\varepsilon ),W)\) on \({\mathcal {B}}(Z_T\times Y_T\times C^0([0,T];U_0))\) and, as \(\varepsilon \rightarrow 0\) and \({\widetilde{{{\mathbb {P}}}}}\)-a.s.,
This convergence means that \({\widetilde{{{\mathbb {P}}}}}\)-a.s.,
The remainder of the proof is very similar to that one of Sect. 5.3, using slightly weaker convergence results. The most difficult part is the convergence of the nonlinear term \(\nabla (\widetilde{u}_i^\varepsilon \widetilde{u}_j^\varepsilon )\), since the previous convergences do not allow us to perform the limit \(\widetilde{u}_i^\varepsilon \nabla \widetilde{u}_j^\varepsilon \) because of \(\rho _1<2\). The idea is to consider the “very weak” formulation by performing the limit in \(\widetilde{u}_i^\varepsilon \widetilde{u}_j^\varepsilon \Delta \phi \) instead of \(\nabla (\widetilde{u}_i^\varepsilon \widetilde{u}_j^\varepsilon )\cdot \nabla \phi \) for suitable test functions \(\phi \). Indeed, let \(\phi \in L^\infty (0,T;C_0^\infty ({\mathcal {O}}))\). Since \(\widetilde{u}_i^\varepsilon \rightarrow \widetilde{u}\) strongly in \(L^2(0,T;L^2({\mathcal {O}}))\) \({\widetilde{{{\mathbb {P}}}}}\)-a.s., we have
It follows from the equivalence of the laws that
and we conclude from Vitali’s theorem that
By density, this convergence holds for all test functions \(\phi \in L^\infty (0,T;W^{2,\infty }({\mathcal {O}}))\) such that \(\nabla \phi \cdot \nu =0\) on \(\partial {\mathcal {O}}\). This ends the proof of Theorem 5.
Remark 33
(Three space dimensions) The three-dimensional case is delicate since \(u_i^\varepsilon \) lies in a space larger than \(L^2(Q_T)\). We may exploit the regularity (51) for \(\nabla (u_i^\varepsilon u_j^\varepsilon )\), but this leads only to the existence of random variables \({\widetilde{\eta }}_{ij}^\varepsilon \) and \({\widetilde{\eta }}_{ij}\) with \(i,j=1,\ldots ,n\) and \(i\ne j\) on the space \(X_T=L_w^{\rho _2}(0,T;L^{\rho _2}({\mathcal {O}}))\) such that \({\widetilde{\eta }}_{ij}^\varepsilon \) and \(u_i^\varepsilon u_j^\varepsilon \) have the same law on \({\mathcal {B}}(X_T)\) and, as \(\varepsilon \rightarrow 0\),
Similar arguments as before lead to the limit
but we cannot easily identify \({\widetilde{\eta }}_{ij}\) with \(\widetilde{u}_i\widetilde{u}_j\). \(\square \)
7 Discussion of the noise terms
We present some examples of admissible terms \(\sigma (u)\). Recall that \((e_k)_{k\in {{\mathbb {N}}}}\) is an orthonormal basis of U.
Lemma 34
The stochastic diffusion
satisfies Assumption (A5) for \(\eta >0\) and \((a_\ell )\in \ell ^2({{\mathbb {R}}})\).
Proof
With the entropy density h given by (5), we compute \((\partial h/\partial u_i)(u)=\pi _i\log u_i\) and \((\partial ^2\,h/\partial u_i\partial u_j)(u)=(\pi _i/u_i)\delta _{ij}\). Therefore, by Jensen’s inequality and the elementary inequalities \(|u_i\log u_i|\le C(1+u_i^{1+\eta })\) for any \(\eta >0\) and \(|u|\le C(1+h(u))\),
The second condition in Assumption (A5) becomes
Thus, Assumption (A5) is satisfied. \(\square \)
The proof shows that \(J_1\) can be estimated if \(s(u_i)^2\log (u_i)^2\) is bounded from above by \(C(1+h(u))\). This is the case if \(s(u_i)\) behaves like \(u_i^\alpha \) with \(\alpha <1/2\). Furthermore, \(J_2\) can be estimated if \(s(u_i)^2/u_i\) is bounded, which is possible if \(s(u_i)=u_i^\alpha \) with \(\alpha \ge 1/2\). Thus, to both satisfy the growth restriction and avoid the singularity at \(u_i=0\), we have chosen \(\sigma _{ij}\) as in Lemma 34. This example is rather artificial. To include more general choices, we generalize our approach. In fact, it is sufficient to estimate the integrals in inequality (23) in such a way that the entropy inequality of Proposition 16 holds. The idea is to exploit the gradient bound for \(u_i\) for the estimatation of \(J_1\) and \(J_2\).
Consider a trace-class, positive, and symmetric operator Q on \(L^2({\mathcal {O}})\) and the space \(U=Q^{1/2}(L^2({\mathcal {O}}))\), equipped with the norm \(\Vert Q^{1/2}(\cdot )\Vert _{L^2({\mathcal {O}})}\). We will work in the following with an U-cylindrical Wiener process \(W^Q\). This setting is equivalent to a spatially colored noise on \(L^2({\mathcal {O}})\) in the form of a Q-Wiener process (with \(Q\ne \textrm{Id}\)). The latter viewpoint provides, in our opinion, a more intuitive insight. In particular, the operator Q is constructed from the eigenfunctions and eigenvalues described below.
Let \((\eta _k)_{k\in {{\mathbb {N}}}}\) be a basis of \(L^2({\mathcal {O}})\), consisting of the normalized eigenfunctions of the Laplacian subject to Neumann boundary conditions with eigenvalues \(\lambda _k\ge 0\), and set \(a_k=(1+\lambda _k)^{-\rho }\) for some \(\rho >0\) such that \(\sum _{k=1}^\infty a_k^2\Vert \eta _k\Vert _{L^\infty ({\mathcal {O}})}^2<\infty \). Since \(\lambda _k\le Ck^{2/d}\) [28, Corollary 2] and \(\Vert \eta _k\Vert _{L^\infty ({\mathcal {O}})}\le Ck^{(d-1)/2}\) [23, Theorem 1], we may choose \(\rho >(d/2)^2\). Considering a sequence of independent Brownian motions \((W_1^k,\ldots ,W_n^k)_{k\in {{\mathbb {N}}}}\), we assume the noise to be of the form \(W^Q=(W_1^Q,\ldots ,W_n^Q)\), where
and \((e_k)_{k\in {{\mathbb {N}}}}=(a_k\eta _k)_{k\in {{\mathbb {N}}}}\) is a basis of \(U=Q^{1/2}(L^2({\mathcal {O}}))\).
Lemma 35
For the SKT model with self-diffusion, let \(\sigma _{ij}(u)=\delta _{ij}u_i^\alpha \) for \(1/2\le \alpha \le 1\), \(i,j=1,\ldots ,n\), interpreted as a map from \(L^2({\mathcal {O}})\) to \({{\mathcal {L}}}_2(H^\beta ({\mathcal {O}});L^2({\mathcal {O}}))\), where \(\beta >\rho \). Then the entropy inequality (29) holds, i.e., \(\sigma _{ij}\) is admissible for Theorem 4.
Proof
We can write inequality (23) for \(0<T<T_R\) as
recalling that \(w^\varepsilon =R_\varepsilon (v^\varepsilon )\) and \(u^\varepsilon =u(w^\varepsilon )\). We simplify \(J_3\) and \(J_4\), using the definition \(e_k=a_k\eta _k\):
The last inequality follows from our assumption on \((a_k)\). By (28), we can estimate the integrand of the third integral on the left-hand side of (55) according to
Hence, because of \(|u|\le C(1+h(u))\), we can formulate (55) as
It is sufficient to continue with the case \(\alpha =1\), since the proof for \(\alpha <1\) follows from the case \(\alpha =1\). Then, using \(|u^\varepsilon |\le C(1+h(u^\varepsilon ))\),
Now, we use the following lemma which is proved in Appendix A.
Lemma 36
Let \(d\ge 2\) and let \(v\in L^2(0,T;H^1({\mathcal {O}}))\) satisfy \(v\log v\in L^\infty (0,T;L^1({\mathcal {O}}))\). Then for any \(\delta >0\), there exists \(C(\delta )>0\) such that
It follows from (56) that, for any \(\delta >0\),
For sufficiently small \(\delta >0\), the last terms on the right-hand side can be absorbed by the corresponding terms on the left-hand side, leading to
Gronwall’s lemma ends the proof. \(\square \)
In the case without self-diffusion, we have an \(H^1({\mathcal {O}})\) estimate for \((u_i^\varepsilon )^{1/2}\) only, and it can be seen that stochastic diffusion terms of the type \(\delta _{ij}u_i^\alpha \) for \(\alpha >1/2\) are not admissible. However, we may choose \(\sigma _{ij}(u)e_k = \delta _{ij}u_i^\alpha (1+(u_i^\varepsilon )^\beta )^{-1}a_k\eta _k\) for \(1/2\le \alpha <1\) and \(\beta \ge \alpha /2\).
References
Amann, H.: Dynamic theory of quasilinear parabolic systems. III. Global existence. Math. Z. 202, 219–250 (1989)
Bansaye, V., Moussa, A., Muñoz-Hernández, F.: Stability of a cross-diffusion system and approximation by repulsive random walks: a duality approach. Submitted for publication (2021). arXiv:2109.07146
Braukhoff, M., Chen, X., Jüngel, A.: Corrigendum: cross diffusion preventing blow up in the two-dimensional Keller-Segel model. SIAM J. Math. Anal. 52, 2198–2200 (2020)
Brzeźniak, Z., Hausenblas, E., Motyl, E.: Uniqueness in law of the stochastic convolution process driven by Lévy noise. Electron. J. Probab. 18, 1–15 (2013)
Brzeźniak, Z., Motyl, E.: The existence of martingale solutions to the stochastic Boussinesq equations. Glob. Stoch. Anal. 1, 175–216 (2014)
Brzeźniak, Z., Ondreját, M.: Stochastic wave equations with values in Riemanninan manifolds. Stoch. Partial Differ. Equ. Appl. 25, 65–97 (2010)
Brzeźniak, Z., Ondreját, M.: Stochastic geometric wave equations with values in compact Riemannian homogeneous spaces. Ann. Prob. 41, 1938–1977 (2013)
Chen, L., Daus, E., Holzinger, A., Jüngel, A.: Rigorous derivation of population cross-diffusion systems from moderately interacting particle systems. J. Nonlinear Sci. 31(94), 38 (2021)
Chen, L., Jüngel, A.: Analysis of a multi-dimensional parabolic population model with strong cross-diffusion. SIAM J. Math. Anal. 36, 301–322 (2004)
Chen, L., Jüngel, A.: Analysis of a parabolic cross-diffusion population model without self-diffusion. J. Differ. Equ. 224, 39–59 (2006)
Chen, X., Daus, E., Jüngel, A.: Global existence analysis of cross-diffusion population systems for multiple species. Arch. Ration. Mech. Anal. 227, 715–747 (2018)
Da Prato, G., Zabczyk, J.: Stochastic Equations in Infinite Dimensions, 2nd edn. Cambridge University Press, Cambridge (2014)
Daus, E., Desvillettes, L., Dietert, H.: About the entropic structure of detailed balanced multi-species cross-diffusion equations. J. Differ. Equ. 266, 3861–3882 (2019)
Debussche, A., Glatt-Holtz, N., Temam, R.: Local martingale and pathwise solutions for an abstract fluids model. Phys. D 240, 1123–1144 (2011)
Desvillettes, L., Lepoutre, T., Moussa, A.: Entropy, duality, and cross diffusion. SIAM J. Math. Anal. 46, 820–853 (2014)
Desvillettes, L., Lepoutre, T., Moussa, A., Trescases, A.: On the entropic structure of reaction-cross diffusion systems. Commun. Partial Differ. Equ. 40, 1705–1747 (2015)
Deuring, P.: An initial-boundary value problem for a certain density-dependent diffusion system. Math. Z. 194, 375–396 (1987)
Dhariwal, G., Jüngel, A., Zamponi, N.: Global martingale solutions for a stochastic population cross-diffusion system. Stoch. Process. Appl. 129, 3792–3820 (2019)
Dietert, H., Moussa, A.: Persisting entropy structure for nonlocal cross-diffusion systems. To appear in Ann. Fac. Sci. Toulouse Math. (2023). arXiv:2101.02893
Drábek, P., Milota, J.: Methods of Nonlinear Analysis: Applications to Differential Equations, 2dn edn. Springer, Basel (2013)
Dreher, M.: Analysis of a population model with strong cross-diffusion in unbounded domains. Proc. R. Soc. Edinb. Sec. A 138, 769–786 (2008)
Galiano, G., Garźon, M., Jüngel, A.: Semi-discretization in time and numerical convergence of solutions of a nonlinear cross-diffusion population model. Numer. Math. 93, 655–673 (2003)
Grieser, D.: Uniform bounds for eigenfunctions of the Laplacian on manifolds with boundary. Commun. Partial Differ. Equ. 27, 1283–1299 (2002)
Jüngel, A.: The boundedness-by-entropy method for cross-diffusion systems. Nonlinearity 28, 1963–2001 (2015)
Jüngel, A.: Entropy Methods for Diffusive Partial Differential Equations. Springer, Berlin (2016)
Kim, J.: Smooth solutions to a quasi-linear system of diffusion equations for a certain population model. Nonlinear Anal. 8, 1121–1144 (1984)
Krein, S., Petunin, J.I., Semenov, E.: Interpolation of Linear Operators. Amer. Math. Soc., Providence (1982)
Kröger, P.: Upper bounds for the Neumann eigenvalues on a bounded domain in Euclidean spaces. J. Funct. Anal. 106, 353–357 (1992)
Krylov, N.: A relatively short proof of Itô’s formula for SPDEs and its applications. Stoch. Partial Differ. Equ. Anal. Comput. 1, 152–174 (2013)
Kuehn, C., Neamţu, A.: Pathwise mild solutions for quasilinear stochastic partial differential equations. J. Differ. Equ. 269, 2185–2227 (2020)
Lepoutre, T., Moussa, A.: Entropic structure and duality for multiple species cross-diffusion systems. Nonlinear Anal. 159, 298–315 (2017)
Liu, W., Röckner, M.: Stochastic Partial Differential Equations: An Introduction. Springer, Cham (2015)
Lou, Y., Ni, W.-M., Wu, Y.: On the global existence of a cross-diffusion system. Discrete Contin. Dyn. Syst. 4, 193–203 (1998)
Métivier, M.: Stochastic Partial Differential Equations in Infinite Dimensional Spaces. Scuola Normale Superiore, Pisa (1988)
Moussa, A.: From nonlocal to classical Shigesada-Kawasaki-Teramoto systems: triangular case with bounded coefficients. SIAM J. Math. Anal. 52, 42–64 (2020)
Prévôt, C., Röckner, M.: A Concise Course on Stochastic Partial Differential Equations. Lecture Notes Math., vol. 1905. Springer, Berlin (2007)
Shigesada, N., Kawasaki, K., Teramoto, E.: Spatial segregation of interacting species. J. Theor. Biol. 79, 83–99 (1979)
Simon, J.: Compact sets in the space \(L^p(0, T;B)\). Ann. Math. Pura. Appl. 146, 65–96 (1987)
Funding
Open access funding provided by Austrian Science Fund (FWF).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The authors thank Prof. Alexandra Neamţu (Konstanz, Germany) for her constructive help in the preparation of this manuscript. The second and third authors acknowledge partial support from the Austrian Science Fund (FWF), Grants I3401, P33010, W1245, and F65. The second author has been supported by the FWF, Grant Y1235 of the START program. This work has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme, ERC Advanced Grant No. 101018153.
Appendices
Appendix A. Proofs of some lemmas
1.1 A.1. Proof of Lemma 14
The operator equation \(\textrm{D}R_\varepsilon [w](a)=b\) can be written as \(a = \textrm{D}Q_\varepsilon [w](b) = u'(w)b + \varepsilon L^*Lb\). Hence,
The matrix \(u'(w)=(h'')^{-1}(u(w))\) is symmetric and positive semidefinite (since h is convex). Thus, the square root operator \(\sqrt{u'(w)}\) exists and is symmetric. This shows that
Furthermore, by the Cauchy–Schwarz inequality \({\text {tr}}[A^TB] \le {\text {tr}}[A^TA]{\text {tr}}[B^TB]\) and the property \({\text {tr}}[AB]={\text {tr}}[BA]\) for matrices A and B,
Inserting these relations into (57) leads to
and consequently,
Together with (58) we obtain the statement.
1.2 A.2 Proof of Lemma 15
It follows from the convexity of h that
Since \(R_\varepsilon \) and \(Q_\varepsilon \) are inverse to each other, we can replace \(v^0\) by \(Q_\varepsilon (R_\varepsilon (v^0)) = u(R_\varepsilon (v^0)) + \varepsilon L^*LR_\varepsilon (v^0)\):
We find after an integration that
which yields the statement.
1.3 A.3 Proof of Lemma 21
We show that
A change of the integration domain and an integration by parts lead to
observing that \(\lim _{t\rightarrow s}(t-s)^{1-\delta }\int _s^t g(r)\textrm{d}r=0\) for \(1-\delta >-1\), since the integrability of g implies that \(\lim _{t\rightarrow s}(t-s)^{-1}\int _s^t g(r)\textrm{d}r=g(s)\) for a.e. s. The result follows as the integrals on the right-hand side of (59) are finite.
1.4 A.4 Proof of Lemma 36
We use the interpolation inequality with \(1/2=\theta _1 + (1-\theta _1)/(2p)\) and some \(1<p<d/(d-2)\) (and \(p>1\) if \(d=2\)) as well as the Young inequality with \(\delta >0\):
The first term on the right-hand side is estimated in a similar way as before, where \(\eta >0\):
For the second term on the right-hand side of (60), we introduce the function \(g(v)=\max \{2,v\log v\}\) for \(v\ge 0\). Then \(g\in C^1([0,\infty ))\). (The function \(v\mapsto v\log v\) is not \(C^1\) at \(v=0\), therefore we need to truncate.) We use the Sobolev inequality:
where \(q=2dp/(d+2p)\). The condition \(p<d/(d-2)\) guarantees that \(q<2\), while \(d\ge 2\) yields \(q>1\); thus \(q\in (1,2)\). Applying the Gagliardo–Nirenberg inequality, combined with the Poincaré–Wirtinger inequality, with \(\theta _2=d(q-1)/(d(q-1)+q)\le 1\), and then the Young inequality, we find that
This yields
The last term is estimated as in (61). We consider the norm of \(\nabla g(v)=\text {1}_{\{v\log v>2\}}(1+\log v)\nabla v\). For this, we observe that \(\text {1}_{\{v\log v>2\}}\log v\le C(1+v^\gamma )\) for some \(0<\gamma <(2-q)/(2q)\) and use the Hölder inequality:
since the property \(2\gamma q/(2-q)<1\) gives \(v^{2\gamma q/(2-q)}\le C(1+v)\) for \(v\ge 0\). Consequently, by Young’s inequality,
and an integration over time gives
where we used \((2-q)/q<1\). Thus, (62) becomes
It remains to insert (61) and the previous estimate into (60) to conclude that
Choosing first \(\delta >0\) and then \(\eta >0\) sufficiently small finishes the proof.
Appendix B. Tightness criterion
Lemma 37
(Tightness criterion) Let \({\mathcal {O}}\subset {{\mathbb {R}}}^d\) (\(d\ge 1\)) be a bounded domain with Lipschitz boundary and let \(T>0\), \(p,q,r\ge 1\), \(\alpha \in (0,1)\) if \(r\ge p\) and \(\alpha \in (1/r-1/p,1)\) if \(r<p\). Let \(s\ge 1\) be such that the embedding \(W^{1,q}({\mathcal {O}})\hookrightarrow L^s({\mathcal {O}})\) is compact, and let Y be a Banach space such that the embedding \(L^s({\mathcal {O}})\hookrightarrow Y\) is continuous. Furthermore, let \((u_n)_{n\in {{\mathbb {N}}}}\) be a sequence of functions such that there exists \(C>0\) such that for all \(n\in {{\mathbb {N}}}\),
Then the laws of \((u_n)\) are tight in \(L^p(0,T;L^s({\mathcal {O}}))\) if \(q\le d\) and in \(L^p(0,T;C^0(\overline{{\mathcal {O}}}))\) if \(q>d\). If \(p=\infty \), the space \(L^p(0,T;\cdot )\) is replaced by \(C^0([0,T];\cdot )\).
Proof
By Theorem 3 and Lemma 5 of [38], the set
is relatively compact in \(L^p(0,T;L^s({\mathcal {O}}))\). We deduce from Chebyshev’s inequality that
The definition of tightness finishes the proof. \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Braukhoff, M., Huber, F. & Jüngel, A. Global martingale solutions for stochastic Shigesada–Kawasaki–Teramoto population models. Stoch PDE: Anal Comp 12, 525–575 (2024). https://doi.org/10.1007/s40072-023-00289-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40072-023-00289-7
Keywords
- Population dynamics
- Cross diffusion
- Martingale solutions
- Multiplicative noise
- Entropy method
- Tightness of laws