Abstract
We consider a semilinear parabolic partial differential equation in \(\mathbf{R}_+\times [0,1]^d\), where \(d=1, 2\) or 3, with a highly oscillating random potential and either homogeneous Dirichlet or Neumann boundary condition. If the amplitude of the oscillations has the right size compared to its typical spatiotemporal scale, then the solution of our equation converges to the solution of a deterministic homogenised parabolic PDE, which is a form of law of large numbers. Our main interest is in the associated central limit theorem. Namely, we study the limit of a properly rescaled difference between the initial random solution and its LLN limit. In dimension \(d=1\), that rescaled difference converges as one might expect to a centred Ornstein–Uhlenbeck process. However, in dimension \(d=2\), the limit is a non-centred Gaussian process, while in dimension \(d=3\), before taking the CLT limit, we need to subtract at an intermediate scale the solution of a deterministic parabolic PDE, subject (in the case of Neumann boundary condition) to a non-homogeneous Neumann boundary condition. Our proofs make use of the theory of regularity structures, in particular of the very recently developed methodology allowing to treat parabolic PDEs with boundary conditions within that theory.
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
1 Introduction
Fix \(D = [0,1]^d\) with \(d \le 3\), and consider the family of functions \(u_\varepsilon :[0,T]\times D \rightarrow \mathbf{R}\) solving the PDE
endowed with either Dirichlet boundary conditions \(u_\varepsilon (t,x) = 0\) for \(x\in \partial D\) or Neumann boundary conditions \(\langle n(x), \nabla u_\varepsilon (t,x)\rangle = 0\), where n denotes the outward facing unit vector normal to the boundary of D. The driving noise \(\eta _\varepsilon \) appearing in this equation is given by
where \(\eta (t,x)\) is a stationary centred random field, which we do not assume Gaussian, but with relatively good mixing properties (see Assumption 2.1 for details) and moments of all orders after testing against a test function. Note that \(\eta _\varepsilon \) is scaled by \(\varepsilon ^{-1}\) rather than \(\varepsilon ^{-(d+2)/2}\), so the noise from [10, 14] (which were restricted to \(d=1\)) has been multiplied by \(\varepsilon ^{d/2}\).
In the case when G is linear and \(H=0\), this problem has been well studied. See for example [1] for the case where furthermore \(\eta \) is Gaussian and constant in time [note that in our case \(\mathfrak {m}=2\) so the exponent \(\alpha \) appearing there would equal 1 in our case, as used in (1.2)], [4] for a similar result in the non-Gaussian case.
Although we will allow \(\eta \) to be a generalised random field in \(d=1\), we assume throughout that there exist locally integrable functions \(\kappa _p :(\mathbf{R}^{d+1})^p \rightarrow \mathbf{R}\) that are continuous outside of the big diagonal \(\Delta _p = \{(z_1,\ldots ,z_p) \in (\mathbf{R}^{d+1})^p\,:\, \exists i\ne j\;\text {with}\; z_i = z_j\}\) and such that, for any \(\mathcal{C}_0^\infty \) test functions \(\phi _1,\ldots ,\phi _p\), the joint cumulant \(\kappa _p(\phi _1,\ldots ,\phi _p)\) of \(\eta (\phi _1),\ldots ,\eta (\phi _p)\) satisfies
(By stationarity, the functions \(\kappa _p\) only depend on the differences of their arguments.) Here and below, we always use the convention that z (resp. \(z_i\), \({\bar{z}}\), etc) denotes a space-time coordinate given by \(z=(t,x)\) (resp. \(z_i=(t_i,x_i)\), \({\bar{z}}=({\bar{t}},{\bar{x}})\), etc). We furthermore normalise our problem by assuming that the covariance of \(\eta \) integrates to 1 in the sense that
In particular, we assume that \(\kappa _2(0,\cdot )\) is absolutely integrable, but this will in any case follow from Assumption 2.1. Furthermore, define the constants
where P denotes the heat kernel, that is the fundamental solution to the heat equation on the whole space. Here and below, symbols drawn in red denote fixed constants, while symbols drawn in blue will later denote basis vectors of a suitable regularity structure associated to our problem. In dimension \(d=1\), our assumptions on \(\kappa _2\) will guarantee that the integral converges absolutely, while in dimensions 2 and 3 our assumptions on \(\kappa _2\) and \(\kappa _3\) will guarantee that all of these integrals converge absolutely.
The scaling \(\varepsilon ^{-1}\) chosen in \(\eta _\varepsilon \) is such that \(u_\varepsilon \) converges as \(\varepsilon \rightarrow 0\) to a limit \(u^{(0)}\), which is our first result. Indeed, writing
we have the following “law of large numbers”:
Theorem 1.1
Let \(u_\varepsilon \) be as above and let \(u^{(0)}\) be the (local) solution to the deterministic PDE
with the same initial condition \(u_0 \in \mathcal{C}^\alpha \) as (1.1) [for some arbitrary \(\alpha \in (0,1)\)] and with homogeneous Dirichlet (resp. Neumann) boundary condition. In the case of Dirichlet boundary conditions, we impose that \(u_0\) vanishes on the boundary.
Assume that the functions \(G,H:\mathbf{R}\rightarrow \mathbf{R}\) are of class \(\mathcal{C}^5\) and \(\mathcal{C}^4\) respectively, that the driving field \(\eta \) satisfies Assumption 2.1, and let furthermore \(T > 0\) be such that the (possible) explosion time for \(u^{(0)}\) is greater than T. Then, in probability and uniformly over \([0,T] \times D\), \(u_\varepsilon \) converges to \(u^{(0)}\) as \(\varepsilon \rightarrow 0\).
The proof of this result will be given in Section 3. Our main quantity of interest however are the fluctuations of \(u_\varepsilon \) around its limit \(u^{(0)}\). One interesting feature of this problem is that in order to see these fluctuations, it is not sufficient to recenter around \(u^{(0)}\). Instead, as soon as \(d \ge 2\), a suitable first-order correction \(u^{(1)}\) living at scale \(\varepsilon \) has to be subtracted first. Our precise “central limit theorem” then takes the following form:
Theorem 1.2
Let \(u_0\) be such that its extension to all of \(\mathbf{R}^d\) by reflectionsFootnote 1 is of class \(\mathcal{C}^3\), let \(u_\varepsilon \) and \(u^{(0)}\) be as above and assume G, H, T and \(\eta \) are as in Theorem 1.1. Let furthermore \(u^{(1)}= 0\) for \(d=1\) and, for \(d \in \{2,3\}\), let \(u^{(1)}\) be the solution to
with zero initial condition. (Note that is an \(\mathbf{R}^d\)-valued constant.) In the case of Neumann boundary condition, we furthermore impose that \(\langle \nabla u^{(1)}(t,x), n(x)\rangle = c(x) GG'(u^{(0)}(t,x))\) for \(x\in \partial D\), where c is an explicit function on the boundary of D which is constant on each of its faces [the precise values of c on each face will be given in (5.16)], while we impose homogeneous Dirichlet boundary conditions otherwise.
Then, in law and in \(\mathcal{C}^\alpha ([0,T] \times D)\) for any \(\alpha < 1-{d\over 2}\), one has
where v is the Gaussian process solving
endowed with homogeneous Dirichlet (resp. Neumann) boundary condition and 0 initial condition, and \(\xi \) denotes a standard space-time white noise.
Proof
Combining Proposition 4.18 with Proposition 4.16 and (4.1) shows that if we set \(v_\varepsilon = \varepsilon ^{-d/2}\big (u_\varepsilon - u^{(0)}- \varepsilon u^{(1)}\big )\) then we do indeed have \(\lim _{\varepsilon \rightarrow 0} v_\varepsilon = v\) (weakly in \(\mathcal{C}^{(d-2)/2-\kappa }\) on [0, T]). The limit v is identified as the solution to (1.8) by combining the second part of Proposition 4.16 with Lemma 4.10. \(\square \)
Remark 1.3
If all we were interested in is the law of large numbers, then the conditions of Assumption 2.1 on \(\eta \) could easily be weakened.
Remark 1.4
In the case of Neumann boundary conditions, it may appear paradoxal that, even though \(u_\varepsilon \), \(u^{(0)}\) and v all satisfy homogeneous boundary conditions, \(u^{(1)}\) does not! This phenomenon is very similar to the presence of the “boundary renormalisation” that can appear in the context of singular SPDEs [9]. There is no contradiction since the convergence \(v_\varepsilon \rightarrow v\) takes place in a very weak topology in which the notion of “normal derivative at the boundary” is meaningless in a pointwise sense. (A very simple example displaying a similar phenomenon is \(n^{-1/2}\sin (nx)\), whose derivative at the origin diverges like \(\sqrt{n}\) while that of its limit vanishes.)
Remark 1.5
Regarding the precise meaning of the equation fulfilled by \(u^{(1)}\) in the case of Neumann boundary condition, denote by \(\delta _{\partial D}\) the distribution on \(\mathbf{R}\times \partial D\) given by
where the integration over the faces of \(\partial D\) are performed against the two-dimensional Lebesgue measure. With this notation, the solution to any equation of the form
(and in particular the equation determining \(u^{(1)}\)) is defined as the solution to the integral equation
where \(P_{\mathrm{Neu}}\) denotes the homogeneous Neumann heat kernel, with the convention that \(g(t,x) = 0\) for \(t \le 0\), and where \(\mathbf {1}_+^D(t,x) = \mathbf {1}_{\{t \ge 0\}}\mathbf {1}_{\{x \in D\}}\). Here, we used the notation \(\int \phi (z)\,\eta (\,{\mathrm{d}}z)\) for the usual pairing between a distribution \(\eta \) and a suitable test function \(\phi \). To see that solutions to (1.10) and (1.9) do indeed coincide if f and g are sufficiently regular for the solution to be differentiable up to the boundary, it suffices to note that the mild formulation is equivalent to the weak formulation, see for example [20], with the term \(g \delta _{\partial D}\) appearing as the boundary term when integrating by parts.
Remark 1.6
In dimension \(d=1\), the term \(u^{(1)}\) in (1.7) is of course redundant. In dimension \(d=2\), it is still the case that \(\varepsilon ^{-d/2}(u_\varepsilon - u^{(0)})\) converges to a limit, but this limit is not centred anymore. In higher dimensions, additional corrections appear. We expect to have a result of the form
where \(u^{(0)}\) is as above and the \({\bar{u}}^{(k)}\) satisfy an equation of the type
for some inhomogeneity \(\Psi _k\) depending on the \( u^{(\ell )}\) for \(\ell < k\) and some of their derivatives. Since v has vanishing expectation, we expect to also have
so that the \(u^{(k)}\) provide an expansion of \(\mathbf{E}u_\varepsilon \) in powers of \(\varepsilon \). Note however that the techniques used in this paper do not provide moment bounds on the solution, so that even in \(d \le 3\) this would require some additional work.
Remark 1.7
The form of the terms appearing in the successive correctors as well as the constants multiplying them can in principle be derived from [2, Eq. 2.12] which describes the form of the counterterms \(\Upsilon \) associated to a given tree \(\tau \). The recursion given there suggests a correspondence with incoming edges corresponding to functional derivatives with respect to u (so and ) and powers of X corresponding to formal directional derivatives, so . This shows a priori that the counterterm multiplying for example must be of the form \(G'(u)G(u)\), etc. The numerical constants multiplying these terms do however differ from those appearing in [2] since their meaning is slightly different: in [2] we use counterterms to “recenter” the original equation in order to obtain a finite limit while here we leave the original equation untouched and compute its “centering”. If we leave aside the behaviour at the boundary, this in principle allows to guess the general form of the equations for the \(u^{(k)}\) appearing in (1.11) for any dimension.
The most surprising part of Theorem 1.2 is surely the fact that in the case when \(u_\varepsilon \) has homogeneous Neumann boundary conditions, even though v and \(u^{(0)}\) both also have homogeneous boundary conditions, \(u^{(1)}\) does not, which seems to contradict (1.7). This is of course not a contradiction but merely suggests that if we write \(v_\varepsilon \) for the expression appearing under the limit in (1.7), then \(v_\varepsilon \) exhibits a kind of boundary layer. Note also that the statement that “v satisfies homogeneous boundary conditions” only makes sense in terms of the integral equation that it solves since v itself is not differentiable at the boundary. (It is not even a function!)
Before we proceed, let us give a heuristic explanation for the appearance of this boundary layer. Consider the simplest case \(H=0\), \(G(u) = u\) and \(u_0 > 0\), in which case we can consider the Hopf-Cole transform \(h_\varepsilon = \log u_\varepsilon \), yielding
To leading order, one would expect the right hand side to behave like \(|\nabla h_\varepsilon |^2 \simeq \mathbf{E}|\nabla Z_\varepsilon |^2\), where \(Z_\varepsilon \) solves \(\partial _t Z_\varepsilon = \Delta Z_\varepsilon + \eta _\varepsilon \) endowed with homogeneous Neumann boundary conditions. It turns out that, in the interior of the domain, one has
which allows one to “guess” the correct limit \(u^{(0)}\). On the boundary however \(\nabla Z_\varepsilon = 0\), so that one expects to be of order \({\mathcal {O}}(1)\) in a layer of width \({\mathcal {O}}(\varepsilon )\) around \(\partial D\). When going to the next scale, this results in a boundary correction of order \({\mathcal {O}}(\varepsilon ^{-1})\) in this boundary layer, which precisely scales like a surface measure on the boundary. Remark 1.5 shows that the net effect of this correction is to modify the boundary condition.
The remainder of the article is structured as follows. First, in Section 2, we formulate our main assumption on the driving noise \(\eta \) and we show that this assumption is “reasonable” by exhibiting an explicit class of examples for which it is satisfied. In Section 3, we then show that the law of large numbers holds. Although this could probably be shown by “classical” means without too much effort, we will use the theory of regularity structures because it shortens the argument and allows us to introduce some results and notation that will be of use later on. In Section 4, we then show that the central limit theorem holds. The main tool in this proof is the convergence of a certain “model” for an appropriate regularity structure as well as refinements of the type of boundary estimates first considered in [9]. The convergence of the model is given in Section 5. “Appendix B” is devoted to the proof of a result showing that the operations of “convolution by a singular kernel” and “multiplication by a smooth function” almost commute, modulo a much smoother remainder, a fact that will undoubtedly sound familiar to anyone acquainted with microlocal analysis. “Appendix C” contains a version of the reconstruction theorem that is purpose-built to allow us to deal with modelled distribution that have very singular boundary behaviour and goes beyond the version obtained in [9]. This appendix was written in collaboration with Máté Gerencsér.
2 Assumptions on the Noise
In this section, we formulate our precise assumptions on the driving noise and we show that they are satisfied for example by a mollified Poisson process. In a nutshell, we want to assume that correlations are bounded by \(\Vert z-{\bar{z}}\Vert ^{-2{\underline{c}}}\) at small scales and \(\Vert z-{\bar{z}}\Vert ^{-2{\overline{c}}}\) at large scales with \({\underline{c}} = {1\over 2}-\delta \) and \({\overline{c}} = {d+2\over 2}+\delta \) for some \(\delta \in (0,{1\over 2})\). However, we also want to encode the fact that higher order cumulants behave “better” than what is obtained from simply using the Cauchy–Schwartz inequality. Note that our assumptions are trivially satisfied by any continuous Gaussian process with correlations that decay at least like \(\Vert z-{\bar{z}}\Vert ^{-2{\overline{c}}}\). Here and below, \(\Vert \cdot \Vert \) will always denote the parabolic distance between space-time points. It will be convenient (in particular in “Appendix A”) to make sure that \(\Vert \cdot \Vert \) is smooth away from the origin, so we set for example \(\Vert z\Vert ^4 = \Vert (t,x)\Vert ^4 = |x|^4 + |t|^2\).
2.1 Coalescence Trees
In order to formulate this precisely, we need a simplified version of the construction of [15, Appendix A]. Given any configuration \((z_1,\ldots ,z_p)\) of p points in \(\mathbf{R}^{d+1}\) with all distances distinct, we associate to it a binary tree T in the following way. Consider Kruskal’s algorithm [17] for constructing the minimal spanning tree of the complete graph with vertices \(\{z_1,\ldots ,z_p\}\) and edge-weights given by their (parabolic) distances. One way of formalising this is the following. Consider the set \(\mathcal{P}_p\) of partitions of \(\Omega _p = \{1,\ldots ,p\}\). We define a distance \(d_z\) between subsets of \(\Omega _p\) as the Hausdorff distance induced by \(\{z_1,\ldots ,z_p\}\), namely
We then define a map \(K:\mathcal{P}_p \rightarrow \mathcal{P}_p\) in the following way. If \(\pi = \{\Omega _p\}\), then \(K(\pi ) = \pi \). Otherwise, let \(A \ne B \in \pi \) be such that \(d_z(A,B) \le d_z(C,D)\) for all \(C,D \in \pi \). Thanks to our assumption on the \(z_i\), this pair is necessarily unique. We then set
that is \(K(\pi )\) is obtained by coalescing the two sets A and B in the partition \(\pi \). The vertices of T are then given by \(V_T = \bigcup _{n \ge 0} K^n(\{\{1\},\ldots ,\{p\}\})\), that is \(V_T\) consists of all the blocks of those partitions. The set \(V_T\) comes with a natural partial order given by inclusion: \(A \le B\) if and only if \(A \supset B\). The (directed) edge set \(E_T \in V_T\times V_T\) of T is then given by the Hasse diagram of \((V_T,\le )\): \((A,B) \in E_T\) if and only if \(A < B\) and there is no \(C \in V_T\) such that \(A< C < B\). It is easy to verify that T is a binary tree and that its leaves are precisely given by the singletons. It will be convenient to also add to \(V_T\) a “point at infinity” which is connected to \(\Omega _p\) by an edge and to view as the minimal element of \(V_T\).
We write for the interior nodes. Each interior node \(A \in \mathring{V}_T\) has exactly two children \(A_1\) and \(A_2\) such that \((A,A_i) \in E_T\) for \(i = 1,2\). We then define an integer labelling \({\mathbf {n}}:\mathring{V}_T \rightarrow \mathbf{Z}\) by \({\mathbf {n}}(A) = -\lceil \log _2 d_z(A_1,A_2) \rceil \). We will always view \({\mathbf {n}}\) as a function on all of \(V_T\) with values in \(\mathbf{Z}\cup \{\pm \infty \}\) by setting and \({\mathbf {n}}(\{i\}) = +\infty \) for \(i=1,\ldots ,p\). Note now that if A, B and C are three disjoint sets, then
As a consequence, the map \({\mathbf {n}}\) is monotone increasing on \(V_T\). Furthermore, as in [15, Eq. A.15], there exist constants c, C depending only on p such that
for all i, j.
Given a configuration of points \(z = (z_1,\ldots ,z_p) \in (\mathbf{R}^{d+1})^p\), we now write \({\mathbf {t}}_z = (T,{\mathbf {n}})\) for the corresponding data constructed as above. We furthermore define a function \(\rho :\mathbf{R}_+\rightarrow \mathbf{R}_+\) by
(Beware that \(\rho \) is an upper bound for the square root of the covariance between two points.) We then assume that the following bound holds:
Assumption 2.1
With the notations as above, for any \(p \ge 2\) and any \(\{k_i\}_{i=1}^p \subset \mathbf{Z}_+^{d+1}\), the pth joint cumulant for \(\eta \) satisfies the bound
uniformly over all \(z \in (\mathbf{R}^{d+1})^p\). (Recall that \(\Omega _p\) is the root of the tree T.) Here and below, the length of the multiindex k should be interpreted in the parabolic sense, namely \(|k| = 2k_0 + \sum _{i=1}^d k_i\).
In dimensions \(d \in \{2,3\}\), we furthermore assume that \(\eta :\Omega \times \mathbf{R}^{d+1} \rightarrow \mathbf{R}\) is a measurable function with \(\mathbf{E}|\eta (0)|^p <\infty \) for \(p = (d+2)/{{\underline{c}}}\).
Remark 2.2
The additional condition that \(\eta \) takes values in \(L^p\) for sufficiently high p is mainly technical and could probably be dropped with some additional effort. It will be used to bound \({\hat{R}}_\varepsilon ^{(d)}\) in the proof of Proposition 4.16. The exponent \((d+2)/{\underline{c}}\) is consistent with the condition on the correlation function in the sense that this is the lowest value of p for which \(L^p_{\mathrm {loc}} \subset \mathcal{C}^{-{\underline{c}}}\).
Note also that the cumulants \(\kappa _p^{(\varepsilon )}\) of the rescaled process \(\eta _\varepsilon \) satisfy
where \(S_\varepsilon (t,x) = (t/\varepsilon ^2,x/\varepsilon )\).
2.2 Justification
We claim that the assumption on the noise is rather weak on the ground that many natural constructions yield stationary random processes that satisfy it. We provide details for the following example:
Proposition 2.3
Let \(\theta :\mathbf{R}^{d+1} \rightarrow \mathbf{R}\) be smooth away from 0 and such that for all \(k\in \mathbf{Z}_+^d\), \(|D^k\theta (z)| \lesssim \Vert z\Vert ^{-2{\overline{c}}-|k|}\) for \(\Vert z\Vert > 1\) and \(|D^k\theta (z)| \lesssim \Vert z\Vert ^{-{\underline{c}}-|k|}\) for \(\Vert z\Vert \le 1\). Let \(\mu \) be a Poisson point measure over \(\mathbf{R}^{d+1}\) with intensity 1 and set \(\eta = \mu \star \theta \), then \(\eta \) satisfies the above assumption.
Before proving this proposition, let us first establish a property of joint cumulants of integrals of deterministic functions with respect to a Poisson point measure.
Lemma 2.4
Let \(p\ge 1\) and let \(f_1,\ldots ,f_p\) be elements of \(L^1(\mathbf{R}^{d+1})\cap L^p(\mathbf{R}^{d+1})\), and again \(\mu \) be a Poisson point measure over \(\mathbf{R}^{d+1}\) with intensity 1. Then the joint cumulant \(\kappa _p(\mu (f_1),\ldots ,\mu (f_p))\) of the random variables \(\mu (f_1),\ldots ,\mu (f_p)\) satisfies
Proof
It is sufficient to prove the result in case there exist disjoint Borel subsets \(A_1,\ldots ,A_k\) of \(\mathbf{R}^{d+1}\) with finite Lebesgue measure such that for \(1\le i\le p\),
In this case, however, the result follows readily from the fact that the joint cumulant is p-linear, and that the joint cumulant of a collection of random variables which can be split into two mutually independent subcollections vanishes, see for example property (iii) in [19, p. 32]. \(\quad \square \)
Proof of Proposition 2.3
It follows from Lemma 2.4 that
so it remains to obtain a bound on this integral. We now consider \(z_1,\ldots , z_p\) to be fixed and we shall make use of the labelled tree \((T,{\mathbf {n}})\) built from these points as above. We are first going to treat the simpler case with all \(k_i\) vanishing and then show how the argument can be modified to deal with the general case.
Case 1. The case with all \(k_i=0\). For every edge \(e = ({\underline{e}},{\bar{e}}) \in E_T\) and every \(n \in \mathbf{Z}\) with \({\mathbf {n}}({\underline{e}})< n < {\mathbf {n}}({\bar{e}})\), we define the domain
where
It is possible to convince oneself that, provided that the constant c appearing in the definition of \(D_{(e,n)}\) is sufficiently large, one has
As a consequence, the integral appearing in (2.3) is bounded by some constant times
We first use the fact that \({\bar{\rho }}\) is decreasing to conclude that, for \({\mathbf {n}}({\underline{e}})< n < {\mathbf {n}}({\bar{e}})\), one has the bound
This can be seen as follows. Write \(\{v_1,\ldots ,v_k\}\) for the (possibly empty) set of nodes in \(\mathring{V}_T\) lying on the shortest path joining \({\bar{e}}\) to (not including \({\bar{e}}\) and themselves). We then have, for every \(j=1,\ldots ,k\),
since the number of factors appearing in each term is the same. Similarly, we have
hence (2.5). Since \({\bar{\rho }} \le \rho \wedge \rho ^2\), it follows from (2.5) that
It remains to observe that \(\sum _{n \in \mathbf{Z}} 2^{-(d+2)n} {\bar{\rho }}(2^{-n}) = \sum _{n \in \mathbf{Z}} 2^{({\underline{c}}-d-2)n} \wedge 2^{(2{\overline{c}} - d-2)n} < \infty \), so that (2.4) is indeed bounded by the required expression.
Case 2. Note that we actually showed that the expression (2.3) with \(\theta \) replaced by \({\bar{\rho }}\) is bounded by the right hand side of (2.1) with \(k_i = 0\). To obtain the general case, it therefore suffices to show that
Let \(\chi \in C^\infty (\mathbf{R}^{d+2})\) be such that
For \(1\le j\le p\), we define \(\chi _j(z)=\chi \big (2^{{\mathbf {n}}(j^\uparrow )}(z-z_j)\big )\), and \(\chi _0(z)=1-\sum _{j=1}^p\chi _j(z)\). It is clear that
We note that for z in the support of \(\chi _0\), for \(1\le i\le p\), \(2\Vert z_i-z\Vert \ge 2^{-{\mathbf {n}}(i^\uparrow )}\),
thus yielding (2.6) as required.
To bound the final term, we note that its integrand can be written as a finite sum of terms of the form
where \(k, k_{j,i}\in \mathbf{Z}_+^{d+1}\) and \(k+\sum _{i\not =j}k_{j,i}=k_j\). Each of these terms is bounded above by the indicator function of the support of \(\chi _j\) times
Since for z in the support of \(\chi _j\) and \(i\not =j\), \(2\Vert z-z_j\Vert \le \Vert z_i-z_j\Vert \), so that
one has \(2\Vert z-z_i\Vert \ge \Vert z_j-z_i\Vert \ge 2^{-{\mathbf {n}}(i^\uparrow )}\wedge 2^{-{\mathbf {n}}(j^\uparrow )}\). Combining all of these bounds, we finally obtain
at which point we apply again (2.6) to obtain the required bound. \(\square \)
3 Law of Large Numbers
The aim of this section is to use a simplified variant of the arguments in [14] to show that Theorem 1.1 holds.Footnote 2 Although it would probably not be much more involved to obtain this proof by usual techniques, we give a proof using regularity structures. The main reason is that this allows us to introduce in a simpler setting a number of notions and notations that will be useful in the proof of our main result later on.
Before we turn to the proof proper, let us comment on the way in which we deal with the Neumann boundary conditions. Writing \(P_{\mathrm{Neu}}\) for the Neumann heat kernel and using the notation \(z=(t,x)\) (and similarly for \(z'\)), we rewrite (1.1) as an integral equation
where \(D_t = [0,t]\times D\) and \(P_t^{\mathrm{Neu}}(x,x') = P_{\mathrm{Neu}}((t,x),(0,x'))\). We also fix an arbitrary time horizon \(T \le 1\) which is not a restriction since the argument can be iterated.
Following [9], we then construct two functions K on \(\mathbf{R}^{d+1}\) and \(K_\partial \) on \(\mathbf{R}^{d+1}\times \mathbf{R}^{d+1}\) such that K is compactly supported, \(K_\partial \) is supported on a strip of finite width around the diagonal, and the identity
holds for \(z,z' \in [0,1] \times D\), where we set
See “Appendix A” for more details on the construction of these kernels and a proof that this can be done in a way that is compatible with the results of [9, 11] that we will use in our argument.
Remark 3.1
We make no claim on the values of K and \(K_\partial \) for arguments outside of \([0,1] \times D\). This is because these will always be integrated against functions that are supported on \([0,1] \times D\) and only the values of the result inside the domain will matter.
We choose K in such a way that it coincides with the whole space heat kernel P on the (parabolic) ball of radius 1 and is compactly supported in the ball of radius 2. We furthermore choose K in such a way that it annihilates polynomials of degree up to 3, is invariant under the transformation \((t,x) \mapsto (t,-x)\), and is such that the sum of its reflections agrees with the Neumann heat kernel on \([0,1] \times D\). (See “Appendix A” for more details.) For example, we can choose K as in [14]. The kernel \(K_\partial \) is a correction term that encodes the effect of the boundary condition. Regarding our regularity structure, we then proceed as if there was no boundary condition whatsoever: we construct models defined on the whole space that are translation invariant and we use convolution with K as our integration operator. We then define an operator \(\mathcal{P}_{\mathrm{Neu}}\) on modelled distributions by setting
and \(\mathcal{K}\) is built in exactly the same way as in [11, Sec. 4]. Note that \({\tilde{\mathcal{K}}}_\partial \) encodes the effect of the boundary condition. (There is a completely analogous definition in the case of Dirichlet boundary conditions.)
Here, \(\mathcal{L}_\gamma :\mathcal{C}^\gamma \rightarrow \mathcal{D}^\gamma \) denotes the “Taylor lift” given by
where \(z = (t,x)\) and k denotes a multiindex in \(\mathbf{N}^{1+d}\).
We now have the preliminaries in place to turn to the proof of Theorem 1.1.
Proof of Theorem 1.1
We use a strategy similar to that in [14, 15], combining this with results from [9] to deal with the boundary conditions. We refer to [7, 8, 12] for introductions to the theory of regularity structures, as well as to [11] for details. In our present context, we use the regularity structure obtained by extending the usual polynomial structure with parabolic scaling with a symbol of degree \(-1-\kappa \) representing the driving noise \(\eta _\varepsilon \), as well as an abstract integration operator of order 2 representing convolution with the (singular part of) heat kernel. As usual, we will often use graphical representations for the basis vectors in our regularity structure(s), and we decree that is our symbolic representation for . (The reason for introducing the “accent” representing the index “1” will become clear later on where more general notations of this type are needed.) Although our goal is to consider (1.1) on the bounded domain \(D\subset \mathbf{R}^d\), we construct the models for our regularity structure on the whole of \(\mathbf{R}\times \mathbf{R}^d\).
With notations almost identical to those in [14] and the formula (3.19) there, it would then be natural to consider a fixed point problem of the type
where \(\mathbf {1}_+^D\) denotes the indicator function of the space-time domain \(\mathbf{R}_+\times D\). Leaving considerations regarding the precise spaces of modelled distributions in which this equation makes sense aside for the moment, it is straightforward to see as in [11] that if we solve (3.4) for the renormalised lift of \(\eta _\varepsilon \), that is the admissible model such that
then the function \(u_\varepsilon = \mathcal{R}_\varepsilon U\) actually solves (1.1).
Indeed, iterating (3.4), we see that any solution U to such a fixed point problem is necessarily of the form
for some continuous functions u and \(\nabla u\). (This is purely notational, \(\nabla u\) is not the gradient of u, but can be interpreted as a kind of “renormalised gradient”.) In particular, the factor multiplying \(\mathcal{P}_{\mathrm{Neu}}\mathbf {1}_+^D\) in the right hand side of (3.4) is given by
where we projected onto terms of negative (or vanishing) degree. At this point, it then suffices to note that the application of the reconstruction operator to L yields
as required.
The problem with the argument outlined above is that since , the behaviour of the modelled distribution \(\mathbf {1}_+^D L\) near \(\partial D\) is such that the reconstruction operator is not a priori well-defined on it, see [9, Secs 4.1 & 4.2]. This is for precisely the same reason why the restriction of a generic distribution \(\zeta \in \mathcal{C}^\alpha \) to a “nice” domain D is only well-defined if \(\alpha > -1\). (For \(\alpha \le -1\) there are non-zero distributions with support contained in \(\partial D\).)
Before we tackle this problem, recall the definition of the spaces \(\mathcal{D}^{\gamma ,\eta }\) as in [11, Sec. 6] (the hyperplane P being given by the time slice \(t=0\)) as well as the spaces \(\mathcal{D}^{\gamma ,w}\) as in [9, Sec. 4] (in which case \(P_0\) is again the time 0 slice while \(P_1 = \mathbf{R}\times \partial D\)). These two spaces are distinguished by the fact that \(\eta \) is a real exponent while w denotes a triple of exponents describing the singular behaviour near \(t=0\), \(\partial D\) and the intersection of both regions respectively. It will be convenient to use the notation \((\alpha )_3\) with \(\alpha \in \mathbf{R}\) as a shorthand for the triple \((\alpha ,\alpha ,\alpha )\).
The idea then is the following. First, we introduce a new symbol , also of degree \(-1-\kappa \), but representing the function \(\eta _\varepsilon \mathbf {1}_{\mathbf{R}\times D}\) instead of representing \(\eta _\varepsilon \) and we add to our regularity structure the symbols and . Write then V for the sector spanned by , , and , \({\hat{V}}\) for the sector spanned by , , and , and \(\iota :V \rightarrow {\hat{V}}\) for the linear map with and similarly for the remaining basis vectors. We will furthermore only ever consider models \({\varvec{\Pi }}\) with the property that
for all \(\tau \in V\). Since \(\iota \) commutes with the structure group and preserves degrees, it follows that \(F \mapsto \iota F\) is continuous from \(\mathcal{D}^{\gamma ,w}(V)\) to \(\mathcal{D}^{\gamma ,w}({\hat{V}})\) for all choices of exponents \(\gamma \) and w and, for \(\gamma > 0\), the local reconstruction operator \({\tilde{\mathcal{R}}}\) (which yields a distribution on \(\mathbf{R}\times (\mathbf{R}^d {\setminus } \partial D)\) and is always well-defined) satisfies
The reason for the introduction of these extra symbols is that we would like to interpret (3.4) as a fixed point problem in the space \(\mathcal{D}^{1+2\kappa ,(2\kappa )_3}\) with values in \(V_0\) (for small enough \(\kappa \)), where \(V_0\) is spanned by , , and . The problem now is that, for \(F \in \mathcal{D}^{1+2\kappa ,(2\kappa )_3}\), we have , but we lose some regularity at the boundary when multiplying by the indicator function of our domain (see [9]), so that we only have . Since the boundary index is now below \(-1\), it follows that the reconstruction operator of [9] is not well-defined on . By Theorem C.5, it is however perfectly well-defined on since only the temporal singularity index is below \(-1\) (but above \(-2\)). Furthermore, one has the identity for test functions whose support does not intersect the boundary of D.
Recall now that, given a modelled distribution F and a distribution \(\zeta \) agreeing with \({\tilde{\mathcal{R}}} F\) outside the boundary of D, [9, Lem 4.12] defines a modelled distribution \(\mathcal{K}^\zeta F\) with improved regularity and such that \(\mathcal{R}\mathcal{K}^\zeta F = K \star \zeta \). Furthermore, the map \((\zeta ,F) \mapsto \mathcal{K}^\zeta F\) is Lipschitz continuous in the natural topologies. This allows us to define an operator \(\mathcal{P}_D:\mathcal{D}^{\kappa ,(\kappa -1)_3}(V) \rightarrow \mathcal{D}^{\kappa +2,(1-\kappa )_3}(V)\) by setting
Our discussion suggests that, instead of (3.4), we should consider the fixed point problem
which admits unique local solutions in \(\mathcal{D}^{1+2\kappa ,(2\kappa )_3}(V)\) by [9], which are continuous with respect to admissible models on the full regularity structure satisfying furthermore the consistency condition (3.7). Here, we need to choose \(\kappa \) small enough to guarantee that \(P_{\mathrm{Neu}}u_0\) does indeed belong to the space \(\mathcal{C}^{\kappa +2,(1-\kappa )_3}\), which is possible thanks to our assumption that \(u_0\) itself is Hölder continuous for some positive exponent.
Retracing the discussion given at the beginning of the proof, but now with the renormalised model \({\hat{{\varvec{\Pi }}}}_\varepsilon \) such that, in addition to (3.5), one has \({\hat{{\varvec{\Pi }}}}_\varepsilon \iota \tau = \mathbf {1}_{\mathbf{R}\times D} {\hat{{\varvec{\Pi }}}}_\varepsilon \tau \) for \(\tau \in V\), we conclude that for \(\varepsilon > 0\), solutions to (3.8) coincide with those of (1.1). We now refer to Theorem 4.8 which shows that the sequence of models \({\hat{{\varvec{\Pi }}}}_\varepsilon \) converges, as \(\varepsilon \rightarrow 0\), to a limiting model \({\hat{{\varvec{\Pi }}}}\) such that , extended canonically to the whole regularity structure. It follows immediately that the solution \({\bar{U}}\) to (3.4) with the model \({\hat{{\varvec{\Pi }}}}\) is such that \(u^{(0)}= \mathcal{R}{\bar{U}}\) does indeed solve (1.5) as claimed. \(\square \)
Remark 3.2
Note that (3.7) does not force us to set , but we could have added a sufficiently regular distribution supported on \(\partial D\). This however would break the identity
on \([0,1]\times D\) and would therefore modify the boundary condition of the resulting solution.
4 Central Limit Theorem
We now turn to the proof of the main result of this article, Theorem 1.2. We will mainly focus on the case of Neumann boundary conditions in dimension \(d=3\), which is the most interesting (and technically most difficult) case. We set
With this notation, we then have in the case \(d\in \{2,3\}\)
where, setting
we have the explicit expression for the remainder term
Furthermore, due to the non-vanishing boundary condition of \(u^{(1)}\) in the Neumann case, \(v_\varepsilon \) is then endowed with the inhomogeneous boundary condition
We now also incorporate the first part of the second line into the remainder, so that we can write
where the remainder term \({\hat{R}}_\varepsilon ^{(d)}\) is given by
Note that here we have H appearing in (4.3) rather than \(H_\eta \) and that the two are related by (1.4). In dimension \(d=1\), we set \(u^{(1)}= 0\) so that \({\hat{R}}_\varepsilon ^{(d)}= R_\varepsilon ^{(d)}\) and \(v_\varepsilon = (u_\varepsilon - u^{(0)})/\sqrt{\varepsilon }\), and we obtain in the same way the slightly simpler expression
(The reason why the term containing \(\Psi \) does not appear in this expression is because this was generated by \(\partial _t u^{(1)}\) which vanishes by definition in dimension one.) The exponent \(\alpha \) appearing in this expression is of course arbitrary, but allowing to tune it will be convenient when expressing this as a fixed point problem.
4.1 Decomposition of the Solution
In order to show that \(v_\varepsilon \) converges to a limit, it will be convenient to break it into a sum of three terms. The first term will be a straightforward approximation to the stochastic heat equation with noise strength \(G(u^{(0)})\) and homogeneous boundary condition. The second term will converge to 0, but incorporates the diverging boundary condition, which is used to compensate a resonance appearing in its right hand side. The final term will be a remainder that is sufficiently regular to be dealt with by the techniques of [9]. For this, we write
and, with the convention that G and its derivatives are always evaluated at \(u^{(0)}\), we set
endowed with the boundary conditions on \(\partial D\)
as well as vanishing initial conditions. The reason why \(v^{(1)}_\varepsilon \) will actually converge to 0 despite the diverging boundary condition when \(d \ge 2\) is the following. Consider the function defined by
Then, we will see in (4.24) that the behaviour of \(v^{(0)}_\varepsilon \) is locally very well described by that of
where as usual. This implies that the behaviour of the term \(G'(u^{(0)}) v^{(0)}_\varepsilon \eta _\varepsilon \) appearing in the right hand side of the equation for \(v^{(1)}_\varepsilon \) is well described locally by that of
Where denotes multiplication by \(\eta _\varepsilon \) as previously, while denotes multiplication by \(\eta _\varepsilon \mathbf {1}_{\mathbf{R}\times D}\). (This will be formalised later on.) We will see that, up to vanishingly small errors, , while for a suitable constant c (in fact a different constant for each face of \(\partial D\) in general), so that the first term in (4.9) is cancelled up to small errors by the term appearing in the equation for \(v^{(1)}_\varepsilon \), while the second term in (4.9) is cancelled by the term \(-\varepsilon ^{1-{d\over 2}} c GG' \delta _{\partial D}\) created by the boundary condition.
Note that this argument does not see much of a difference between the Neumann and Dirichlet cases. Indeed, if we want the right hand side of the equation for \(v^{(1)}_\varepsilon \) to converge to a limiting distribution for the latter, we also need to add a diverging (when \(d=3\)) term proportional to \(\delta _{\partial D}\). The difference is that \(K_{\mathrm{Dir}}(z,z')\) vanishes for \(z' \in \partial D\), so that this term has no influence on \(v^{(1)}_\varepsilon \) in the Dirichlet case.
The idea now is to proceed as follows:
-
In a first step, we describe in Section 4.2 a regularity structure that is sufficiently rich to allow us to give precise control on the behaviour of \(v^{(0)}_\varepsilon \), \(v^{(1)}_\varepsilon \) and \(v_\varepsilon \). As already alluded to, this will in particular include symbols representing non-stationary space-time stochastic processes, but we will try to keep these to an absolute minimum. We also describe there the renormalisation procedure which allows us to construct a suitable (random) model.
-
We then make precise the description (4.8) for \(v^{(0)}_\varepsilon \) by expressing it as a modelled distribution in this regularity structure, which in particular contains the two symbols and . The main conclusion in (4.24) will be that the presence of allows us to express \(v^{(0)}_\varepsilon \) as a modelled distribution with a good behaviour near \(\partial D\). (At this stage, we could of course also use one symbol only and define our model using the Neumann heat kernel only, but the decompositions (4.8) and (4.9) are convenient for the remainder of the argument and to be able to reuse existing results.)
-
In a second step, we show that if we set
then \({\hat{{\varvec{\Pi }}}}_\varepsilon \) converges to a limiting model as \(\varepsilon \rightarrow 0\), which furthermore has good restriction properties to D, uniformly in \(\varepsilon \). Furthermore, since is only singular near the boundary of D, we can describe \(v^{(0)}_\varepsilon \) by another modelled distribution with worse behaviour near the boundary, but which only uses “translation invariant symbols” in its description, see Lemma 4.10, which allows us to give a description of \(v^{(1)}_\varepsilon \) in terms of such symbols in Lemma 4.12.
-
We set up a sufficiently large regularity structure so that we can formulate a fixed point problem for the remainder \({\bar{v}}_\varepsilon \) and control its behaviour as \(\varepsilon \rightarrow 0\), see Propositions 4.16 and 4.18. Combining this with the convergence of the corresponding renormalised model which is performed in Theorem 4.8 but relies crucially on the next section, we are finally able to conclude.
4.2 Definition of the Ambient Regularity Structure
We start by defining a regularity structure that is sufficiently large to allow us to perform the steps mentioned above and in particular to formulate (4.2) as a fixed point problem for a modelled distribution V. This fixed point problem will be chosen precisely in such a way that if we take as our model the renormalised lift of the noise \(\eta _\varepsilon \), then the corresponding counterterms are precisely such that if V solves the fixed point problem, then \(\mathcal{R}V\) solves (4.2).
Let be new symbols representing \(\xi _\varepsilon \varepsilon ^{\alpha (i,j)}\) with
(Beware that j is simply a superscript in , not a power.) In the graphical notation analogous to [14], we will use “accents” to denote the upper and lower indices on , so for example , , etc. We will also sometimes write instead of . The degree of these symbols is given by
The main reason why we never consider these “noises” as having degree larger than \({\delta - 1\over 2}\) is that we want to view them as “noise types” and so the structure group should act trivially on them. A byproduct of this choice is that it allows us to deal with driving noises \(\eta \) that are themselves unbounded.
We now build a regularity structure extending the one built in Section 3, in which (4.35) can be formalised, by applying the framework of [5, Sec. 5]. We take as our basic “noise types” the noises with \(0 \le j \le i \le 2\), as well as an additional noise type which will be used to represent the noise \(\xi _\varepsilon \), restricted to \(\mathbf{R}\times D\). We also introduce two “edge types” representing convolution by a suitable cutoff \(\mathcal{K}\) of the standard heat kernel in the whole space and representing the integral operator with kernel \(K_{\mathrm{Neu}}\) as in (3.1) (see “Appendix A” for a precise definition), both having degree 2. It will be convenient to also introduce edge types and with \(i=1,\ldots ,d\) of degree 3 representing the integral operators with kernel \(K_i(x,y) = (y_i-x_i)K(y-x)\) and \(K_{{\mathrm{Neu}},i}(x,y) = (y_i-x_i) K_{\mathrm{Neu}}(y-x)\) respectively. Finally, we introduce a “virtual” edge types of degree 2 which will allow us to produce a rule (in the technical sense of [5, Sec. 5]) generating the relevant trees containing non-translation invariant symbols and , but without cluttering our regularity structure with unneeded symbols.
The rule used to generate the regularity structure is then given by and
Recall that, given a collection \({\mathfrak {L}}\) of “edge types” (in our case, these are the “integration” types , , etc as well as the “noise” types and which are also interpreted as edges for the purpose of this discussion), a “rule” is a map \(R :{\mathfrak {L}}\rightarrow \mathcal{P}({\hat{\mathcal{P}}}({\mathfrak {L}})) {\setminus } \{\emptyset \}\), where \(\mathcal{P}(A)\) denotes the powerset of a set A, \({\hat{\mathcal{P}}}({\mathfrak {L}})\) denotes the set of non-empty multisets with elements in \({\mathfrak {L}}\).Footnote 3 In other words, an element \(R({\mathfrak {t}})\) is a collection of “node types”, with each node type being a collection of edge types, with repetitions allowed. In (4.12), we use the identification between a multiset and a formal monomial, that is denotes the multiset with one copy of and two copies of .
The basis vectors for our regularity structure are rooted trees \((V, E, \rho )\) with edges \(e \in E\) labelled by \({\mathfrak {L}}\) and nodes \(v \in V\) labelled by \(\mathbf{N}^d\), denoting polynomial factors, with the convention that \(0 \in \mathbf{N}\). By convention, edges are oriented away from the root. Any node \(v\in V\) then has a “type” \(\mathcal{N}(v) \in {\hat{\mathcal{P}}}({\mathfrak {L}})\) given by the collection of the types of the outgoing edges adjacent to v. If \(v \in V {\setminus } \{\rho \}\), there is a unique edge coming into v and we write \({\mathfrak {t}}(v) \in {\mathfrak {L}}\) for the type of the incoming edge. A tree is said to “conform” to a rule R if, whenever \(v \in V {\setminus } \{\rho \}\), one has \(\mathcal{N}(v) \in R({\mathfrak {t}}(v))\), while \(\mathcal{N}(\rho ) \in \bigcup _{{\mathfrak {t}}\in {\mathfrak {L}}} R({\mathfrak {t}})\).
Remark 4.1
Note that never appears inside any \(R({\mathfrak {t}})\), so that a tree conforming to the rule R is not allowed to have any edge of type . The only reason for its presence is to allow the root of a conforming tree to be of type , since otherwise the tree which will be used later on would not be conforming to our rule. Actually, the symbol , which was introduced in a completely ad hoc manner so far, will then be interpreted as
We assign degrees to the components appearing here by (4.11) as well as
It is straightforward to verify that this rule is subcritical and complete. We henceforth denote by \((\mathcal {T},\mathcal {G})\) the (reduced in the terminology of [5, Sec. 6.4]) regularity structure generated by the rule R.
4.3 Description of the Models
Throughout this article, we will use the notation \({\varvec{\Pi }}\) (possibly with additional decorations) for a continuous linear map \({\varvec{\Pi }}:\mathcal{T}\rightarrow \mathcal{D}'(\mathbf{R}^{d+1})\) such that there exists a (necessarily unique) admissible model \((\Pi ,\Gamma )\) related to \({\varvec{\Pi }}\) by [11, Sec. 8.3]. Here, we associate the kernel K to and , \(K_i\) to , and \(K_{\mathrm{Neu}}\), \(K_{{\mathrm{Neu}},i}\) to , respectively.
We henceforth make a slight abuse of terminology and call \({\varvec{\Pi }}\) itself a model. The following notion of an “admissible” model is a slight strengthening of the usual one to our context which essentially states that our “square” symbols are the restrictions of the “round” symbols to \(\mathbf{R}\times D\).
Definition 4.2
A model \({\varvec{\Pi }}\) for \((\mathcal {T},\mathcal {G})\) is admissible if it is admissible in the sense of [11, Def. 8.29] for the kernels listed above and furthermore, for any \(\tau \in \mathcal {T}\) and i, j such that , one has
for test functions \(\phi \) with \({\mathrm{supp}}\,\phi \subset \mathbf{R}\times D\) and for test functions with \({\mathrm{supp}}\,\phi \subset \mathbf{R}\times D^c\).
Remark 4.3
Note furthermore that all basis vectors \({\hat{\tau }} \in T\) with \(\deg {\hat{\tau }} < -1\) are of the form (or similar with replaced by ) for some i, j and \(\tau \) as in Definition 4.2. As a consequence, admissible models in our sense admit a canonical decomposition \({\varvec{\Pi }}= {\varvec{\Pi }}^+ + {\varvec{\Pi }}^-\) as in Assumption C.1 by setting . This is a crucial fact which allows us to use Theorem C.5.
We also define the following notion:
Definition 4.4
Consider the rule given by and
We write for the sector spanned by trees conforming to and we call the translation invariant sector.
Given any admissible model for \((\mathcal {T},\mathcal {G})\), we then write \(\mathcal{K}\), \(\mathcal{K}_i\), \(\mathcal{K}_{\mathrm{Neu}}\), \(\mathcal{K}_{{\mathrm{Neu}},i}\) for the corresponding integration operators as defined in [11, Sec. 5]. We now provide a complete description of the renormalised models \({\hat{{\varvec{\Pi }}}}_\varepsilon \) we will use for this regularity structure. From now on, whenever we do not explicitly mention a model on \((\mathcal{T},\mathcal{G})\), we assume that we talk about the specific (random) model \({\hat{{\varvec{\Pi }}}}_\varepsilon \), and not a general admissible model. In particular, all the variants of the spaces \(\mathcal{D}^\gamma \) used below will be the spaces associated to this model. In order to specify the \(\varepsilon \)-dependence, we will write \(\mathcal{R}_\varepsilon \) for the reconstruction operator associated to the model \({\hat{{\varvec{\Pi }}}}_\varepsilon \). Theorem 4.8 shows that \({\hat{{\varvec{\Pi }}}}_\varepsilon \) converges to some limiting model \({\varvec{\Pi }}\) as \(\varepsilon \rightarrow 0\), giving rise to a reconstruction operator \(\mathcal{R}_0\). Whenever we simply write \(\mathcal{R}\), it denotes the reconstruction operator for a generic admissible model.
Writing \(\mathbf {1}^D\) for the indicator function of \(\mathbf{R}\times D\), we first define a family of non-renormalised (random) models \({\varvec{\Pi }}_\varepsilon \) as the canonical lift for the “noise” given by
with \(\alpha (i,j)\) as in (4.10). We then define an intermediate model \({\tilde{{\varvec{\Pi }}}}_\varepsilon \) related to \({\varvec{\Pi }}_\varepsilon \) by
as well as . We furthermore impose admissibility which forces us to set . One can verify that the only remaining basis vector of \(\mathcal{T}\) of negative degree and not belonging to the translation invariant sector is . However, this element will never be needed for our considerations, so we do not need to specify the action of \({\tilde{{\varvec{\Pi }}}}_\varepsilon \) on it.
We now construct a renormalised model \({\hat{{\varvec{\Pi }}}}_\varepsilon \) from \({\tilde{{\varvec{\Pi }}}}_\varepsilon \) by applying a slight modification of the BPHZ renormalisation procedure [5, 6] to the translation invariant sector (which can be viewed as a regularity structure in its own right, generated by the rule ). Writing \(\mathcal{T}_-\) for the subspace of consisting of symbols of strictly negative degree, we will define \({\hat{{\varvec{\Pi }}}}_\varepsilon \) by an expression of the form
where \(\Delta ^-\) is a certain linear operator from \(\mathcal{T}\) into \({\text {Alg}}\mathcal{T}_-\otimes \mathcal{T}\), with \({\text {Alg}}\mathcal{T}_-\) the free unital algebra generated by \(\mathcal{T}_-\), and \(g_\varepsilon \) is a character of \({\text {Alg}}\mathcal{T}_-\), which is canonically identified with an element of the dual space \(\mathcal{T}_-^*\). In the BPHZ renormalisation procedure, one should choose \(g_\varepsilon \) of the form
where \(\mathcal{A}:{\text {Alg}}\mathcal{T}_- \rightarrow {\text {Alg}}\mathcal{T}\) is the “twisted antipode” [5] and \(\mathbf{E}{\tilde{{\varvec{\Pi }}}}_\varepsilon \) is the character of \({\text {Alg}}\mathcal{T}\) determined by \((\mathbf{E}{\tilde{{\varvec{\Pi }}}}_\varepsilon )(\tau ) = \mathbf{E}({\tilde{{\varvec{\Pi }}}}_\varepsilon \tau )(\phi )\), for any fixed test function \(\phi \) with \(\int z^k \phi (z)\,{\hbox {d}}z = \delta _{k,0}\) for |k| small enough. (In our case \(|k| \le 1\) suffices.)
Recall that \(\Delta ^-\) is an “extraction/contraction” operator which iterates over all possible ways of extracting divergent subsymbols of its argument, so for example
The twisted antipode behaves in a somewhat similar fashion, in this case
Remark 4.5
Inspection of the rule and our degree assignment shows that the basis vectors of \(\mathcal{T}_{-}\) are given by
It will be convenient to have an alternative degree assignment \({\overline{\text {deg}}}\) on which better reflects the self-similarity properties of our objects given by setting
and then extending it as usual. Instead of choosing the character \(g_\varepsilon \) as in the BPHZ specification (4.14), it will be convenient to choose it in a way such that, for some constants \(C(\tau )\) that are independent of \(\varepsilon \), one has
For \({\overline{\text {deg}}}\tau = 0\), we choose \(g_\varepsilon (\tau ) = -C(\tau )\) whenever the symbol \(\tau \) contains an “accent”, that is one of the noises with \(i+j > 0\). The only symbols of negative degree without accents that appear in the translation invariant sector of our regularity structure and that contain at least two copies of the noise are and , which are of vanishing degree \({\overline{\text {deg}}}\) in dimensions 2 and 3 respectively, and will be considered separately below.
The constants \(C(\tau )\) themselves are chosen to coincide with the ones appearing in Theorems 1.1 and 1.2 with the convention that for any symbol \(\tau \) the constant \(C(\tau )\) is also written as the same symbol \(\tau \), but drawn in red and with its “accents” stripped. For example, we set
We also set \(C(\tau ) = 0\) whenever \(\tau \) contains only one instance of the noise, namely we set
[In general, we should also set \(C(\tau ) = 0\) if \(\tau \) is of the form for some \(\tau '\), but the only symbol of negative degree of this type appearing in this work is which is already covered by (4.16).] Other constants that will be relevant for our analysis (in dimension \(d=3\)) are given by as well as
The convergence of integrals corresponding to , and in dimension 3 can easily be verified by using our assumption on the cumulants and the self-similarity of the heat kernel. The convergence of the integral for is more subtle since . As a consequence, although \(\kappa _2(z,z')\) decays fast enough when \(\Vert z-z'\Vert \) is large, the function \(z \mapsto x P^2(z)\) is homogeneous of (parabolic) degree \(-5\) and is therefore not absolutely integrable at large scales. However, since it is odd under \((t,x) \mapsto (t,-x)\), additional cancellations occur and the integral should be interpreted as
which does converge absolutely, so we set in dimension 3.
In dimension 2, , but and the expectation of diverges logarithmically and the expression given above fails to converge. We then have no choice but to set
where \(K_\varepsilon (t,x) = \varepsilon ^2 K(\varepsilon ^2 t, \varepsilon x)\). We then have the following preliminary result:
Proposition 4.6
The model \({\hat{{\varvec{\Pi }}}}_\varepsilon \) restricted to the translation invariant sector converges as \(\varepsilon \rightarrow 0\) to the BPHZ model \({\hat{{\varvec{\Pi }}}}\) such that and for \(i+j > 0\).
Proof
Consider the “BPHZ model” \({\varvec{\Pi }}_\varepsilon ^{{\textsc {bphz}}}\) on given by
with \(g_\varepsilon ^{{\textsc {bphz}}}\) defined as in (4.14). Thanks to Proposition 5.5, our noises are such that the norm of [6, Def. A.18] is finite, uniformly in \(\varepsilon \), for the cumulant homogeneity described in (5.3).
It therefore follows from [6, Thm 2.33] that \({\varvec{\Pi }}_\varepsilon ^{{\textsc {bphz}}}\) converges to \({\hat{{\varvec{\Pi }}}}\). Since furthermore the action of the character group of \(\mathcal{T}_-\) on the space of admissible models is continuous [5], it suffices to show that one can write
for some \(\delta g_\varepsilon \in \mathcal{T}_-^*\) with \(\lim _{\varepsilon \rightarrow 0} \delta g_\varepsilon = 0\). For this, the following result is useful:
Lemma 4.7
Let \(g,{\bar{g}} \in \mathcal{T}_-^*\) such that furthermore \(g(\tau ) = {\bar{g}}(\tau ) = 0\) for every \(\tau \) of the type (4.16). Then, one has the identity
for all \(\tau \in \mathcal{T}\).
Proof
where \(\Delta ^{-}:{\text {Alg}}\mathcal{T}_- \rightarrow {\text {Alg}}\mathcal{T}_- \otimes {\text {Alg}}\mathcal{T}_-\) is an extraction/contraction operator defined just like above, but extended multiplicatively to \({\text {Alg}}\mathcal{T}_-\) and such that only those terms are kept that actually belong to \({\text {Alg}}\mathcal{T}_- \otimes {\text {Alg}}\mathcal{T}_-\) (that is every factor needs to be of negative degree on both sides of the tensor product). Inspection of the list (4.15) reveals that in our case, the only situation in which we have a “subsymbol” of negative degree appearing in any of our symbol in such a way that the contracted symbol is still of negative degree is when the subsymbol contains only one noise. We conclude that
and the claim follows. \(\quad \square \)
We conclude from Lemma 4.7 that (4.18) holds with \(\delta g_\varepsilon = g_\varepsilon - g_\varepsilon ^{{\textsc {bphz}}}\), so that it remains to show that \(\lim _{\varepsilon \rightarrow 0} \delta g_\varepsilon (\tau ) = 0\) for every \(\tau \in \mathcal{T}_-\). For elements \(\tau \) of the form \(\tau = X^k \Xi _i^j\) we have \(g_\varepsilon (\tau ) = g_\varepsilon ^{{\textsc {bphz}}}(\tau ) = 0\). For all other elements \(\tau \) with \({\overline{\text {deg}}}\tau \le 0\), a simple scaling argument shows that \(g_\varepsilon ^{{\textsc {bphz}}}(\tau )\) is given by the exact same formula as \(g_\varepsilon (\tau )\), except that all instances of the heat kernel P are replaced by \(K_\varepsilon \), where
Note that \(K_\varepsilon \) coincides with P in a parabolic ball of radius \({\mathcal {O}}(1/\varepsilon )\) around the origin and vanishes outside of another ball of radius \({\mathcal {O}}(1/\varepsilon )\).
This in particular shows that
Since \(\kappa _2(0,z)\) decreases like \(\Vert z\Vert ^{-2{\bar{c}}} = \Vert z\Vert ^{-(d+2+2\delta )}\) for large z and P decreases like \(\Vert z\Vert ^{-d}\), it follows that
which of courses converges to 0. The symbols \(\tau \) differing from only by the placement of their accents then also converge since \(\delta g_\varepsilon (\tau )\) is given by the same expression as (4.19), except for being multiplied by a higher power of \(\varepsilon \).
Turning now to (which only appears when \(d \in \{2,3\}\)), it follows from [15, Lem. 6.8] that is a sum of terms of the form
where \(|G(z)| \lesssim (1+\Vert z\Vert )^{2-2{\bar{c}}} = (1+\Vert z\Vert )^{-(d+2\delta )}\). It follows that as desired, and is controlled in the same way by a higher power. In dimension 3, and it was shown in (4.17) that converges absolutely, which immediately implies that .
To deal with the symbol (again with \(d \in \{2,3\}\)), we first note that Assumption 2.1 implies the bound
We also note that for any \(\kappa \in [0,d]\) one has the bound
This allows us to make use of [13, Thm 4.3]. Since we apply the bound (4.20) with \(\kappa = \delta + {d-2\over 2}\) which, in the notation of [13], yields a bound of the type
for some \(\varepsilon \)-dependent kernel assignment \(({\bar{K}},{\bar{R}}) \in \mathcal{K}_0^- \times \mathcal{K}_0^+\) with bounds that are independent of \(\varepsilon \) and the Feynman diagram
Here, the first coordinate of the label for each edge denotes its small-scale degree assignment while the second coordinate denotes its large-scale degree assignment. It is straightforward to verify that the small-scale degree assignment for this diagram satisfies the assumption of [13, Prop. 2.3], so that it does not require renormalisation. Furthermore, the large-scale degree assignment is seen to satisfy the assumption of [13, Thm 4.3], which guarantees that the integral converges absolutely and is bounded independently of \(\varepsilon \), so that .
The symbols and (in dimension 3) which have vanishing degree \({\overline{\text {deg}}}\), can be dealt with using the same technique, leading to the bound by using the Feynman diagrams
for bounding and
for bounding . All three are easily seen to satisfy both the small-scale and large-scale integrability conditions.
The remaining three symbols in the list (4.15) are all such that , so we need to show that . This can in principle be shown again by using the bounds from [13]. A “cheaper” way of showing that is to note that in all three cases we can make use of a combination of Proposition 5.5 (used in the same way as in the proof of Proposition 5.9) and (4.14) to conclude that one can build a regularity structure \({\hat{\mathcal{T}}}\) extending (by adding additional “noises” representing \(\eta _\varepsilon ^{(\alpha )}\) for suitable choices of \(\alpha \)) such that, for every , one can find \(\kappa > 0\) and a symbol such that and such that
for some suitable fixed test function \(\phi \). Since we know from [6, Thm 2.33] that the BPHZ renormalised model \({\varvec{\Pi }}_\varepsilon ^{{\textsc {bphz}}}\) converges (so in particular remains uniformly bounded), we conclude that as required. \(\quad \square \)
Theorem 4.8
The random models \({\hat{{\varvec{\Pi }}}}_\varepsilon \) converge weakly to a limiting admissible model \({\hat{{\varvec{\Pi }}}}\) which, on the translation invariant sector, is given by the BPHZ lift of
For the remaining symbols, it is given by the unique admissible model such that
as well as \({\hat{{\varvec{\Pi }}}} \tau = 0\) for any symbol \(\tau \) containing the noise .
Remark 4.9
Note that only symbols with \(i \ge j\) appear in our regularity structure.
Proof
Convergence on the translation invariant sector was already shown in Proposition 4.6, so it only remains to consider the non-translation invariant symbols of negative degree. In dimension 3, these are , , , , , , , and . (There is also the symbol , but applying the model to it yields the exact same distribution as when applying it to .)
The convergence on the remainder of the regularity structure is shown in the next section, but we collect the various parts of the proof here. Convergence of , and to \(\xi \mathbf {1}^D\) and 0 in \(\mathcal{C}^{-\frac{5}{2}-\kappa }\), \(\mathcal{C}^{-1-\kappa }\) and \(\mathcal{C}^{-{3\over 2}-\kappa }\) respectively follows from Corollary 5.13 and Corollary 5.6.
Convergence of essentially follows from [6, Thm 2.31], noting that the bound for does not require any derivative of the test function in this case, so that we immediately obtain the required bound by noting that . The reason why this is so is that the only point in the proof where derivatives of the test function could potentially appear is in the bound [6, Eq. A.29] in the proof of [6, Thm A.32]. By the definition of \(R(\mathbf {S})\), these derivatives can only hit a test function in a situation where \(\tau \) contains a connected subtree containing its root and of degree less than \(-{d+2\over 2}\). This is not the case for .
Convergence of , and also follows in the same way. Regarding , we note that by (4.23) and we already obtained convergence of . Convergence of is the content of Theorem 5.20. The convergence of follows from Corollary A.3, combined with the usual Schauder estimates for integration against K. \(\square \)
Most of Section 5 is devoted to filling in the missing parts in the proof of Theorem 4.8, namely the proofs of Theorem 5.11 and Theorem 5.20.
4.4 Description of \(v^{(0)}_\varepsilon \)
We now have the notation in place to formalise the discussion given above regarding the local behaviour of \(v^{(0)}_\varepsilon \) and \(v^{(1)}_\varepsilon \). Recall the definition (4.5) of \(v^{(0)}_\varepsilon \) which we rewrite in integral form as
Depending on context, we will model \(v^{(0)}_\varepsilon \) by three different modelled distributions \(V_\varepsilon ^{(0)}\), \({\tilde{V}}_\varepsilon ^{(0)}\) and \({\hat{V}}_\varepsilon ^{(0)}\).
Regarding \(V^{(0)}_\varepsilon \), we use Proposition 6.1, which guarantees that one can find \(\Phi _\varepsilon \in \mathcal{C}^{{3\over 2}-\kappa ,-{1\over 2}-\kappa }\) such that, settingFootnote 4
where the integration operators \(\mathcal{K}_{\mathrm{Neu}}\), and \(\mathcal{K}_{{\mathrm{Neu}},i}\) should be interpreted as the natural extensions of the corresponding integration operators described in Proposition 6.1, one has
provided that we consider admissible models \({\hat{{\varvec{\Pi }}}}_\varepsilon \) with . Note that \(V^{(0)}_\varepsilon \) depends on \(\varepsilon \) via the choice of model \({\hat{{\varvec{\Pi }}}}_\varepsilon \). Furthermore, Theorem 4.8 and Lemma 5.14 guarantee that the bounds on \(V^{(0)}_\varepsilon \) are uniform over \(\varepsilon \) and that one has \(V^{(0)}_\varepsilon \rightarrow V^{(0)}\) in \(\mathcal{D}^{{3\over 2}-\kappa ,-{1\over 2}-\kappa }\) with respect to the renormalised models \({\hat{{\varvec{\Pi }}}}_\varepsilon \) and the limiting model \({\hat{{\varvec{\Pi }}}}\).
It is natural at this stage, as already mentioned earlier, to define the symbol as the element of \(\mathcal{T}\) of degree \(-{1\over 2}-\kappa \) given by
which is indeed consistent with (4.7). With this notation, Propositions A.2 and A.4 combined with (4.22) guarantee the existence of a function \({\tilde{\Phi }}_\varepsilon \in \mathcal{C}^{{3\over 2}-\kappa ,w}\) with \(w = \bigl (-{1\over 2}-\kappa ,{1\over 2}-\kappa ,-{1\over 2}-\kappa \bigr )\) in the sense of [9, Def. 3.2] such that if we set \({\tilde{V}}^{(0)}_\varepsilon = {\tilde{V}}^{(0,1)}_\varepsilon + {\tilde{V}}^{(0,2)}_\varepsilon \) with
then we have \({\tilde{V}}^{(0,1)}_\varepsilon \in \mathcal{D}^{\frac{3}{2}-\kappa }\), \({\tilde{V}}^{(0,2)}_\varepsilon \in \mathcal{D}^{\frac{3}{2}-\kappa ,w}\) and \(\bigl (\mathcal{R}_\varepsilon {\tilde{V}}^{(0)}_\varepsilon \bigr )(\phi ) = \bigl (\mathcal{R}_\varepsilon V^{(0)}_\varepsilon \bigr )(\phi )\) for all test functions \(\phi \) supported in \(\mathbf{R}_+ \times D\). As before, all these objects converge in the limit \(\varepsilon \rightarrow 0\) provided that the underlying models converge. To see this, we note that we can choose
and then apply Proposition A.4 to bound the first term and Proposition A.2 to bound the remaining two terms.
On the other hand, we define \({\hat{V}}^{(0)}_\varepsilon \) by setting
where we use the convention that \(u^{(0)}\) is extended outside of \(\mathbf{R}_+ \times D\) in any way that makes it globally \(\mathcal{C}^{3}\). (For positive times, this is possible since the extension of \(u_0\) to the whole space by suitable reflections is of class \(\mathcal{C}^{3}\) by our assumptions. For negative times, this is possible by Whitney’s extension theorem [21].) Here we made a slight abuse of notation: the operator \(\mathcal{K}\) should be interpreted in the sense of [9, Sec. 4.5] with , which converges as \(\varepsilon \rightarrow 0\) by an argument very similar to that of Proposition 6.1, combined with the fact that \(G(u^{(0)})\) is \(\mathcal{C}^3\).
Note that we have
The modelled distribution \({\hat{V}}^{(0)}_\varepsilon \) exhibits rather singular behaviour near the boundary of the domain, but by Proposition A.2 it converges as \(\varepsilon \rightarrow 0\) in \(\mathcal{D}^{1,w}\) with \(w = \big (-{1\over 2} - \kappa \big )_3\) [we use again the notation \((\eta )_3 = (\eta ,\eta ,\eta )\)]. It has the advantage however of not involving the integration map and the restricted noise , so it is purely described in terms of the “translation invariant” part of the regularity structure, that is the sector generated by and . We summarise the above discussion with the following statement.
Lemma 4.10
We have \({\hat{V}}^{(0)}_\varepsilon \in \mathcal{D}^{1,(-\frac{1}{2}-\kappa )_3}_{-\frac{1}{2}-\kappa }\), \(\mathcal{R}_\varepsilon {\hat{V}}^{(0)}_\varepsilon = v^{(0)}_\varepsilon \) on \(\mathbf{R}_+ \times D\), and \({\hat{V}}^{(0)}_\varepsilon \) is of the form
for some continuous function \(\Phi ^{(0)}_\varepsilon \). Furthermore, \(\lim _{\varepsilon \rightarrow 0} {\hat{V}}^{(0)}_\varepsilon = {\hat{V}}^{(0)} \in \mathcal{D}^{1,(-\frac{1}{2}-\kappa )_3}\) with \(\mathcal{R}_0 {\hat{V}}^{(0)} = P_{\mathrm{Neu}}\bigl (\mathbf {1}_+^D G(u^{(0)})\xi \bigr )\).
4.5 Description of \(v^{(1)}_\varepsilon \)
Recall that \(v^{(1)}_\varepsilon \) was defined in (4.6) as the solution to an inhomogeneous linear equation with inhomogeneous boundary conditions (et least in the Neumann case). Regarding \(v^{(1)}_\varepsilon \), we would like to describe it by a modelled distribution \(V^{(1)}_\varepsilon \) given by
with \(\mathcal{P}_{\mathrm{Neu}}\) as in (3.2) and where \(v^{(1,\star )}_\varepsilon \) would be given by the solution to
endowed with homogeneous Neumann boundary conditions. The reason why the identity \(\mathcal{R}_\varepsilon V^{(1)}_\varepsilon = v^{(1)}_\varepsilon \) holds (on \(\mathbf{R}_+\times D\) as usual) is as follows. By (4.24),
It then follows from the definition (4.13) of the renormalised model and the fact that \(\mathcal{R}_\varepsilon ({\tilde{V}}_\varepsilon ^{(0)}) = v^{(0)}_\varepsilon \) that, applying the reconstruction operator to this expression yields
We then note that the two terms appearing on the second line, when multiplied by another factor of \(G(u^{(0)})\), are exactly the additional two terms appearing on the first line of (4.6), while the singular term involving Dirac masses on the boundary of D, when hit by \(K_{\mathrm{Neu}}\), is responsible for the non-homogeneous boundary conditions.
The problem with such a definition is that it yields a description of \(V^{(1)}_\varepsilon \) in terms of symbols involving , while the general convergence results of [6] require translation invariance of the noise objects, which is not the case here. If we were to try to improve the situation by replacing by in (4.25), then we immediately run into the problem that the behaviour of this modelled distribution on the boundary of D is too singular for the general results of [9] to apply. This is not just a technicality: this singular behaviour is precisely what is responsible for the additional boundary renormalisation!
Instead, we define \(V^{(1)}_\varepsilon \) as a sum of terms that is “equivalent” to the definition (4.25) in the sense that they reconstruct the same function, but such that each of the terms can be controlled in a slightly different, situation-specific, way.
We first deal with the boundary correction by setting
Since with \(w = (-\frac{3}{2}-2\kappa ,-\frac{1}{2}-2\kappa ,-\frac{3}{2}-2\kappa )\) and since it belongs to a sector of regularity \(-\frac{3}{2}-2\kappa \), its reconstruction belongs to \(\mathcal{C}^{-\frac{3}{2}-2\kappa }\). It then follows from Proposition A.2 that \(V^{(1,0)}_\varepsilon \in \mathcal{D}^{2,(0)_3}\).
We now break up \({\tilde{V}}_\varepsilon ^{(0)}\) in (4.25) as in (4.24) and deal with the first term. By Proposition 6.3, we can find \(V^{(1,1)}_\varepsilon \in \mathcal{D}^{2-2\kappa ,{\bar{w}}}\) with \({\bar{w}} = \big ({1\over 2}-2\kappa ,{1\over 2}-2\kappa ,0\big )\) of the form
with \(\Phi _\varepsilon ^{(1,1)}\) taking values in the classical Taylor polynomials, and such that
The second term is dealt with similarly. As a consequence of Proposition 6.4 with \(g_1 = G'G(u^{(0)})\), \(g_{2,i} = G'(u^{(0)})^2 \partial _i u^{(1)}\) and \(g_3 = G'(u^{(0)}) {\tilde{\Phi }}_\varepsilon \) (with \({\tilde{\Phi }}_\varepsilon \) as in (4.25)), we can find \(V^{(1,2)}_\varepsilon \in \mathcal{D}^{2-2\kappa ,{\bar{w}}}\) such that
and such that furthermore \(V^{(1,2)}_\varepsilon \) takes values in the translation invariant sector and is of the form
for some \(\Phi ^{(1,2)}_\varepsilon \) taking values in the Taylor polynomials. In order to define \(V^{(1,3)}_\varepsilon \), we make use of the following lemma:
Lemma 4.11
Let \(\phi _\varepsilon \) be such that on \(\mathbf{R}_+ \times D\) one has the identity
and one has \(\phi _\varepsilon (t,x) = 0\) for \(t < 0\) or \(x \not \in D\). Then, for any \(\alpha \in [0,1)\), one has the bound \(\mathbf{E}\Vert \phi _\varepsilon \Vert _{\alpha +\frac{1}{2}-\kappa ,w} \lesssim \varepsilon ^{-\alpha }\) with \(w = \bigl (\alpha -\frac{1}{2}-\kappa \bigr )_3\).
Proof
We decompose \(\phi _\varepsilon \) as
and we treat the three terms separately. The first two terms are estimated by combining Proposition A.2 with Corollary 5.10.
The bound on the last term follows from combining Proposition 5.9 with Corollary B.5. To apply the latter, we set \(\theta = \kappa \) (small enough) and \(\chi = \alpha - {5\over 2}-\kappa \), which yields a bound in \(\mathcal{C}^{\kappa +2}\) on
Since \(K_i\) gains three derivatives, the term \(K_i \bigl (\mathbf {1}_+^D \xi _\varepsilon \bigr )\) itself satisfies the required bound and we are done. \(\square \)
Recalling \(W_{1,\varepsilon }\) as defined in (4.26), it follows from Lemma 4.11 combined with the definition of the renormalised model that we can rewrite it as
As a consequence of Proposition 5.9 (with \(\alpha = \kappa \)), combined with Lemma 4.11 (with \(\alpha = \frac{1}{2}\)), we conclude that one has
Indeed, the reconstruction theorem [9, Thm 4.9] and the multiplication rules [9, Lem. 4.3] imply that if \(g \in \mathcal{C}^{\gamma ,(\eta )_3}\) for \(\eta \le 0\) and \(\gamma > 0\), and \(\zeta \in \mathcal{C}^{\beta }\) with \(\beta \le 0\) and \(\gamma + \beta > 0\) then, provided that \(\eta + \beta > -1\), one has \(g \zeta \in \mathcal{C}^{\eta + \beta }\). (View \(\zeta \) as the constant function in a regularity structure containing only one symbol of degree \(\beta \) and apply the reconstruction theorem to \(g\zeta \).)
Since \(-\frac{1}{2} - 3\kappa > -1\), we can multiply such a distribution by the indicator function of \(\mathbf{R}_+ \times D\). It follows that, setting
we have \(\lim _{\varepsilon \rightarrow 0} V^{(1,3)}_\varepsilon = 0\) in \(\mathcal{D}^{\frac{3}{2}-3\kappa }\) and furthermore \(\mathcal{R}_\varepsilon V^{(1,3)}_\varepsilon = v^{(1,\star )}_\varepsilon \). Combining these definitions, we set
Summarising this discussion, one has the following result:
Lemma 4.12
We have \({\hat{V}}^{(1)}_\varepsilon \in \mathcal{D}^{\frac{3}{2}-3\kappa ,(0)_3}\), \(\mathcal{R}_\varepsilon {\hat{V}}^{(1)}_\varepsilon = v^{(1)}_\varepsilon \) and \({\hat{V}}^{(1)}_\varepsilon \) is of the form
for some \(\Phi ^{(1)}_\varepsilon \) taking values in the Taylor polynomials, and where \(\Phi ^{(0)}_\varepsilon \) is as in Lemma 4.10. Furthermore, \(\lim _{\varepsilon \rightarrow 0} {\hat{V}}^{(1)}_\varepsilon = {\hat{V}}^{(1)} \in \mathcal{D}^{\frac{3}{2}-3\kappa ,(0)_3}\) with \(\mathcal{R}_0 {\hat{V}}^{(1)} = 0\).
Proof
Collecting (4.28), (4.29), (4.30) and (4.33), we see that (4.34) holds, but with \(\Phi ^{(0)}_\varepsilon \) replaced by . These two expression are seen to coincide on \(\mathbf{R}_+ \times D\) by comparing Lemma 4.10 with (4.24).
The only statement we haven’t shown yet is that \(\mathcal{R}_0 {\hat{V}}^{(1)} = 0\). Since we already know by (4.32) that \(W_{1,\varepsilon }\) converges to 0 and since \(\mathcal{R}_\varepsilon {\hat{V}}^{(1)}_\varepsilon = \mathcal{R}_\varepsilon V^{(1)}_\varepsilon \), it remains by (4.25) to show that . This in turn is immediate from (4.27) when combined with Theorem 4.8 which guarantees that the limiting model vanishes on all accented symbols. \(\square \)
4.6 Formulation of the Fixed Point Problem
Introduce now a modelled distribution \({\overline{V}}_{\varepsilon }\) and, using the shorthand \(\hat{V}_\varepsilon = \hat{V}_\varepsilon ^{(0)} + \hat{V}_\varepsilon ^{(1)} + {\overline{V}}_{\varepsilon }\), consider the fixed point problem
where \({\tilde{R}}_\varepsilon ^{(d)} = 0\) for \(d = 1\) and
Remark 4.13
We will set this up as a fixed point problem in the space \(\mathcal{D}^{\frac{3}{2}-3\kappa ,(0)_3}\). Since and (in \(d \in \{2,3\}\) for the latter), this forces us to rely on Theorem C.5 for the reconstruction of the right hand side of (4.35) and to combine this with [9, Lem. 4.12] to provide an interpretation for the integration operator \(\mathcal{K}\) appearing in the definition (3.2) of \(\mathcal{P}_{\mathrm{Neu}}\).
Remark 4.14
Recall that the definition (4.3) of the remainder \({\hat{R}}_\varepsilon ^{(d)}\) involves an arbitrary exponent \(\alpha \). We henceforth fix a choice \(\alpha = \alpha (d)\) depending on the dimension, namely
All further statements about \({\hat{R}}_\varepsilon ^{(d)}\) hold for this particular choice.
We claim that with this definition and provided that we consider the renormalised model constructed in Section 4.3, (4.35) admits a unique solution in \(\mathcal{D}^{\frac{3}{2}-3\kappa ,(0)_3}\) and, provided that we set \(\varsigma = \varepsilon ^\alpha \xi _\varepsilon \) and \({\bar{\varsigma }} = \varepsilon ^{-1-\kappa } \sigma _\varepsilon \), one has \(v_\varepsilon = \mathcal{R}_\varepsilon \hat{V}_\varepsilon \). The reason for the appearance of \({\tilde{R}}_\varepsilon ^{(d)}\) is to cancel out some additional unwanted terms arising from the renormalisation procedure. Before this, we formulate a technical lemma, where we write \(\Vert \theta \Vert _\alpha \) for the \(\mathcal{C}^\alpha \) norm of the function / distribution \(\theta \) on \(D_T = [0,T] \times D\) with T as in Theorem 1.2.
Lemma 4.15
Let \(w, {\bar{w}}\) with \(\Vert w\Vert _{L^\infty } + \Vert {\bar{w}}\Vert _{L^\infty } \le \varepsilon ^{-d/2}\) on the domain \(D_T\) and let \(\kappa \in (0,{1\over 4})\). Writing \(X = \Vert \varsigma \Vert _{-\frac{1}{2} + 2\kappa } + 1\), one has the bounds
for some proportionality constants depending only on \(u^{(0)}\), G and H. In dimensions 2 and 3, we set \(X = \Vert \varsigma \Vert _{L^p} + 1\) (for any fixed \(p \in [1,\infty ]\)) and we have the bounds
with \(\beta (2) = {1\over 4} - {\kappa \over 3}\) and \(\beta (3) = {7\over 12} - {\kappa \over 3}\) for \(\kappa \) sufficiently small.
Proof
The case of dimensions 2 and 3 is straightforward to verify since all bounds are uniform. In dimension 1, the first term of (4.3) is easy to bound. To bound the second term, we use the fact that composition with a smooth function is a (locally) Lipschitz continuous operation in \(\mathcal{C}^{\frac{1}{2}-\kappa }\), combined with the fact that the product is continuous as a bilinear map from \(\mathcal{C}^{\frac{1}{2}-\kappa } \times \mathcal{C}^{-{1\over 2}+2\kappa }\) into \(\mathcal{C}^{-{1\over 2}+2\kappa }\), see [3] or [8, Thm 13.16]. \(\square \)
Proposition 4.16
Fix an initial condition \(u_0\), a final time \(T < 1\) and nonlinearities G and H, all as in Theorem 1.2, as well as the random model \({\hat{{\varvec{\Pi }}}}_\varepsilon \) as defined in Section 4.3. Choose \(\varsigma \in L^p\) with \(p = (d+2)/{\underline{c}}\) (for \(d \in \{2,3\}\)) or \(\varsigma \in \mathcal{C}^{2\kappa -{1\over 2}}\) (for \(d=1\)), as well as \({\bar{\varsigma }} \in \mathcal{C}^{-{1\over 2}-2\kappa }\) (for \(d \in \{2,3\}\)). Then, the the fixed point problem (4.35) admits a unique local solution \({\overline{V}}_{\varepsilon }\) in \(\mathcal{D}^{\frac{3}{2}-3\kappa ,(0)_3}\). Furthermore, bounds on the solution are uniform over \(\varepsilon \in [0,1]\) and over \(\varsigma \), \({\bar{\varsigma }}\) in bounded balls in their respective spaces.
Furthermore, for \(\varepsilon = 0\) and \({\hat{{\varvec{\Pi }}}}\) as in Theorem 4.8, the solution \({\overline{V}}\) is such that \({\bar{v}} = \mathcal{R}_0 {\overline{V}}\) solves
with homogeneous boundary conditions, where \({\hat{V}}^{(0)}\) is as in Lemma 4.10. In particular, \(\lim _{\varepsilon \rightarrow 0} \hat{V}_\varepsilon = {\hat{V}}\) is such that \(\mathcal{R}_0 {\hat{V}}\) coincides with the process v defined in (1.8).
Remark 4.17
In the case \(d=1\), there is no condition on \({\bar{\varsigma }}\) since the fixed point problem does not depend on it.
Proof
We first consider the case \(d=3\). Note first that, for any \(\mathcal{C}^4\) function \({\tilde{G}}\), we have \(\mathcal{L}({\tilde{G}}(u^{(0)},u^{(1)})) \in \mathcal{D}^{2,(0)_3}\). Since \({\hat{V}}^{(0)}_\varepsilon \in \mathcal{D}^{1,(-{1\over 2}-\kappa )_3}\) by Lemma 4.10 and \({\hat{V}}^{(1)}_\varepsilon \in \mathcal{D}^{{3\over 2}-3\kappa ,(0)_3}\) by Lemma 4.12, it then follows from [9, Lem. 4.3] that, for \({\overline{V}}_{\varepsilon }\in \mathcal{D}^{{3\over 2}-3\kappa ,(0)_3}\), all the terms appearing after \(\mathcal{P}_{\mathrm{Neu}}\mathbf {1}_+^D\) in the right hand side of (4.35) belong to \(\mathcal{D}^{{1\over 2}-4\kappa ,(-{3\over 2}-2\kappa )_3}\), provided that \(\kappa \) is sufficiently small. In particular, the operator \(\mathcal{K}\) (defined as described in Remark 4.13) maps this continuously into \(\mathcal{D}^{2,(0)_3}\), with arbitrarily small norm for small time intervals.
Furthermore, the reconstruction operator of Theorem C.5 continuously maps the space \(\mathcal{D}^{{1\over 2}-4\kappa ,(-{3\over 2}-2\kappa )_3}\) into \(\mathcal{C}^{-{3\over 2}-2\kappa }\), which is then mapped continuously into \(\mathcal{C}^{2,(0)_3}\) by \(K_\partial \) by Proposition A.2, and therefore into \(\mathcal{D}^{2,(0)_3}\) by the Taylor lift \(\mathcal{L}\), again with arbitrarily small norm for small time intervals as a consequence of the bound (A.3) which also holds for \(K_\partial \).
Note now that by Corollary 5.6, we have \(\mathbf{E}\Vert \xi _\varepsilon \Vert _{\kappa -2}\lesssim \varepsilon ^{-{1\over 2}-\kappa }\). As a consequence of (4.21), we conclude from this that \(\mathbf{E}|v^{(0)}_\varepsilon |_{L^\infty } \lesssim \varepsilon ^{-{1\over 2}-\kappa }\). Since \(\beta > {1\over 2}\), it follows from Lemma 4.15 that, for \(p = (d+2)/{\underline{c}}\),
uniformly over bounded sets for the model \({\hat{{\varvec{\Pi }}}}_\varepsilon \) and over bounded sets for \({\overline{V}}_{\varepsilon }+{\hat{V}}^{(1)}_\varepsilon \) in \(\mathcal{D}^{{3\over 2}-3\kappa ,(0)_3}\). Since \(u^{(0)}\) and \(u^{(1)}\) are bounded in \(\mathcal{C}^1\), it is immediate from (4.36) that one has a bound of the type
In particular, the argument of \(P_{\mathrm{Neu}}\) appearing in the last term on the right hand side of (4.35) is mapped continuously by \(P_{\mathrm{Neu}}\) into \(\mathcal{C}^{{3\over 2}-3\kappa ,(0)_3}\), again with arbitrarily small norm when considering a short enough time interval. Furthermore, all of these expressions are locally Lipschitz continuous (with similar bounds) as a function of \({\overline{V}}_{\varepsilon }\) in \(\mathcal{D}^{{3\over 2}-3\kappa ,(0)_3}\) and of the model \({\hat{{\varvec{\Pi }}}}_\varepsilon \), uniformly over \(\varepsilon \in [0,1]\) which yields the first claim over a short enough time (but bounded from below independently of \(\varepsilon \)) interval. This can be maximally extended as usual, and the claim follows from the fact that we know a priori that solutions to (4.38) do not explode.
The second claim is straightforward by simply setting \(\varepsilon = 0\) and applying the reconstruction operator to both sides of (4.35). The case of \(d=2\) is virtually identical, noting in particular that even though diverges in this case, it only does so logarithmically and is therefore compensated by the factor \(\varepsilon ^\kappa \) in (4.36). We leave the verification of the case \(d=1\) to the reader. \(\quad \square \)
Proposition 4.18
Let \(\varsigma _\varepsilon = \varepsilon ^\alpha \xi _\varepsilon \), \({\bar{\varsigma }}_\varepsilon = \varepsilon ^{2-d-\kappa } \sigma _\varepsilon \), and define \({\hat{{\varvec{\Pi }}}}_\varepsilon \) as in Section 4.3. Then, the assumptions of Proposition 4.16 are satisfied and we have \(\varsigma _\varepsilon , {\bar{\varsigma }}_\varepsilon \rightarrow 0\) in their respective spaces. Furthermore, for any \(\varepsilon > 0\), the modelled distribution \(\hat{V}_\varepsilon \) constructed in Proposition 4.16 is such that \(\mathcal{R}_\varepsilon \hat{V}_\varepsilon \) coincides with the process \(v_\varepsilon \) defined in (4.1).
Proof
We first show that the assumptions of Proposition 4.16 are satisfied. The fact that the random models \({\hat{{\varvec{\Pi }}}}_\varepsilon \) are uniformly bounded (in probability) as \(\varepsilon \rightarrow 0\) and converge in probability to \({\hat{{\varvec{\Pi }}}}\) is the content of Theorem 4.8. By the second part of Assumption 2.1 combined with stationarity, we furthermore see that
when \(d \in \{2,3\}\). For \(d=1\), we have
which converges to 0 in probability by Corollary 5.6. We also conclude from Corollary 5.6 and our definitions that, for \(d \in \{2,3\}\), \(\Vert {\bar{\varsigma }}_\varepsilon \Vert _{-{1\over 2}-2\kappa } = \Vert \eta _\varepsilon ^{({d\over 2}-1+\kappa )}\Vert _{-{1\over 2}-2\kappa } \rightarrow 0\) in probability.
It remains to show that solutions coincide with \(v_\varepsilon \). This is a special case of the general result obtained in [2] and could in principle also be obtained in a way similar to [14]. We present a short derivation here in order to remain reasonably self-contained.
The powercounting of the various symbols appearing in our structure depends on the dimension, so we first restrict ourselves to the case \(d = 3\), which is the one with the largest number of terms of negative degree appearing. Combining (4.35) with Lemmas 4.10 and 4.12, we conclude that if we take for \({\overline{V}}_{\varepsilon }\) any solution to (4.35), there exist functions v and \(\nabla v\) such that, for \(\Phi _\varepsilon ^{(0)}\) and \(\Phi _\varepsilon ^{(1)}\) as in Lemma 4.12, the following identities hold on \(\mathbf{R}_+\times D\):
Developing the argument of \(\mathcal{P}_{\mathrm{Neu}}\mathbf {1}_+^D\) in (4.35) up to order 0, we conclude that it is given by
At this point, we apply the results of [2]. Comparing [2, Eq. 2.20] with [2, Def. 3.20] and [2, Thm. 3.25], we see that each term appearing on the right hand side generates a counterterm for the renormalised equation. Each of these terms is of the form for some function \({\hat{F}}\) and some symbol . The counterterm generated by any such term is then obtained precisely by simply replacing by the corresponding renormalisation constant and by interpreting the first two arguments of \({\hat{F}}\) as the value and gradient of the actual solution (after reconstruction).
Remark 4.19
One may worry that we are not quite in the framework of [2] because of the special treatment of \({\hat{V}}_\varepsilon ^{(0)}\) and \({\hat{V}}_\varepsilon ^{(1)}\). This however is due to purely analytical reasons that only affect the boundary behaviour. The computation of the renormalisation terms on the other hand is a purely algebraic affair which is not affected by this. The boundary conditions of \(v_\varepsilon \) however are affected by our decomposition and need to be determined separately.
It follows that, in dimension 3, the solution \(v_\varepsilon = \mathcal{R}_\varepsilon {\hat{V}}_\varepsilon \) to the fixed point problem with the renormalised model satisfies in \(\mathbf{R}_+ \times D\) the PDE
Furthermore, both \(\mathcal{R}_\varepsilon {\hat{V}}_\varepsilon \) and \(v^{(0)}_\varepsilon = \mathcal{R}_\varepsilon {\hat{V}}_\varepsilon ^{(0)}\) have homogeneous boundary conditions, so that the boundary conditions of \(v_\varepsilon \) coincide with those of \(v^{(1)}_\varepsilon = \mathcal{R}_\varepsilon {\hat{V}}_\varepsilon ^{(1)}\).
By (4.36) and since we chose \({\bar{\varsigma }} = \varepsilon ^{-1-\kappa } \sigma _\varepsilon \), there is a cancellation between \({\tilde{R}}_\varepsilon ({\bar{\varsigma }})\) and some of the other terms appearing in this equation. Since furthermore , we obtain
which, when combining with the definition of \(\Psi \) given in (1.6) and the fact that \(\varsigma = \varepsilon ^\alpha \xi _\varepsilon \), precisely coincides with (4.2). Since its initial condition and boundary condition coincide as well, this completes the proof of the claim.
In dimension 2, a similar argument (but taking less terms into account) yields
Again, the term \({\tilde{R}}_\varepsilon ^{(2)}\) precisely cancels the term proportional to , so that this again coincides with (4.2). In dimension 1, an even simpler argument shows that
which again coincides with (4.2), noting that in this case one has \(u^{(1)}= 0\). \(\square \)
5 Convergence of Models
In order to show convergence of the models, we apply the general result of [6, Thm 2.31]. This result shows that if one considers the “BPHZ lifts” of a sequence of smooth and stationary stochastic processes \(\xi _n\) as given in [5, Thm 6.17] then, provided that one has uniform bounds of a suitable “norm” of \(\xi _n\) and under a few relatively weak additional algebraic assumptions, the resulting sequence of models converges to a limit, provided that \(\xi _n \rightarrow \xi \) weakly in probability.
5.1 Cumulant Homogeneity Assignments
In this section, we define
and we often use \(z = (t,x)\) for space-time coordinates. The exponents \(\alpha \) will always be chosen in \(\big ({\underline{c}},{d+2\over 2}\big ]\). Our aim is to obtain a suitable bound independent of \(\varepsilon \) for joint cumulants of the form
Given a finite collection of at least two space-time points \(z = \{z_a\}_{a \in A}\), we again consider the corresponding labelled binary tree \({\mathbf {t}}_z = (T,{\mathbf {n}})\) as in Section 2.1, with the leaves of T identified with the index set A. Recall that the nodes \(V_T\) of T are given by subsets of A, with inner nodes \(\mathring{V}_T\) given by subsets with at least two elements and the root of T given by A itself.
Recall from [6, Def. A.14] the following definition:
Definition 5.1
A “consistent cumulant homogeneity” consists, for each finite index set A, each binary tree T over A as above, and each choice of indices \(\alpha :A \rightarrow [{\underline{c}}, {d+2\over 2}]\), a function \({\mathfrak {c}}_T^{(\alpha )}:\mathring{V}_T\rightarrow \mathbf{R}_+\) satisfying furthermore
-
For every \(B\subset A\), \(\sum _{v\in \mathring{V}_T: v \cap B \ne \emptyset } {\mathfrak {c}}_T^{(\alpha )}(v) \ge \sum _{a\in B} \alpha _a\).
-
For every \(u \in \mathring{V}_T\), \(\sum _{v\in \mathring{V}_T: v \subset u} {\mathfrak {c}}_T^{(\alpha )}(v) \le \sum _{a \in u} \alpha _a\).
-
If \(|A| \ge 3\), then for every \(u \in \mathring{V}_T\) with \(|u| \le 3\), one has \(\sum _{v\in \mathring{V}_T: v \subset u}{\mathfrak {c}}_T^{(\alpha )}(v) < (d+2)(|u|-1)\).
Remark 5.2
Applying the first two conditions with \(B = A\) and \(u = A\) respectively implies in particular that \(\sum _{v\in \mathring{V}_T} {\mathfrak {c}}_T^{(\alpha )}(v) = \sum _{a\in A} \alpha _a\).
We will now display a consistent cumulant homogeneity such that, for every finite set A, space-time points \(\{z_a\}_{a\in A}\), and choices \(\alpha :A\rightarrow [{\underline{c}}, {d+2\over 2}]\), We have the bound
We claim that one possible choice is obtained by setting
where
Proposition 5.3
The choice (5.3) is a consistent cumulant homogeneity.
Proof
The first two conditions of Definition 5.1 follow immediately from the structure of the formula (5.3), in particular the facts that \(2^{- {\mathfrak {d}}(u,a)}\) is positive, vanishes for \(a \not \in u\), and is such that \(\sum _{u \in \mathring{V}_T} 2^{- {\mathfrak {d}}(u,a)} = 1\).
Regarding the last condition, the case \(|u| = 3\) follows from the fact that the second condition holds and \(\alpha _a \le {d+2\over 2}\). The case \(|u|=2\) follows from the condition \(|A| \ge 3\) which guarantees that the corresponding sum is bounded by \({d+2\over 2}\) since \({\mathfrak {d}}(u,a) = 1\) in this case. \(\square \)
Lemma 5.4
Setting \({\underline{\alpha }} = \inf _{a\in A}\alpha _a\) and \({\overline{\alpha }} = \sup _{a\in A}\alpha _a\), one has \({\mathfrak {c}}_T^{(\alpha )}(u) \in [{\underline{\alpha }},{\overline{\alpha }}]\) for \(u \in \mathring{V}_T {\setminus } \{A\}\) and \({\mathfrak {c}}_T^{(\alpha )}(A) \in [2{\underline{\alpha }},2{\overline{\alpha }}]\).
Proof
By convexity of (5.3), it suffices to consider the case where \(\alpha _a = 1\) for all a. We proceed by induction on the size of A. When \(|A| = 2\), one has \(\mathring{V}_T = \{A\}\), so that \({\mathfrak {d}}(A,a) = 0\) and therefore \({\mathfrak {c}}_T^{(\alpha )}(A) = 2\) as claimed.
Assume now that \(|A| > 2\) and fix a binary tree T over A. Write \(A_1\) and \(A_2\) for the children of A in \(\mathring{V}_T\), so that \(A = A_1 \sqcup A_2\). We distinguish two cases. In the first case, \(|A_1| \wedge |A_2| \ge 2\), so that the tree T can naturally be thought of as two trees \(T_1\), \(T_2\) over \(A_1\), \(A_2\) joined by their roots. By (5.4) (in particular the fact that there is an additional \(-1\) at the root), we then have
while \({\mathfrak {c}}_T^{(\alpha )}(u) = {\mathfrak {c}}_{T_i}^{(\alpha )}(u)\) for all other \(u \in \mathring{V}_T\), with \(i \in \{1,2\}\) depending on whether \(u \subset A_1\) or \(u \subset A_2\). We conclude by using the induction hypothesis, which implies that \({\mathfrak {c}}_{T_1}^{(\alpha )}(A_1) = {\mathfrak {c}}_{T_2}^{(\alpha )}(A_2) = 2\).
In the second case, we have \(|A_1| = 1\) and \(|A_2| \ge 2\) (or vice-versa), the case \(|A|=2\) having already been dealt with. In this case, the tree T consists of a subtree \(T_2\) over \(A_2\) as before, with an additional root vertex A and single extra leaf. In this case, we have
whence we conclude as before. \(\square \)
Proposition 5.5
Under Assumption 2.1, the bound (5.2) holds for the choice (5.3).
Proof
It follows from Assumption 2.1 and (5.1) that
where
Let now \({\hat{c}}:\mathring{V}_T \rightarrow [{\underline{c}}, {\overline{c}}]\) be any map such that \({\hat{c}}(A) + \sum _{u \in \mathring{V}_T} {\hat{c}}(u) = \sum _{a\in A} \alpha _a\), and rewrite (5.5) as
We now note that (5.6) implies that
for every \({\hat{c}} \in [{\underline{c}}, {\overline{c}}]\), uniformly over \(n \in \mathbf{Z}\) and \(\varepsilon \in (0,1]\). Inserting this into (5.7) immediately yields that, uniformly in \(\varepsilon \), one has
Since the map \({\mathfrak {c}}_T^{(\alpha )}\) is of the desired type by Lemma 5.4 [modulo the additional factor 2 at the root which is taken care of explicitly in (5.8)], the claim follows. \(\square \)
Corollary 5.6
For any \(\alpha \in \big ({\underline{c}},{d+2\over 2}\big )\), one has \(\eta _\varepsilon ^{(\alpha )} \rightarrow 0\) in probability in \(\mathcal{C}^\beta \) for every \(\beta < -\alpha \). In particular, \(\eta _\varepsilon \rightarrow 0\) in probability in \(\mathcal{C}^\beta \) for every \(\beta < -1\) and \(\zeta _\varepsilon \rightarrow 0\) in probability in \(\mathcal{C}^\beta \) for every \(\beta < {d-2\over 2}\). The same holds for \(\eta _\varepsilon ^{(\alpha )} \mathbf {1}_A\) for any fixed Borel set A.
Proof
The first statement is an immediate consequence of [6, Thm 2.31]. The fact that we can multiply \(\eta _\varepsilon ^{(\alpha )}\) by an arbitrary indicator function follows from the fact that these bounds do not involve the derivative of the test function in this case. \(\square \)
5.2 Power-Counting Conditions
By (5.3)–(5.4), the quantity defined in [6, Def. A.24] can be estimated by
This is because for \(D \in \mathring{V}_T\) with \(D \ne A\), one always has \({\mathfrak {d}}(D,a) \le |D|-1\). Furthermore, by [6, Rem. 2.28], one has for all typed sets A and D. These observations allow us to conclude the following:
Lemma 5.7
All the decorated trees generated by the rule of Definition 4.4 are \({\mathfrak {c}}\)-super regular in the terminology of [6, Def. A.27].
Remark 5.8
Since our scaling and degree assignment are fixed throughout and since we consider all cumulants, that is we choose \({\mathfrak {L}}_{{\text {Cum}}}\) to contain all possible cumulants, we omit these from our notation.
Proof
We reduce ourselves to considering symbols of negative degree since the claim then follows automatically for the remaining ones. These symbols are listed in (4.15). Note also that the definition of \({\mathfrak {c}}\)-super regularity is non-trivial only for trees that contain at least three noises, so that it suffices to consider the symbols
By (5.9), it is sufficient to verify that for every subtree \(\tau \) containing \(k\ge 2\) instances of a noise one has the bound
where \(\Xi _i\) denotes the ith noise appearing in \(\tau \). This can be seen simply by inspection of the list (5.10). \(\square \)
5.3 Special Bounds
For some of the symbols in our regularity structure, we will bounds that are stronger than what is suggested by the degrees of the symbols in question. In this statement, \(\phi \) denotes an arbitrary measurable function with
Proposition 5.9
Let \(\delta >0\) be as in Section 2. For \(\alpha \in [-\delta ,1]\) and \(\beta = -\frac{1}{2}-\alpha \), we have the bounds
where we write \(X \lesssim Y\) as a shorthand for the existence, for every \(p \ge 1\), of a constant C such that \(\mathbf{E}|X|^p \le C Y^p\), uniformly over all \(\lambda \le 1\), \(\varepsilon \le 1\) and \(\phi \) as in (5.11).
Proof
Recall that, by Proposition 5.5, we can view any \(\eta _\varepsilon ^{(\theta )}\) as in (5.1) for \(\theta \in [{1\over 2}-\delta , {5\over 2}]\) as a “noise” of regularity \(-\theta \) whose “norm”, as measured by [6, Def. A.18] with respect to the cumulant homogeneity just described remains uniformly bounded as \(\varepsilon \rightarrow 0\).
In particular, we can write
and we can apply [6, Thm 2.31], showing that the BPHZ renormalisation of this term satisfies the required bounds. Recall that doesn’t quite agree with the BPHZ renormalisation, but the error between the two is given by with \(\delta g_\varepsilon \) as in (4.19), which is easily seen to satisfy the required bounds.
The other two terms can be dealt with similarly by writing
thus concluding the proof. \(\square \)
Corollary 5.10
For every \(\kappa > 0\), every \(\alpha \in [0,1)\), and every Borel set \(A \subset \mathbf{R}^{d+1}\), one has the bound \(\big (\mathbf{E}\Vert \mathbf {1}_A \xi _\varepsilon \Vert _{\mathcal{C}^w}^p\big )^{1/p} \lesssim \varepsilon ^{-\alpha }\) with \(w = \alpha - {5\over 2} - \kappa \).
Proof
This follows from the third bound of Proposition 5.9 by Kolmogorov’s criterion, using the fact that and that we can move the multiplication by \(\mathbf {1}_A\) onto the test function. \(\square \)
5.4 Tightness and Convergence for the Noise
In this section, we show that the convergence announced in Theorem 4.8 holds. As usual, convergence is obtained by first showing tightness and then identification of the limiting distribution. More precisely, we prove the following:
Theorem 5.11
One has \(\xi _\varepsilon \rightarrow \xi \) weakly in \(\mathcal{C}^\alpha \) for every \(\alpha < -{d+2\over 2}\).
It turns out that this statement is a relatively straightforward consequence of the following proposition. Writing \(\kappa _p(X)\) for the pth cumulant of a real-valued random variable X, one has the following:
Proposition 5.12
For \(p \ge 2\), we have the bound
uniformly over all \(\lambda \in \mathbf{R}_+\) and all \(\phi \) as in (5.11).
Before we turn to the proof of Proposition 5.12, let us show how to deduce Theorem 5.11 from it. First, we have the following corollary:
Corollary 5.13
For any \(\kappa \le {1\over 2}\) and \(p \ge 2\), one has the bounds
uniformly over \(\lambda \le 1\), \(\varepsilon \le 1\) and \(\phi \) as in (5.11).
Proof
By simple rescaling, it is straightforward to see that the required bound is equivalent to the bound
uniformly over \({\bar{\lambda }} \le 1/\varepsilon \) and \(\varepsilon \le 1\).
For even integers \(p \ge 2\), Proposition 5.12 implies that \(|\kappa _{p}\big (\langle \eta ,\phi _0^\lambda \rangle \big )| \lesssim \lambda ^{-{d+2\over 2} p} \wedge \lambda ^{-{\underline{c}} p}\), so that.
It remains to observe that (5.13) implies that \(\mathbf{E}|\langle \eta ,\phi _0^{{\bar{\lambda }}}\rangle |^{p} \lesssim {{\bar{\lambda }}}^{-c p}\), uniformly over all \({\bar{\lambda }} > 0\), for any \(c \in [{\underline{c}}, {d+2\over 2}]\). Since (5.12) is of this form and our assumptions guarantee that the values of c appearing there fall into the correct interval, this concludes the proof. \(\square \)
Proof of Theorem 5.11
Using [11, Eq. 10.4] (which is nothing but an analogue of Kolmogorov’s continuity criterion) and the compactness of the embeddings \(\mathcal{C}^\alpha \subset \mathcal{C}^\beta \) for \(\alpha > \beta \) (over any bounded domain), it follows from Corollary 5.13 that the laws \(\xi _\varepsilon \) are tight in \(\mathcal{C}^\alpha \) for every \(\alpha < -{d+2\over 2}\).
It remains to show that every limit \(\xi \) of a convergent subsequence of \(\xi _\varepsilon \) is space-time white noise. It thus suffices to show that, for every \(\phi \in \mathcal{C}_0^\infty \) with \(\int \phi ^2(z)\,{\hbox {d}}z = 1\), \(\xi (\phi )\) is centred normal with variance 1. Since all moments of \(\langle \xi _\varepsilon ,\phi \rangle \) remain bounded as \(\varepsilon \rightarrow 0\) by Corollary 5.13, one has
For \(p \ge 3\), this vanishes, thus showing that \(\xi (\phi )\) is Gaussian. It clearly has zero mean since this is already the case for \(\langle \xi _\varepsilon ,\phi \rangle \). Its variance is given by
Note now that, for every \(\varepsilon > 0\), every \(\delta \in [0,1]\) and any \(z, {\bar{z}} \in \mathbf{R}^{d+1}\), one has the bound
Recalling that \(\int \phi ^2 =1\) and that \(\int \kappa _2(0,z)\,{\hbox {d}}z = 1\) by (1.3), we thus obtain
Here, we used the fact that the integral of \(\phi _0^{1/\varepsilon }\) is independent of \(\varepsilon \), as well as our assumptions (1.3) and (2.1) on the covariance function of \(\eta \). In particular, we choose \(\delta \) as in the definition of \({\overline{c}}\), see Section 2, whence \(2{\overline{c}} - \delta > d+2\) which guarantees integrability at infinity. (Integrability at 0 is guaranteed by \(2{\underline{c}} - \delta = d - \delta < d+2\).) \(\square \)
Proof of Proposition 5.12
We expand the expression for the cumulant as
In order to bound this integral, we use a simplified version of the type of multiscale analysis used in [15, 16]. Let us recall how this works.
We now write \({\mathbf {t}}= (T,{\mathbf {n}})\) for a generic binary tree, together with a scale assignment as above (that is we enforce the fact that \({\mathbf {n}}\) is monotone) and \({\mathbf {T}}\) for the set of all such \({\mathbf {t}}\). Given \({\mathbf {t}}\in {\mathbf {T}}\), we write \(V_{\mathbf {t}}\)/\(E_{\mathbf {t}}\) for the vertex/edge set of the corresponding tree and \({\mathbf {n}}_{\mathbf {t}}\) for the scale assignment. In particular, \({\mathbf {n}}_{\mathbf {t}}(\Omega _p)\) denotes the scale assignment for the root, which controls the diameter of the set \(\{z_1,\ldots ,z_p\}\) in \(\mathbf{R}^{d+1}\). The number p will always be considered fixed, so we do not include it explicitly in our notation. Denoting by \(\mathcal{T}\) the map \(\mathcal{T}:(z_1,\ldots ,z_p) \mapsto {\mathbf {t}}\in {\mathbf {T}}\) defined at the start of Section 2, we set \(D_{\mathbf {t}}= \mathcal{T}^{-1}({\mathbf {t}})\) for the set of all configurations of points \(z \in (\mathbf{R}^{d+1})^p\) giving rise to a given combinatorial data. Then, it was shown in [15, Lem. A.13] that, for every bounded Borel set \(U \in \mathbf{R}^{d+1}\) one has the bound
where \(|\cdot |\) denotes Lebesgue measure. Furthermore, by construction, \(\bigcup _{{\mathbf {t}}\in {\mathbf {T}}} D_{\mathbf {t}}\) is of full Lebesgue measure.
Without loss of generality, we can restrict ourselves to the case where the support of \(\phi \) has diameter bounded by 1 in the parabolic distance. With all of this notation at hand, we then bound (5.14) by
We simplify this expression as follows. First, we note that
as soon as \(2^{- {\mathbf {n}}(\Omega _p)} \ge \lambda \), since the support of \(\phi _0^\lambda \) is bounded by \(\lambda \), so that we can restrict the sum above to those \({\mathbf {t}}\) satisfying \(2^{- {\mathbf {n}}(\Omega _p)} \le \lambda \). Furthermore, one has
Combining this with Assumption 2.1, we conclude that
We treat separately the cases \(\lambda \le 1\) and \(\lambda \ge 1\). In the former case, we can apply [15, Lem. A.10] with the distinguished vertex \(\nu _\star \) appearing there equal to the root \(\Omega _p\). The first condition appearing there is then satisfied by the fact that \(2{\underline{c}} - (d+2) < 0\) by assumption, while the second condition is empty and therefore trivially satisfied. Note that the shape T of the tree is fixed in [15, Lem. A.10], while we also sum over all possible shapes, but since there are finitely many of them for any fixed p, this just yields an additional prefactor. We thus obtain the bound
Since \(\mathring{V}_{\mathbf {t}}\) contains exactly \(p-1\) elements (the tree T is binary and has p leaves), we finally obtain
as required.
The case \(\lambda > 1\) is split into two subcases. If \(p \ge 3\), we use the fact that \(|\kappa _p(z)| \lesssim {\bar{\kappa }}_{p,{\overline{c}}}(z)\) uniformly over all z, so that, using [15, Lem. A.10] as above, we have
Note that, in order to be able to apply that result, we need to verify that, for every subtree \({\hat{{\mathbf {t}}}}\) of \({\mathbf {t}}\) spanned by some subset of its leaves satisfies \(\sum _{A \in \mathring{V}_{{\hat{{\mathbf {t}}}}}} \alpha (A) < 0\), where \(\alpha (A) = 2{\overline{c}}-(d+2)\) if \(A = \Omega _p\) and \(\alpha (A) = {\overline{c}}-(d+2)\) otherwise. This is of course trivially satisfied as soon as \({\hat{{\mathbf {t}}}} \ne {\mathbf {t}}\) since \({\overline{c}} < d+2\) by assumption. The exponent at the root however is given by \(2{\overline{c}}-(d+2)\), which is positive, but since \(p \ge 3\), the sum of all exponents is given by \(2{\overline{c}}-(d+2) + (p-2)({\overline{c}} - (d+2)) = p{\overline{c}} - (d+2)(p-1)\), which is indeed negative for \(p \ge 3\) and \(\delta < {d+2\over 6}\),which we assume w.l.o.g..
It remains to consider the case \(p = 2\) and \(\lambda > 1\). In this case, the above computation reduces to
as required, thus concluding the proof. \(\square \)
We also use the following convergence results which do not strictly speaking follow from Theorem 5.11 since multiplication by an indicator function (even that of a hypercube) is not a continuous operation on \(\mathcal{C}^{\beta }\) for \(\beta < 0\):
Lemma 5.14
Let \(\xi _{\varepsilon ,D} = \xi _\varepsilon \mathbf {1}_{\mathbf{R}\times D}\) and \(\xi _{\varepsilon ,D}^+ = \xi _\varepsilon \mathbf {1}_{\mathbf{R}_+ \times D}\). Then, \(\xi _{\varepsilon }\), \(\xi _{\varepsilon ,D}\), and \(\xi _{\varepsilon ,D}^+\) jointly weakly converge to limits \(\xi \), \(\xi _D\) and \(\xi _D^+\).
Proof
The proof is identical to that of Theorem 5.11, using the fact that all bounds we used are uniform over test functions as in (5.11), so that multiplying them by the indicator function of some domain changes nothing. \(\square \)
5.5 Boundary Term
Recall that we have set
where \(\partial _{i,0} D = \{x \in D \,:\, x_i = 0\}\) and \(\partial _{i,1} D = \{x \in D \,:\, x_i = 1\}\) and the constants are given by
where the function \(Q_{i,j}\) is defined as follows. For \(i=1,\ldots ,d\), write \(\iota _i:\mathbf{R}\times \mathbf{R}^{d} \rightarrow \mathbf{R}^{d+1}\) for the map given by
where \(x^{(i)} \in \mathbf{R}^{d-1}\) denotes the vector obtained from x by deleting the ith coordinate. With this notation, we then set
Remark 5.15
We will show in Lemma 5.21 that both the integrands in the definition of \(Q_{i,j}\) and the functions \(Q_{i,j}\) themselves are integrable, so that these expressions are all finite.
Remark 5.16
The formula given above is valid for the case of Neumann boundary conditions. In the case of Dirichlet boundary conditions, a similar formula holds, but the precise values of the constants do not matter since they do not affect the solutions.
Note that although \(P\circ \iota _i\ne P\), it does not depend on i, while \(\kappa _2\circ \iota _i\) does depend on i in general, since we do not assume that the driving noise is isotropic. We henceforth write \({\hat{P}} = P\circ \iota _i\). One of the main results of this section is that the renormalised model on vanishes in a suitable sense as \(\varepsilon \rightarrow 0\). We first provide a bound on its expectation, which requires the bulk of the work. For the formulation of this result, we write \(\mathcal{B}_0^1\) for the set of all test functions \(\phi \in \mathcal{C}^1_0\) with support contained in the parabolic ball of radius 1 and such that \(\max \{\Vert \phi \Vert _\infty , \Vert D\phi \Vert _\infty \} \le 1\).
Proposition 5.17
With defined as in (5.15), one has for \(d \in \{2,3\}\) and any \(\kappa > 0\) small enough,
uniformly over \(\varepsilon , \lambda \le 1\), \(z \in \mathbf{R}^{d+1}\), and \(\phi \in \mathcal{B}_0^1\).
Before we turn to the proof, we introduce a number of notations and preliminary bounds. Write \(\mathcal{G}\) for the reflection group generated by “elementary” reflections across the 2d planes containing the faces of D. The group \(\mathcal{G}\) is naturally identified with \(\mathbf{Z}^d\) (as a set, not as a group!) since for each \(k \in \mathbf{Z}^d\) there exists exactly one element \(R_k \in \mathcal{G}\) mapping \(k+D\) into D. We also write \(\mathcal{G}\ni R \mapsto (-1)^R \in \{-1,1\}\) for the group homomorphism mapping the elementary reflections to \(-1\). We will write \(\Lambda :\mathbf{R}^{d+1} \rightarrow \mathbf{R}\times D\) for the map such that \(\Lambda {\upharpoonright }\mathbf{R}\times (k+D) = {\mathrm {id}}\times \mathcal{R}_k\) and \(S :\mathbf{R}^{d+1} \rightarrow \{-1,1\}\) by \(S {\upharpoonright }\mathbf{R}\times (k+D) = (-1)^{R_k}\).
With these notations, it follows from Proposition A.1 that the truncated Neumann and Dirichlet heat kernels are such that for \(z \in \mathbf{R}\times D\) one has the identities
(Note that S is ill-defined on the measure zero set consisting of the reflection planes, but since it always appears in an integral this does not matter.)
Lemma 5.18
Let \(U \subset \mathbf{R}^{d+1}\) and let \(\Phi :U \rightarrow \mathbf{R}^{d+1}\) be a diffeomorphism between U and its image such that \(\Vert D\Phi ^{-1}\Vert \) is bounded uniformly over \(\Phi (U)\). Assume furthermore that z is such that \(\Vert z-z'\Vert \ge \lambda \) for all \(z' \in U\). Then,
for all \(\kappa \in [0,{d\over 2}]\).
Proof
We write \(U = U_1 \sqcup U_2\) where
Since \(\Vert z-z'\Vert \ge \lambda \) for \(z' \in U\) by assumption, we have the bound \(K(z-z') \lesssim \lambda ^{-d}\). On \(U_1\), it follows from the definition that \(K(z-z') \lesssim \Vert z- \Phi (z')\Vert ^{-d}\). As a consequence, we obtain the bound
as claimed. On \(U_2\) on the other hand, we use the fact that \(\rho \) is a decreasing function, so that
which is then bounded exactly as above. \(\square \)
Lemma 5.19
For any fixed \(c > 0\), one has
uniformly over \(\varepsilon \in (0,1]\).
Proof
We can assume that \(\varepsilon < c\), so that
by Assumption 2.1, and the bound follows at once. \(\square \)
Proof of Proposition 5.17
We now consider the Neumann case, the Dirichlet case follows from a virtually identical calculation. We start by bounding the expectation of . By the reflection principle (5.18), the correction due to the Neumann boundary conditions is given by
It then follows from the definitions that
Writing \(\lambda = \lambda _\psi \) for the diameter of \({\mathrm{supp}}\,\psi \), we aim to give a bound of the form
for any \(\kappa > 0\) small enough, uniformly over \(\varepsilon , \lambda \le 1\). We restrict ourselves to the case \(\lambda _\psi \le {1\over 8}\) and we set
We furthermore assume for the moment that
and that, whenever \(W_\psi \cap \partial D = \emptyset \), one has \(W_\psi \subset D\). We will see later that it is always possible to reduce oneself to this case. Finally, for \(x \in \mathbf{R}^{d}\), we write \(|x|_0\) for the number of vanishing coordinates of x and we set \(d_\psi = \sup \{|x|_0\,:\, (t,x) \in W_\psi \}\). We treat the different cases separately.
The case \(d_\psi = 0\). In this case \(W_\psi \cap \partial D = \emptyset \), so that \({\mathrm{supp}}\,\psi \cap \partial D = \emptyset \) and in particular . We then have
We break the inner integral into a finite sum of integrals over \(\mathbf{R}\times (k+D)\), since K has compact support and \(z \in \mathbf{R}\times D\). Since we can restrict z to the support of \(\psi \), we have \(|x-x'| \ge \lambda \) for all \(x' \in k+D\) with \(k \ne 0\) by the definition of \(W_\psi \). We can then apply Lemma 5.18 with \(U = \mathbf{R}\times (k+D)\) and \(\Phi = \Lambda \) for the first term, while \(\Phi = {\mathrm {id}}\) for the second term. It follows that is bounded by
which is indeed bounded by the right hand side of (5.20).
The case \(d_\psi = 1\). In this case, the support of \(\psi \) is located near one of the faces of D (say \(\partial _{i,0}D\)), but its distance to the other faces is at least \(\lambda \). Write \({\mathfrak R}\) for the element of G which corresponds to reflection around the plane containing \(\partial _{i,0}D\) and \(\pi _i :\mathbf{R}^{d+1} \rightarrow \mathbf{R}^{d+1}\) for the orthogonal projection onto that plane. We also write \(E_i = \{(t,x) \,:\, x_i < 0\}\).
We first note that
where |R| is bounded by (5.23). Indeed, the integrands in (5.19) and (5.24) vanish on D and coincide on \({\mathfrak R}D\). Their integral over the complement of these two regions is then bounded exactly as before by applying Lemma 5.18 to finitely many translates of D. By Lemma 5.19, we can furthermore replace K by P so that
where \(|{\tilde{R}}|\) is bounded by (5.23). The reason for this is that \(P-K\) is supported outside of an annulus of radius 1 and \(\psi \) is supported at a distance of at most 1/4 from the reflection plane of \({\mathfrak R}\), and one has \(\Vert z - {\mathfrak R}z'\Vert \ge 1/2\) for all \(z,z'\) with \(z \in {\mathrm{supp}}\,\psi \) and \(\Vert z - z'\Vert \ge 1\).
On the other hand, we claim that
To see that this is the case, we first perform the change of variables \(z = \iota _i(s,v)\) and similarly for \(z'\), and we note that \(\pi _i \iota _i (s,v) = \iota _i(0,v)\), so that the right hand side is given by
Performing the substitution \(s' \mapsto -\beta \) and comparing to the definition of \(Q_{i,0}\), we conclude that
By Lemma 5.21, the function \(Q_{i,0}\) satisfies the bounds
Note now that the integral over u is restricted to the projection of the support of \(\psi \) which is of volume at most \(\lambda ^{d+1}\) (since u consists of \(d-1\) spatial variables and one time variable). Bounding \(\psi \) by its supremum, it follows that
On the other hand, we can bound \(|(\psi \circ \iota _i)(s,u)-\psi \circ \iota _i)(0,u)|\) by \(|s|\,\Vert D_x\psi \Vert _\infty \), which similarly yields
Combining these and choosing \(\psi = \phi _z^\lambda \) so that \(\Vert \psi \Vert _\infty \lesssim \lambda ^{-d-2}\) and \(\Vert D_x\psi \Vert _\infty \lesssim \lambda ^{-d-3}\) yields a bound of the order \(\varepsilon ^{-{d\over 2}}(\varepsilon \lambda ^{-1} \wedge \varepsilon ^2 \lambda ^{-2})\), which does imply (5.17) since \(-{d\over 2} - \kappa \in (-2,-1)\).
The case \(d_\psi = 2\). We claim that this case can be obtained as a consequence of the bounds for the cases \(d_\psi = 0\) and \(d_\psi = 1\). We consider the case of dimension \(d=2\) (but the computation below is done in a way that keeps track of dimension and applies to \(d=3\) as well) so that by (5.21)
We then fix a smooth function \(\chi :\mathbf{R}\rightarrow \mathbf{R}_+\) such that \({\mathrm{supp}}\,\chi \subset [0,2]\) and such that \(\sum _{k \in \mathbf{Z}} \chi _k(x)=1\), where \(\chi _k(x) = \chi (x-k)\). For some fixed constant \(c > 1\) and integers n, \(k_i\) and \(\ell \), we then set
By choosing c sufficiently large, we can guarantee that, for every n, k and \(\ell \), the function \(\chi _{n,k,\ell }\) is such that \(d_{\chi _{n,k,\ell }} \in \{0,1\}\). Furthermore, there are only finitely many values of k (independently of n and \(\ell \)) for which \(\chi _{n,k,\ell } \ne 0\). This is because \(\chi _{n,k,\ell }(S_{2^{n}}z)\) is independent of n. Fix now a test function of the form \(\psi = \phi _{z_0}^\lambda \) and write
By construction, one has \(\psi _{n,k,\ell } = 0\) for n such that \(2^{-n} \ge 2\lambda \), so that
Applying the bounds we already obtained for \(d_\psi \in \{0,1\}\), we conclude that
For any given n, the number of values for k and \(\ell \) leading to non-vanishing \(\lambda _{\psi _{n,k,\ell }}\) is of the order of \((\lambda 2^n)^d\), so that we eventually obtain the bound
as claimed. The case of dimension \(d=3\) is identical, the only difference being that \(\ell \) now has two components. Note that this calculation breaks in \(d=4\) where the sum over n diverges. This suggests that in this case one would have to add to an additional correction term that charges faces of codimension 2.
The case \(d_\psi = 3\). This is relevant only for \(d=3\), we shall however keep track of d in our calculation to illustrate how this would behave in higher dimensions. We then proceed in the same way as for the case \(d_\psi =2\), making use this time of the fact that we already have the required bound for all test functions with \(d_\psi < 3\). This time, we have
and we set in a fashion similar to above,
This time, for any given n, the number of values for k and \(\ell \) leading to non-vanishing \(\lambda _{\psi _{n,k,\ell }}\) is of the order of \((\lambda 2^n)^{d-1}\), which then yields similarly to above
Note that this time the series actually converges for all \(d < 6\).
To conclude, we justify the assumption (5.21) and the fact that, for \(W_\psi \cap \partial D = \emptyset \), one has \(W_\psi \subset D\). Indeed, by our assumption on \(\lambda _\psi \), it is always possible to enforce this by applying a finite number of reflections around the planes \(\{x\,:\, x_i = 1/2\}\). If \({\mathrm{supp}}\,\psi \) intersects \(\partial _{i,1} D\) for example, then we reflect around \(x_i = {1\over 2}\) to have \({\mathrm{supp}}\,\psi \) intersect \(\partial _{i,0} D\) instead. The only effect of this reflection is that, in order to obtain the same answer, we only need to reflect the noise \(\eta \) around that plane. The effect of this operation on its covariance function is to change the sign of the ith spatial coordinate of its argument, which is why how we obtain \(Q_{i,1}\) rather than \(Q_{i,0}\) in (5.26). \(\square \)
We now have the main ingredients in place to prove the main result of this section.
Theorem 5.20
With defined as in (5.15), one has for \(d \in \{2,3\}\) and any \(\kappa > 0\) small enough,
uniformly over \(\varepsilon , \lambda \le 1\), \(z \in \mathbf{R}^{d+1}\), and \(\phi \in \mathcal{B}_0^1\).
Proof
Writing \(\psi = \phi _z^\lambda \), the triangle inequality yields
and we already obtained the required bound on the second term in Proposition 5.17, so we focus on the first one. Furthermore, differs from by a deterministic quantity, so that we only need to bound
By (4.7) combined with (5.18), this random variable equals
This time, we will not need to exploit the cancellation between these two terms on \(\mathbf{R}\times D\), so we simply bound both terms separately. The second term equals , which is bounded by the right hand side of (5.27) by Theorem 4.8.
For the first term, we use the fact that K is compactly supported, so that it can be bounded by a finite sum of terms of the type
with \(R \in \mathcal{G}\).
The expectation of the pth power of this expression is given by a multiple integral with the integrand given by a sum of terms, each of which is a product of heat kernels and of cumulants. At this stage, we note that the bound in [6] does not exploit any further cancellations, so we can put absolute values everywhere, bound \(K(z-Rz')\) by \(\Vert z-z'\Vert ^{-d}\), and use the bounds from that paper. \(\square \)
5.6 Bounds on the Function Q
It remains to prove
Lemma 5.21
Under Assumption 2.1,
and similarly for \(Q_{i,1}\).
Proof
We fix i and we simply write Q instead of \(Q_{i,0}\). In all the estimates below we will use repeatedly the inequality \(|(\kappa _2\circ \iota _i)(s-\beta ,v)|\vee |(\kappa _2\circ \iota _i)(s+\beta ,v)|\lesssim t^{-\underline{c}}\wedge t^{-\overline{c}}\).
We first estimate \(\sup _{s\in \mathbf{R}_+}\left| Q\left( s\right) \right| \). Here \(\beta \in \mathbf{R}_+\), \(v=(t,x)\), with \(x\in \mathbf{R}^{d-1}\).
and the bound does not depend upon s. Here, to go from the second to the third line, we set \(a = s+\beta \) and we performed the substitution \(t \mapsto a^2 t/4\).
It remains to estimate \(\int _{\mathbf{R}_+} |Q\left( s\right) |(1+s) \,{\mathrm{d}}s\). Again \(v=(t,x)\), with \(x\in \mathbf{R}^{d-1}\).
thus concluding the proof. \(\square \)
6 Auxiliary Results
In this section, we collect a number of results that are more or less straightforward consequences of known results, specialised to our setting. Throughout this section, we assume that we are working with the regularity structure defined in Section 4.2 and that \(\xi _\varepsilon \) is defined as in (4.1) and satisfies Assumption 2.1.
We will write \(\mathcal{C}^\gamma \) as a shorthand for \(\mathcal{D}^\gamma ({\bar{T}})\), where \({\bar{T}}\) is the sector spanned by the Taylor polynomials, and similarly for \(\mathcal{C}^{\gamma ,\eta }\), etc. Note that for \(\gamma \not \in \mathbf{N}\) this is consistent with the usual definition of \(\mathcal{C}^\gamma \).
Proposition 6.1
Let \(\zeta ^+, \zeta \in \mathcal{C}^{-{5\over 2}-\kappa }\) be such that \(\zeta ^+(\phi ) = 0\) for \(\phi \) supported in \(\{t < 0\}\) and \(\zeta ^+(\phi ) = \zeta (\phi )\) for \(\phi \) supported in \(\{t > 0\}\) and let \({\varvec{\Pi }}\) be an admissible model with . Write for \(\mathcal{K}^{{\hat{\zeta }}}\) defined as in [9, Sec. 4.5]. Define , and analogously and, given \(\Phi \), let \(V^{(0)}\) be given by (4.22) for some \(u^{(0)}\in \mathcal{C}^{3-\kappa }\).
Then, setting \({\bar{\gamma }} = {3\over 2}-\kappa \) and \({\bar{\eta }} = -{1\over 2}-\kappa \), there exists a choice \(\Phi \in \mathcal{C}^{{\bar{\gamma }},{\bar{\eta }}}\) such that \(\mathcal{R}V^{(0)} = K_{\mathrm{Neu}}\bigl (G(u^{(0)})\zeta ^+\bigr )\) and \(V^{(0)}\) belongs to \(\mathcal{D}^{{\bar{\gamma }},{\bar{\eta }}}\). In particular, if \(\zeta ^+ \in \mathcal{C}^\alpha \) for some \(\alpha > -1\) is supported in \(\mathbf{R}_+ \times D\) then, for \(t \in [0,1]\), \(\mathcal{R}V^{(0)}\) coincides with the solution of \(\partial _t v = \Delta v + G(u^{(0)}) \zeta ^+\) with vanishing initial condition, endowed with Neumann (respect. Dirichlet) boundary conditions.
If furthermore \(\zeta _n \rightarrow \zeta \) in \(\mathcal{C}^{-{5\over 2}-\kappa }\) and \({\varvec{\Pi }}_n \rightarrow {\varvec{\Pi }}\) as admissible models, then one has \(|\!|\!|V^{(0)}_n ; V^{(0)}|\!|\!|_{{\bar{\gamma }},{\bar{\eta }}} \rightarrow 0\).
Remark 6.2
In principle, the model \({\varvec{\Pi }}\) does contain non-trivial information through its action on . This is because the kernel \(K_\partial \) is not 2-regularising (condition 5.5 in [11, Ass. 5.1] fails to be satisfied), so that the extension theorem [11, Thm 5.14] cannot be applied here.
Proof
We aim to apply Corollary B.6. Let \(\gamma = {1\over 2}-2\kappa \), \(\eta = -{5\over 2} -\kappa \) (so that in particular \(\gamma - \eta = 3-\kappa \)), and let \(\mathcal{B}= \mathcal{C}^{\gamma -\eta }\) (on which \(\mathcal{C}^{\gamma -\eta }\) then acts canonically by multiplication) with the injection . (This is actually independent of the model \({\varvec{\Pi }}\).) Given \(g \in \mathcal{B}\), we set
which is consistent with the reconstruction operator by our assumption on \(\zeta ^+\). We are therefore in the setting of Corollary B.6 provided that we set \(K_0 = K_{\mathrm{Neu}}\).
This guarantees that we can find \(\Phi \in \mathcal{C}^{\gamma +2,\eta +2}\) with the desired properties. The continuity as a function of \(\zeta ^+\) and the model \({\varvec{\Pi }}\) follows from the corresponding continuity statement in Corollary B.6. \(\square \)
Proposition 6.3
For every \(g \in \mathcal{C}^{2-\kappa }\) one can find \(\Phi \) taking values in the Taylor polynomials such that, setting
one has \(V \in \mathcal{D}^{2-2\kappa ,{\bar{w}}}\) with \({\bar{w}} = \big ({1\over 2}-2\kappa ,{1\over 2}-2\kappa ,0\big )\), and .
Proof
We make use of [9, Lem. 4.12] and Corollary B.6. For this, similarly to above, we set \(\mathcal{B}= \mathcal{C}^{2-\kappa }\) and, for \(g \in \mathcal{B}\), we set
Note that as a consequence of Proposition A.2 one has for any \({\tilde{\gamma }} > 0\) and for \({\tilde{w}} = \big (-{1\over 2}-\kappa \big )_3\), where \((\eta )_3 {\mathop {=}\limits ^{\mathrm{def}}}(\eta ,\eta ,\eta )\). Since , it follows from [9, Lem. 4.3] that \(\iota g \in \mathcal{D}^{\gamma , w}\) for \(\gamma = 1-2\kappa \) and for \(w = \bigl (-{3\over 2}-2\kappa \bigr )_3\). It then follows from [9, Lem. 4.12] that one can find a modelled distribution \(V \in \mathcal{D}^{2-2\kappa ,{\bar{w}}}\) with \({\bar{w}} = \big ({1\over 2}-2\kappa ,{1\over 2}-2\kappa ,0\big )\) of the form
with \({\hat{\Phi }}\) taking values in the Taylor polynomials and such that
thus concluding the proof. \(\square \)
Proposition 6.4
Equip the regularity structure \((\mathcal{T},\mathcal{G})\) with an admissible model \({\varvec{\Pi }}\) in the sense of Definition 4.2 and let G be a modelled distribution of the form
for some functions \(g_1, g_2 \in \mathcal{C}^{2-\kappa }\) and \(g_3 \in \mathcal{C}^{{3\over 2}-\kappa ,w}\) with \(w = \bigl (-{1\over 2}-\kappa ,{1\over 2}-\kappa ,{1\over 2}-\kappa \bigr )\). Then, there exists a unique modelled distribution which we call such that on test functions whose support does not intersect \(\mathbf{R}\times \partial D\) and such that
locally uniformly over \(x \in \mathbf{R}\times D\) and uniformly over \(\lambda \in (0,1]\).
Furthermore, there exists \(V \in \mathcal{D}^{2-2\kappa ,{\bar{w}}}\) with \({\bar{w}} = \big ({1\over 2}-2\kappa ,{1\over 2}-2\kappa ,0\big )\) taking values in the translation invariant sector, and such that . The function V is of the form
with \(\Phi \) taking values in the Taylor polynomials, and the map \((g_1,g_2,g_3,{\varvec{\Pi }}) \mapsto V\) is uniformly Lipschitz continuous on bounded sets.
Proof
We use again the same strategy of proof as in Proposition 6.3, but this time we take as our space \(\mathcal{B}\) the space of triples \(g = (g_1,g_{2,i},g_3)\) as in the statement of the proposition and we set
where \( \mathcal{R}\) is the reconstruction operator given by Theorem C.5. This time, we have and , so that it follows from [9, Lem. 4.3] that for \(\gamma = {1\over 2}-2\kappa \) and \(w = \bigl (-{3\over 2}-2\kappa ,-\frac{1}{2}-2\kappa ,-\frac{3}{2}-2\kappa \bigr )\).
This shows that Theorem C.5 can indeed be applied to this situation since furthermore our admissible models are such that as soon as \(\phi \) is supported outside of \(\mathbf{R}\times D\). The remainder of the proof then follows from an application of [9, Lem. 4.12] in the same way as in the proof of Proposition 6.3. \(\square \)
Remark 6.5
Note that the results of [9] do not apply here since and are both strictly below \(-1\). Our saving grace is that the coefficients are sufficiently well-behaved near the boundary of the domain.
Notes
Here we perform the reflections consistent with the reflection principle for our choice of boundary conditions.
Simplified except for the treatment of the boundary conditions, which leads to non-trivial complications.
We ignore the possibility of having “derivatives” on our edges as in [5], as these do not occur in this article.
with the convention \(\mathcal{L}= \mathcal{L}_1\) for \(\mathcal{L}_1\) the Taylor lift (3.3)
References
Bal, G.: Homogenization with large spatial random potential. Multiscale Model. Simul. 8(4), 1484–1510, 2010. https://doi.org/10.1137/090754066
Bruned, Y., Chandra, A., Chevyrev, I., Hairer, M.: Renormalising SPDEs in regularity structures. ArXiv e-prints. Journal EMS, to appear, 2017. arXiv:1711.10239
Bahouri, H., Chemin, J.-Y., Danchin, R.: Fourier Analysis and Nonlinear Partial Differential Equations, Volume 343 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer, Heidelberg, 2011, xvi+523
Bal, G., Gu, Y.: Fluctuations of parabolic equations with large random potentials. Stoch. Partial Differ. Equ. Anal. Comput. 3(1), 1–51, 2015. https://doi.org/10.1007/s40072-014-0040-8
Bruned, Y., Hairer, M., Zambotti, L.: Algebraic renormalisation of regularity structures. Invent. Math. 215(3), 1039–1156, 2019. https://doi.org/10.1007/s00222-018-0841-x. arXiv:1610.08468
Chandra, A., Hairer, M.: An analytic BPHZ theorem for regularity structures, 2016. ArXiv e-prints. arXiv:1612.08138v5.
Chandra, A., Weber, H.: Stochastic PDEs, regularity structures, and interacting particle systems. Ann. Fac. Sci. Toulouse Math. (6) 26(4), 847–909, 2017. https://doi.org/10.5802/afst.1555. arXiv:1508.03616
Friz, P.K., Hairer, M.: A course on rough paths. Universitext. Springer, Cham, 2014, xiv+251. With an introduction to regularity structures. https://doi.org/10.1007/978-3-319-08332-2
Gerencsér, M., Hairer, M.: Singular SPDEs in domains with boundaries. Probab. Theory Relat. Fields 173(3–4), 697–758, 2019. https://doi.org/10.1007/s00440-018-0841-1. arXiv:1702.06522
Gu, Y., Tsai, L.-C.: Another look into the Wong–Zakai theorem for stochastic heat equation. Ann. Appl. Probab. 29(5), 3037–3061, 2019. https://doi.org/10.1214/19-AAP1474
Hairer, M.: A theory of regularity structures. Invent. Math. 198(2), 269–504, 2014. https://doi.org/10.1007/s00222-014-0505-4. arXiv:1303.5113
Hairer, M.: Regularity structures and the dynamical \(\Phi ^4_3\) model. In: Current Developments in Mathematics 2014. Int. Press, Somerville, 1–49, 2016. arXiv:1508.05261.
Hairer, M.: An analyst’s take on the BPHZ theorem. In Computation and Combinatorics in Dynamics, Stochastics and Control, Volume 13 of Abel Symposium. Springer, Cham, 429–476, 2018. arXiv:1704.08634.
Hairer, M., Pardoux, E.: A Wong–Zakai theorem for stochastic PDEs. J. Math. Soc. Jpn. 67(4), 1551–1604, 2015. https://doi.org/10.2969/jmsj/06741551. arXiv:1409.3138
Hairer, M., Quastel, J.: A class of growth models rescaling to KPZ. Forum Math. Pi 6, e3, 112 (2018). https://doi.org/10.1017/fmp.2018.2. arXiv:1512.07845.
Hairer, M., Shen, H.: The dynamical sine-Gordon model. Commun. Math. Phys. 341(3), 933–989, 2016. https://doi.org/10.1007/s00220-015-2525-3. arXiv:1409.5724
Kruskal Jr., J.B.: On the shortest spanning subtree of a graph and the traveling salesman problem. Proc. Am. Math. Soc. 7, 48–50, 1956. https://doi.org/10.1090/S0002-9939-1956-0078686-7
Martin, J.: Refinements of the Solution Theory for Singular SPDEs. Ph.D. thesis, Humboldt-Universität zu Berlin, 2018. https://d-nb.info/1182541526/34
Peccati, G., Taqqu, M.S.: Wiener chaos: moments, cumulants and diagrams, vol. 1 of Bocconi & Springer Series. Springer, Milan; Bocconi University Press, Milan, 2011, xiv+274. A survey with computer implementation, Supplementary material available online. https://doi.org/10.1007/978-88-470-1679-8
Walsh, J. B.: An introduction to stochastic partial differential equations. In École d’été de probabilités de Saint-Flour, XIV—1984, Volume 1180 of Lecture Notes in Mathematics. Springer, Berlin, 265–439, 1986. https://doi.org/10.1007/BFb0074920
Whitney, H.: Analytic extensions of differentiable functions defined in closed sets. Trans. Am. Math. Soc. 36(1), 63–89, 1934. 10.2307/1989708
Acknowledgements
The authors gratefully acknowledge financial support from the Leverhulme Trust through a leadership award, from the European Research Council through a consolidator grant, Project 615897, and from the Royal Society through a research professorship. We are also grateful to Máté Gerencsér for allowing us to use his ideas for the results of “Appendix C” and to Ajay Chandra for discussions on the application of the results of [6].
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by F. Otto.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Extension of the Kernel
We fix a function \(K :\mathbf{R}^{d+1} \rightarrow \mathbf{R}\) which is smooth, non-anticipative (that is which is supported on positive times), even in the spatial variable, agrees with the heat kernel on \([0,1] \times [-1,1]^d\), and is supported on \([0,2] \times [-2,2]^d\). As usual, we write \(K = \sum _{n \ge 0} {\tilde{K}}_n\) with \({\tilde{K}}_n\) satisfying the conditions of Definition B.1 with \(\beta = 2\). We furthermore write
In order to implement integration against the heat kernel with Neumann boundary conditions, we set
(recall that the reflection group \(\mathcal{G}\) was defined in Section 4.1) where \(\psi :\mathbf{R}_+ \rightarrow \mathbf{R}_+\) is a smooth function such that \(\phi (r) = 1\) for \(r \le 2\) and \(\phi (r) = 0\) for \(r \ge 3\). The kernel \(K_{\mathrm{Dir}}\) is defined similarly.
Furthermore, it is obvious that Assumption B.1 is satisfied with \(\beta = 2\), since this is the case for K itself. Note that this would not be true if it weren’t for the presence of the cutoff \(\phi \) in (A.1) since the kernel without cutoff has singularities at \(z = R(z')\) for all reflections \(R \in \mathcal{G}\).
Proposition A.1
One has \(K_{\mathrm{Neu}}(z,z') = \sum _{R \in \mathcal{G}} K(z-R(z'))\) for \(z,z' \in ([0,1]\times D)^2\).
Proof
Recall that \({\tilde{K}}_n\) is supported in the ball of radius \(2^{-n}\) and note that \(\Vert z-R(z')\Vert \ge \Vert z-z'\Vert \) whenever both z and \(z'\) belong to \(\mathbf{R}\times D\). As a consequence, for every \(z,z' \in ([0,1]\times D)^2\), every \(n\ge 0\) and every \(R \in \mathcal{G}\), one has either \({\tilde{K}}_n (z-R(z')) = 0\) or \(\phi (2^n \Vert z-z'\Vert ) = 1\), so that one can replace \(\phi \) by 1 in (A.1). \(\square \)
In order to state the main result of this appendix, we write \({\tilde{D}} = \mathbf{R}_+ \times D\) as a shorthand.
Proposition A.2
Let K, \(K_\partial \), \(K_i\) and \(K_{\partial ,i}\) be as above and let \(\zeta , \zeta ^c \in \mathcal{C}^{\alpha }\) with \(\alpha \le -2\) be such that \(\zeta \) is supported on \({\tilde{D}}\) and \(\zeta ^c\) is supported on its complement. Then, restricted to \({\tilde{D}}\), both \(K \zeta ^c\) and \(K_{\partial } \zeta \) belong to \(\mathcal{C}^{\gamma ,w}\) for \(w = \bigl (\alpha +2\bigr )_3\) and any \(\gamma > 0\). Furthermore, when restricted to \({\tilde{D}}^c\), \(K \zeta \) belongs to \(\mathcal{C}^{\gamma ,w}\).
The same statements hold when \(\alpha \le -3\) with K and \(K_\partial \) replaced by \(K_i\) and \(K_{\partial ,i}\) respectively and \(w = \bigl (\alpha +3\bigr )_3\).
Proof
Consider the case \(z \not \in {\tilde{D}}\), so that
We now note that \(\langle \zeta , D^k {\tilde{K}}_n(z-\cdot )\rangle = 0\) whenever \(2^{-n} \le d(z, {\tilde{D}})\), while in general
by definition of \(\mathcal{C}^\alpha \). It immediately follows that one has
so that one has indeed \(K \zeta \in \mathcal{C}^{\gamma ,(\alpha +2)_3}\) for every \(\gamma > 0\). The case of \(K \zeta ^c\), \(K_i \zeta \) and \(K_i \zeta ^c\) is dealt with in exactly the same way.
Consider now the case \(z\in {\tilde{D}}\) and define
We then have the identity
It follows again from the definition of \(\mathcal{C}^\alpha \) that the bound (A.2) holds with \({\tilde{K}}_n(z-\cdot )\) replaced by \({\tilde{K}}_n^R(z,\cdot )\).
We also note that \({\tilde{K}}_n^R(z,\cdot ) = 0\) unless there exists a point \(z'\) such that one has on one hand \(\Vert z-z'\Vert \lesssim 2^{-n}\) and on the other hand \(\Vert z-R(z')\Vert \lesssim 2^{-n}\). In particular, there exists a constant C such that, if \(d(z, \partial {\tilde{D}}) \ge C 2^{-n}\), one has \({\tilde{K}}_n^R(z,\cdot ) = 0\) unless \(R = {\mathrm {id}}\) (which is excluded from the sum (A.4)). If on the other hand \(d(z, \partial {\tilde{D}}) \le C 2^{-n}\), then \({\tilde{K}}_n^R(z,\cdot )\) is only non-zero for at most eight different reflections R. Combining these observations shows as before that the bound (A.3) holds with \(K\zeta \) replaced by \(K_\partial \zeta \). The proof for \(K_\partial \) replaced by \(K_{\partial ,i}\) is virtually identical. \(\square \)
Corollary A.3
For \(\alpha \in (-3,-2]\) and \(\zeta \in \mathcal{C}^\alpha \) supported on \(\mathbf{R}_+ \times D\), one has \(K_{\partial } \zeta \in \mathcal{C}^{\alpha + 2}\).
Proof
This follows immediately from Proposition A.2, combined with the fact that \(\mathcal{C}^{\gamma ,(\eta )_3} \subset \mathcal{C}^{\eta }\) whenever \(\gamma \ge \eta > -1\). \(\square \)
Proposition A.4
Let \(\zeta \in \mathcal{C}^{\alpha }\) with \(\alpha \le -2\) be supported on \(\mathbf{R}_- \times \mathbf{R}^d\). Then, restricted to \(\mathbf{R}_+ \times \mathbf{R}^d\), \(K_{\mathrm{Neu}}\zeta \) belongs to \(\mathcal{C}^{\gamma ,w}\) for \(w = \bigl (\alpha +2,\gamma , \alpha +2\bigr )\) and any \(\gamma > 0\).
Proof
With the same notations as above, we have for \(z = (t,x)\),
As before, the summand vanishes as soon as \(2^{-n} \lesssim \sqrt{|t|}\) and, for all \(z \in \mathbf{R}_+ \times \mathbf{R}^d\) and \(n\ge 0\), only at most a fixed number of test functions \({\tilde{K}}_n^R(z,\cdot )\) are non-vanishing, so that we obtain for \(t > 0\) the bound
This then implies the required bound by the definition of the spaces \(\mathcal{C}^{\gamma ,w}\). \(\square \)
Appendix B: Integration and Multiplication by Smooth Functions Almost Commute
We will use a form of the multilevel Schauder theorem of [11, Sec. 5] with slightly weaker assumptions on the kernel K. In this section, we fix a space-time scaling \({\mathfrak {s}}\) as in [11]. (In our case this would be the parabolic scaling \({\mathfrak {s}}= (2,1,\ldots ,1)\).)
Definition B.1
A function \(K:\mathbf{R}^d \times \mathbf{R}^d \rightarrow \mathbf{R}\) is said to be \(\beta \)-regularising if it can be decomposed as
where the functions \({\tilde{K}}_n\) have the following properties:
-
For all \(n \ge 0\), the map \({\tilde{K}}_n\) is supported in the set \(\{(x,y)\,:\, \Vert x-y\Vert _{\mathfrak {s}}\le 2^{-n}\}\).
-
For any two multiindices k and \(\ell \), there exists a constant C such that the bound
$$\begin{aligned} \bigl |D_x^k D_y^\ell {\tilde{K}}_n(x,y)\bigr | \le C 2^{(|{\mathfrak {s}}| - \beta + |\ell |_{\mathfrak {s}}+ |k|_{\mathfrak {s}})n}, \end{aligned}$$(B.2)holds uniformly over all \(n \ge 0\) and all \(x,y \in \mathbf{R}^d\).
Remark B.2
This is identical to [11, Ass. 5.1], but with the last condition absent. It turns out that the only place where the third condition of [11, Ass. 5.1] is ever used in [11, Sec. 5] is in the proof of [11, Lemma 5.19], which in turn is only used for the proof of the extension theorem, [11, Thm 5.14].
We assume that we are given a regularity structure of the type studied in [5, 11], endowed with a family of integration maps for multiindices \(k \in \mathbf{N}^d\). We also assume that we are given a model \((\Pi ,\Gamma )\) which is admissible for the collection of kernels \(K_k\) such that
We will furthermore assume that our regularity structure contains the polynomial structure on \(\mathbf{R}^d\) (for some fixed scaling) and that admissible models satisfy the usual identity for every \(\tau \in \mathcal{T}\). Assuming that is of order \(\beta \), we assume that is of order \(\beta + |k|_{\mathfrak {s}}\), which is compatible with (B.3) in the sense that if \(K_0\) is \(\beta \)-regularising in the sense of Definition B.1, then \(K_k\) is \((\beta + |k|_{\mathfrak {s}})\)-regularising.
Lemma B.3
Let \(K_0\) be a \(\beta \)-regularising kernel for some \(\beta > 0\) and let \((\Pi ,\Gamma )\) be an admissible model for the collection of kernels \(K_k\) given in (B.3). Then, one has the identities
Proof
We first consider the identity for \(\Pi _x\). We can assume that \(k=0\) since the general case then follows at once by simply setting . We also assume without loss of generality that \(\tau \) is homogeneous of degree \(\alpha \), so that
where the sum is constrained by \(|k|_{\mathfrak {s}}< \alpha + \beta + |\ell |_{\mathfrak {s}}\). On the other hand, one has
where the sum this time is constrained by \(|k|_{\mathfrak {s}}< \alpha + \beta + |\ell -m|_{\mathfrak {s}}\). The first term in this expression clearly equals the first term of (B.5) by the binomial theorem, so we only need to consider the second term. The integrand can be written as
where p is constrained by \(p \le k \wedge (\ell -m)\). Setting \(q = k-p\) and \(r = k+m\), this can be written as
where the sum is constrained by the fact that all three variables are positive mutiindices and furthermore
It follows that for any fixed values of q and r the above sum vanishes, except when \(r-q = 0\), so that it equals
constrained by \(|q|_{\mathfrak {s}}< \alpha + \beta + |\ell |_{\mathfrak {s}}\), which is precisely the integrand appearing in the second term of (B.5).
The second identity immediately follows from the first one. Indeed, both sides take values in the Taylor polynomials by the definition of an admissible model. Furthermore, the first identity implies that
As a consequence of combining this with the first identity of (B.4), both sides of the second identity of (B.4) are equal after applying \(\Pi _x\) to them. Since furthermore both sides belong to the space of Taylor polynomials on which \(\Pi _x\) is injective, the claim follows. \(\square \)
Corollary B.4
In the context of Lemma B.3, write \(\mathcal{K}_k\) for the integration operator associated to as in [11] and write \(\mathcal{L}_\gamma \) for the Taylor lift \(\mathcal{C}^\gamma \rightarrow \mathcal{D}^\gamma \). Then, for every \(F \in \mathcal{D}_\chi ^\gamma \) with \(\chi \le 0 < \gamma \) and \(g \in \mathcal{C}^{\theta -\chi }\) with \(\theta \in (0,\gamma ]\) (and such that \(\theta + \beta \not \in \mathbf{Z}\)), one can find a function \(\phi \in \mathcal{C}^{\theta + \beta }\) such that, setting
one has \(G \in \mathcal{D}^{\theta + \beta }\) and \(\mathcal{R}G = K (g\mathcal{R}F)\).
Proof
A straightforward calculation virtually identical to (B.6) shows that, as a consequence of the first identity of Lemma B.3, one has
For \(F\in \mathcal{D}^\gamma _\chi \), write , so that \(\mathcal{N}_k (F)\) takes values in the Taylor polynomials of degree at most \(\gamma +\beta \), and set
We then set
Combining this with (B.8), it follows that, setting \({\bar{G}} = \mathcal{K}_0(\mathcal{L}_{\theta -\chi }(g)F)\), one has
Combining this with the second identity of Lemma B.3 and writing \({\bar{\mathcal{Q}}}\) for the projection onto the Taylor polynomials, we conclude that
Since furthermore \({\bar{G}}\) and each of the terms \(\mathcal{L}_{\theta -\chi -|\ell |_{\mathfrak {s}}}(D^\ell g) \,\mathcal{K}_{\ell } \bigl (F\bigr )\) belong to \(\mathcal{D}^{\theta + \beta }\), we conclude that one necessarily has \( G \in \mathcal{D}^{\theta + \beta }\). This in turn implies that \(\Phi \in \mathcal{D}^{\theta + \beta }\), so that it is the lift of a function \(\phi \in \mathcal{C}^{\theta +\beta }\). By (B.9), we furthermore have \(\mathcal{R}G = \mathcal{R}{\bar{G}} = K (g\mathcal{R}F)\), thus concluding the proof. \(\square \)
One simple but very useful corollary of this result can be formulated as follows:
Corollary B.5
Let \(\zeta \in \mathcal{C}^\chi \) with \(\chi \le 0\), let K and \(K_k\) be as above, and let \(g \in \mathcal{C}^{\theta -\chi }\) with \(\theta > 0\) and \(\theta + \beta \not \in \mathbf{Z}\). Then,
Proof
It suffices to consider the case of a regularity structure with symbol (plus Taylor polynomials and their products with ) and model mapping to \(\zeta \). We then apply the reconstruction operator to both sides of (B.7) with . \(\square \)
In our context, we will need an analogous result, but for the spaces \(\mathcal{D}^{\gamma ,\eta }\) of [11, Sec. 6]. Furthermore, we will need to be able to cover situations in which [11, Prop. 6.16] does not apply because we consider elements taking values in a sector of regularity below \(-2\), so that the reconstruction theorem [11, Prop. 6.9] fails. We therefore make use instead of [9, Lem. 4.12] which, given \(F\in \mathcal{D}^{\gamma ,\eta }(V)\) with V of regularity \(\alpha \ge \eta \), allows us to specify a distribution \(\zeta \in \mathcal{C}^\eta \) such that \((\mathcal{R}F)(\phi ) = \zeta (\phi )\) for every test function \(\phi \) whose support does not intersect the plane \(P = \{(t,x)\,:\, t = 0\}\). We then have the following result.
Corollary B.6
Let \(\gamma > 0\), let V be a sector of regularity \(\alpha \le 0\), and let \(w = (\eta ,\sigma ,\mu )\) with \(\eta , \sigma , \mu \le \alpha \) and \(\eta \le \sigma \wedge \mu \), \(\eta + \beta > -2\), \(\sigma +\beta > -1\). For each admissible model \({\varvec{\Pi }}\), let \(\mathcal{B}\) be a Banach space equipped with a bounded map \(\iota :\mathcal{B}\rightarrow \mathcal{D}^{\gamma ,w}(V)\). (Since the latter depends on \({\varvec{\Pi }}\) in general, this can also be the case for \(\mathcal{B}\) and/or \(\iota \). In this case, we assume that \(\iota \) is bounded independently of the underlying model.) Let furthermore \({\hat{\mathcal{R}}} :\mathcal{B}\rightarrow \mathcal{C}^{\eta }\) be a continuous linear operator such that \(({\hat{\mathcal{R}}} F)(\phi ) = (\mathcal{R}\iota F)(\phi )\) for every \(F \in \mathcal{B}\) and every test function \(\phi \) whose support does not intersect the two boundaries.
We furthermore assume that we have a continuous bilinear map
such that \(\iota (gF) = \mathcal{L}_{\gamma -\eta }(g)\iota F\) and such that
where the right hand side is meaningful thanks to the fact that \(\gamma > 0\). Then, with \({\bar{\gamma }}\) and \({\bar{w}}\) as in [9, Lem. 4.12], one can find \(\phi \in \mathcal{C}^{{\bar{\gamma }},{\bar{w}}}\) such that the modelled distribution G given by (B.7) (with suitably defined \(\mathcal{K}_\ell \), see the proof) belongs to \(\mathcal{D}^{{\bar{\gamma }},{\bar{w}}}\) and satisfies \(\mathcal{R}G = K_0 (g{\hat{\mathcal{R}}} F)\).
If furthermore one has a sequence of models \({\varvec{\Pi }}_n \rightarrow {\varvec{\Pi }}\) with associated subspaces \(\mathcal{B}_n \subset \mathcal{D}^{\gamma ,w}(V)\) and reconstruction operators \({\hat{\mathcal{R}}}_n\), as well as a sequence \(F_n \in \mathcal{B}_n\) such that \(|\!|\!|F_n; F|\!|\!|_{\gamma ,w} \rightarrow 0\) and \(\Vert {\hat{\mathcal{R}}}_n F_n - {\hat{\mathcal{R}}} F\Vert _{\eta } \rightarrow 0\), then the sequence of modelled distributions \(G_n\) constructed in the first part of the statement converges to G in the sense that \(|\!|\!|G_n; G|\!|\!|_{{\bar{\gamma }},{\bar{w}}} \rightarrow 0\).
Proof
The proof is virtually identical to that of Corollary B.4. The only difference is that we use [9, Lem. 4.12] to define maps \(\mathcal{K}_k :\mathcal{B}\rightarrow \mathcal{D}^{{\bar{\gamma }}_k,{\bar{w}}_k}\) (with \({\bar{\gamma }}_k\) and \({\bar{w}}_k\) defined like \({\bar{\gamma }}\) and \({\bar{w}}\), but with \(\beta \) replaced by \(\beta + |k|_{\mathfrak {s}}\)) such that \(\mathcal{R}\mathcal{K}_k F = K_k {\hat{\mathcal{R}}} F\). (Since \(\eta +\beta > -2\) by assumption, the assumptions of that lemma are satisfied and \(\mathcal{R}\mathcal{K}_k F\) is always well-defined as a distribution on the whole space-time.) \(\quad \square \)
Appendix C: Reconstruction Theorem
In this section, we present a version of the reconstruction theorem that allows to bypass to some extent the condition \(\nu > -1\) appearing in [9]. This appendix was written in collaboration with Máté Gerencsér. In this section we assume that \(P_0 = \{0\}\times \mathbf{R}^d\), \(P_1 = \mathbf{R}\times \partial D\), and \(P = P_0 \cup P_1\), and we write \(|x|_{P_i}\) for the parabolic distance between x and \(P_i\). Generic points x, y, etc are space-time points.
Throughout this section, we fix a regularity structure (T, G) containing the polynomial structure and such that the product with elements of the polynomial structure \({\bar{T}}\) is well-defined in T. We also only consider models such that \(\Gamma _{x,y}\) acts on \({\bar{T}}\) by translations by \(y-x\). Let us recall from [11, Sec 8] that, given any regularity structure (T, G), a model \((\Pi ,\Gamma )\) can alternatively be described by a linear map \({\varvec{\Pi }}:T\rightarrow \mathcal{D}'\), together with a continuous map \(F:\mathbf{R}^{d}\rightarrow G\) such that, setting \(\Pi _x={\varvec{\Pi }}F_x\), \(\Gamma _{xy}=(F_x)^{-1}F_y\), the analytic bounds for models are satisfied. The main assumption we impose on our models in the present setting is the following:
Assumption C.1
Setting \(T_{<} = \bigoplus _{\alpha \le 1} T_\alpha \), there exist linear maps \({\varvec{\Pi }}^+, {\varvec{\Pi }}^-:T_{<} \rightarrow \mathcal{D}'\) with \({\varvec{\Pi }}^+ + {\varvec{\Pi }}^- = {\varvec{\Pi }}{\upharpoonright }T_{<}\) and such that
-
\(\big ({\varvec{\Pi }}^+\tau \big )(\psi )=0\) for all \(\tau \in T_<\) and all \(\psi \) supported in \(\mathbf{R}\times D^c\) and \(\big ({\varvec{\Pi }}^- \tau \big )(\psi )=0\) for all \(\psi \) supported in \(\mathbf{R}\times D\);
-
Setting \(\Pi ^+_x = {\varvec{\Pi }}^+F_x\), \(\Pi ^-_x:={\varvec{\Pi }}^-F_x\) the pairs \((\Pi ^+,\Gamma )\) and \((\Pi ^-,\Gamma )\) are models on \((T_<,G)\) in the sense of [11]. (But they are not admissible in general!)
We will always write \({\underline{\alpha }}\) for the lowest degree appearing in our ambient regularity structure T. We also fix \(\gamma > 0\) as well as exponents \(\eta \) and \(\sigma \) on which we make the following assumption:
Assumption C.2
The exponents satisfy the condition
We also use the shorthand \(w = (\eta ,\sigma ,\eta )\) similarly to [9] (except that we make the simplifying assumption that the “corner exponent” coincides with \(\eta \) which is not essential but simplifies our argument). One crucial ingredient for our result is the following:
Lemma C.3
For every \(f \in \mathcal{D}^{\gamma ,w}\), there exist \(f_\pm \in \mathcal{D}^{\sigma ,\eta }\) such that \(f_+(x) = f(x)\) for \(x \in \mathbf{R}\times D\) and \(f_-(x) = f(x)\) for \(x \in \mathbf{R}\times D^c\).
Proof
It follows from the definition of the spaces \(\mathcal{D}^{\gamma ,w}\) that the restriction of \(\mathcal{Q}_{<\sigma }f\) to either \(\mathbf{R}\times {\bar{D}}\) or \(\mathbf{R}\times \overline{D^c}\) belongs to \(\mathcal{D}^{\sigma ,\eta }\). In particular, components of f of degree below \(\sigma \) can be extended continuously to \((\mathbf{R}{\setminus } \{0\}) \times D\). (Note that \(\sigma < 0\) though!) The claim then follows from an adaptation of Whitney’s extension theorem to the setting of regularity structures, see for example [18, Thm 5.3.16]. \(\square \)
Remark C.4
In general there is no reason for \(f_+\) and \(f_-\) to coincide on \(\mathbf{R}\times \partial D\).
We define spaces \(\mathcal{C}^{\alpha ,\eta }\) with \(\eta \le \alpha < 0\) as consisting of those distributions \(\zeta \in \mathcal{C}^{\eta }(\mathbf{R}^{1+d}) \cap \mathcal{C}^{\alpha }(\mathbf{R}^{1+d} {\setminus } P_0)\) such that
uniformly over x in compacts away from \(P_0\), \(2\lambda \in (0, 2 \wedge |x|_{P_0}]\), and test functions \(\psi \in \mathcal{B}\). Here and below we write \(\mathcal{B}\) for the set of functions supported in the centred (parabolic) ball of radius 1 and with r derivatives bounded by 1, where r is some fixed sufficiently large value. Note that if \(\eta > -2\), \(\zeta \in \mathcal{C}^{\alpha ,\eta }\) is uniquely determined by its action on test functions supported outside \(P_0\), see [11].
Theorem C.5
Under Assumptions C.1 and C.2, there then exists a unique continuous linear operator \(\mathcal{R}:\mathcal{D}^{\gamma ,w}_P\rightarrow \mathcal{C}^{{\underline{\alpha }},\eta }\) such that \(\mathcal{R}f(\psi )={\tilde{\mathcal{R}}} f(\psi )\) for all \(\psi \) supported in \(\mathbf{R}^d{\setminus } P\), and such that one has the bound
uniformly over \(x\in P_1{\setminus } P_0\) (in compacts), over \(\lambda \in (0,1]\) such that \(2\lambda \leqq |x|_{P_0}\), and over \(\psi \in \mathcal{B}\).
Remark C.6
We did not specify the continuity of the map \(\mathcal{R}\) with respect to different models. We will continue to omit continuity statements in the sequel, on one hand for the sake of easing the presentation, and on the other hand due to the fact that since all the operations discussed here are linear, the “linearisation trick” of [14, Prop 3.11] automatically implies all the required continuity properties.
Proof
The proof is very similarly to that of [9, Thm 4.10], but due to the central role of the statement in our proof, we provide some detail. The uniqueness part is quite straightforward: take two \(\mathcal{C}^{{\underline{\alpha }},\eta }\) distributions \(\xi _1,\xi _2\) that have the properties claimed for \(\mathcal{R}f\) in the theorem. Their difference then vanishes away from P, and thanks to the bound (C.2), must belong to \(\mathcal{C}^{{\underline{\alpha }},\eta }\). Since \(\eta > -2\) however, such a distribution necessarily vanishes.
To construct \(\mathcal{R}\), we use essentially the same construction as in the proof of [11, Prop. 6.9]. Similarly to the construction of the functions \(\phi _{x,n}\) performed there, we can find, for every \(n \in \mathbf{N}\), a countable index set \(\Xi _n\) and functions \(\phi _{x,n}\) with \(n \in \mathbf{N}\) and \(x \in \Xi _n\) with the following properties. There exist constants \(c_i > 0\) such that:
-
(i)
For every \(n \in \mathbf{N}\) and \(x \in \Xi _n\) there exists \(\psi \in \mathcal{B}\), \(y \in \mathbf{R}^{d+1}\) with \(|y|_{P_1} = 2^{-n}\) such that, setting \(\lambda = 2^{-n-1}\), one has \(\phi _{x,n} = c_1 \lambda ^{d+2} \psi _{y}^\lambda \).
-
(ii)
For every ball B of (parabolic) radius \(\lambda \), there exist at most \(c_2\lambda ^{d+1}\) elements \(x \in \Xi _n\) with \({\mathrm{supp}}\,\phi _{x,n} \cap B \ne \emptyset \).
-
(iii)
For every \(y \in \mathbf{R}^{d+1}\) with \(0 < |y|_{P_1} \le 1\), one has \(\sum _{n \in \mathbf{N}}\sum _{x \in \Xi _n} \phi _{x,n}(y) = 1\).
(Note that the sum appearing in the last claim always converges since, by the first two properties, it is guaranteed to only contain finitely many terms.)
We now write \(\mathcal{R}_\pm \) for the reconstruction operators for \(\mathcal{D}^\gamma \) spaces with \(\gamma < 0\) associated to the models \(\Pi ^\pm \) as in the second part of [11, Thm 3.10], we fix \(y \in \mathbf{R}^{d+1} {\setminus } P_0\), \(\lambda \le 1 \wedge |y|_{P_0}/C\) for some large enough (but fixed) constant C, and \(\psi \in \mathcal{B}\), and we define
If one also has \(\lambda \le |y|_{P_1}/2\), then the second sum only contains finitely many terms and one has \((\mathcal{R}f)(\psi ) = ({\tilde{\mathcal{R}}} f)(\psi )\). Since in this case it follows from [9, Def. 3.1] combined with [11, Lem. 6.7] that one has the bound
it remains to consider the convergence of the second term. Since we can restrict ourselves to the case \(|y|_{P_1} \le 2\lambda \), we can assume without loss of generality that \(y \in P_1 {\setminus } P_0\). In particular, the terms in the sum vanish unless \(2^{-n} \lesssim \lambda \).
We also note that for these terms, one can write
for some z, some \({\tilde{\psi }} \in \mathcal{B}\), and \({\bar{\lambda }} = 2^{-n-1}\). Provided that C is sufficiently large, properties (i) and (ii) guarantee furthermore that z is such that \({\bar{\lambda }} \simeq |z|_{P_1}\le |z|_{P_0}\). It follows from this that, when restricted to the support of \({\tilde{\psi }}_{z}^{{\bar{\lambda }}}\), one has \(\Vert \mathcal{Q}_{<\delta } f\Vert _{\mathcal{D}^\delta } \lesssim |z|_{P_0}^{\eta -\sigma } {\bar{\lambda }}^{\sigma -\delta }\), provided that \(\delta \in [\sigma ,\gamma ]\), so that [11, Lem. 6.7] yields again
Finally, since the support of \({\tilde{\psi }}\) is either fully contained in \(\mathbf{R}\times D\) or fully contained in \(\mathbf{R}\times D^c\), it follows that
and we conclude that
Since there are at most \(2^{n(d+1)}\) such terms and since \(\sigma > -1\) by assumption, both the convergence of the sum and the required bound (C.2) follow. \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hairer, M., Pardoux, É. Fluctuations Around a Homogenised Semilinear Random PDE. Arch Rational Mech Anal 239, 151–217 (2021). https://doi.org/10.1007/s00205-020-01574-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00205-020-01574-8