Abstract
We give a novel characterization of the centered model in regularity structures which persists for rough drivers even as a mollification fades away. We present our result for a class of quasilinear equations driven by noise, however we believe that the method is robust and applies to a much broader class of subcritical equations. Furthermore, we prove that a convergent sequence of noise ensembles, satisfying uniformly a spectral gap assumption, implies the corresponding convergence of the associated models. Combined with the characterization, this establishes a universality-type result.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
We continue the program initiated in [23] and consider quasilinear parabolic equations of the form
with a rough random forcing \(\xi \). We are interested in the regime where the nonlinear term \(a(u)\Delta u\) is singular but renormalizable. The nonlinearity a is assumed to be scalar-valued, smooth and such that the equation is uniformly parabolic.
In [21], the authors suggested an alternative point of view on Hairer’s regularity structures [12], the main differences being a different index set than trees and working with objects that are dual to Hairer’s. We comment throughout the text on differences and similarities between the two approaches, see in particular Remarks 1.8 and 1.9. However, the main philosophy of splitting the problem into a probabilistic construction of the model and a deterministic solution theory stays the same. Using the notion of modelled distributions, an a-priori estimate was established in [21] by assuming existence of a model that is suitably (stochastically) estimated. The main ingredients of a regularity structure and the notion of model in this setting were introduced there.
In [18], this regularity structure for quasilinear equations was systematically obtained and algebraically characterized. It was shown that, analogous to Hairer’s original approach, a Hopf-algebraic structure lies behind the structure group. Moreover, working with a dual perspective allowed for the insight that this Hopf algebra arises from a natural Lie algebra: the generators of actions on nonlinearities and solutions. While this provides a deeper insight into regularity structures, it is logically not necessary for [19, 21] or this work.
In [19], the construction and stochastic estimates of the model were provided, which serves as an input for [21]. Although the main result can be compared (in much less generality, of course) to the one in [5], the technique to obtain this result is drastically different. It is based on Malliavin calculus and a spectral gap assumption on the noise ensemble, which can be seen as a substitute for Gaussian calculus. Recently, this technique has been picked up and applied to the tree setting in great generality [13], which shows the robustness of the method. One further difference compared to the works in the framework of [12], is that in [19] the renormalization procedure is top-down rather than bottom-up. By this we mean that, guided by symmetries, we postulate a counterterm in the equation with as little degree of freedom as possible. The counterterm is then translated to the model, and the remaining degree of freedom is fixedFootnote 1 by the BPHZ-choice of renormalization. This strategy has the advantage that symmetries are built in, which comes at the price that it is a-priori not clear whether the equation can be renormalized with such a restricted counterterm.
Although the Malliavin derivative may be seen as a mere technical tool in [19], it actually allows for a conceptual gain: it allows for a characterization of models which persists for rough noise, as we shall establish in this work. This is in contrast to the existing literature on regularity structures, where the model is only tangible for smooth approximations, but not in the limit as the smoothness fades away. We believe that this characterization will prove useful in establishing qualitative properties of solutions of singular SPDEs, that are not present on the level of approximations. We think in particular of lattice approximations, as e.g. obtained in [20, 24] for two dimensional Ising-like models and in [10] for three dimensional ones, which properly rescaled and with suitable interaction converge to the dynamical \(\Phi ^{2n}_2\) resp. \(\Phi ^4_3\) model; for further results on discrete approximations we refer to [7] and references therein. A first simple application is given in Corollary 1.5, where we show scale invariance of the model. We also think that the method is robust and can be applied to other equations; in Remark 1.6 we provide two examples, a multiplicative stochastic heat equation and a version of the stochastic thin-film equation, and we outline what has to be modified for those equations. We also expect that weak universality results can be obtained, which we aim to address in future work.
In the following we briefly summarize the main results obtained in this work.
1.1 Main results
-
Characterization. In Definition 1.1 we make precise what we mean by a model. With help of the Malliavin derivative, the definition is weak enough to allow for rough noise, but strong enough for a unique characterization. In Theorem 1.3 we establish existence and uniqueness of the model associated to (1.1) under a spectral gap assumption on the driving noise.
-
Convergence. If a sequence of noise ensembles converges in law (resp. in probability resp. in \(L^p\)), and satisfies uniformly a spectral gap inequality, then the corresponding models obtained via Theorem 1.3 converge in law (resp. in probability resp. in \(L^p\)), see Theorem 1.4 for a precise statement.
-
Universality. The limit obtained in the previous item is independent of the approximating sequence. This is an immediate consequence of combining Theorem 1.3 and Theorem 1.4, and establishes universality in the class of (not necessarily Gaussian) noise ensembles satisfying uniformly a spectral gap inequality. We want to emphasize that such situations (in particular for non-Gaussian approximations) naturally appear in applications, e.g. in thin ferromagnetic films whose idealized physical model is triggered by white noise, but the actual physical approximation is not Gaussian due to the polycrystallinity of the material [14, 15].
-
Lifting symmetries. The characterizing Definition 1.1 allows to propagate symmetries of the noise ensemble to the associated model, cf. Corollary 1.5. This is of particular interest for symmetries that are not preserved under approximation, e.g. scale invariance for smooth noise.
Let us now comment on related work. A characterization of models in the rough setting as obtained in this work has not been carried out in the existing literature on regularity structures, although a characterization by fixing the expectation, the BPHZ-model, is available, see [2, Theorem 6.18]. However, this BPHZ characterization is restricted to the setting of smooth random models, and depends on a (non-canonical!) truncation of the solution operator. In view of the recent work [13] that exploits the Malliavin derivative in the tree setting, it is conceivable that the characterization of the present work can also be achieved in the tree setting, possibly by appealing to the dictionary recently developed in [3]. Let us point out that convergence of models was already obtained in [12] for specific equations, and in [5] in great generality. However, the convergence and universality obtained there is within the class of noise ensembles satisfying uniform cumulant bounds. A similar convergence and universality result was obtained in [6] via the renormalization group flow approach, again in the realm of cumulant bounds. Let us also mention the work [8], which establishes a weak universality result for a class of reaction diffusion equations (driven by Gaussian noise) by making use of the Malliavin derivative in the framework of paracontrolled distributions [9]; for further results on weak universality we refer to references therein. A first universality result in the context of a spectral gap assumption was obtained in [15] for a mildly singular SPDE; it relies on a characterization of the limiting model by suitable commutator estimates instead of the Malliavin derivative. Although such commutator estimates could also serve for a characterization of the model associated to (1.1), we believe that the approach via the Malliavin derivative pursued here is more suitable for extensions to other equations.
Notation
We denote a generic space-time point by \(x=(x_0,\dots ,x_d)\in \mathbb {R}^{1+d}\), where we think of 0 as the time-like coordinate and \(1,\dots ,d\) for \(d\ge 1\) as the space-like coordinates. If not further specified, we take suprema over space-time points over all of \(\mathbb {R}^{1+d}\). A typical element of \(\mathbb {N}_0\) is denoted by k, where we will often write \(k\ge 0\) instead of \(k\in \mathbb {N}_0\). Similarly, \(\textbf{n}\) denotes a typical element of \(\mathbb {N}_0^{1+d}\), where we shall write \(\textbf{n}\ne \textbf{0}\) instead of \(\textbf{n}\in \mathbb {N}_0^{1+d}\setminus \{\textbf{0}\}\). Partial derivatives are denoted by \(\partial ^\textbf{n}=\partial _0^{n_0}\cdots \partial _d^{n_d}\), and for the spatial Laplacian we write \(\Delta =\partial _1^2+\dots +\partial _d^2\). In line with the parabolic operator \((\partial _0-\Delta )\) we measure length with respect to
so that the effective dimension of \(\mathbb {R}^{1+d}\) equipped with this distance is given by \(D=2+d\). By \(\Vert \cdot \Vert _{\dot{H}^s}\) we denote an anisotropic version of the homogeneous Sobolev norm of order \(s\in \mathbb {R}\):
We will write (1.12)\(_\beta \) when we refer to the corresponding statement for a specific multi-index \(\beta \), and sometimes write (1.24)\(_\beta ^\gamma \) when we want to specify also a second multi-index. By \(\lesssim \) like in (1.45) we mean \(\le C\) for a constant \(C<\infty \), where the constant may depend on the integrability exponent \(p<\infty \) and the multi-index \(\beta \), but is uniform in \(x,y\in \mathbb {R}^{1+d}\) and the mollification length scale \(t>0\) (see (1.34) for the definition of the scaled test function \(\psi _t\)).
1.2 The centered model
In this section we recall in a rather heuristic discussion the notion of centered model from [19], which should serve as a motivation for Definition 1.1. Most of it is a repetition of [19, Section 2.2] and [22, Section 5], however we prefer to provide a self contained introduction. The reader familiar with this setting may directly jump to Sect. 1.3. For more details and rigorous arguments we refer the reader to the aforementioned references or [11], and for an introduction and additional motivation we recommend to have a look at [22]. We provide a brief comparison to the model in [12] in Remark 1.8.
We follow a top-down renormalization ansatz, meaning that we replace \(\xi \) by a smooth version \(\xi _\tau \), and guided by symmetries we postulate a counterterm h in (1.1):
The aim is to control the solution manifold in the limit of vanishing smoothness \(\xi _\tau \rightarrow \xi \), by fine-tuning the counterterm h in a \(\tau \) dependent way that preserves as much symmetries of the original equation (1.1) as possible. We are interested in ensembles \(\xi \) (and \(\xi _\tau \)) that are stationary in space-time and reflection invariant in space, see Assumption 1.2 i) and ii), which suggests a counterterm of the form
for some deterministic functional c on the space of nonlinearities a. For such a modified equation, it is expected that for fixed a, the space of solutions u (up to constants, and satisfying the equation modulo space-time analytic functions) is parametrized by space-time analytic functions p satisfying \(p(x)=0\) for a distinguished point \(x\in \mathbb {R}^{1+d}\). The model components \(\Pi _{x\beta }\) are then introduced by the formal power series ansatz
where we sum over multi-indices \(\beta :\mathbb {N}_0 \, \dot{\cup } \, (\mathbb {N}_0^{1+d}{\setminus }\{\textbf{0}\})\rightarrow \mathbb {N}_0\). Introducing for \(k\ge 0\) and \(\textbf{n}\ne \textbf{0}\) the coordinates
on the space of nonlinearities a and space-time analytic functions p with \(p(0)=0\), and for multi-indices \(\beta \) as above the monomials
the previous power series ansatz can be more compactly written as
From this ansatz one can deduce for multi-indices \(\beta \ne 0\) satisfying \(\beta (k)=0\) for all \(k\ge 0\) that
where \(e_\textbf{n}\) denotes the multi-index that maps \(k\ge 0\) to 0 and \(\textbf{m}\) to \(\delta _\textbf{n}^\textbf{m}\). For all other multi-indices, one can together with the equation for u read off an equation for the coefficients \(\Pi _{x\beta }\)
which is an infinite hierarchy of linear PDEs as we shall see below. First, let us comment on the individual components of this PDE. By \(\Pi _x\) we denote the formal power series whose coefficients are the space-time functions \(\Pi _{x\beta }\), i.e. \(\Pi _x(y) = \sum _\beta \Pi _{x\beta }(y) \, \textsf{z}^\beta \in \mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\). Products have to be understood in the algebra \(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\), where 1 denotes the neutral element. Similar to \(\Pi _x\), the counterterm c can – at least formally – be expressed as \(c[a]=\sum _\beta c_\beta \, \textsf{z}^\beta [a]\); we thus write \(c=\sum _\beta c_\beta \, \textsf{z}^\beta \in \mathbb {R}[[\textsf{z}_k]]\). By \(D^{(\textbf{0})}\) we denote the derivation on \(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\) defined by
which is made such that for \(v\in \mathbb {R}\) and polynomial \(c\in \mathbb {R}[\textsf{z}_k]\) we have
see the more general (1.16). Together with \(a(v)=\sum _{k\ge 0} \textsf{z}_k[a] v^k\) which is a consequence of the definition of \(\textsf{z}_k\), this should serve as enough motivation for how (1.5b) arises from the equation for u. Let us point out that all seemingly infinite sums over \(k,l\ge 0\) in the previous expressions are in fact effectively finite sums, meaning that they are finite sums on the level of every \(\beta \)-component.
We turn to the notion of homogeneity, which arises from a formal scaling argument. Denoting by S the parabolic rescaling \(x\mapsto (s^2x_0,sx_1,\dots ,sx_d)\) for \(s>0\), we define \(\hat{u}=s^{-\alpha }u(S\cdot )\), \(\hat{a}=a(s^\alpha \cdot )\) and \(\hat{\xi }=s^{2-\alpha }\xi (S\cdot )\). It is then straightforward to check that if \((u,a,\xi )\) is a solution to (1.1), then \((\hat{u},\hat{a},\hat{\xi })\) is again a solution to (1.1). Postulating that the renormalized equation inherits this invariance, and rescaling p consistently with u by \(\hat{p}=s^{-\alpha }p(S\cdot )\), one can read off the power series ansatz (1.3), that
where the homogeneity \(|\beta |\) is given by
Here, one should think of \(1+\sum k\beta (k)-\sum \beta (\textbf{n})\) as the number of instances of \(\xi \) contained in \(\Pi _{x\beta }\) (in form of a multilinear expression), while \(\sum |\textbf{n}|\beta (\textbf{n})\) is the (parabolic) degree of polynomials contained in \(\Pi _{x\beta }\).
In view of this, it is natural to consider only components \(\Pi _{x\beta }\) that contain at least one instance of \(\xi \), together with the monomials from (1.4), which motivates the following definition. We call a multi-index \(\beta \) populated, if
where we call the latter purely polynomial, and \([\beta ]\) is defined by
We therefore consider \(\Pi _x\) as a \(\textsf{T}^*\)-valued map, where \(\textsf{T}^*\subset \mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\) is defined by
For later use we also define the purely polynomial part \(\bar{\textsf{T}}^*\) of \(\textsf{T}^*\) by
and its natural complement \({\tilde{\textsf{T}}}^*\) in \(\textsf{T}^*\) such that
Similar to \(\Pi _x\), we want to consider \(\Pi ^-_x\) as taking values in \(\textsf{T}^*\), where we observe the following: on the one hand \(\pi ,\pi '\in \textsf{T}^*\) does notFootnote 2 imply \(\textsf{z}_k\pi ^k\pi '\in \textsf{T}^*\), however if the product is contained in \(\textsf{T}^*\), then due to the presence of \(\textsf{z}_k\) it automatically belongs to \({\tilde{\textsf{T}}}^*\); on the other hand, one can check that \(\pi \in \textsf{T}^*\) and \(c\in \mathbb {R}[[\textsf{z}_k]]\subset {\tilde{\textsf{T}}}^*\) does imply \(\pi ^l (D^{(\textbf{0})})^l c\in {\tilde{\textsf{T}}}^*\). For that reason, we introduce the projection P from \(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\) to \({\tilde{\textsf{T}}}^*\) in the definition of \(\Pi ^-_x\) to obtain the \({\tilde{\textsf{T}}}^*\)-valued map
Let us also mention that similar to (1.7), the invariance of (1.1) under \(\hat{u}=u(\cdot +z)\), \(\hat{a}=a\) and \(\hat{\xi }=\xi (\cdot +z)\) for \(z\in \mathbb {R}^{1+d}\) propagates via (1.3) to \(\Pi _x\), yielding
Thinking of a noise \(\xi \) whose law is invariant under translation \(\xi \mapsto \xi (\cdot +z)\) and the rescaling \(\xi \mapsto s^{2-\alpha }\xi (S\cdot )\), we expect by (1.11) that the law of \(|y-x|^{-|\beta |} \Pi _{x\beta }(y)\) coincides with the law of \(|y-x|^{-|\beta |} \Pi _{0\beta }(y-x)\), which by (1.7) coincides with the law of \(\Pi _{0\beta }\big (\tfrac{y-x}{|y-x|}\big )\). Since \(\Pi _{x\beta }\) is expected to be a (Hölder-) continuous function and the range of \((y-x)/|y-x|\) is compact, this indicates that
We turn to the reexpansion maps \(\Gamma ^*\), where we will mainly follow [22]. Recall that the distinguished point x that we fixed for the power series ansatz led to \(\Pi _x\). Of course, we could as well have chosen any other point y, leading to \(\Pi _y\). It is then clear from
for some polynomials \(p_x,p_y\) vanishing at the origin, that there is an algebraic relation between \(\Pi _x\) and \(\Pi _y\). Indeed, we recall from [18, Proposition 5.1] that if we define \(\Gamma ^*_{xy}\) by
and imposing that \(\Gamma ^*_{xy}\) is an algebra endomorphismFootnote 3 of \(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\), then formally
for functionals \(\pi \) of pairs (a, p). Hence we see from subtracting the two equations in (1.13) and the ansatz (1.3), that for a suitable choice of \(\pi ^{(\textbf{n})}_{xy}\) we have
Note that \(\Gamma ^*_{xy}\) being an algebra endomorphism is compatible with (1.14), and because of that (1.14) may be equivalently expressed as
We want to point out that it is not always possible to extend a linear map from the coordinates \(\textsf{z}_k,\textsf{z}_\textbf{n}\) in a multiplicative way to the space of formal power series \(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\) – as opposed to extending from \(\textsf{z}_k,\textsf{z}_\textbf{n}\) to the space of polynomials \(\mathbb {R}[\textsf{z}_k,\textsf{z}_\textbf{n}]\). That this is possible in our specific setting relies on the restriction of \(\pi ^{(\textbf{n})}_{xy}\) to
which is in line with the required recentering, and which we shall assume; a proof of this fact can be found in [22, Lemma 3]. It is also worth mentioning that \(\Gamma ^*_{xy}\) preserves \(\textsf{T}^*\) and \({\tilde{\textsf{T}}}^*\) in the sense of
which is a consequence of \(\Pi _x(y), \pi ^{(\textbf{n})}_{xy}\in \textsf{T}^*\), see [19, (2.59)].
Let us also mention that the recentering (1.17) of \(\Pi _x\) is inherited by \(\Pi ^-_x\) through (1.10). Due to the presence of the projection P in \(\Pi ^-_x\), it takes the form of
cf. [19, (2.63)]. Here, \(\textrm{id}-P\) takes the role of projecting to \(\bar{\textsf{T}}^*\), i.e. \((\textrm{id}-P)\Pi _y = \sum _{\textbf{n}\ne \textbf{0}} \textsf{z}_\textbf{n}(\cdot -y)^\textbf{n}\), which can be used to seeFootnote 4 that the \(\beta \)-component of the second right-hand side term of (1.21) is a polynomial of degree \(\le |\beta |-2\), i.e.
Furthermore, the scaling and translation invariances (1.7) and (1.11) together with the recentering (1.17) suggest for scaling and translation invariant noise that the law of \((\Gamma ^*_{xy})_\beta ^\gamma \) coincides with the one of \((\Gamma ^*_{x+z \, y+z})_\beta ^\gamma \), and the law of \(s^{|\beta |-|\gamma |} (\Gamma ^*_{xy})_\beta ^\gamma \) coincides with the one of \((\Gamma ^*_{SxSy})_\beta ^\gamma \). Here, \((\Gamma ^*)_\beta ^\gamma \) denote the matrix coefficients of \(\Gamma ^*\) w.r.t. the monomials \(\{\textsf{z}^\beta \}_\beta \) in the sense of
Since \(\Gamma ^*_{SxSy}\) converges to \(\Gamma ^*_{00}\) as \(s\rightarrow 0\), and we expect \(\Gamma ^*_{00}\) to be the identity, this indicates that \(\Gamma ^*_{xy}\) is strictly triangular in the sense of
By a similar argument that led to the estimate (1.12) for \(\Pi \), we expect
1.3 Seeking for a stable characterization
At this point we want to recapitulate, and clarify what of the above can potentially survive the limit of vanishing smoothness \(\xi _\tau \rightarrow \xi \). For smooth noise \(\xi _\tau \) we obtained \(\Pi _x\), of which the purely polynomial components are given by (1.4), and the remaining populated and not purely polynomial components are determined.Footnote 5 by (1.5). Moreover, we have introduced the reexpansion maps \(\Gamma ^*_{xy}\) that recenter \(\Pi \) and \(\Pi ^-\) according to (1.17) and (1.21).
Clearly, (1.4), (1.5a) and (1.17) do make sense for rough noise \(\xi \) as well, where in line with Assumption 1.2 iii) we think of \(\xi \) as having \((\alpha -2)\)-Hölder continuous realizations for \(\alpha \in (0,1)\), and expect correspondingly \(\Pi _{x\beta }\) and \(\Pi ^-_{x\beta }\) to be \(\alpha \) respectively \((\alpha -2)\)-Hölder continuous. Also (1.21) does not cause any problems in the rough setting: although \(\Pi ^-_{x\beta }\) is expected to be a distribution in this case, the first right-hand side term \(\Gamma ^*_{xy}\Pi ^-_y\) of (1.21) is component wise a well-defined linear combination of distributions, while the second right-hand side term of (1.21) contains only products of smooth objects due to the presence of the projection \(\textrm{id}-P\). However, even if \(\Pi ^-_{x\beta }\) can be given a sense for rough noise, there is no way that its individual constituents in (1.5b) survive a limiting procedure, except for \(\beta =0\) where
Nevertheless, we claim that the transformation behaviour (1.21) of \(\Pi ^-_x\) is enough to provide a stable characterization, at least for multi-indices \(\beta \) with \(|\beta |>2\). Indeed, observe that (1.12) implies
where \(\varphi _y^\lambda \) is a Schwartz function, recentered at \(y\in \mathbb {R}^{1+d}\) and rescaled by \(\lambda >0\). We will thus informally write \(\Pi ^-_{y\beta }(y) = 0\) for \(|\beta |>2\). With this, we observe that \((\Gamma ^*_{xy}\Pi ^-_y)_\beta (y) = ((\Gamma ^*_{xy}-\textrm{id})\Pi ^-_y)_\beta (y)\), where by the triangularity (1.23) the right-hand side involves only \(\Pi ^-_{y\gamma }\) for \(|\gamma |<|\beta |\). Provided also the second right-hand side term of (1.21) involves only components \(\Pi _{\gamma }\) for \(\gamma \) “smaller” than \(\beta \), we see that (1.21) determines \(\Pi _{x\beta }(y)\) in a recursive way. This argument will be made rigorous in Step 2 of the proof of Proposition 2.1.
For multi-indices \(\beta \) with \(|\beta |<2\), the above argument fails since one cannot pass from \(\Gamma ^*_{xy}\) to \(\Gamma ^*_{xy}-\textrm{id}\) and benefit from strict triangularity anymore. This is where we will profit from the Malliavin derivative as we shall argue now. We will introduce the Malliavin derivative rigorously in Sect. 1.4, for the moment we think of \(\delta \) as being the operation of taking the derivative of an object like \(\Pi _{x\beta }(y)\) as a functional of \(\xi \), in the direction of an infinitesimal perturbation \(\delta \xi \). Applying \(\delta \) to \(\Pi ^-_{x\beta }(y)=(\Gamma ^*_{xy}\Pi ^-_y)_\beta (y)\), which is a consequence of (1.22), we obtain by Leibniz’ rule
The first right-hand side term involves only \(\Pi ^-_{y\gamma }\) for \(|\gamma |<|\beta |\), since the triangularity (1.23) is inherited by \(\delta \Gamma ^*_{xy}\), and therefore allows for a recursive argument. To investigate the second right-hand side term, we first note that \(\Pi _y(y)=0\) as a consequence of the ansatz (1.3), which together with (1.5b) yields
where we have used that c is deterministic and hence \(\delta c=0\), and we have dropped the projection P since we will use this identity only for \(\gamma \)-components with \(|\gamma |<2\) where P is inactive anyway. Applying \(\Gamma ^*_{xy}\) to this identity, we obtain since \(\Gamma ^*_{xy}\) is multiplicative, satisfies \(\Gamma ^*_{xy}\textsf{z}_0=\sum _k \textsf{z}_k \Pi _x^k(y)\) as a consequence of (1.14), and from (1.17) which implies by Leibniz’ rule \(\delta \Pi _x = \delta \Gamma ^*_{xy}\Pi _y+\Gamma ^*_{xy}\delta \Pi _y+\delta \Pi _x(y)\), that
This right-hand side has again the potential for a recursive argument, however it contains the problematic product of \(\Pi _x^k\) and \(\Delta (\delta \Pi _x-\delta \Pi _x(y)-\delta \Gamma ^*_{xy}\Pi _y)\). Here we note that applying \(\delta \) to a multilinear expression like \(\Pi _x\) amounts to replacing one instance of \(\xi \) by \(\delta \xi \). Although \(\delta \xi \) gains D/2 orders of regularity compared to \(\xi \), see (1.31) and (1.32), \(\delta \Pi \) does not benefit from this gain of regularity since there might be other instances of \(\xi \) left. However, one can expect that \(\delta \Pi \) is modelled to order \(\alpha +D/2\) in terms of \(\Pi \). For the above product to be given a sense in the rough setting, we would then need that \(\alpha + (\alpha +D/2-2)>0\), which is the reason for the restriction \(\alpha >1-D/4\) in Assumption 1.2. Unfortunately, \(\delta \Gamma ^*\) is not the right object to express this gain in modelledness by D/2: since \(\alpha +D/2>1\), every \(\beta \)-component of \(\delta \Gamma ^*_{xy}\Pi _y\) would have to contain the first order monomials \(\Pi _{y e_\textbf{n}}=(\cdot -y)^\textbf{n}\) for \(|\textbf{n}|=1\), which contradicts the triangularity of \(\delta \Gamma ^*_{xy}\) with respect to the homogeneity \(|\cdot |\) for multi-indices \(|\beta |<1\).
We therefore aim to replace \(\delta \Gamma ^*_{xy}\) by a map that allows for this gain of modelledness, while keeping the necessary algebraic properties of \(\delta \Gamma ^*_{xy}\) that led to
The latter can be achieved for any \(\textrm{d}\Gamma ^*_{xy}\in \textrm{End}(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]])\) that coincides with \(\delta \Gamma ^*_{xy}\) on \(\mathbb {R}[[\textsf{z}_k]]\), and that satisfies for \(\pi ,\pi '\in \mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\)
Indeed, applying \(\textrm{d}\Gamma ^*_{xy}\) to \(\Pi ^-_y(y) = \textsf{z}_0\Delta \Pi _y(y) +\xi _\tau (y) 1 - c\), we obtain
By (1.14), to which we apply \(\delta \), and by (1.17) which implies \(\Delta \Gamma ^*_{xy}\Pi _y=\Delta \Pi _x\), the first right-hand side term coincides with \(\sum _k \textsf{z}_k \delta (\Pi _x^k(y)) \Delta \Pi _x(y)\). Since \(c\in \mathbb {R}[[\textsf{z}_k]]\subset {\tilde{\textsf{T}}}^*\), the same argumentation leads to \(\textrm{d}\Gamma ^*_{xy} c = \sum _{l} \tfrac{1}{l!} \delta (\Pi _x^l(y)) (D^{(\textbf{0})})^l c\). Again by (1.14), we see that the second right-hand side term coincides with \(\sum _{k} \textsf{z}_k \Pi _x^k(y) \Delta \textrm{d}\Gamma ^*_{xy}\Pi _y(y)\), which altogether yields the desired
which coincides with [19, (4.50)].
By (1.14), we also see that \(\delta \Gamma ^*_{xy} = \delta \Pi _x(y)\Gamma ^*_{xy}D^{(\textbf{0})}\) on \(\mathbb {R}[[\textsf{z}_k]]\), hence such a map \(\textrm{d}\Gamma ^*_{xy}\) has to coincide with \(\delta \Pi _x(y)\Gamma ^*_{xy}D^{(\textbf{0})} + \sum _{\textbf{n}\ne \textbf{0}} \textrm{d}\pi ^{(\textbf{n})}_{xy} \Gamma ^*_{xy} \partial _{\textsf{z}_\textbf{n}}\) for some \(\textrm{d}\pi ^{(\textbf{n})}_{xy}\in \mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\). To ease the notation we shall write \(\sum _\textbf{n}\textrm{d}\pi ^{(\textbf{n})}_{xy} \Gamma ^*_{xy} D^{(\textbf{n})}\) with \(\textrm{d}\pi ^{(\textbf{0})}_{xy}=\delta \Pi _x(y)\) and
We will only ever apply such a map \(\textrm{d}\Gamma ^*_{xy}\) to elements in \(\textsf{T}^*\) that are populated on multi-indices \(|\beta |<2\), hence we can drop the summands with \(|\textbf{n}|\ge 2\). To obtain equally nice mapping properties as for \(\Gamma ^*_{xy}\) in (1.20), we will restrict \(\textrm{d}\pi ^{(\textbf{n})}_{xy}\) for \(\textbf{n}\ne \textbf{0}\) from \(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\) to \({\tilde{\textsf{T}}}^*\), which yields together with \(D^{(\textbf{n})}\textsf{T}^*\subset {\tilde{\textsf{T}}}^*\) and (1.20)
Summarizing this discussion, (1.25) determines \(\delta \Pi ^-_x(y)\), and hence \(\Pi ^-_x(y)\) up to its expectation by an application of the spectral gap inequality (1.42), provided we can choose \(\textrm{d}\pi ^{(\textbf{n})}_{xy}\in {\tilde{\textsf{T}}}^*\) for \(|\textbf{n}|=1\) such that
where
Moreover, we have to come up with an ordering \(\prec \) on multi-indices with respect to which \(\textrm{d}\Gamma ^*_{xy}\) is strictly triangular, which is necessarily different from the homogeneity \(|\cdot |\) as we saw above. We want to emphasize that (1.25) and (1.28) have the potential to make sense even in the rough setting: neither singular products nor diverging constants appear. The remaining difficulty to overcome is to find a suitable probabilistic and weak formulation that avoids point evaluations of distributions. This will be achieved in Definition 1.1 in (1.40) and (1.41).
Finally, let us point out the following geometric interpretation recently given in [4] (which is set up for a semilinear equation with additive noise, however the following applies to the quasilinear case as well). There, for fixed nonlinearity a, the map \(p\mapsto \Pi _x[a,p]\) is informally interpreted as an inverse chart on the solution manifold of (1.1), and \(\Gamma _{yx}\) as a transition function between two inverse charts \(\Pi _x\) and \(\Pi _y\). Furthermore, \(\textrm{d}\Gamma ^*_{xy}\Pi _y\) maps the parameter-space p to the tangent space of the solution manifold of (1.1). Hence (1.28) expresses that \(\delta \Pi _x\) is approximately a parameterization of this tangent space. Informally, our main result thus characterizes the inverse charts \(\Pi _x\) via the transition maps \(\Gamma _{yx}\) and via the tangent space through (1.25) and (1.28). This is stable for rough noise as opposed to the description of the inverse charts directly via (1.5).
1.4 Assumption and main results
In this section we make the heuristic discussion from the previous Sects. 1.2 and 1.3 precise. We start by introducing the Malliavin derivative. Consider cylindrical functionals of the form
for some \(f\in C^\infty (\mathbb {R}^{1+d})\) and Schwartz functions \(\zeta _1,\dots ,\zeta _N\in \mathcal {S}(\mathbb {R}^{1+d})\), where \((\cdot ,\cdot )\) denotes the pairing between a tempered distribution and a Schwartz function. For such cylindrical functionals we may define
and for suitable \(\delta \xi :\mathbb {R}^{1+d}\rightarrow \mathbb {R}\)
we shall typically suppress the dependence on \(\xi \) and \(\delta \xi \) in the notation and simply write \(\delta F\). We denote by \(\Vert \cdot \Vert _*\) the Sobolev norm \(\Vert \cdot \Vert _{\dot{H}^{2-\alpha -D/2}}\) defined in (1.2), and the Malliavin-Sobolev space \(\mathbb {H}\) is given by the completion of cylindrical functionals with respect to the stochastic norm
By \(C(\mathbb {H})\) we denote the linear space of continuous functions from \(\mathbb {R}^{1+d}\) to \(\mathbb {H}\), endowed with the topology given by the family of semi-norms \(\sup _{y\in K} \Vert F(y)\Vert _{\mathbb {H}}\) for \(K\subset \mathbb {R}^{1+d}\) compact, that grow at most polynomially at infinity, i.e. there exists \(n\in \mathbb {N}\) such that
Next, we introduce the semigroup convolution, which we shall use as a convenient tool for weak formulations. We define the Schwartz function \(\psi _t\in \mathcal {S}(\mathbb {R}^{1+d})\) through
We will then typically write \(f_t\) as shorthand for \(f*\psi _t\). Note the semigroup property
and the scaling identity
that we shall use frequently. From this scaling identity, since \(\psi _{t=1}\) is a Schwartz function, we derive the moment bound
for all \(x,y\in \mathbb {R}^{1+d}\), \(t>0\), \(\theta >-D\) and \(\textbf{n}\).
Equipped with this, we have all the tools to formulate Definition 1.1, which makes the heuristic discussion from Sect. 1.3 precise.
Definition 1.1
Given an ensemble \(\mathbb {E}\) on the space of tempered distributions \(\mathcal {S}'(\mathbb {R}^{1+d})\), on which we denote realizations by \(\xi \), we call the family of random fields \((\Pi ,\Gamma ^*)=\{\Pi _{x\beta }(y),(\Gamma ^*_{xy})_\beta ^\gamma \}_{x,y,\beta ,\gamma }\) a model for (1.1) if the following holds almost surely:
-
The realizations \(\xi \) are related to the model by
$$\begin{aligned} (\partial _0-\Delta )\Pi _{x\,\beta =0}=\xi -\mathbb {E}\xi \, . \end{aligned}$$(1.38) -
\(\Pi _x\) takes values in \(\textsf{T}^*\), the purely polynomial part of \(\Pi \) is given by (1.4), its expectation is shift invariant in the sense of
$$\begin{aligned} \mathbb {E}\Pi _{x\beta }(y)=\mathbb {E}\Pi _{x+z\,\beta }(y+z) \quad \text {for all }x,y,z\in \mathbb {R}^{1+d} \, , \end{aligned}$$(1.39)and it satisfies for all \(p<\infty \) and all multi-indices \(\beta \) the stochastic estimate (1.12).
-
\(\Gamma ^*_{xy}\) is an algebra endomorphism of \(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\) which satisfies (1.15) and (1.18), the former for some \(\pi ^{(\textbf{n})}_{xy}\in \textsf{T}^*\) subject to (1.19). Furthermore, \(\Gamma ^*\) is related to \(\Pi \) and \(\Pi ^-:=(\partial _0-\Delta )P\Pi \) via the recentering (1.17) and (1.21), and it satisfies for all \(p<\infty \) and all populated \(\beta ,\gamma \) the stochastic estimate (1.24).
-
For singular multi-indices \(|\beta |<2\) we have \(\Pi _{x\beta }\in C(\mathbb {H})\). Moreover, for \(\delta \xi \in \mathcal {S}\), \(|\textbf{n}|=1\) and \(x\ne y\) there exist random \(\textrm{d}\pi ^{(\textbf{n})}_{xy}\in Q{\tilde{\textsf{T}}}^*\), where Q denotes the projection to multi-indices satisfying \(|\beta |<2\), such that \(\textrm{d}\Gamma ^*_{xy}\) defined by (1.29) with \(\textrm{d}\pi ^{(\textbf{0})}_{xy}=Q\delta \Pi _x(y)\) satisfies
$$\begin{aligned} \lim _{z\rightarrow y}|y-z|^{-1} \, \mathbb {E} \big |\big (\delta \Pi _x-\delta \Pi _x(y) -\textrm{d}\Gamma ^*_{xy} Q \Pi _y\big )_\beta (z) \big | = 0 \, , \end{aligned}$$(1.40)and
$$\begin{aligned} \lim _{t\rightarrow 0} \mathbb {E}\big | \big (\delta \Pi ^-_x\hspace{-.3ex}-\hspace{-.3ex}\textrm{d}\Gamma ^*_{xy} Q \Pi ^-_{y} \hspace{-.3ex}-\hspace{-.3ex}\sum _{k\ge 0}\textsf{z}_k\Pi _x^k(y) \Delta (\delta \Pi _x\hspace{-.3ex}-\hspace{-.3ex}\textrm{d}\Gamma ^*_{xy} Q \Pi _y) \hspace{-.3ex}-\hspace{-.3ex}\delta \xi \textsf{1}\big )_{\beta t} (y) \big | =0 \, . \end{aligned}$$(1.41)Note that taking the Malliavin derivative of \(\Pi ^-_x\) and \(\Delta \Pi _x\) is well-defined, since \(\Pi _{x\beta }\in C(\mathbb {H})\) implies \(\Pi ^-_{x\beta \,t},\, \Delta \Pi _{x\beta \,t} \in C(\mathbb {H})\), see [19, Lemma A.2].
To guarantee the existence of such a model, we make in line with [19] the following assumption on the underlying noise ensemble.
Assumption 1.2
The ensemble \(\mathbb {E}\) is a probability measure on the space of tempered distributions \(\mathcal {S}'(\mathbb {R}^{1+d})\), that w.l.o.g. is centered, and
-
i)
is invariant under space-time shifts \(x\mapsto x+z\) for \(z\in \mathbb {R}^{1+d}\),
-
ii)
is invariant under spatial reflections \(x\mapsto (x_0,x_1,\dots ,x_{i-1},-x_i,x_{i+1},\dots ,x_d)\) for \(i=1,\dots ,d\),
-
iii)
satisfies a spectral gap inequality: for \(\alpha \in (\max \{0,1-D/4\},1)\setminus {{\mathbb {Q}}}\) there exists \(C>0\) such that for all integrable cylindrical functionals F holds
$$\begin{aligned} \mathbb {E}(F-\mathbb {E}F)^2\le C\,\mathbb {E}\big \Vert \tfrac{\partial F}{\partial \xi }\big \Vert _*^2\, . \end{aligned}$$(1.42)In addition, we assume that the operator (1.30) is closable w.r.t. the topologies of \(\mathbb {E}^\frac{1}{2}|\cdot |^2\) and \(\mathbb {E}^\frac{1}{2}\Vert \cdot \Vert _*^2\).
Let us briefly comment on the restriction of \(\alpha \) in the spectral gap assumption. Note that the choice of \(\Vert \cdot \Vert _*\) is made such that realizations \(\xi \) will have annealed Hölder regularity \(\alpha -2\). Hence the restriction \(\alpha >0\) is dictated by subcriticality. The restriction \(\alpha >1-D/4\) is necessary in reconstruction, see (3.4), and is only present in a single spatial dimension \(d=1\). Equation (1.1) is not singular in the regime \(\alpha >1\), thus no renormalization is required and simpler tools can be applied; see [17] where the framework of the current work is applied to this simpler regime. Finally, we consider \(\alpha \not \in {{\mathbb {Q}}}\) to avoid integer exponents in Liouville-type arguments, which is crucial for our uniqueness result. This last restriction is much less dramatic when one is interested in model components \(\Pi _\beta \) up to a certain threshold, e.g. in the regime \(\alpha \in (1/4,1/2)\) all statements remain true for all model components \(\Pi _\beta \) with \(|\beta |<2\), provided we avoid that \(\alpha ,2\alpha ,\dots ,7\alpha \) are integers.
Under Assumption 1.2, the main result is as follows.
Theorem 1.3
(Existence and Uniqueness) Assume that the ensemble \(\mathbb {E}\) satisfies Assumption 1.2. Then there exists a unique model \((\Pi ,\Gamma ^*)\) for (1.1).
We prove the uniqueness part of Theorem 1.3 in Sect. 2, and the existence part in Sect. 3. We use this existence and uniqueness for the following convergence result, the proof of which is given in Sect. 4.
Theorem 1.4
(Convergence) Assume that a sequence of ensembles \((\mathbb {E}_n)_{n\in \mathbb {N}}\) satisfies Assumption 1.2, uniformly in n, and that \(\mathbb {E}_n\) converges weakly to an ensemble \(\mathbb {E}\) as \(n\rightarrow \infty \).
Then \(\mathbb {E}\) satisfies Assumption 1.2, and the associated models \((\Pi ^{(n)},\Gamma ^{*(n)})\) converge componentwise in law with respect to the distance induced by
to the model \((\Pi ,\Gamma ^*)\) associated to \(\mathbb {E}\), for any \(\kappa >0\).
Furthermore, the convergence holds in probability (resp. in \(L^p\) for all \(p<\infty \)), provided the underlying sequence of random variables \((\xi _n)_{n\in \mathbb {N}}\) converges in probability (resp. in \(L^p\) for all \(p<\infty \)) to a random variable \(\xi \), and the corresponding laws satisfy Assumption 1.2 uniformly in n.
As a simple application of the uniqueness of models, we obtain the following covariance properties of the model.
Corollary 1.5
Assume that \((\Pi ,\Gamma ^*)\) is a model for (1.1). If the ensemble \(\mathbb {E}\) is invariant under the transformation T, where T denotes either the space time shift \(x\mapsto x+z\) for some \(z\in \mathbb {R}^{1+d}\), or the spatial orthogonal transformation \(x\mapsto (x_0,\bar{O}(x_1,\dots ,x_d))\) for some orthogonal \(\bar{O}\in \mathbb {R}^{d\times d}\), then almost surely
Furthermore, if the ensemble \(\mathbb {E}\) is invariant under \(\xi \mapsto s^{2-\alpha }\xi (S\cdot )\), where S denotes the anisotropic rescaling \(x\mapsto (s^2x_0,sx_1,\dots ,sx_d)\) for \(s>0\), then almost surely
Let us emphasize that Corollary 1.5 makes a couple of arguments from the heuristic discussion in Sect. 1.2 rigorous, in particular (1.7) and (1.11). Moreover, it allows to appeal to invariances of the noise that are not preserved under approximation, where we think in particular of scaling invariance. We also want to point out that the listed transformations should rather be seen as exemplary, and many other transformations lead to analogous covariant transformations of the model.
Proof of Corollary 1.5
We will only give the argument for the scaling covariance. The statements for shift- and reflection covariance and for covariance under orthogonal transformations follow similarly. Let \((\Pi ,\Gamma ^*)\) be the model associated to \(\mathbb {E}\), and define \({\widetilde{\Pi }}_{x\beta }(y):=s^{-|\beta |}\Pi _{Sx\,\beta }(Sy)\) and \(({\widetilde{\Gamma }}^*_{xy})_\beta ^\gamma = s^{-|\beta |+|\gamma |}(\Gamma ^*_{Sx\,Sy})_\beta ^\gamma \). By the uniqueness part of Theorem 1.3, it is enough to show that \(({\widetilde{\Pi }},{\widetilde{\Gamma }}^*)\) satisfies Definition 1.1 with \({\widetilde{\xi }}\) given by \(s^{2-\alpha }\xi (S\cdot )\), where we note that the corresponding \({\widetilde{\mathbb {E}}}\) coincides with \(\mathbb {E}\) by assumption. The first item (1.38) follows from
For purely polynomial multi-indices we observe \({\widetilde{\Pi }}_{xe_\textbf{n}}(y) = s^{-|\textbf{n}|} (Sy-Sx)^\textbf{n}= (y-x)^\textbf{n}\), and the shift invariance (1.39) of the expectation of \({\widetilde{\Pi }}\) follows immediately from the one of \(\Pi \). For the estimate (1.12) we note that
Clearly, \({\widetilde{\Gamma }}^*_{xy}\) is still an algebra endomorphism, and (1.15) and (1.18) follow from the corresponding properties of \(\Gamma ^*_{xy}\). The transformation (1.17) of \({\widetilde{\Pi }}\) follows by
from the corresponding transformation of \(\Pi \). Similarly, one can see the transformation (1.21) of \({\widetilde{\Pi }}^-\) and the stochastic estimate (1.24). Malliavin differentiability of \({\widetilde{\Pi }}_{x\beta }(y)\) is inherited from \(\Pi \), and we define \(\textrm{d}{\widetilde{\Gamma }}^*_{xy}:=s^{-|\beta |+|\gamma |}\textrm{d}\Gamma ^*_{Sx\,Sy}\). With this, we obtain
which for fixed \(s>0\) converges to 0 as \(z\rightarrow y\). For the last item (1.41), it is enough to note that for any distribution f we have by duality and the scaling property (1.36) that \((f(S\cdot ))_t(y) = f_{st}(Sy)\). \(\square \)
Remark 1.6
(Other equations) We note that in Definition 1.1, only the recentering (1.21) of \(\Pi ^-\), the order of vanishing in (1.40), and (1.41) are equation specific. Working for instance with a multiplicative stochastic heat equation,
we derive from the same principles as in Sect. 1.2 a counterterm of the form \(c[a(\cdot +u)]\), and the corresponding \(\Pi _x^-\) is given by
Accordingly, (1.21) and (1.41) have to be replaced by the simpler
and
and (1.40) stays the same. For a version of the stochastic thin-film equation,
where the noise \(\xi \) takes now values in \(\mathbb {R}^d\), is stationary, invariant under \(\xi \mapsto \bar{O}^T\xi (O\cdot )\) with \(Ox=(x_0,\bar{O}(x_1,\dots ,x_d))\) for orthogonal \(\bar{O}\in \mathbb {R}^{d\times d}\), and a, b are scalar-valued nonlinearities, we derive from similar principles a counterterm of the form \(\nabla \cdot (c[a(\cdot +u), b(\cdot +u)] \nabla u)\) for a scalar valued functional c, and \(\Pi ^-_x\) takes the form of
Here, \(\textsf{z}_k^{(a)}\) and \(\textsf{z}_\ell ^{(b)}\) denote the coordinate functionals of the nonlinearities a and b, respectively, and \(D^{(\textbf{0})}\) is now given by \(\sum _{k\ge 0}(k+1)\textsf{z}_{k+1}^{(a)}\partial _{\textsf{z}_k^{(a)}} + \sum _{\ell \ge 0}(\ell +1)\textsf{z}_{\ell +1}^{(b)}\partial _{\textsf{z}_\ell ^{(b)}}\). In this case, (1.21) and (1.41) are in line with [11] given by
and
and (1.40) is replaced by the higher order vanishing
The following lemma collects several consequences of Definition 1.1 that will be useful later on.
Lemma 1.7
Assume that \((\Pi ,\Gamma ^*)\) is a model for (1.1). Then
-
i)
the expectation of \(\Pi ^-\) is shift invariant in the sense of
$$\begin{aligned} \mathbb {E}\Pi ^-_{x\beta t}(y)=\mathbb {E}\Pi ^-_{x+z \, \beta t}(y+z) \quad \text {for all }x,y,z\in \mathbb {R}^{1+d}, \ t>0 \, , \end{aligned}$$(1.44)and \(\Pi ^-\) satisfies for all multi-indices \(\beta \) and \(p<\infty \)
$$\begin{aligned} \mathbb {E}^\frac{1}{p}|\Pi ^-_{x\beta t}(y)|^p \lesssim (\root 4 \of {t})^{\alpha -2}(\root 4 \of {t}+|x-y|)^{|\beta |-\alpha } \, . \end{aligned}$$(1.45) -
ii)
\(\Gamma ^*\) is triangular with respect to the homogeneity in the sense of (1.23). Furthermore, \(\Gamma ^*\) is transitive
$$\begin{aligned} \Gamma ^*_{xy}\Gamma ^*_{yz}=\Gamma ^*_{xz} \, , \end{aligned}$$(1.46)and in addition to the mapping property (1.20), \(\Gamma ^*_{xy}\) respects \(\bar{\textsf{T}}^*\) in the following sense,
$$\begin{aligned} (\Gamma ^*_{xy})_{e_\textbf{n}}^\gamma \ne 0 \quad \implies \quad \gamma = e_\textbf{m}\text { for some }\textbf{m}\ne \textbf{0} \, . \end{aligned}$$(1.47) -
iii)
\(\Pi \) and \(\Gamma ^*\) are (annealed) Hölder continuous, more precisely, for every \(\beta \) there exists \(\varepsilon >0\) such that for all \(x',x'',y',y''\in \mathbb {R}^{1+d}\), \(p<\infty \) and \(\gamma \)
$$\begin{aligned} \mathbb {E}^\frac{1}{p} |\Pi _{x'\beta }(y')-\Pi _{x''\beta }(y'')|^p&\lesssim |x'-x''|^\varepsilon (|x'-x''|+|x'-y'|)^{|\beta |-\varepsilon }\\&\quad + |y'-y''|^\alpha (|y'-y''|+|x''-y''|)^{|\beta |-\alpha } \, , \\ \mathbb {E}^\frac{1}{p} |(\Gamma ^*_{x'y'}-\Gamma ^*_{x''y''})_\beta ^\gamma |^p&\lesssim |x'-x''|^\varepsilon (|x'-x''|+|x'-y'|)^{|\beta |-|\gamma |-\varepsilon }\\&\quad + |y'-y''|^\varepsilon (|y'-y''|+|x''-y''|)^{|\beta |-|\gamma |-\varepsilon } \, , \end{aligned}$$with the understanding that all exponents are non-negative.
Proof
The stationarity (1.44) is an immediate consequence of \(\Pi ^-_{x\beta } = (\partial _0-\Delta )(P\Pi _x)_\beta \) and the corresponding shift invariance (1.39) of \(\mathbb {E}\Pi _{x\beta }(y)\). The estimate on \(\Pi ^-\) may be seen as follows. Since \((\partial _0-\Delta )\psi _t\) integrates to 0 on \(\mathbb {R}^{1+d}\), we obtain from the recentering (1.17) that
Applying \(\mathbb {E}^\frac{1}{p}|\cdot |^p\), the triangle inequality, (1.12) and (1.24), this yields
where the sum is over finitely many \(\gamma \) satisfying \(|\gamma |\le |\beta |\). By the moment bound (1.37), this is estimated by
which by \(|\gamma |\ge \alpha \) is further bounded by the right-hand side of (1.45).
We turn to ii). For a proof of (1.23) see [22, Lemma 3]. The transitivity is proven in [19, Proposition 5.4], at least on \(\textsf{T}^*\); it immediately generalizes to \(\mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\) by multiplicativity. To see (1.47), we first note that as a consequence of (1.15), (1.18) and (1.19), it is satisfied for multi-indices \(\gamma \) of length one, i.e. \(\gamma =e_k,e_\textbf{m}\). The general case follows by multiplicativity from \((\Gamma ^*_{xy})^{e_k}_0 = 0 = (\Gamma ^*_{xy})^{e_\textbf{m}}_0\), which is again a consequence of (1.15), (1.18) and (1.19).
We turn to the Hölder continuity iii). Using the recentering (1.17) we obtain
which by Hölder’s inequality together with the estimates (1.12) of \(\Pi \) and (1.24) of \(\Gamma ^*\) and the triangularity (1.23) yields
where we note that the sums over \(\gamma \) are finite. In the first sum we use this finiteness and \(|\gamma |<|\beta |\), to choose \(\varepsilon >0\) small enough such that \(|\beta |-|\gamma |\ge \varepsilon \). In the second sum we use that the homogeneity is bounded below by \(\alpha \), which implies the desired continuity for \(\Pi \). The argument for \(\Gamma ^*\) is similar, based on
which is a consequence of the transitivity (1.46). \(\square \)
Remark 1.8
(Comparison to [12]) Let us briefly compare our definition of model to Hairer’s [12]. Here, the model is a random object, which is referred to as random model in Hairer’s language. The “anchoring” (1.38) and (1.4) is identical in both settings, as well as the estimate (1.12) of \(\Pi \) apart from the fact that here it is formulated in a probabilistic as opposed to a deterministic way. The shift invariance (1.39) of \(\mathbb {E}\Pi \) is assumed to obtain the corresponding invariance (1.44) of \(\mathbb {E}\Pi ^-\). By the BPHZ-choice of renormalization, see (4.19), this expresses the fact that c has no space-time dependence. The recentering (1.17) and (1.21) and the estimate (1.24) of \(\Gamma ^*\) coincide (again up to probabilistic/deterministic formulations). The properties (1.15), (1.18) and multiplicativity of \(\Gamma ^*\) are mainly made such that Lemma 1.7 ii) holds, which is what we will use in the sequel, and all of which is also present in [12] (note that (1.47) corresponds to \(\Gamma \bar{\textsf{T}}\subset \bar{\textsf{T}}\)). Furthermore, (1.15), (1.18) and multiplicativity imply that \((\Gamma ^*_{xy})_\beta ^\gamma \) for \(\gamma \) not purely polynomial is determined by \(\Pi _\gamma \) for \(\gamma \) “smaller” than \(\beta \), see Step 1 in the proof of Proposition 2.1 for a precise statement. Similar results are also available in [12]. The main difference is the last item in Definition 1.1, which has no analogue in [12]. It is however reasonable that it can be related to the pointed modelled distribution \(H_{\tau }^x\) introduced in [13]. For a more concise comparison of the two settings we refer to [18, Section 5.3] for algebraic aspects and to [19, Section 2.6] for analytic and probabilistic aspects.
Remark 1.9
(Comparison to [21]) Note that the model in [21] rather corresponds to the equation
with \(a_0=a(0)\), which is equivalent to (1.1) modulo the substitution \({a\mapsto a+1}\). Adopting this perspective affects the model in two ways: On the one hand, the Ansatz (1.3) features only multi-indices \(\beta \) with \(\beta (k=0)=0\), hence \(\beta \) is a multi-index over the smaller set \(k\ge 1\) and \(\textbf{n}\ne \textbf{0}\). On the other hand, the model \((\Pi ,\Gamma ^*)\) inherits a dependence on \(a_0\) through the differential operator \((\partial _0-a_0\Delta )\). As shown in [19, Proposition 2.7], the stochastic estimates (1.12) of \(\Pi \) and (1.24) of \(\Gamma ^*\) hold then locally uniformly for \(a_0>0\). We could therefore work in this setting as well and modify Definition 1.1 accordingly, with all results holding verbatim (the convergence in Theorem 1.4 holds then locally uniformly for \(a_0>0\)). Note that such a model satisfies then also [21, Assumptions 1 and 2], up to the qualitative smoothness of \(\Pi _x\) which is necessary to write down the renormalized equation. The main result of [21] is that under this assumption, any smooth solution of the renormalized equation is (uniformly in the qualitative smoothness) approximated by a suitable truncation of the series in (1.3), validating our ansatz. Together with a robust reformulation of (1.1) in the space of modelled distributions, which makes sense in the rough setting as well, one would expect to be able to combine this a-priori estimate with a continuity method to construct a solution of (1.1); see an upcoming work by Broux, Otto, and Steele for an implementation of this approach in the setting of a space-time periodic semilinear equation with additive noise.
2 Uniqueness of the model
The aim of this section is to prove the following proposition, which establishes the uniqueness part of Theorem 1.3.
Proposition 2.1
(Uniqueness) Assume that \(\mathbb {E}\) satisfies Assumption 1.2. Then a model \((\Pi ,\Gamma ^*)\) for (1.1) is unique.
The proof proceeds inductive with respect to an ordering \(\prec \) on multi-indices which we shall introduce now. As is clear from the discussion in Sect. 1.3, this ordering can not be given by the homogeneity on multi-indices. Instead, we consider the following ordinal
where we fix \(1>\lambda _1>\lambda _2>0\), and \([\beta ]\) is defined in (1.9). We will then write
The following lemma collects properties of \(\prec \) that we will use in the sequel.
Lemma 2.2
The ordering \(\prec \) is coercive on populated multi-indices, meaning that \(\{\gamma \,|\,\gamma \text { populated and }\gamma \prec \beta \}\) is finite for every \(\beta \). Furthermore, for all \(\beta ,\gamma \) and populated \(\beta _1,\dots ,\beta _{k+1}\) we have
Proof
The coercitivity is an immediate consequences of the definition of \(\prec \). A proof of (2.1) can be found in [19, (8.6)]. For (2.2) we note that \(\Gamma ^*_{xy}\) defined through (1.15) and (1.18) coincides with \(\Gamma ^*_{xy}\) defined through the exponential formula [19, (2.44)]. Hence (2.2) coincides with [19, (8.9)] for populated \(\beta ,\gamma \), and follows by multiplicativity for all \(\beta ,\gamma \). The proof of (2.3) follows as in [19, (8.11)] from its definition (1.29) and the corresponding property (2.2) of \(\Gamma ^*_{xy}\). \(\square \)
Another crucial ingredient for uniqueness is the following Liouville type result.
Lemma 2.3
(Liouville) Let \(\eta >0\), \(\eta \not \in \mathbb {N}\), and assume for a random field f that \(\sup _x |x|^{-\eta }\mathbb {E}|f(x)|<\infty \), and that \((\partial _0-\Delta )f\) is a random polynomial of degree \(\le \eta +2\). Then \(f=0\).
Proof
The assumption implies \((\partial _0-\Delta )\partial ^\textbf{n}f = 0\) for \(|\textbf{n}|>\eta +2\), and by the defining property (1.34) of \(\psi _t\) we obtain \(\partial _t\partial ^\textbf{n}f_t = (\partial _0-\Delta )(\partial _0+\Delta )\partial ^\textbf{n}f_t=0\) for \(|\textbf{n}|>\eta \). In particular, \(\partial ^\textbf{n}f_t\) is independent of t for \(|\textbf{n}|>\eta \). Furthermore, by assumption and the moment bound (1.37)
From the limit \(t\rightarrow \infty \), we deduce that almost surely \(\partial ^\textbf{n}f_t(y)=0\) for all \(|\textbf{n}|>\eta \), \(y\in \mathbb {R}^{1+d}\) and \(t>0\). Hence f is a polynomial of degree \(\le \eta \), and since \(\eta \not \in \mathbb {N}\) this strengthens to \(<\eta \). Since f vanishes by assumption at the origin to order \(\eta \), we learn \(f=0\). \(\square \)
Proof of Proposition 2.1
Assume that we are given two families of random fields \((\Pi ,\Gamma ^*)\) and \(({\widetilde{\Pi }},{\widetilde{\Gamma }}^*)\), both satisfying Definition 1.1. We will prove that their respective \(\beta \)-components coincide almost surely. First, note that it is enough to consider populated multi-indices. Indeed, for \(\beta \) not populated, \(\Pi _{x\beta }=0={\widetilde{\Pi }}_{x\beta }\). For \((\Gamma ^*_{xy})_\beta ^\gamma \) we note that populated multi-indices fully determine \(\pi ^{(\textbf{n})}_{xy}\in \textsf{T}^*\) by \((\Gamma ^*_{xy}-\textrm{id})_\beta ^{e_\textbf{n}} = \pi ^{(\textbf{n})}_{xy\beta }\), see (1.15), which then determines \((\Gamma ^*_{xy})_\beta ^\gamma \) for arbitrary multi-indices by multiplicativity.
For the remaining populated multi-indices, we start with purely polynomial multi-indices \(\beta =e_\textbf{n}\). From (1.4) we learn that \(\Pi _{xe_\textbf{n}}(y) = (y-x)^\textbf{n}= {\widetilde{\Pi }}_{xe_\textbf{n}}(y)\). Turning to \((\Gamma ^*_{xy})_{e_\textbf{n}}^\gamma \), we observe that by (1.47) the only non-vanishing components are given by \((\Gamma ^*_{xy})_{e_\textbf{n}}^{e_\textbf{m}}\) for \(\textbf{m}\ne \textbf{0}\), and (1.23) restricts to \(|\textbf{m}|\le |\textbf{n}|\). From the \((\beta =e_\textbf{n})\)-component of (1.17) we learn that
which by the binomial formula shows that
The same argument can be repeated for \({\widetilde{\Gamma }}^*\), thus \((\Gamma ^*_{xy})_{e_\textbf{n}}^\gamma = ({\widetilde{\Gamma }}^*_{xy})_{e_\textbf{n}}^\gamma \) for all \(\gamma \).
For the remaining populated but not purely polynomial multi-indices we proceed by induction with respect to \(\prec \). It is convenient to explicitly include uniqueness of \(\textrm{d}\Gamma ^*_{xy}\) into the induction, we therefore assume that for \(\delta \xi \in \mathcal {S}(\mathbb {R}^{1+d})\) and \(x\ne y\) we are also given \(\textrm{d}\Gamma ^*_{xy}\) and \(\textrm{d}{\widetilde{\Gamma }}^*_{xy}\) from Definition 1.1 corresponding to \((\Pi _x,\Gamma ^*_{xy})\) and \(({\widetilde{\Pi }}_x,{\widetilde{\Gamma }}^*_{xy})\), respectively. For the base case \(\beta =0\), we recall from (1.38) that \((\partial _0-\Delta )(\Pi _{x0}-{\widetilde{\Pi }}_{x0})=0\). Using the estimate (1.12) and \(|\beta =0|=\alpha \not \in {{\mathbb {Q}}}\), Liouville’s principle Lemma 2.3 yields \(\Pi _{x0}-{\widetilde{\Pi }}_{x0}=0\). For \((\Gamma ^*_{xy})_0^\gamma \) we note that by the triangularity (1.23) the only non-vanishing component is \((\Gamma ^*_{xy})_0^0=1=({\widetilde{\Gamma }}^*_{xy})_0^0\). By the triangularity (2.3) of \(\textrm{d}\Gamma ^*_{xy}\), the only non-vanishing components of \((\textrm{d}\Gamma ^*_{xy})_0^\gamma \) are given by \((\textrm{d}\Gamma ^*_{xy})_0^{e_\textbf{n}}\) with \(|\textbf{n}|=1\). Using (1.4), this yields \((\textrm{d}\Gamma ^*_{xy}Q\Pi _y)_0 = \sum _{|\textbf{m}|=1} (\cdot -y)^\textbf{m}(\textrm{d}\Gamma ^*_{xy})_0^{e_\textbf{m}}\), which evaluated at \(y+\lambda \textbf{n}\) for \(\lambda >0\) and \(|\textbf{n}|=1\) equals \(\lambda (\textrm{d}\Gamma ^*_{xy})_0^{e_\textbf{n}}\). Since the same holds for \(\textrm{d}{\widetilde{\Gamma }}^*_{xy}\), by using the already established \(\delta \Pi _{x0} = \delta {\widetilde{\Pi }}_{x0}\) we obtain
Dividing by \(\lambda \) and taking \(\lambda \rightarrow 0\), we obtain from (1.40) that \((\textrm{d}\Gamma ^*_{xy}-\textrm{d}{\widetilde{\Gamma }}^*_{xy})_0^{e_\textbf{n}}=0\) almost surely, finishing the argument for the base case.
In the induction step we assume uniqueness of \(\Pi _{x\beta '}\), \((\Gamma ^*_{xy})_{\beta '}^\gamma \) and \((\textrm{d}\Gamma ^*_{xy})_{\beta '}^\gamma \) for \(\beta '\prec \beta \) and all populated \(\gamma \), and show uniqueness of the corresponding \(\beta \)-components. We first note that \(\Pi _{x\beta '}\) determines unique \(\Pi ^-_{x\beta '}=(\partial _0-\Delta )(P\Pi _{x})_{\beta '}\), \(\delta \Pi _{x\beta '}\) and \(\delta \Pi ^-_{x\beta ' t}\).
Step 1. \((\Gamma ^*_{xy})_\beta ^\gamma \) is unique for \(\gamma \) not purely polynomial. This is a consequence of the induction hypothesis together with the fact that for \(\gamma \) not purely polynomial
which we shall establish now. Since \(\gamma \) is assumed to be populated and not purely polynomial, we can write \(\textsf{z}^\gamma =\textsf{z}_{k_1}\cdots \textsf{z}_{k_i} \textsf{z}_{\textbf{n}_1}\cdots \textsf{z}_{\textbf{n}_j}\) for some \(k_1,\dots ,k_i\ge 0\), \(i\ge 1\), and \(\textbf{n}_1,\dots ,\textbf{n}_j\ne \textbf{0}\), \(j\ge 0\). In this case, \([\gamma ]\ge 0\) translates into \(k_1+\dots +k_i\ge j\). We learn from (1.18) and the multiplicativity of \(\Gamma ^*_{xy}\) that \((\Gamma ^*_{xy})_\beta ^\gamma \) is a sum consisting of terms of the form
where \(l_1,\dots ,l_i\ge 0\) and \( e_{k_1+l_1} + \cdots + e_{k_i+l_i} + \beta _1^1 + \cdots + \beta _{l_1}^1 + \cdots + \beta _1^i + \cdots + \beta _{l_i}^i + {\bar{\beta }}_1 + \cdots + {\bar{\beta }}_j = \beta \). We first claim that \(|e_{k+l}+\beta _1+\dots +\beta _l|_\prec \ge k\). Indeed, if \(k+l=0\), we obtain \(|e_0|_\prec = \lambda _2 > 0 = k\). If \(k+l>0\) and \(l=0\), we obtain \(|e_k|_\prec = k\), and if \(l>0\) we obtain by additivity of \(|\cdot |_\prec \ge \lambda _1-1\) that \(|e_{k+l}+\beta _1+\dots +\beta _l|_\prec \ge k+l+l(\lambda _1-1) > k\). We now show that \(\beta _1^1\prec \beta \), which in particular means that \(l_1\ge 1\). Then by additivity of \(|\cdot |_\prec \) and the above, we obtain \(|\beta |_\prec \ge k_1+l_1 + |\beta _1^1|_\prec + (l_1-1)(\lambda _1-1) + k_2+\cdots +k_i+j(\lambda _1-1)\), which by \(k_1+\dots +k_i\ge j\) is larger than \(|\beta _1^1|_\prec +1+(l_1-1)\lambda _1+j\lambda _1>|\beta _1^1|_\prec \). We come to \({\bar{\beta }}_1\prec \beta \), which in particular means that \(j\ge 1\). Similarly as above, we obtain \(|\beta |_\prec \ge k_1 + \dots + k_i + |{\bar{\beta }}_1|_\prec +(j-1)(\lambda _1-1) \ge |{\bar{\beta }}_1|_\prec +1+ (j-1)\lambda _1> |{\bar{\beta }}_1|_\prec \).By symmetry, this concludes the proof.
Step 2. \(\Pi ^-_{x\beta }\) is unique. In this step we distinguish \(|\beta |<2\) from \(|\beta |>2\).
Case \(|\beta |>2\) . We rewrite (1.21) to
and note that the \(\beta \)-component of the right-hand side is unique by Step 1, the triangular properties (2.1) and (2.2), and the induction hypothesis. We therefore obtain \((\Pi ^-_{x}-\Pi ^-_{y})_\beta = ({\widetilde{\Pi }}^-_{x}-{\widetilde{\Pi }}^-_{y})_\beta \), and hence \((\Pi ^-_{x}-{\widetilde{\Pi }}^-_{x})_{\beta \,t}(y) = (\Pi ^-_{y} - {\widetilde{\Pi }}^-_{y})_{\beta \,t}(y)\). By the triangle inequality and (1.45), we obtain \(\mathbb {E}| (\Pi ^-_{x}-{\widetilde{\Pi }}^-_{x})_{\beta \, t}(y) | \lesssim (\root 4 \of {t})^{|\beta |-2}\), which by \(|\beta |>2\) yields almost surely \(\Pi ^-_{x\beta }={\widetilde{\Pi }}^-_{x\beta }\).
Case \(|\beta |<2\) . The proof is more complex for \(|\beta |<2\) and relies on Malliavin differentiability and the extra ingredient \(\textrm{d}\Gamma ^*\).
Step 2a. \(\mathbb {E}\Pi ^-_{x\beta \, t}(y)\) is unique. Since \(|\beta |<2\), we obtain from the estimate (1.45) on \(\Pi ^-_{x\beta }\) in particular \(\lim _{t\rightarrow \infty } \mathbb {E}\Pi ^-_{x\beta \, t}(y) = 0\). We can therefore write
Hence it remains to check that \(\partial _s \mathbb {E}\Pi ^-_{x\beta \, s}(y)\) is unique. By use of the semigroup property (1.35) and the defining property (1.34) of \(\psi _t\), we obtain
Again by \(|\beta |<2\), (1.22) yields \(\Pi ^-_{x\beta }=(\Gamma ^*_{xz}\Pi ^-_z)_\beta \). Furthermore, by Lemma 1.7 we know that \(\mathbb {E}\Pi ^-_{z\beta \tau }(z)\) does not depend on z, and since \((\partial _0^2-\Delta ^2)\psi _{s-\tau }\) integrates to 0, we obtain
Since \(\Pi ^-_z\in {\tilde{\textsf{T}}}^*\) has no purely polynomial components, the right-hand side is unique by Step 1 and the strict triangularity (2.2) combined with uniqueness of \(\Pi ^-_{z\beta '}\) for \(\beta '\prec \beta \).
Step 2b. \((\textrm{d}\Gamma ^*_{xy})_\beta ^\gamma \) is unique for \(\gamma \) not purely polynomial. By definition (1.29) of \(\textrm{d}\Gamma ^*_{xy}\) we have
where \(\textrm{d}\pi ^{(\textbf{0})}_{xy\beta '}=\delta \Pi _{x\beta '}(y)\), \(\textrm{d}\pi ^{(\textbf{n})}_{xy\beta '}=(\textrm{d}\Gamma ^*_{xy})_{\beta '}^{e_\textbf{n}}\) for \(|\textbf{n}|=1\) and \((\Gamma ^*_{xy}D^{(\textbf{n})})_{\beta ''}^\gamma = \sum _{\gamma '} (\Gamma ^*_{xy})_{\beta ''}^{\gamma '}(D^{(\textbf{n})})_{\gamma '}^\gamma \). Since \(\gamma \) is not purely polynomial by assumption, and \(\gamma '\) inherits this property from \(D^{(\textbf{n})}\) as can be easily seen from its definition (1.6) and (1.26), the claim follows from Step 1 and the induction hypothesis, provided \(\beta '\prec \beta \) and \(\beta ''\preccurlyeq \beta \). Since \(\textrm{d}\pi ^{(\textbf{n})}_{xy}\in {\tilde{\textsf{T}}}^*\), we can restrict to \(\beta '\) that are not purely polynomial, which implies \(0\preccurlyeq \beta '\) and hence \(\beta ''\preccurlyeq \beta \). Similarly, since \(\gamma '\) is not purely polynomial, \(\beta ''\) inherits this property from \(\Gamma ^*_{xy}\) by (1.20), and thus \(\beta '\preccurlyeq \beta \). If \(\beta '=\beta \), then \(\beta ''=0\), and by the triangularity (1.23) of \(\Gamma ^*_{xy}\) also \(\gamma '=0\). However, \((D^{(\textbf{n})})_0^\gamma \) is only non-vanishing for purely polynomial \(\gamma \), as can be seen from its definition (1.6) and (1.26), which contradicts our assumption and thus \(\beta '\prec \beta \).
Step 2c. \(\Pi ^-_{x\beta }\) is unique. We look at (1.41) and learn from Step 2b and the triangular properties (2.1) and (2.3) that all terms besides \(\delta \Pi ^-_{x\beta t}\) are unique at this stage of the induction. Hence for \(x\ne y\)
Applying \(\mathbb {E}|\cdot |\) and the triangle inequality, we learn from (1.41) that \(\mathbb {E}|(\delta \Pi ^-_x-\delta {\widetilde{\Pi }}^-_x)_{\beta t}(y)|\rightarrow 0\) as \(t\rightarrow 0\). Note also that the semigroup property (1.35) yields
By the polynomial growth (1.33), which by [19, Lemma A.2] is inherited by \(\delta \Pi ^-_{x\beta t}\), and since \(\psi _T\) is a Schwartz function, the dominated convergence theorem thus yields for all \(T>0\), \(x\ne y\in \mathbb {R}^{1+d}\) and \(\delta \xi \in \mathcal {S}(\mathbb {R}^{1+d})\), almost surely
Since \(\mathcal {S}(\mathbb {R}^{1+d})\) is separable, we obtain \(\frac{\partial }{\partial \xi } (\Pi ^-_x - {\widetilde{\Pi }}^-_x)_{\beta \,T}(y)=0\) on \(\mathcal {S}\) almost surely, and by density of \(\mathcal {S}(\mathbb {R}^{1+d})\) in \(\dot{H}^{\alpha -2+D/2}\) we obtain \(\frac{\partial }{\partial \xi } (\Pi ^-_x - {\widetilde{\Pi }}^-_x)_{\beta \,T}(y)=0\). Together with Step 2a, an application of the spectral gap inequality (1.42) yields \((\Pi ^-_x-{\widetilde{\Pi }}^-_x)_{\beta \,T}(y)=0\) almost surely, and by continuity of \((\Pi ^-_x-{\widetilde{\Pi }}^-_x)_{\beta \,T}\) it vanishes also at \(y=x\). Since \(T>0\) and \(y\in \mathbb {R}^{1+d}\) were arbitrary, \(\Pi ^-_{x\beta }={\widetilde{\Pi }}^-_{x\beta }\).
Step 3. \(\Pi _{x\beta }\) is unique. Note that since \(\beta \) is populated and not purely polynomial, \(|\beta |\not \in \mathbb {N}\) as a consequence of \(\alpha \not \in {{\mathbb {Q}}}\). Since \((\partial _0-\Delta )(\Pi _{x\beta }-{\widetilde{\Pi }}_{x\beta })=\Pi ^-_{x\beta }-{\widetilde{\Pi }}^-_{x\beta }=0\) by Step 2, we obtain \(\Pi _{x\beta }={\widetilde{\Pi }}_{x\beta }\) as an immediate consequence of the estimate (1.12) by Liouville’s principle Lemma 2.3.
Step 4. \((\Gamma ^*_{xy})_\beta ^\gamma \) is unique. In view of Step 1 it remains to prove uniqueness of \((\Gamma ^*_{xy})_\beta ^\gamma \) for purely polynomial \(\gamma \). We rewrite (1.17) to \(\Pi _x-\Pi _x(y)-\Gamma ^*_{xy}P\Pi _y = \Gamma ^*_{xy}(\textrm{id}-P)\Pi _y\) and note that the \(\beta \)-component of the left-hand side is at this stage of the induction unique by Step 1, the triangularity (2.2) and Step 3. By (1.4), the \(\beta \)-component of the right-hand side equals
which is a finite sum due to (2.2) and establishes uniqueness of \((\Gamma ^*_{xy})_\beta ^{e_\textbf{n}}\).
Step 5. \((\textrm{d}\Gamma ^*_{xy})_\beta ^\gamma \) is unique. By Step 2b it remains to establish uniqueness of \((\textrm{d}\Gamma ^*_{xy})_\beta ^\gamma \) for purely polynomial \(\gamma \), and from the definition (1.29) of \(\textrm{d}\Gamma ^*_{xy}\) we see that this is only non vanishing for \(\gamma =e_\textbf{n}\) with \(|\textbf{n}|=1\). Together with (1.4), the already established \(\delta \Pi _{x\beta } = \delta {\widetilde{\Pi }}_{x\beta }\), Step 2b and the triangularity (2.3) of \(\textrm{d}\Gamma ^*_{xy}\), this yields
Evaluating at \(y+\lambda \textbf{n}\) for \(\lambda >0\) and \(|\textbf{n}|=1\) and dividing by \(\lambda \), we obtain from (1.40) almost surely \((\textrm{d}\Gamma ^*_{xy}-\textrm{d}{\widetilde{\Gamma }}^*_{xy})_\beta ^{e_\textbf{n}}=0\), which finishes the proof. \(\square \)
3 Existence of a model
To establish the existence part of Theorem 1.3, we consider \((\Pi ^{(\tau )},\Gamma ^{*(\tau )})\) associated to the mollified noise \(\xi _\tau :=\xi *\psi _\tau \), which was constructed in [19]. We start by checking that this construction indeed yields a model in the sense of Definition 1.1, provided \(\xi \) is replaced by the smooth \(\xi _\tau \). Notice that the construction in [19] is carried out for \(d=1\) and \(\alpha <1/2\), so that the regularity index \(2-\alpha -D/2\) of the norm \(\Vert \cdot \Vert _*\) is positive. However, by small adaptations of exponents and weights, see Appendix A, the proof holds in arbitrary dimension and for \(\alpha <1\). We refer to [11], where the same strategy is applied for \(\alpha <1\) and in arbitrary dimension – although for a slightly different equation.
Lemma 3.1
Assume that \(\mathbb {E}\) satisfies Assumption 1.2. Then \((\Pi ^{(\tau )},\Gamma ^{*(\tau )})\) constructed in [19] is a model for (1.1) with \(\xi \) replaced by \(\xi _\tau \).
Proof
For the proof we just collect all the necessary properties of \(\Pi ^{(\tau )}\) and \(\Gamma ^{*(\tau )}\) from [19]. Indeed, \(\Pi ^{(\tau )}_{x0}\) satisfies the linear stochastic heat equation (1.38) with \(\xi \) replaced by \(\xi _\tau \) as a consequence of [19, (2.18), (2.35)]. The purely polynomial part \(\Pi _{x e_\textbf{n}}\) is in agreement with (1.4) by [19, (2.21)], and the shift invariance (1.39) follows from [19, (5.2)]. The mapping properties (1.15) and (1.18) of \(\Gamma ^{*(\tau )}\) are captured by [19, (2.51)] and [19, (2.50)], respectively, where \(\pi ^{(\textbf{n})(\tau )}\) is restricted by (1.19) due to [19, (2.46)]. The recentering (1.17) of \(\Pi ^{(\tau )}\) and (1.21) of \(\Pi ^{-(\tau )}\) is stated in [19, (2.61), (2.64)] and [19, (2.63)], respectively. From [19, Section 7] we know thatFootnote 6\(\Pi ^{(\tau )}_{x\beta } \in C(\mathbb {H})\), and the ansatz (1.29) for \(\textrm{d}\Gamma ^{*(\tau )}\) is the same as in [19, (2.64), (4.40)]. The stochastic estimates (1.12) and (1.24) are established in [19, (2.36), (2.37)], where we point out the uniformity in \(\tau >0\),
It remains to check the properties (1.40) and (1.41). For this, we recall from [19, (4.86)]
where \(q'<q\le 2\), with q denoting a generic conjugate exponent of \(2\le p<\infty \). Hence, the constant in \(\lesssim \) also depends on \(q'<q\) when it comes to estimates of Malliavin derivatives. The weights \(w_x\) are a small adaptation of those in [19], and are given in Appendix A. Here, \(\kappa \) is restricted by
![figure a](http://media.springernature.com/lw685/springer-static/image/art%3A10.1007%2Fs00440-024-01292-2/MediaObjects/440_2024_1292_Figa_HTML.png)
where the restriction (3.4a) is for the weight \(w_x(y)\) to be locally (square) integrable, see [19, (4.36)], and the restriction \(2<\kappa +2\alpha \) originates from a reconstruction argument, see [19, (4.51)]. Since \(\alpha >1-D/4\), cf. Assumption 1.2 (iii), it is possible to choose such a \(\kappa \). Also the restriction (3.4b), which is not present in [19], is an admissible choice since the set of homogeneities is locally finite. The reason for this further restriction on \(\kappa \) is that then [19, (4.71)] can be stated as
with the understanding that \(|\beta |\ge 2\alpha \) unless the left-hand side vanishes, and where \(F^{(\tau )}_{xy}\) is given by
Since by assumption \(\kappa +\alpha>2-\alpha >1\), and since \(w_x(y)\) is finite for \(\delta \xi \in \mathcal {S}\) and \(x\ne y\), (3.3) implies (1.40). From (3.5) and since \(\kappa +2\alpha >2\) by (3.4), we obtain (1.41). \(\square \)
In view of the already established uniqueness result, to prove convergence of \((\Pi ^{(\tau )},\Gamma ^{*(\tau )})\) it is tempting to try to appeal to tightness, which follows from the uniform bounds (3.1) and (3.2). However, difficulties arise since on the level of the law of a random variable it seems to be tricky to preserve the relation between the random variable and its Malliavin derivative.
Instead, we follow a different approach. In fact, the estimates established in [19] upgrade to estimates of increments and yield that \((\Pi ^{(\tau )},\Gamma ^{*(\tau )})\) is Cauchy in \(\tau \). More precisely, for every \(\beta \) there exists \(\varepsilon >0\) such that for all \(p<\infty \), \(x,y\in \mathbb {R}^{1+d}\) and \(\tau ,\tau '>0\)
with the understanding that \(|\beta |-|\gamma |-\varepsilon >0\) unless the left-hand side vanishes. This allows to define the random variables
where both limits have to be understood w.r.t. \(\mathbb {E}^\frac{1}{p}|\cdot |^p\). To check that \((\Pi ,\Gamma ^*)\) defined in this way is indeed a model for (1.1), we will also make use of the corresponding estimates of the Malliavin derivative
together with
where the weights \(\bar{w}\) and \(w_x(y)\) are given in Appendix A. It is not too hard to see that the identical strategy of [19] yields these estimates. For completeness, we walk the reader through and point out the necessary adaptations, see Sect. 3.1. Before we do so, we shall however argue that \((\Pi ,\Gamma ^*)\) is indeed a model.
Proposition 3.2
(Existence) Assume that \(\mathbb {E}\) satisfies Assumption 1.2. Then there exists a model \((\Pi ,\Gamma ^*)\) for (1.1).
Proof
Assumption 1.2 is exactlyFootnote 7 the setting in which [19] was carried out. Therefore we just need to check that \((\Pi ,\Gamma ^*)\) defined by (3.9) satisfies Definition 1.1. Applying triangle inequality and the estimate (3.1), we have
Taking the limit \(\tau \rightarrow 0\) on both sides yields (1.12). The argument for (1.24) is similar.
Preliminary for what follows, we note that \(\Pi _x^-:=(\partial _0-\Delta )P\Pi _x\) satisfies for all \(t>0\) and \(y\in \mathbb {R}^{1+d}\)
Indeed, the left-hand side is bounded by
and by the already established (1.12) we may appeal to the dominated convergence theorem to obtain (3.12).
We turn to (1.38), for which we have to show that \(\Pi ^-_{x0}=\xi \). By the triangle inequality and (1.38) for \(\Pi ^{(\tau )}_{x0}\) we obtain
The first term on the right-hand side converges to 0 as \(\tau \rightarrow 0\) by (3.12), and so does the second term on the right-hand side, see e.g. the upcoming (3.16). Hence \(\mathbb {E}|\Pi ^-_{x0t}(y)-\xi _t(y)|=0\) for all \(t>0\) and \(y\in \mathbb {R}^{1+d}\), which yields (1.38). For (1.4) we note that \(\mathbb {E}|\Pi _{x e_\textbf{n}}(y) - (y-x)^\textbf{n}| = \mathbb {E}| \Pi _{x e_\textbf{n}}(y) - \Pi ^{(\tau )}_{x e_\textbf{n}}(y)|\), which converges to 0 as \(\tau \rightarrow 0\). Similarly, \(\mathbb {E}\Pi _{x\beta }(y)-\mathbb {E}\Pi _{x+z\,\beta }(y+z) = \mathbb {E}(\Pi _{x\beta }(y)-\Pi ^{(\tau )}_{x\beta }(y) ) + \mathbb {E}(\Pi ^{(\tau )}_{x+z\,\beta }(y+z)-\Pi _{x+z\,\beta }(y+z))\), where the right-hand side converges to 0 as \(\tau \rightarrow 0\). The arguments for (1.17), (1.15), (1.18) and (1.21) are similar.
We turn to Malliavin differentiability. By dualizing (3.10) we observe that \(\frac{\partial }{\partial \xi }\Pi ^{(\tau )}_{x\beta }(y)\) is a Cauchy sequence w.r.t. \(\mathbb {E}^\frac{1}{2}\Vert \cdot \Vert _*^2\). Together with (3.7) and the closability assumption of \(\frac{\partial }{\partial \xi }\), we obtain \(\Pi _{x\beta }(y)\in \mathbb {H}\) and \(\frac{\partial }{\partial \xi }\Pi ^{(\tau )}_{x\beta }(y)\rightarrow \frac{\partial }{\partial \xi }\Pi _{x\beta }(y)\) w.r.t. \(\mathbb {E}^\frac{1}{2}\Vert \cdot \Vert _*^2\) as \(\tau \rightarrow 0\). With this at hand we are in position to check (1.40) and (1.41). First, note that (3.11) implies that for fixed \(\delta \xi \in \mathcal {S}\) and \(x\ne y\), \((\textrm{d}\Gamma ^{*(\tau )}_{xy})_\beta ^\gamma \rightarrow (\textrm{d}\Gamma ^*_{xy})_\beta ^\gamma \) w.r.t. \(\mathbb {E}^\frac{1}{q'}|\cdot |^{q'}\) as \(\tau \rightarrow 0\), which serves as the definition of \(\textrm{d}\Gamma ^*_{xy}\). Since \((\textrm{d}\Gamma ^{*(\tau )}_{xy})_\beta ^{e_\textbf{n}} = \textrm{d}\pi ^{(\textbf{n})(\tau )}_{xy\beta }\), this implies in particular \(\textrm{d}\pi ^{(\textbf{n})(\tau )}_{xy\beta }\rightarrow \textrm{d}\pi ^{(\textbf{n})}_{xy\beta }\) w.r.t. \(\mathbb {E}^\frac{1}{q'}|\cdot |^{q'}\) as \(\tau \rightarrow 0\), which we again see as the definition of \(\textrm{d}\pi ^{(\textbf{n})}_{xy}\). By Hölder’s inequality and \(\Gamma ^{*(\tau )}_{xy}\rightarrow \Gamma ^*_{xy}\) w.r.t. \(\mathbb {E}^\frac{1}{p}|\cdot |^p\) as \(\tau \rightarrow 0\), one finds that the structure (1.29) is preserved in the limit. Analogous to checking that (1.12) and (1.24) are preserved in the limit, we obtain that (3.3) is preserved in the limit. In the same way we obtained (3.12), we use (3.10) to argue in favor of
which allows together with the above established convergences to also preserve (3.5) in the limit. Having (3.3) and (3.5), we can proceed as in the proof of Lemma 3.1 to obtain (1.40) and (1.41). \(\square \)
3.1 Proof of the Cauchy property
In this section we give the argument for (3.7), (3.8), (3.10) and (3.11). To not change the exponent \(\varepsilon \) from line to line, we fix a multi-index \(\beta \), and choose \(\varepsilon \) that works for all \(\beta '\preccurlyeq \beta \). As it turns out, this is the case for \(\varepsilon \) satisfying
![figure b](http://media.springernature.com/lw685/springer-static/image/art%3A10.1007%2Fs00440-024-01292-2/MediaObjects/440_2024_1292_Figb_HTML.png)
This is an admissible choice,Footnote 8 since the set of homogeneities is locally finite and since \(\kappa +2\alpha -2>0\), see (3.4). In the following, we will establish (3.7)\(_{\beta '}\), (3.8)\(_{\beta '}\), (3.10)\(_{\beta '}\) and (3.11)\(_{\beta '}\) for this choice of \(\varepsilon \) and for all \(\beta '\preccurlyeq \beta \). Similar to [19], we shall include the upcoming estimates (3.16)\(_{\beta '}\) – (3.18)\(_{\beta '}\), (3.20)\(_{\beta '}\) – (3.23)\(_{\beta '}\) and (3.26)\(_{\beta '}\) – (3.29)\(_{\beta '}\) for the same choice of \(\varepsilon \) and for all \(\beta '\preccurlyeq \beta \).
First, these estimates are verified for all purely polynomial multi-indices. In fact, all left-hand sides of all these estimates are identically zero for purely polynomial \(\beta '\). For \(\Pi _{x\beta '}\) and \(\delta \Pi _{x\beta '}\) this is the case by (1.4), and for \(\Pi ^-_{x\beta '}\) and \(\delta \Pi ^-_{x\beta '}\) this is a consequence thereof. For \((\Gamma ^*_{xy})_{\beta '}\) and \((\delta \Gamma ^*_{xy})_{\beta '}\) this follows from (1.47) and (2.4), and for \((\textrm{d}\Gamma ^*_{xy})_{\beta '}\) this is a consequence of (1.27). For \(\pi ^{(\textbf{n})}_{xy\beta '}\), \(\delta \pi ^{(\textbf{n})}_{xy\beta '}\) and \(\textrm{d}\pi ^{(\textbf{n})}_{xy\beta '}\) this follows from the corresponding statements of \((\Gamma ^*_{xy})_{\beta '}\), \((\delta \Gamma ^*_{xy})_{\beta '}\) and \((\textrm{d}\Gamma ^*_{xy})_{\beta '}\).
For the remaining multi-indices we proceed by induction w.r.t. \(\prec \), where the already established estimates for the multi-indices \(\beta =e_\textbf{n}\) with \(|\textbf{n}|=1\) serve as the base case. In the induction step, we follow the strategy of [19], where in addition we will frequently use telescoping for products in form of
The main philosophy is as follows: we use the algebraic identities of [19], apply (3.14) on the products that arise, and estimate all factors as in [19] except increments, where we apply the new established estimates. Clearly, algebraic properties like triangularity are preserved, so that the inductive structure of the proof does not change. Also the homogeneities add up in the same way as in [19], although sometimes (especially in integration) care has to be taken that the \(\varepsilon \)-loss in the exponents does not create problems. The following lemma illustrates this in a simple setup.
In all following statements, Lemmas 3.3–3.16, we shall always assume that all estimates from [19] hold, without explicitly stating them in the assumption. Furthermore, all statements of the induction hypothesis will be implicitly assumed to hold for all integrability exponents \(p<\infty \) and \(q'<2\), all space-time points \(x,y,z\in \mathbb {R}^{1+d}\), and every convolution parameter \(t>0\). For example, by (3.7)\(_{\prec \beta }\) we mean the estimate for all \(p<\infty \), \(x,y\in \mathbb {R}^{1+d}\) and for all multi-indices \(\beta '\prec \beta \).
Lemma 3.3
(Algebraic argument I’) Assume that (3.20)\(_{\prec \beta }\) holds. Then (3.8)\(_\beta ^\gamma \) holds for all \(\gamma \) not purely polynomial.
Proof
We use the exponential formula [19, (2.44)] to see
After telescoping (3.14), we follow the proof of [17, Lemma 4.3] to obtain (3.8) by Hölder’s inequality and using estimates of \(\pi ^{(\textbf{n})(\tau )}_{xy\beta '}\). The only modification is that on the increment \(\pi ^{(\textbf{n}_i)(\tau )}_{xy\beta _i}-\pi ^{(\textbf{n}_i)(\tau ')}_{xy\beta _i}\) we use the estimate (3.20). Note that the triangularity (1.23) implies that the left-hand side of (3.8) vanishes for \(\beta =\gamma \), and for all other multi-indices where the left-hand side does not vanish we have \(|\beta |>|\gamma |\), which by the choice of \(\varepsilon \) in (3.13a) implies that \(|\beta |-|\gamma |-\varepsilon >0\). \(\square \)
Next, we establish
where in the proof we distinguish multi-indices \(\beta \) satisfying \(|\beta |<2\) from \(|\beta |>2\), and start with the latter.
Lemma 3.4
(Reconstruction I’) Assume \(|\beta |>2\), that (3.16)\(_{\prec \beta }\) and (3.20)\(_{\prec \beta }\) hold, and that (3.8)\(_\beta ^{\gamma }\) holds for all \(\gamma \) not purely polynomial. Then (3.16)\(_\beta \) holds.
Proof
As in [19, Proposition 4.2], (3.16) follows by a standard reconstruction argument, provided we have continuity in the base point in form of
and the sum of the exponents \(\alpha -2-\varepsilon \) and \(|\beta |-\alpha \) is positive. This positivity is guaranteed by (3.13a) due to the assumption \(|\beta |>2\). Appealing to the recentering (1.21) of \(\Pi ^-\) and telescoping (3.14), the remaining part of the proof follows the lines of the one of [19, Proposition 4.2], with the only modification that on the increments we use the estimates (3.8), (3.16) and (3.20) on \(\Gamma ^{*(\tau )}\), \(\Pi ^{-(\tau )}\) and \(\pi ^{(\textbf{n})(\tau )}\), respectively. \(\square \)
For multi-indices \(\beta \) satisfying \(|\beta |<2\) we first note that
which follows by the same argument as in [19, Proposition 4.7], again by telescoping (3.14), and using on increments the estimates (3.16)\(_{\prec \beta }\) and the already established (3.8)\(_\beta ^{\gamma }\) for \(\gamma \) not purely polynomial. It remains to prove
which by dualization and (A.1), and using the spectral gap inequality (1.42) will together with (3.17)\(_\beta \) analogous to [19, Section 4.3] imply (3.16)\(_\beta \) for multi-indices \(\beta \) with \(|\beta |<2\). We postpone the argument for (3.18) for a moment, and continue with the following integration lemma.
Lemma 3.5
(Integration I’) Assume that (3.16)\(_\beta \) holds. Then (3.7)\(_\beta \) holds.
Proof
The proof follows the one of [19, Proposition 4.3] based on the solution formula
A little bit of care has to be taken concerning integrability at \(t=0\) and \(t=\infty \) due to the loss of \(\varepsilon \) in the exponents, which we check in the following by pointing out how the exponents change compared to the aforementioned reference. We apply (3.19) where we replace \(\Pi ^{(\tau )}_{x\beta }(y)\) by the increment \(\Pi ^{(\tau )}_{x\beta }(y)-\Pi ^{(\tau ')}_{x\beta }(y)\), and split the integral into the near field \(t\le |x-y|^4\) and the far field \(t\ge |x-y|^4\). In the near field, we split the integral into its contributions form 1 and \(\textrm{T}_x^{|\beta |}\), where \(\textrm{T}_x^{|\beta |}\) denotes the operation of taking the Taylor polynomial of degree \(<|\beta |\) centered at x. On the former, the order of vanishing in \(\root 4 \of {t}\) is given by \(\alpha -4-\varepsilon \), which is integrable at \(t=0\) since \(\alpha -\varepsilon >0\) by the choice of \(\varepsilon \) in (3.13a). On the latter, the order of vanishing in \(\root 4 \of {t}\) is given by \(|\beta |-4-|\textbf{m}|-\varepsilon \) with \(\textbf{m}\) restricted to \(|\textbf{m}|<|\beta |\). This restriction of \(\textbf{m}\) implies by (3.13a) that \(|\beta |-|\textbf{m}|-\varepsilon >0\), hence also this contribution is integrable at \(t=0\). For the far field contribution, we note that the Taylor remainder has a growth rate in \(\root 4 \of {t}\) given by \(|\beta |-4-|\textbf{m}|-\varepsilon \) where this time \(\textbf{m}\) is restricted to \(|\textbf{m}|\ge |\beta |\). Therefore \(|\beta |-|\textbf{m}|-\varepsilon <0\), which makes this contribution integrable at \(t=\infty \). \(\square \)
Using the estimates (3.7) on \(\Pi \) and (3.8) on \(\Gamma ^*\), the proof of the following lemma follows the one of [19, Proposition 4.4], in addition by telescoping (3.14), and using that \(|\beta |>|\textbf{n}|\) implies \(|\beta |>|\textbf{n}|+\varepsilon \) by the choice of \(\varepsilon \) in (3.13a).
Lemma 3.6
(Three-point argument I’) Assume that (3.7)\(_{\preccurlyeq \beta }\) holds, and that (3.8)\(_{\beta }^{\gamma }\) holds for all \(\gamma \) not purely polynomial. Then
with the understanding that \(|\beta |-|\textbf{n}|-\varepsilon >0\) unless the left-hand side vanishes. In particular, (3.8)\(_\beta ^\gamma \) holds for all \(\gamma \).
In the remaining part of this section we provide the argument for (3.18). First, by applying the derivative \(\delta \) to the exponential formula (3.15) and using Leibniz rule, we obtain the following lemma.
Lemma 3.7
(Algebraic argument II’) Assume that (3.20)\(_{\prec \beta }\) and (3.26)\(_{\prec \beta }\) hold. Then for all \(\gamma \) not purely polynomial
with the understanding that \(|\beta |-|\gamma |-\varepsilon >0\) unless the left-hand side vanishes.
Next, we observe that analogous to Lemma 3.3, the proof of [19, Proposition 4.11] in addition with telescoping (3.14), the estimates (3.8) on \(\Gamma ^*\) and (3.28) on \(\textrm{d}\pi ^{(\textbf{n})}\) yield the following.
Lemma 3.8
(Algebraic argument III’) Assume that (3.28)\(_{\prec \beta }\) holds, and that (3.8)\(_\beta ^\gamma \) holds for all \(\gamma \) not purely polynomial. Then for all \(\gamma \) not purely polynomial
Similarly, from [19, Proposition 4.16] and by telescoping (3.14) we obtain the following.
Lemma 3.9
(Algebraic argument IV’) Assume that (3.29)\(_{\prec \beta }\) holds, and that (3.8)\(_{\preccurlyeq \beta }^{\gamma }\) holds for all \(\gamma \) not purely polynomial. Then (3.11)\(_\beta ^\gamma \) holds for all \(\gamma \) not purely polynomial.
Equipped with these estimates, we can carry out a reconstruction argument.
Lemma 3.10
(Reconstruction III’) Assume that (3.7)\(_{\prec \beta }\), (3.8)\(_{\prec \beta }\), (3.16)\(_{\prec \beta }\) and (3.27)\(_{\prec \beta }\) hold, and that (3.11)\(_{\preccurlyeq \beta }^\gamma \) and (3.22)\(_{\preccurlyeq \beta }^\gamma \) hold for all \(\gamma \) not purely polynomial. Then
Proof
The proof follows the one of [19, Proposition 4.12]. We will first establish a continuity condition of \(F^{(\tau )}_{xy}-F^{(\tau ')}_{xy}\) in the secondary base point y,
where we recall that \(F^{(\tau )}_{xy}\) is defined in (3.6). Since the sum of the exponents \(\alpha -2-\varepsilon \) and \(\kappa +\alpha \) is positive, which is a consequence of (3.13b), and since
which is a consequence of [19, (4.68)] by the triangle inequality, a general reconstruction argument impliesFootnote 9
Together with
this implies the desired (3.23).
We shall now give the argument for (3.24) and (3.25). Actually, the argument for (3.24) is identical to the one of [19, (4.69)], with the same modification as always: use telescoping (3.14) and apply on increments the estimates of the assumption of the lemma. The same applies to (3.25), however, the following additional estimate has to be carried out for \(\beta =0\):
We obtain this estimate by interpolating between the two estimates
The former is a consequence of the triangle inequality and (A.2) with \(\textbf{n}=\textbf{0}\), since \(\alpha -2+\kappa \le 0\) by (3.4b). For the latter, we assume w.l.o.g. that \(\tau '\le \tau \), so that
By the defining property (1.34) of \(\psi \) we have \(\partial _s\delta \xi _s = (\partial _0^2-\Delta ^2)\delta \xi _s\), and the claim follows from (A.2) with \(|\textbf{n}|=4\). \(\square \)
As in [19, Proposition 4.18], this allows to achieve the estimate (3.18) by appealing to
and averaging in the secondary base point z using (A.4).
Lemma 3.11
Assume that (3.16)\(_{\prec \beta }\), (3.18)\(_{\prec \beta }\), (3.11)\(_{\beta }\) and (3.23)\(_{\beta }\) hold, and that (3.8)\(_\beta ^\gamma \) and (3.21)\(_\beta ^\gamma \) hold for all \(\gamma \) not purely polynomial. Then (3.18)\(_\beta \) holds.
The same arguments as in Lemma 3.5 apply to see the following.
Lemma 3.12
(Integration II’) Assume that (3.18)\(_\beta \) holds. Then (3.10)\(_\beta \) holds.
This in turn serves as input for the next lemma, the proof of which is analogous to the one of [19, Proposition 4.10], using in addition telescoping (3.14).
Lemma 3.13
(Three-point argument II’) Assume that (3.7)\(_{\prec \beta }\), (3.10)\(_{\preccurlyeq \beta }\) hold, and that (3.8)\(_\beta ^\gamma \) and (3.21)\(_\beta ^\gamma \) hold for all \(\gamma \) not purely polynomial. Then
with the understanding that \(|\beta |-|\textbf{n}|-\varepsilon >0\) unless the left-hand side vanishes. In particular, (3.21)\(_\beta ^\gamma \) holds for all \(\gamma \).
Furthermore, (3.23) can be integrated to obtain the following.
Lemma 3.14
(Integration III’) Assume that (3.23)\(_\beta \) holds. Then
Proof
The proof follows [19, Proposition 4.14], where as in Lemma 3.5, some care has to be taken concerning integrability at \(t=0\) and \(t=\infty \). In the near field range \(t\le |y-z|^4\), it is enough to mention that by (3.13b) we have \(\alpha -1+\kappa -\varepsilon >0\), and that by (3.13a) we have \(\alpha -\varepsilon >0\). In the far field range \(t\ge \max \{|y-z|^4,|x-z|^4\}\), the factor \(t^{-\varepsilon }\) actually improves integrability, and in the intermediate range \(|y-z|^4\le t\le |x-z|^4\) the question of integrability doesn’t come up. \(\square \)
Once more, we can as in [19, Proposition 4.15] use this estimate, together with (3.14), to obtain the next lemma.
Lemma 3.15
(Three-point argument III’) Assume that (3.7)\(_{\prec \beta }\), (3.20)\(_{\prec \beta }\) and (3.27)\(_\beta \) hold, and that (3.8)\(_{\prec \beta }^\gamma \), (3.11)\(_\beta ^\gamma \) and (3.22)\(_\beta ^\gamma \) hold for all \(\gamma \) not purely polynomial. Then for all \(\textbf{n}\) with \(|\textbf{n}|\le 1\),
In particular, (3.22)\(_\beta ^\gamma \) holds for all \(\gamma \).
Finally, the same strategy as in [19, Proposition 4.17] yields the following lemma, which concludes the induction step.
Lemma 3.16
(Three-point argument IV’) Assume that (3.7)\(_{\prec \beta }\), (3.10)\(_{\prec \beta }\) and (3.27)\(_{\prec \beta }\) hold, and that (3.11)\(_\beta ^\gamma \) holds for all \(\gamma \) not purely polynomial. Then for all \(\textbf{n}\) with \(|\textbf{n}|\le 1\),
In particular, (3.11)\(_\beta ^\gamma \) holds for all \(\gamma \).
4 Convergence of models
In this section we provide the proof of Theorem 1.4. A first technical ingredient is the following lemma, which is concerned with upgrading the probabilistic convergence \(\xi _n\rightarrow \xi \) in the space of tempered distributions to convergence with respect to an annealed Hölder-type norm.
Lemma 4.1
Assume that the laws of \(\xi _n,\xi \) satisfy Assumption 1.2, uniformly in n, and that \(\xi _n\rightarrow \xi \) either almost surely or w.r.t. \(\mathbb {E}^\frac{1}{p}|\cdot |^p\) on the space of tempered distributions. Then \(\xi _n\rightarrow \xi \) with respect to the annealed norm
for every \(\varepsilon >0\), where \(p'<\infty \) is arbitrary in case of assuming almost sure convergence, and \(p'=p\) in case of assuming convergence w.r.t. \(\mathbb {E}^\frac{1}{p}|\cdot |^p\).
Here, we keep track of the small scale regularity as well as the large scale regularity as an artefact of working on the whole space. The reason for losing an \(\varepsilon \) on small and large scales, although working with annealed norms, is a consequence of “trading regularity for convergence”. This was already present in (3.16), where even for \(\xi _\tau \) we had to give up an \(\varepsilon \) of regularity to achieve the convergence \(\xi _\tau \rightarrow \xi \) as \(\tau \rightarrow 0\).
The proof of Lemma 4.1 is an application of the Arzela–Ascoli theorem, and is deferred to Sect. 4.1. Another ingredient for the proof of Theorem 1.4 is the following lemma, which for \(\tau >0\) establishes continuity of the map \(\xi \mapsto (\Pi ^{(\tau )},\Gamma ^{*(\tau )})\) in a suitable (annealed) topology.
Lemma 4.2
Assume that \(\mathbb {E}\) satisfies Assumption 1.2. Fix \(\tau >0\), \(p<\infty \) and a multi-index \(\beta \). Then there exists \(\varepsilon >0\) small enough and \(p'<\infty \) large enough, such that the map \(\xi \mapsto (\Pi ^{(\tau )}_\beta ,\Gamma ^{*(\tau )}_\beta )\) is continuous, provided the space of \(\xi \) is equipped with the norm \(\Vert \cdot \Vert _{\alpha -2,\varepsilon }\) defined in (4.1) with integrability exponent \(p'\), and the space of \((\Pi ^{(\tau )}_\beta ,\Gamma ^{*(\tau )}_\beta )\) is equipped with the norm
where
One should expect Lemma 4.2 to be true, since on the one hand, (4.3) and (4.5), which analogously to (1.45) imply \(\mathbb {E}^\frac{1}{p}|\Pi ^-_{x\beta t}(y)|^p \le (\root 4 \of {t})^{\alpha -2-\varepsilon } (\root 4 \of {t}+|x-y|)^{|\beta |-\alpha } (1+\root 4 \of {t}+|x|+|y|)^{2\varepsilon }\), yield good enough estimates on small and large scales in order for the integral in the solution formula (3.19) to converge at \(t=0\) and \(t=\infty \); on the other hand, (4.4) yields enough small scale regularity in order for the products \(\Pi ^k\Delta \Pi \) in (1.5b) of \(\Pi ^{-}\) to be well defined.
We give a quantitative proof of Lemma 4.2 establishing even Hölder continuity in Sect. 4.1, the arguments of which are an adaptation of the arguments of the proof of [19, Remark 2.3], which establishes for \(|\textbf{n}|=2\)
One just has to ensure that the \(\varepsilon \)-loss in the exponents does not cause any problems.
With these two preliminary lemmas, we can give the proof of Theorem 1.4.
Proof of Theorem 1.4
Let \((\mathbb {E}_n)_{n\in \mathbb {N}}\) be a sequence of ensembles satisfying Assumption 1.2 uniformly in n, that converges weakly to an ensemble \(\mathbb {E}\) which then automatically satisfies Assumption 1.2. It is convenient for the proof to work with random variables on a common probability space. We therefore appeal to Skorohod’s representation theorem to obtain random variables on a common probability space, which we denote by \(\xi _n,\xi \), with their corresponding law given by the ensembles \(\mathbb {E}_n,\mathbb {E}\), and such that up to a subsequence that we do not relabel \(\xi _n\rightarrow \xi \) almost surely on the space of tempered distributions. A little bit of care has to be taken here, since we apply Skorohod’s representation theorem on the space of tempered distributions which is not a metric space.Footnote 10 A suitable version of this theorem can be found in [1, Example 5.1.I], and requires to check that the \(\xi _n\) are tight and the laws of \(\xi _n,\xi \) are Radon measures. The former follows from the uniform moment bound \(\mathbb {E}^\frac{1}{p}|(\xi _n)_t(x)|^p\lesssim (\root 4 \of {t})^{\alpha -2}\), see (1.45), which is a consequence of the spectral gap assumption. The latter is a consequence of the following: by Kolmogorov’s criterion and again by the moment bound, the law of \(\xi _n\) concentrates on a compact subset of \(\mathcal {S}'\), namely a weighted and sufficiently negative Hölder space; as a Borel probability measure supported on a compact set, it is therefore a Radon measure.
Recall from Lemma 4.1, that the almost sure convergence implies \(\xi _n\rightarrow \xi \) with respect to \(\Vert \cdot \Vert _{\alpha -2,\varepsilon }\) for every \(\varepsilon >0\). Furthermore, the map \(\xi \mapsto (\Pi ^{(\tau )}_\beta ,\Gamma ^{*(\tau )}_\beta )\) is continuous for fixed \(\tau >0\) by Lemma 4.2, provided the space of \(\xi \) is equipped with \(\Vert \cdot \Vert _{\alpha -2,\varepsilon }\) for \(\varepsilon >0\) small enough, and the space of \((\Pi ^{(\tau )}_\beta ,\Gamma ^{*(\tau )}_\beta )\) is equipped with \(\Vert \cdot \Vert _\beta \) defined in (4.2). Note, that the norm \(\Vert \cdot \Vert _\beta \) satisfies by definition
These preliminary remarks allow for the conclusion as we argue now. Denoting the models associated to \(\xi \), \(\xi _\tau \), \(\xi _n\) and \((\xi _n)_\tau \) by \(\Pi \), \(\Pi ^{(\tau )}\), \(\Pi ^{(n)}\) and \(\Pi ^{(n)(\tau )}\), respectively, we have
Since the Cauchy property (3.7) was uniform in the class of ensembles satisfying Assumption 1.2, for given \(\eta >0\) there exists \(\tau >0\) such that the first and the third right-hand side contribution are bounded by \(\eta \, |x-y|^{|\beta |-\varepsilon }\), uniformly in n. The second right-hand side term is by (4.9) bounded by \(|x-y|^{|\beta |-\varepsilon }(1+|x|+|y|)^{2\varepsilon } \Vert (\Pi ^{(n)(\tau )}-\Pi ^{(\tau )}, \Gamma ^{*(n)(\tau )}-\Gamma ^{*(\tau )})\Vert _\beta \), which for n large enough and fixed \(\tau >0\) is dominated by \(\eta \,|x-y|^{|\beta |-\varepsilon }(1+|x|+|y|)^{2\varepsilon }\) due to continuity. Altogether we obtain
The same game can be played with \(\Gamma ^*\), to obtain
We turn this annealed convergence by Kolmogorov’s criterion into a quenched convergence result, possibly along a further subsequence. A suitable version of Kolmogorov’s criterion can be found in [13, Proposition B.3], which we apply here with \(F_x=|z|^{-|\beta |+\varepsilon }(\Pi ^{(n)}_{x\beta }(x+z)-\Pi _{x\beta }(x+z))\), and where we note that the necessary continuity follows from Lemma 1.7 iii). This yields for \(x\in \mathbb {R}^{1+d}\) and \(\kappa >\varepsilon \)
Noting that the supremum over \(x\in \mathbb {R}^{1+d}\) in the definition (1.43) of \({\vert \hspace{-1.0625pt}\vert \hspace{-1.0625pt}\vert \Pi \vert \hspace{-1.0625pt}\vert \hspace{-1.0625pt}\vert }_\beta \) can be replaced by \(x\in (1+d)^{-1/2}\mathbb {Z}^{1+d}\), and using \(\mathbb {E}(\sup _{x\in \mathbb {Z}^{1+d}}|f(x)|)^p\le \sum _{x\in \mathbb {Z}^{1+d}}\mathbb {E}|f(x)|^p\), which is a consequence of the obvious \(\sup _{x\in \mathbb {Z}^{1+d}} |f(x)|\le (\sum _{x\in \mathbb {Z}^{1+d}} |f(x)|^p)^{1/p}\), we get
By \(\kappa >\varepsilon \) we can choose \(p<\infty \) large enough such that this series is finite. The analogous result can be obtained for \(\Gamma ^*\), hence
which in turn yields the desired convergence in law. That we had to pass to a subsequence for Skorohod’s theorem is not an issue, since for any subsequence of \((\Pi ^{(n)},\Gamma ^{*(n)})\) we can start the proof again with the corresponding subsequence of \(\mathbb {E}_n\), and obtain convergence in law of a further subsequence of \((\Pi ^{(n)},\Gamma ^{*(n)})\) to \((\Pi ,\Gamma ^*)\), establishing convergence in law of the whole sequence \((\Pi ^{(n)},\Gamma ^{*(n)})\).
To argue in favor of the same statement, with convergence in law replaced by convergence in probability, we proceed similarly. Instead of appealing to Skorohod’s representation theorem, we pass to a subsequence to obtain almost sure convergence of \(\xi _{n_k}\) to \(\xi \) in the space of tempered distributions. From here, we follow the above proof to obtain \(L^p\) convergence of \((\Pi ^{(n_k)},\Gamma ^{*(n_k)})\) to \((\Pi ,\Gamma ^*)\), which implies almost sure convergence of a further subsequence. For any other subsequence of \((\Pi ^{(n)},\Gamma ^{*(n)})\) we can repeat this argument with the corresponding subsequence of \(\xi _n\), which yields almost sure convergence of a further subsequence of \((\Pi ^{(n)},\Gamma ^{*(n)})\) to \((\Pi ,\Gamma ^*)\). By the subsequence criterion [16, Lemma 4.2], this implies convergence in probability of the whole sequence \((\Pi ^{(n)},\Gamma ^{*(n)})\) to \((\Pi ,\Gamma ^*)\).
Assuming \(L^p\) convergence for all \(p<\infty \), we infer directly from Lemma 4.1 the convergence \(\xi _n\rightarrow \xi \) with respect to the annealed norm defined in (4.1), and the desired \(L^p\) convergence of \((\Pi ^{(n)},\Gamma ^{*(n)})\) follows as above. \(\square \)
4.1 Proof of Lemmas 4.1 and 4.2
Proof of Lemma 4.1
Let \(\eta >0\), we will show that \(\lim _{n\rightarrow \infty }\Vert \xi _n-\xi \Vert _{\alpha -2,\varepsilon }<\eta \). Recall from (1.45) that the spectral gap inequality implies \(\mathbb {E}^\frac{1}{p}|\xi _t(x)|^p \lesssim (\root 4 \of {t})^{\alpha -2}\), where the implicit constant depends on p, but not on \(t>0\) or \(x\in \mathbb {R}^{1+d}\). The same holds by assumption for \(\xi _n\), where the implicit constant does not depend on n either. Therefore, for \(f_n:(0,\infty )\times \mathbb {R}^{1+d}\rightarrow \mathbb {R}\) defined by
we obtain by the triangle inequality
Hence we can find a compact set \(K\subset (0,\infty )\times \mathbb {R}^{1+d}\) such that the supremum of \(f_n\) on the complement of K is bounded by \(\eta \), uniformly in n. Notice that \(f_n\) is chosen such that \(\Vert \xi _n-\xi \Vert _{\alpha -2,\varepsilon }=\sup _{t,x}f_n(t,x)\). To conclude, it therefore remains to establish
This in turn, is a consequence of Arzelà Ascoli, which yields a uniformly convergent subsequence \(f_{n_k}\). If \(\xi _n\rightarrow \xi \) almost surely on \(\mathcal {S}'\), then the pointwise limit of \(f_n(t,x)\) is 0 by the dominated convergence theorem. If \(\xi _n\rightarrow \xi \) w.r.t. \(\mathbb {E}^\frac{1}{p}|\cdot |^p\) on \(\mathcal {S}'\), then again the pointwise limit of \(f_n(t,x)\) is 0. In either case, we conclude that the uniform limit of \(f_{n_k}\) has to be 0, and as a consequence the whole sequence \(f_n\) converges uniformly to 0. It remains to check the assumptions of the Arzelà Ascoli theorem. By (4.10), the sequence \(f_n\) is uniformly bounded on the compact set K. For equicontinuity, we first note that by the triangle inequality
The first two right-hand side terms are further bounded by
The first term satisfies the desired continuity, since \(t\mapsto (\root 4 \of {t})^{2-\alpha +\varepsilon }(1+\root 4 \of {t}+|x|)^{-2\varepsilon }\) is (uniformly) continuous on K, and since \(\mathbb {E}^\frac{1}{p}|(\xi _n)_t(x)|^p\lesssim (\root 4 \of {t})^{\alpha -2}\) is bounded uniformly in n and \((t,x)\in K\). For the second term, we note that \((\root 4 \of {t'})^{2-\alpha +\varepsilon }(1+\root 4 \of {t'}+|x'|)^{-2\varepsilon }\) is uniformly bounded for \((t',x')\in K\), and it remains to establish equicontinuity of \(\mathbb {E}^\frac{1}{p} | (\xi _n)_t(x)-(\xi _n)_{t'}(x') |^p\). This is a consequence of
which was established in [19, (6.3)], together with
for some \({\bar{t}}\) between t and \(t'\), which is a consequence of the mean value theorem, the semigroup property (1.35) and the moment bound (1.37). The terms involving \(\xi \) instead of \(\xi _n\) can be treated in the same way. \(\square \)
Proof of Lemma 4.2
Let \((\Pi ^{(\tau )},\Gamma ^{*(\tau )})\) and \(({\widetilde{\Pi }}^{(\tau )},{\widetilde{\Gamma }}^{*(\tau )})\) be the models associated to \(\xi \) and \({\tilde{\xi }}\). We will show that \(\Vert (\Pi ^{(\tau )},\Gamma ^{*(\tau )})-({\widetilde{\Pi }}^{(\tau )},{\widetilde{\Gamma }}^{*(\tau )})\Vert _\beta \) is arbitrarily small, provided \(\Vert \xi -{\tilde{\xi }}\Vert _{\alpha -2,\varepsilon }\) is sufficiently small. Since the only norm we put on \(\xi ,{\tilde{\xi }}\) is \(\Vert \cdot \Vert _{\alpha -2,\varepsilon }\), we will for the ease of notation simply write \(\Vert \xi \Vert \) instead. The proof proceeds inductively, for which it is convenient to introduce the norms
With this notation at hand, we will show in Step 1 that the linear map \(\xi \mapsto \Pi ^{-(\tau )}_0=\xi _\tau \) is continuous, i.e. we establish
where we want to emphasize that the implicit constant in \(\lesssim _\tau \) does depend on \(\tau \) (and blows up as \(\tau \rightarrow 0\)). In Step 2 we establish one part of the continuity of the map \(\Pi ^{-(\tau )}_\beta \mapsto \Pi ^{(\tau )}_\beta \), namely
In Step 3 we will prove that
where \(\Vert \cdot \Vert _{\preccurlyeq \beta }:=\sum _{\beta '\preccurlyeq \beta }\Vert \cdot \Vert _{\beta '}\). We will use this in Step 4 to show
and in Step 5 we establish for \(\beta \ne 0\)
Finally, in Step 6 we show for \(\beta \ne 0\) that
which finishes the argument.
Note that the reason for \(p'\) in the definition (4.1) of \(\Vert \xi \Vert \) being different from p in the definition (4.2) of \(\Vert (\Pi ,\Gamma ^*)\Vert _\beta \), is that along the way we appeal to Hölder’s inequality. Due to the triangularity (2.1), (2.2), (2.3) and the coercitivity of \(\prec \), we will do so a finite number of times to reach the multi-index \(\beta \), and hence it is possible to choose \(p'<\infty \) large enough. We will be cavalier in the following and not distinguish in the notation between \(p'\) and p, and implicitly always assume we made the appropriate change in exponents. For \(\varepsilon \), the following choice will turn out to be suitable. We choose \(\varepsilon >0\) satisfying (3.13a) and
which is an admissible choice since the set of homogeneities is locally finite.
Step 1 (Proof of (S1)). By the definition of \(\Vert \xi \Vert \) in (4.1), we see that \(\mathbb {E}^\frac{1}{p}| \xi _{\tau +t}(x) |^p \le (\root 4 \of {t+\tau })^{\alpha -2-\varepsilon } (1+\root 4 \of {t+\tau }+|x|)^{2\varepsilon } \Vert \xi \Vert \), which since \(\alpha -2-\varepsilon <0\) and \(\tau \lesssim 1\) yields \(\Vert \Pi ^{(\tau )}\Vert ^-_0\lesssim \Vert \xi \Vert \). To argue for \([\Pi ^{(\tau )}]^-_{0}\lesssim _\tau \Vert \xi \Vert \) we use the triangle inequality to obtain
The first right-hand side term can be rewritten as \(\mathbb {E}^\frac{1}{p}|\int _0^t ds\,\partial _s\xi _{\tau +s}(y)|^p\), which by the defining property (1.34) of \(\psi \), the moment bound (1.37) and the definition of \(\Vert \xi \Vert \) can be estimated by
Since \(\alpha -\varepsilon >0\) by (3.13a) and \(\tau \lesssim 1\), this is further bounded by
The same bound holds for the third right-hand side term with y replaced by z. The second right-hand side term we bound using the mean value theorem and the definition of \(\Vert \xi \Vert \) by
which since \(\alpha -\varepsilon -1<0\) and \(\tau \lesssim 1\) is further estimated by
Choosing \(t=|y-z|^4\), we obtain altogether
which yields the desired \([\Pi ^{(\tau )}]^-_{0}\lesssim _\tau \Vert \xi \Vert \).
Step 2 (Proof of (S2)). We use the representation formula (3.19) of \(\Pi ^{(\tau )}\). By linearity it is enough to show \(\Vert \Pi ^{(\tau )} \Vert _\beta \le \Vert \Pi ^{(\tau )}\Vert ^-_\beta \). The proof is analogous to the one of Lemma 3.5. In the near field \(0\le t\le |x-y|^4\) we split the contributions from 1 and \(\textrm{T}_x^{|\beta |}\), and bound the former by the definition of \(\Vert \Pi ^{(\tau )}\Vert _\beta ^-\) by
which since \(\alpha -\varepsilon >0\) is bounded by \(|x-y|^{|\beta |-\varepsilon }(1+|x|+|y|)^{2\varepsilon } \Vert \Pi ^{(\tau )}\Vert ^-_\beta \). The contribution from \(\textrm{T}_x^{|\beta |}\) is again by the definition of \(\Vert \Pi ^{(\tau )}\Vert _\beta ^-\) bounded by
By the choice of \(\varepsilon \) in (3.13a), \(|\beta |>|\textbf{n}|\) implies \(|\beta |-|\textbf{n}|-\varepsilon >0\), hence also this contribution can be further estimated by \(|x-y|^{|\beta |-\varepsilon }(1+|x|+|y|)^{2\varepsilon } \Vert \Pi ^{(\tau )}\Vert ^-_\beta \). In the far field \(|x-y|^4\le t<\infty \) we use the Taylor remainder to estimate this contribution by
Since \(\beta \) is not purely polynomial, \(|\beta |\le |\textbf{n}|\) implies \(|\beta |<|\textbf{n}|\), and the choice of \(\varepsilon \) in (4.11) implies \(|\beta |-|\textbf{n}|+\varepsilon <0\). Hence also this contribution is further estimated by \(|x-y|^{|\beta |-\varepsilon }(1+|x|+|y|)^{2\varepsilon } \Vert \Pi ^{(\tau )}\Vert ^-_\beta \), which finishes the proof of \(\Vert \Pi ^{(\tau )} \Vert _\beta \lesssim \Vert \Pi ^{(\tau )}\Vert ^-_\beta \).
Step 3 (Proof of (S3)). As a first ingredient we prove
for which we have to show that for \(\gamma \) not purely polynomial
The proof is analogous to the one of Lemma 3.3. After appealing to the exponential formula (3.15) and telescoping (3.14), we observe that by \(\pi ^{(\textbf{n})(\tau )}_{xy\beta '} = (\Gamma ^{*(\tau )}_{xy}-\textrm{id})_{\beta '}^{e_\textbf{n}}\), see (1.15), we have to estimate linear combinations of terms of the form
for \(\beta _1,\dots ,\beta _k\prec \beta \). By Hölder’s inequality we can use on the first \(i-1\) and the last \(k-1\) terms the bound (1.24) on \(\Gamma ^{*(\tau )}\), which was uniform in the class of ensembles satisfying Assumption 1.2, while on the increment we appeal to the definition of \(\Vert \Gamma ^{*(\tau )}\Vert _{\beta _i}\) to obtain an estimate by
As in the proof of Lemma 3.3, the exponent of \(|x-y|\) equals \(|\beta |-|\gamma |-\varepsilon \), so that we obtain (4.12).
We turn to purely polynomial multi-indices \(\gamma \), where the argument is based on [19, (4.16)], which reads
As in the proof of [19, Proposition 4.4], by using in addition telescoping (3.14), we can argue in favor of
which together with the already established (4.12) yields (S3).
Step 4 (Proof of (S4)). We will establish bounds of increments of \(\partial _1^2\Pi ^{(\tau )}_{x\beta }\), the remaining derivatives can be bounded similarly. As a first ingredient, we prove for all \(\textbf{n}\ne \textbf{0}\)
Next, we note that (4.8) yields by the moment bound (1.37)
By using \(\partial _1^2\Gamma ^{*(\tau )}_{xy}\Pi ^{(\tau )}_y = \partial _1^2\Pi ^{(\tau )}_x\), see (1.17), we observe
After applying Hölder’s inequality, we appeal for the first right-hand side term to (1.24) and to (4.13) with \(\Pi ^{(\tau )}\) replaced by \(\Pi ^{(\tau )}-{\widetilde{\Pi }}^{(\tau )}\), and for the second right-hand side term to (S3) and (4.14), to obtain an estimate by
which yields
As in [19, Proof of Remark 2.3], this weak bound can be upgraded to a pointwise bound, yielding (S4).
It remains to provide the argument for (4.13). For this, we note that the in Step 2 established \(\Vert \Pi ^{(\tau )} \Vert _\beta \lesssim \Vert \Pi ^{(\tau )}\Vert ^-_\beta \) implies by the moment bound (1.37)
Equipped with this, we first consider the regime \(t\ge 1\). In this case, (4.15) is a stronger estimate than (4.13). In the regime \(t\le 1\), we distinguish the case \(|\beta |>3\) and \(|\beta |<3\). For the former, we use again (4.15), and argue again that it is stronger than (4.13). To see this, we split the exponent of \(\root 4 \of {t}\) into \(\alpha -\varepsilon -|\textbf{n}|\) and \(|\beta |-\alpha -2\), and argue that the latter is positive: by \(|\beta |>3\), this is a consequence of \(\alpha <1\). We turn to the case \(|\beta |<3\). Here, we appeal to the solution formula (3.19) and note that the Taylor polynomial drops out due to \(|\beta |<3\) and \(\textbf{n}\ne \textbf{0}\), so that
The \(\mathbb {E}^\frac{1}{p}|\cdot |^p\)-norm of the far field contribution \(s\ge 1\) is by the definition of \(\Vert \Pi ^{(\tau )}\Vert ^-_\beta \) estimated by
Since \(|\beta |<3\), we have by the choice of \(\varepsilon \) in (4.11) that \(|\beta |+\varepsilon <3\). Thus \(|\beta |+\varepsilon -|\textbf{n}|-6 < -|\textbf{n}|-3\), which by \(\textbf{n}\ne \textbf{0}\) is \(\le -4\). Hence the integral converges at \(s=\infty \) and is \(\lesssim (1+|x|)^{2\varepsilon } \Vert \Pi ^{(\tau )}\Vert ^-_\beta \), which by \(t\le 1\) is trivially estimated by the right-hand side of (4.13). In the intermediate regime of \(t\le s\le 1\) we rewrite
By the definition of \([\Pi ^{(\tau )}]^-_{\beta }\), the \(\mathbb {E}^\frac{1}{p}|\cdot |^p\)-norm of this contribution is estimated by
which by the moment bound (1.37) is estimated by
Again, since \(t\le 1\) and \(\textbf{n}\ne \textbf{0}\), this is trivially estimated by the right-hand side of (4.13).
Step 5 (Proof of (S5)). In preparation for later, we first establish
Indeed, using that \(\psi _t\) integrates to 1, we rewrite
to see from the definitions of \(\Vert \Pi ^{(\tau )}\Vert _\beta \) and \([\Pi ^{(\tau )}]_\beta \) that
which for \(t=1\) yields (4.16). Using \(\Delta \Pi ^{(\tau )}_{x\beta }(y) = (\Gamma ^{*(\tau )}_{xy}\Delta \Pi ^{(\tau )}_{y})_{\beta }(y)\), we obtain from the definition of \(\Vert \Gamma ^{*(\tau )}\Vert _\beta \) together with (S3) and (4.7), and from (1.24) and (4.16)
where the first sum restricts to \(\gamma \prec \beta \) since \(\Gamma ^{*(\tau )}-{\widetilde{\Gamma }}^{*(\tau )}=\Gamma ^{*(\tau )}-\textrm{id}+\textrm{id}-{\widetilde{\Gamma }}^{*(\tau )}\) is strictly triangular. This implies \(|\beta |>|\gamma |\), which by the choice of \(\varepsilon \) in (3.13a) upgrades to \(|\beta |-|\gamma |-\varepsilon >0\). Thus, also using \(|\cdot |\ge \alpha \), we have \(|x-y|^{|\beta |-|\gamma |-\varepsilon } \le (1+|x-y|)^{|\beta |-|\gamma |-\varepsilon } \le (1+|x-y|)^{|\beta |-\alpha }\), which together with \(|x-y|^{|\beta |-|\gamma |} \le (1+|x-y|)^{|\beta |-\alpha }\) implies
Equipped with this, we show
Recall from [19, (2.38)] that \(c^{(\tau )}_\beta \) is chosen in such a way that
Hence, (4.18) is true for \(\beta =0\) by \(c^{(\tau )}_0 = 0 = {\tilde{c}}^{(\tau )}_0\) as a consequence of the centeredness in Assumption 1.2. For the remaining multi-indices, we proceed by induction and assume (4.18)\(_{\prec \beta }\). Note that
where the choice \(t=1\) and \(x=0\) will turn out to be convenient, and recall from [19, (4.19)] that
Using this, we obtain from (4.12) & (S3), (1.24), (1.45) and the definition of \(\Vert \Pi ^{(\tau )}\Vert ^-_\beta \)
which by the moment bound (1.37) is bounded by
Since \(|\beta |<2\) upgrades to \(|\beta |-2+\varepsilon <0\) by the choice of \(\varepsilon \) in (4.11), this yields
which for \(t=1\) and \(x=0\) is estimated by the right-hand side of (4.18).
We turn to the estimate of \({I} - \widetilde{I}\). Since we are in the smooth setting with \(\tau >0\) fixed, we can appeal to the hierarchy from (1.10) in the form of
where we note that since we are in the induction step (hence \(\beta \ne 0\)), the term \(\xi _\tau 1\) is irrelevant here. To estimate \(|\mathbb {E}({I'}-{{\widetilde{I'}}})_{\beta t}(x)|\) we appeal to telescoping (3.14) and Hölder’s inequality, to the definition of \(\Vert \Pi ^{(\tau )}\Vert _\beta \), (1.12), (4.7), (4.17)\(_{\prec \beta }\) and the moment bound (1.37) to obtain
which for \(t=1\) and \(x=0\) is again estimated by the right-hand side of (4.18). Similarly, we estimate \(|\mathbb {E}({I''}-{{\widetilde{I''}}})_{\beta t}(x)|\) by the right-hand side of (4.18): appealing to the definition of \(\Vert \Pi ^{(\tau )}\Vert _\beta \), (1.12), (4.6), (4.18)\(_{\prec \beta }\) and the moment bound (1.37) leads with the choice of \(t=1\) and \(x=0\) to the desired result. This concludes the argument for (4.18).
Having (4.17) and (4.18) we are ready to prove (S5). For this, we note that the quantity we have to estimate is the \(\beta \)-component of
We treat the terms of (4.21)–(4.26) separately. For (4.21) we appeal to Hölder’s inequality, (1.12) and the definition of \([\Pi ^{(\tau )}]^-_{\beta _{k+1}}\) to obtain
which due to \(|\beta _1|+\cdots +|\beta _{k+1}|=|\beta |\) is bounded by
We turn to (4.22), where we note that by telescoping
Hence we may appeal to Hölder’s inequality, (1.12), the definition of \(\Vert \Pi ^{(\tau )}\Vert _{\beta '}\) and (4.8) to obtain
which again by \(|\beta _1|+\cdots +|\beta _{k+1}|=|\beta |\) is bounded by
We turn to (4.23), which can be estimated as (4.22), by replacing (4.8) by (4.7) and using in addition
In turn, this last estimate follows from \(\Pi ^{(\tau )}_x(y)-\Pi ^{(\tau )}_x(z) = \Gamma ^{*(\tau )}_{xz}\Pi _z(y)\), see (1.17), together with (1.12), (1.24), (S3), and the definitions of \(\Vert \Gamma ^*\Vert _\beta \) and \(\Vert \Pi ^{(\tau )}\Vert _\beta \).
Also (4.24) can be estimated analogously, using in addition (4.17) and
which follows again from (1.17), (1.12) and (1.24).
We turn to (4.26), where we first note that by telescoping
Using this, together with Hölder’s inequality, (1.12) and (4.28), we obtain
On the one hand, the last factor is by (4.18) estimated by \( \Vert \Pi ^{(\tau )}-{\widetilde{\Pi }}^{(\tau )}\Vert _{\prec \beta } + [\Pi ^{(\tau )}-{\widetilde{\Pi }}^{(\tau )}]_{\prec \beta } + \Vert \Pi ^{(\tau )}-{\widetilde{\Pi }}^{(\tau )}\Vert ^-_{\prec \beta } \). On the other hand, the last factor is by the definition (1.6) of \(D^{(\textbf{0})}\) only non-vanishing, if \(|\beta _{k+1}|\ge \alpha (k+1)\); by the definition (1.8) of the homogeneity this implies \(|\beta _1|+\cdots +|\beta _k| = |\beta |+k\alpha -|\beta _{k+1}| \le |\beta |-\alpha \), thus \((|x-y|+|x-z|)^{|\beta _1|+\cdots +|\beta _k|-\alpha } \le (1+|x-y|+|x-z|)^{|\beta |-\alpha }\). In total we get
Finally, (4.25) can be estimated by the same arguments, using in addition (4.27).
Step 6 (Proof of (S6)). We distinguish the cases \(|\beta |>2\) and \(|\beta |<2\). For the former, we proceed identically to the proof of Lemma 3.4 and start with
Indeed, this estimate follows from (1.21), telescoping (3.14), and the already established (4.12) & (S3) and (S6)\(_{\prec \beta }\). By a general reconstruction argument, see [17, Lemma 4.8], this implies
which yields (S6) for \(|\beta |>2\).
We turn to the case \(|\beta |<2\), and start with
Indeed, this follows as the estimates of \({I'}-{\widetilde{I'}}\) and \({I''}-{{\widetilde{I''}}}\) from (4.20) in Step 5. The only modification is that here we include \(c^{(\tau )}\) into \({I''}\), hence the sum in \({I''}\) starts now with \(k=0\), and we therefore have to include in addition the already established (4.18)\(_\beta \) in our assumption.
We use (4.29) to make a step towards (S6) and prove
Indeed, (4.29) yields
which by the moment bound (1.37) and for \(t\le 1\) is bounded by
Then (4.30) follows from the fact that the map \(t\mapsto (\root 4 \of {t})^{\alpha -2-\varepsilon }(\root 4 \of {t}+|x-y|)^{|\beta |-\alpha }\) is monotone decreasing on (0, 1] and takes at 1 the value \((1+|x-y|)^{|\beta |-\alpha }\). The monotonicity is seen as follows: since \(\alpha -2-\varepsilon <0\), the map \(t\mapsto (\root 4 \of {t})^{(\alpha -2-\varepsilon )/(|\beta |-\alpha )}\) is monotone decreasing; since \(|\beta |<2\) implies \(1+\frac{\alpha -2-\varepsilon }{|\beta |-\alpha }<0\), also \(t\mapsto (\root 4 \of {t})^{1+(\alpha -2-\varepsilon )/(|\beta |-\alpha )}\) is monotone decreasing.
It remains to bound the opposite regime of (4.30), namely \(t\ge 1\). For this, we first note that by the triangle inequality
Since the estimate (1.45) implies \(\mathbb {E}^\frac{1}{p}|\Pi ^{-(\tau )}_{x\beta T}(y)|^p\rightarrow 0\) as \(T\rightarrow \infty \) for \(|\beta |<2\), we obtain
To estimate this right-hand side, we first note that the definition of \([\Pi ^{(\tau )}]^-_\beta \) together with the moment bound (1.37) yields
Interpolating with
which is a consequence of (1.45) by the triangle inequality, we obtain for every \({\bar{\varepsilon }}\in (0,1)\)
Equipped with this, we first use the defining property (1.34) of \(\psi _t\), the semigroup property (1.35), and that \((\partial _0^2-\Delta ^2)\psi _{s/2}\) integrates to 0, to obtain
which by (4.31) and the moment bound (1.37) yields
For s large, the effective exponent on \(\root 4 \of {s}\) equals \(|\beta |-6+{\bar{\varepsilon }}(2+\varepsilon )\). Choosing \({\bar{\varepsilon }}=\varepsilon /(2+\varepsilon )\), this exponent equals \(|\beta |-6+\varepsilon \). The choice of \(\varepsilon \) in (4.11) implies that \(|\beta |<2\) strengthens to \(|\beta |+\varepsilon <2\), and thus the exponent is \(<-4\), which makes \((\root 4 \of {s})^{|\beta |-6+\varepsilon }\) integrable at \(s=\infty \). We therefore get for this choice of \({\bar{\varepsilon }}\)
This is further bounded by
which yields by \({\bar{\varepsilon }}=\varepsilon /(2+\varepsilon )\)
Together with the already established (S5)\(_\beta \), this finishes the argument for (S6). \(\square \)
Notes
That there is no choice left is specific to the assumption of a spectral gap inequality with \(\alpha \not \in {{\mathbb {Q}}}\) in combination with estimating small and large scales.
e.g. \(\textsf{z}_{\textbf{n}_1},\textsf{z}_{\textbf{n}_2}\in \textsf{T}^*\) but \(\textsf{z}_1\textsf{z}_{\textbf{n}_1}\textsf{z}_{\textbf{n}_2}\not \in \textsf{T}^*\).
that is, for \(\pi ,\pi '\in \mathbb {R}[[\textsf{z}_k,\textsf{z}_\textbf{n}]]\) it holds \(\Gamma ^*_{xy}\pi \pi '=(\Gamma ^*_{xy}\pi )(\Gamma ^*_{xy}\pi ')\).
for a detailed argument see [19, (2.60) and (2.63)].
up to the restriction \(d=1\) and \(\alpha <1/2\) as mentioned at the beginning of this section.
w.l.o.g. choose \(\beta \) such that \(|\beta |\ge 2\alpha \), so that there exist \(\beta ',\beta ''\) with \(|\beta ''|<|\beta '|\le |\beta |\).
it actually implies the stronger estimate where \(\kappa \) is replaced by \(\kappa +\alpha \) and \(|\beta |-\alpha \) is replaced by \(|\beta |-2\alpha \), which however we won’t make use of.
While this could be avoided, it has the advantage that the proofs of the three cases convergences in law resp. in probability resp. in \(L^p\) are very similar.
References
Banakh, T.O., Bogachev, V.I., Kolesnikov, A.V.: Topological spaces with skorokhod representation property. Ukr. Math. J. 57, 1371–1386 (2005). https://doi.org/10.1007/s11253-006-0002-z
Bruned, Y., Hairer, M., Zambotti, L.: Algebraic renormalisation of regularity structures. Invent. Math. 215(3), 1039–1156 (2019). https://doi.org/10.1007/s00222-018-0841-x
Bruned, Y., Nadeem, U.: Diagram-free approach for convergence of tree-based models in regularity structures. Preprint arXiv:2211.11428 (2022)
Broux, L., Otto, F., Tempelmayr, M.: Lecture notes on Malliavin calculus in regularity structures. Preprint arXiv:2401.05935v1 (2024)
Chandra, A., Hairer, M.: An analytic BPHZ theorem for regularity structures. Preprint arXiv:1612.08138 (2016)
Duch, P.: Flow equation approach to singular stochastic PDEs. Preprint arXiv:2109.11380 (2021)
Erhard, D., Hairer, M.: A scaling limit of the parabolic Anderson model with exclusion interaction. Comm. Pure Appl. Math. 77(2), 1065–1125 (2024)
Furlan, M., Gubinelli, M.: Weak universality for a class of 3d stochastic reaction-diffusion models. Probab. Theory Related Fields 173(3–4), 1099–1164 (2019). https://doi.org/10.1007/s00440-018-0849-6
Gubinelli, M., Imkeller, P., Perkowski, N.: Paracontrolled distributions and singular PDEs. Forum Math. Pi 3(e6), 75 (2015). https://doi.org/10.1017/fmp.2015.2
Grazieschi, P., Matetski, K., Weber, H.: The dynamical ising-kac model in 3d converges to \(\Phi ^4_3\). Preprint arXiv:2303.10242 (2023)
Gvalani, R.S., Tempelmayr, M.: Stochastic estimates for the thin-film equation with thermal noise. Preprint arXiv:2309.15829 (2023)
Hairer, M.: A theory of regularity structures. Invent. Math. 198(2), 269–504 (2014). https://doi.org/10.1007/s00222-014-0505-4
Hairer, M., Steele, R.: The BPHZ theorem for regularity structures via the spectral gap inequality. Arch. Ration. Mech. Anal. 248(1), 81 (2024). https://doi.org/10.1007/s00205-023-01946-w
Ignat, R., Otto, F.: The magnetization ripple: a nonlocal stochastic PDE perspective. J. Math. Pures Appl. 9(130), 157–199 (2019). https://doi.org/10.1016/j.matpur.2019.01.010
Ignat, R., Otto, F., Ried, T., Tsatsoulis, P.: Variational methods for a singular SPDE yielding the universality of the magnetization ripple. Comm. Pure Appl. Math. 76(11), 2959–3043 (2023). https://doi.org/10.1002/cpa.22093
Kallenberg, O.: Foundations of modern probability. Probability and its applications (New York), 2nd edn. Springer, New York (2002). https://doi.org/10.1007/978-1-4757-4015-8
Linares, P., Otto, F.: A tree-free approach to regularity structures: The regular case for quasi-linear equations. Preprint arXiv:2207.10627 (2022)
Linares, P., Otto, F., Tempelmayr, M.: The structure group for quasi-linear equations via universal enveloping algebras. Comm. Amer. Math. Soc. 3, 1–64 (2023). https://doi.org/10.1090/cams/16
Linares, P., Otto, F., Tempelmayr, M., Tsatsoulis, P.: A diagram-free approach to the stochastic estimates in regularity structures. Preprint arXiv:2112.10739v2 (2021)
Mourrat, J.-C., Weber, H.: Convergence of the two-dimensional dynamic ising-kac model to \(\Phi ^4_2\). Commun. Pure Appl. Math. 70(4), 717–812 (2017). https://doi.org/10.1002/cpa.21655
Otto, F., Sauer, J., Smith, S., Weber, H.: A priori bounds for quasi-linear SPDEs in the full sub-critical regime. Preprint arXiv:2103.11039 (2021)
Otto, F., Seong, K., Tempelmayr, M.: Lecture notes on tree-free regularity structures. Mat. Contemp. 58, 150–196 (2023). https://doi.org/10.21711/231766362023/rmc584
Otto, F., Weber, H.: Quasilinear SPDEs via rough paths. Arch. Ration. Mech. Anal. 232(2), 873–950 (2019). https://doi.org/10.1007/s00205-018-01335-8
Shen, H., Weber, H.: Glauber dynamics of 2D Kac-Blume-Capel model and their stochastic PDE limits. J. Funct. Anal. 275(6), 1321–1367 (2018). https://doi.org/10.1016/j.jfa.2017.12.014
Acknowledgements
The author would like to thank F. Otto and R. Gvalani for many useful discussions during the course of this work, and L. Broux, A. Chandra, R. Steele, P. Tsatsoulis and C. Wagner for helpful comments. This work was completed while the author held a position at the Max–Planck Institute for Mathematics in the Sciences; funding and working conditions are gratefully acknowledged.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
M.T. wrote and reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A: Weights
Appendix A: Weights
In this section we adapt the weights \(\bar{w}, w(y), w_x(y)\) from [19, (4.23), (4.37), (4.46)] appropriately, so that we can move beyond the limitation of the regularity index \(2-\alpha -D/2\) of the norm \(\Vert \cdot \Vert _*\) from the spectral gap assumption (1.42) being positive. We provide all properties of these weights that we used along the way and that are used in the proof of the stochastic estimates in the aforementioned reference.
We start with the weight \(\bar{w}\), which we define by
Note that in view of \(q\le 2\) by Minkowski’s inequality
which is necessary to obtain from an estimate like (3.18) by an application of the spectral gap inequality (1.42) and dualization an estimate like (3.16).
We turn to w(y), which we define by
In reconstruction Lemma 3.10 we used the estimate
which by
follows from Cauchy-Schwarz and
which is a consequence of the scaling identity (1.36). Furthermore, w(y) behaves well under (square) averaging,
![](http://media.springernature.com/lw200/springer-static/image/art%3A10.1007%2Fs00440-024-01292-2/MediaObjects/440_2024_1292_Equ120_HTML.png)
The former is a consequence of
which relies on \(\kappa <D/2\), see (3.4a), and a similar bound for averages over balls establishes the latter.
Finally, we define \(w_x(y)\) by
which by (A.3) and (A.5) satisfies
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tempelmayr, M. Characterizing models in regularity structures: a quasilinear case. Probab. Theory Relat. Fields (2024). https://doi.org/10.1007/s00440-024-01292-2
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00440-024-01292-2