Abstract
We prove a strong localized gluing result for the general relativistic constraint equations (with or without cosmological constant) in \(n\ge 3\) spatial dimensions. We glue an \(\epsilon \)-rescaling of an asymptotically flat data set \(({\hat{\gamma }},{\hat{k}})\) into the neighborhood of a point \(\mathfrak {p}\in X\) inside of another initial data set \((X,\gamma ,k)\), under a local genericity condition (non-existence of KIDs) near \(\mathfrak {p}\). As the scaling parameter \(\epsilon \) tends to 0, the rescalings \(\frac{x}{\epsilon }\) of normal coordinates x on X around \(\mathfrak {p}\) become asymptotically flat coordinates on the asymptotically flat data set; outside of any neighborhood of \(\mathfrak {p}\) on the other hand, the glued initial data converge back to \((\gamma ,k)\). The initial data we construct enjoy polyhomogeneous regularity jointly in \(\epsilon \) and the (rescaled) spatial coordinates. Applying our construction to unit mass black hole data sets \((X,\gamma ,k)\) and appropriate boosted Kerr initial data sets \(({\hat{\gamma }},{\hat{k}})\) produces initial data which conjecturally evolve into the extreme mass ratio inspiral of a unit mass and a mass \(\epsilon \) black hole. The proof combines a variant of the gluing method introduced by Corvino and Schoen with geometric singular analysis techniques originating in Melrose’s work. On a technical level, we present a fully geometric microlocal treatment of the solvability theory for the linearized constraints map.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Let \(n\ge 3\). For a smooth Riemannian n-manifold \((X,\gamma )\) and a symmetric 2-tensor k on X, the constraint equations for \(\gamma ,k\) are
Here, \(R_\gamma \) is the scalar curvature of \(\gamma \), \(\delta _\gamma \) is the negative divergence, and \(\Lambda \in \mathbb {R}\) is the cosmological constant. We say that \((X,\gamma ,k)\) is an initial data set if (1.1) holds. Given a Lorentzian manifold (M, g) of dimension \((n+1)\) and signature \((-,+,\ldots ,+)\), and given an embedded spacelike hypersurface \(X\subset M\), the first and second fundamental form \(\gamma \) and k of X satisfy (1.1) provided g satisfies the Einstein vacuum equations
The fundamental theorems of Choquet-Bruhat and Geroch [CB52, CBG69] show, conversely, that given \((X,\gamma ,k)\) satisfying (1.1), there exists a maximal globally hyperbolic spacetime (M, g), unique up to isometries, which solves the Einstein vacuum equations and into which X embeds with first and second fundamental form given by \(\gamma \) and k, respectively.
We consider the problem of gluing the initial data of a small asymptotically flat black hole (such as a Schwarzschild or Kerr black hole with small mass), or more generally of a rescaled asymptotically flat initial data set \((\mathbb {R}^n\setminus {\hat{K}}^\circ ,{\hat{\gamma }},{\hat{k}})\) with cosmological constant 0, into a neighborhood of a point \(\mathfrak {p}\in X\) in a given generic (near \(\mathfrak {p}\)) smooth initial data set \((X,\gamma ,k)\) with arbitrary cosmological constant \(\Lambda \in \mathbb {R}\). Here \({\hat{K}}\subset \mathbb {R}^n\) is compact (and possibly empty); and the asymptotic flatness condition, in standard coordinates \({\hat{x}}\in \mathbb {R}^n\), means that \({\hat{\gamma }}(\partial _{{\hat{x}}^i},\partial _{{\hat{x}}^j})\rightarrow \delta _{i j}\) and \({\hat{k}}(\partial _{{\hat{x}}^i},\partial _{{\hat{x}}^j})\rightarrow 0\) as \(|{\hat{x}}|\rightarrow \infty \), with appropriate rates of convergence. (See Definition 4.4 for the precise definition used in this paper.)
Theorem 1.1
(Main result, rough version). Assume that X is generic in a connected neighborhood \(\mathcal {U}^\circ \) of \(\mathfrak {p}\) (in the sense that it does not admit any KIDs in \(\mathcal {U}^\circ \), see Definition 4.13). Then there exist \(\epsilon _\sharp >0\) and a family \((X_\epsilon ,\gamma _\epsilon ,k_\epsilon )\), \(\epsilon \in (0,\epsilon _\sharp )\), of initial data sets with cosmological constant \(\Lambda \) with the following properties.
-
(1)
On \(X\setminus \mathcal {U}^\circ \subset X_\epsilon \), we have \((\gamma _\epsilon ,k_\epsilon )=(\gamma ,k)\).
-
(2)
In geodesic normal coordinates \(x=(x^1,\ldots ,x^n)\in \mathbb {R}^n\) around \(\mathfrak {p}\in X\), the manifold \(X_\epsilon \) is equal to \(B(0,1)\setminus \epsilon {\hat{K}}^\circ \), and we have smooth convergence \((\gamma _\epsilon ,k_\epsilon )\rightarrow (\gamma ,k)\) as \(\epsilon \searrow 0\) in \(|x|>\delta \) for any \(\delta >0\). (That is, the matrix coefficients \((\gamma _\epsilon )_{i j}=\gamma _\epsilon (\partial _{x^i},\partial _{x^j})\) and \(k_\epsilon (\partial _{x^i},\partial _{x^j})\) converge to those of \(\gamma \) and k.)
-
(3)
The tensors \(\gamma _\epsilon |_{\epsilon {\hat{x}}}=(\gamma _\epsilon |_{\epsilon {\hat{x}}}(\partial _{x^i},\partial _{x^j}))_{i,j=1,\ldots ,n}\) and \(\epsilon k_\epsilon |_{\epsilon {\hat{x}}}\) converge, smoothly and locally uniformly in \({\hat{x}}\in \mathbb {R}^n\setminus {\hat{K}}^\circ \), to \({\hat{\gamma }}|_{{\hat{x}}}=({\hat{\gamma }}|_{{\hat{x}}}(\partial _{{\hat{x}}^i},\partial _{{\hat{x}}^j}))\) and \({\hat{k}}|_{{\hat{x}}}\), respectively, as \(\epsilon \searrow 0\).
To explain part (3), we first note that \(\epsilon ^2({\hat{\gamma }},{\hat{k}})\) is a rescaled asymptotically flat data set: asymptotically flat coordinates for it are \(\epsilon {\hat{x}}\), which as \(\epsilon \searrow 0\) become local coordinates x on X near \(\mathfrak {p}\). Note that the ADM mass of \(\epsilon ^2({\hat{\gamma }},{\hat{k}})\) is \(\epsilon \) times that of \(({\hat{\gamma }},{\hat{k}})\). Since in the coordinates \({\hat{x}}=\frac{x}{\epsilon }\) we have \(\partial _{{\hat{x}}^i}=\epsilon \partial _{x^i}\), part (3) states that \((\gamma _\epsilon ,\epsilon k_\epsilon )\approx \epsilon ^2({\hat{\gamma }},{\hat{k}})\) in x-coordinates when \(|x|\lesssim \epsilon \).Footnote 1 See Fig. 1. In the region \(\epsilon \lesssim |x|\lesssim 1\), the family \((\gamma _\epsilon ,k_\epsilon )\) transitions from the \(\epsilon \)-rescaling \(\epsilon ^2({\hat{\gamma }},{\hat{k}})\) to the original data set \((\gamma ,k)\) in an appropriate manner; we describe this more precisely in Sect. 1.2 below.
Illustration of Theorem 1.1. The data set \((\gamma ,k)\) is unchanged outside of \(\mathcal {U}^\circ \), and in \(\overline{\mathcal {U}^\circ }\setminus \{\mathfrak {p}\}\) it converges smoothly and locally uniformly to \((\gamma ,k)\) as \(\epsilon \searrow 0\). Near \(\mathfrak {p}\) on the other hand, the glued initial data are close to the \(\epsilon \)-rescaling \(\epsilon ^2({\hat{\gamma }},{\hat{k}})\) of the asymptotically flat data set \(({\hat{\gamma }},{\hat{k}})\)
A natural choice for \(({\hat{\gamma }},{\hat{k}})\) for \(n=3\) is the initial data set of a (boosted) Schwarzschild or Kerr black hole [Sch16, Ker63]; see Sect. 6.2. To illustrate Theorem 1.1 in this case, consider the initial data \(({\hat{\gamma }},{\hat{k}})\) of an unboosted mass \({\hat{\mathfrak {m}}}>0\) Schwarzschild black hole in polar coordinates \({\hat{x}}={\hat{r}}\omega \), \(\omega \in \mathbb {S}^2\),
on \(\mathbb {R}^3\setminus {\hat{K}}^\circ \) where \({\hat{K}}\) is a bounded closed ball of radius \(>\max (0,2{\hat{\mathfrak {m}}})\). Then
is the metric of a mass \(\epsilon {\hat{\mathfrak {m}}}\) Schwarzschild black hole. The initial data set \((\gamma _\epsilon ,k_\epsilon )\) thus describes a mass \(\epsilon {\hat{\mathfrak {m}}}\) black hole glued into the given data set \((X,\gamma ,k)\).
We do not concern ourselves here with the construction of appropriate initial data sets \((X,\gamma ,k)\). We recall that Beig–Chruściel–Schoen [BCS05] demonstrated the genericity of the assumption on the absence of KIDs in a large number of settings; moreover, Moncrief [Mon75] showed that the absence of KIDs is equivalent to the absence of Killing vector fields in the evolving spacetime. KIDs on \((X,\gamma ,k)\) near \(\mathfrak {p}\) are nonzero elements, defined in a neighborhood of \(\mathfrak {p}\), of the cokernel of the linearization of the constraint equations around \((\gamma ,k)\). While this cokernel is necessarily finite-dimensional, it may be non-trivial. However, a non-trivial cokernel is typically an obstruction for localized gluing constructions or deformations of initial data sets; for instance, the rigidity part of the Positive Mass Theorem [SY79] in the time-symmetric setting (\(k=0\), and \(\gamma \) is scalar-flat) implies that one cannot compactly perturb the Euclidean metric to a non-isometric scalar-flat metric. (See however the work by Czimek–Rodnianski [CR22] on how to overcome the presence of a cokernel in characteristic gluing problems by taking advantage of the nonlinear nature of the constraint equations.) In this paper, we impose the local genericity condition on \((X,\gamma ,k)\) in order to obtain a local gluing result in Theorem 1.1 via an appropriate solvability theory for the linearized constraints map.
The initial data of a subextremal Kerr (or Kerr–de Sitter or Kerr–anti de Sitter) black hole do not satisfy the genericity condition required in Theorem 1.1. Nonetheless, we show in Sect. 6.2 how to prove Theorem 1.1 also in this case, provided \(\mathcal {U}^\circ \) intersects the black hole interior; see Theorem 6.2. (The main idea in the proof is to eliminate the cokernel by allowing for a violation of the constraint equations deep inside the black hole.) In particular, we are thus able to glue a small black hole into a unit mass black hole initial data set.
1.1 Context
Starting with the construction by Majumdar–Papapetrou [Maj47, Pap45] of electrovacuum spacetimes via the superposition of extremally charged black holes (with Kastor–Traschen [KT93] performing a similar construction in \(\Lambda >0\)), several constructions of initial data sets containing several black hole regions have been proposed. By solving the constraint equations using explicit ansatzes mainly involving superpositions of scalings and translations of the harmonic function \(|x|^{-1}\), Brill–Lindquist [BL63] constructed initial data for the Einstein–Maxwell equations with any finite number N of charged wormholes (Einstein–Rosen bridges) at arbitrary points in \(\mathbb {R}^3\) with arbitrary masses; the resulting data have \(N+1\) asymptotically flat regions. Misner’s time-symmetric ‘matched throat’ vacuum initial data [Mis63] identify all but 2 asymptotically flat ends; see Lindquist [Lin63] for the Einstein–Maxwell case. There also exist hybrid approaches, such as the one developed by Brandt–Bruegmann [BB97] which involves an explicit prescription for the conformal class of k (with \(\gamma \) conformally Euclidean) together with a numerical scheme for finding the conformal factor.
The key mathematical technique allowing for flexible and localized gluing constructions for solutions of the constraint equations was introduced by Corvino [Cor00] with Schoen [CS06]; see also [CD03] and Sect. 1.2 below. It is based on the observation that the adjoint of the linearized constraint equations is overdetermined and permits coercive estimates on function spaces with very strong weights at the boundary of the gluing region. Concretely, Corvino–Schoen prove that an asymptotically flat data set can be perturbed near infinity to an exact Kerr data set for a suitable choice of Kerr parameters (mass and angular momentum). The relevant linear operator in this setting (namely, the linearization of the constraints map around the trivial Minkowski data) has a nontrivial cokernel, which is accounted for by appropriately choosing the parameters of the Kerr data set.
This technique was generalized and used by Chruściel–Delay [CD03, §8.9] to construct initial data containing many Kerr black holes. (See [CD02] for the time-symmetric case of data containing several Schwarzschild black holes.) The initial data of [CD03] are symmetric under the parity map \(x\mapsto -x\); given pairwise disjoint balls \(B(x_i,4 r_i)\), the data are equal to Kerr data (with arbitrary parameters, subject to the parity condition) in each of the \(B(x_i,2 r_i)\), and also in \(\mathbb {R}^3\setminus \bigcup B(x_i,4 r_i)\). The gluing procedure succeeds when all black hole masses are sufficiently small relative to the radii \(r_i\) and the pairwise distances of the \(x_i\). We also recall that Isenberg–Mazzeo–Pollack [IMP02, §9] constructed many-black-hole initial data by connecting Euclidean spaces to the neighborhood of any finite number of points on a given asymptotically flat maximal (i.e. \({\text {tr}}_\gamma k=0\)) data set via wormholes. Chruściel–Mazzeo [CM03] proved the presence of multiple black holes in the spacetime development of the data produced in [CD02, CD03, IMP02] under suitable smallness conditions. See the hypotheses in [CM03, §3] for details; in particular, the first fundamental form must globally be sufficiently close to the Euclidean metric.
Another construction of many-black-hole initial data was given by Chruściel–Corvino–Isenberg [CCI11]. Phrasing their main result in a manner which relates more directly to the present work, [CCI11] glues sufficiently small rescalings of large compact subsets of N given asymptotically flat initial data sets (modified to be exact Kerr data near their respective asymptotically flat ends) into N disjoint balls in \(\mathbb {R}^3\); the glued data are exact Kerr data, with carefully chosen small parameters, also near infinity in \(\mathbb {R}^3\).
The many-black-hole initial data sets obtained by (repeatedFootnote 2) application of Theorem 1.1 combine features of those of [IMP02] (in that the background geometry is arbitrary) with those of [CD03, CCI11] (in that one glues in rescaled asymptotically flat data sets into neighborhoods of points in a background data set). We stress, however, that the background data \((X,\gamma ,k)\) and the asymptotically flat data \(({\hat{\gamma }},{\hat{k}})\) in Theorem 1.1 are unrestricted except for the genericity assumption on X (which can be relaxed in some circumstances, cf. Sect. 6.2). Moreover, and importantly for intended future applications (see Sect. 1.4), our construction gives detailed control on the \(\epsilon \)-dependence of the glued initial data; see Sect. 1.2. (Moreover, this control cannot be obtained by a naive application of Corvino-type gluing methods; see Remark 1.4.)
In the presence of a positive cosmological constant \(\Lambda >0\), the author showed how to glue Schwarzschild—or Kerr–de Sitter black holes into neighborhoods of points on the future conformal boundary of de Sitter space [Hin21a], thus producing examples of spacetimes solving (1.2) with precisely controlled structure at future timelike infinity and the future conformal boundary. A fortiori, this gives initial data sets on \(\mathbb {S}^3\) (or its punctured version \(\mathbb {R}^3\)) containing any finite number of (large) regions with exact Kerr–de Sitter data, while away from these regions the data are close to de Sitter data. Apart from [Hin21a, CM03], there do not appear to be any results (beyond simple domain-of-dependence type considerations) on the structure of the future evolution of the many-black-hole initial data sets obtained by any of the aforementioned methods. The author conjectures that a subclass of initial data of the type constructed by Theorem 1.1 allows for such results; see Sect. 1.4.
Further results on initial data gluing which are not directly tied to many-black-hole settings include Cortier’s gluing into Kerr–de Sitter data sets [Cor13], and the Carlotto–Schoen gluing in asymptotic cones [CS16] which produces asymptotically flat initial data containing any finite number of non-trivial conic regions which do not interact for any desired amount of time. We also mention the problem of filling in a given asymptotically flat data set in a controlled singularity-free manner, as studied in [BC17, §5]. In more recent work, Aretakis–Czimek–Rodnianski [ACR21a, ACR21b, ACR21c] developed a gluing procedure for the characteristic initial value problem, in which case the constraint equations become a coupled system of transport equations along the null generators of the incoming and outgoing null cones emanating from a spacelike 2-sphere, see e.g. [Luk11, §2.3]. Applications include a sharp (as far as decay is concerned) improvement of the Carlotto–Schoen result as well as an alternative proof of the Corvino–Schoen gluing results. For non-vacuum initial data sets, there are further types of gluing problems one may consider; we mention in particular the results by Corvino–Huang [CH20] on promoting the dominant energy condition to a strict inequality (again under suitable genericity conditions).
Besides the problem of gluing given or known initial data sets, one may wish to construct initial data sets ab initio. This is typically done via variants of the conformal method [Lic44, YJ73] in which \(\gamma ,k\) are expressed in terms of suitable seed data and a conformal factor which satisfies a semilinear elliptic PDE. We refer the reader to Carlotto’s comprehensive review article [Car21] for a detailed discussion and further references.
1.2 Gluing and asymptotic expansions
We proceed to set up a more precise version of Theorem 1.1. To begin, we say that a function u on \(\mathbb {R}^n\) is conormal with weight \(\alpha \in \mathbb {R}\) if
for all \(j\in \mathbb {N}_0\), where \(r=|x|\).Footnote 3 (In other words, every derivative in x gains a power of \(\langle r\rangle ^{-1}\).) Polyhomogeneity of u is the stronger requirement that in \(|x|>1\), we can write
where \(z_\ell \in \mathbb {C}\), \(k_\ell \in \mathbb {N}_0\), \(u_\ell \in {\mathcal {C}}^\infty (\mathbb {S}^{n-1})\), with \({\text {Re}}z_\ell \rightarrow \infty \); the symbol ‘\(\sim \)’ means that for any N, the difference of u and the truncation of the sum to all \(\ell \) with \({\text {Re}}z_\ell \le N\) is conormal with weight N. Thus, polyhomogeneous functions have generalized Taylor expansions at the boundary at infinityFootnote 4\(r^{-1}=0\) of \(\mathbb {R}^n\).
We then call a pair \(({\hat{\gamma }},{\hat{k}})\) of smooth symmetric 2-tensors on \(\mathbb {R}^n\setminus {\hat{K}}^\circ \) (with \({\hat{K}}\Subset \mathbb {R}^n\)) \(\delta \)-asymptotically flat if the following holds: writing coordinates on \(\mathbb {R}^n\) as \({\hat{x}}=({\hat{x}}^1,\ldots ,{\hat{x}}^n)\), the coefficients \({\hat{\gamma }}_{{\hat{i}}{\hat{j}}}={\hat{\gamma }}(\partial _{{\hat{x}}^i},\partial _{{\hat{x}}^j})\) and \({\hat{k}}_{{\hat{i}}{\hat{j}}}\) are polyhomogeneous conormal at infinity, with \({\hat{\gamma }}_{{\hat{i}}{\hat{j}}}-\delta _{i j}\) conormal with weightFootnote 5\(\delta >0\) and \({\hat{k}}\) conormal with weight \(1+\delta \).
Following a standard procedure in geometric singular analysis (see e.g. [MM90]), we construct the glued metrics \((\gamma _\epsilon ,k_\epsilon )\) for all sufficiently small \(\epsilon >0\) in one fell swoop by working on an appropriate total space \(\widetilde{X}\setminus \widetilde{K}^\circ \), of dimension \(n+1\), which is a manifold with corners; here \(\widetilde{X}\) is a resolution of \([0,\epsilon _\sharp )\times X\) at \(\{0\}\times \{\mathfrak {p}\}\), and \(\widetilde{K}=\bigsqcup \{\epsilon \}\times \epsilon {\hat{K}}\) in geodesic normal coordinates on X around \(\mathfrak {p}\). See Sect. 3 for details. We only note here that the space \(\widetilde{X}\) is equipped with a map down to \([0,\epsilon _\sharp )\) which is a fibration over \((0,\epsilon _\sharp )\), and the fibers of \(\widetilde{X}\setminus \widetilde{K}^\circ \rightarrow (0,\epsilon _\sharp )\) are the manifolds \(X_\epsilon \) in Theorem 1.1. The fibers become singular as \(\epsilon \searrow 0\) however, and the fiber over \(\epsilon =0\) is the disjoint union of two manifolds: one is the compactification at infinity of \(\mathbb {R}^n\setminus {\hat{K}}^\circ \) which carries the asymptotically flat data set \(({\hat{\gamma }},{\hat{k}})\), and the other is a compactification of \(X\setminus \{\mathfrak {p}\}\) (namely, the real blow-up of X at \(\{\mathfrak {p}\}\)) which carries the original data set \((\gamma ,k)\). Local coordinates on this space are illustrated in Fig. 2. In particular, the transitional region \(\epsilon \lesssim |x|\lesssim 1\) omitted in the statement of Theorem 1.1 is simply a neighborhood of the codimension 2 corner of \(\widetilde{X}\).
We may now state a more precise version of Theorem 1.1, in which we construct a single pair \((\widetilde{\gamma },\widetilde{k})\) of sections of the bundle \(S^2\widetilde{T}^*\widetilde{X}\) which is defined as the pullback of the second symmetric power of the cotangent bundle of X to \(\widetilde{X}\):
Theorem 1.2
(Main result, more precise version) Let \({\hat{\gamma }}\) and \({\hat{k}}\) be \(\delta \)-asymptotically flat on \(\mathbb {R}^n\setminus {\hat{K}}^\circ \) for some \(\delta >0\). Suppose that the initial data set \((X,\gamma ,k)\) does not admit any KIDs in a smoothly bounded connected open neighborhood \(\mathcal {U}^\circ \subset X\) of \(\mathfrak {p}\in X\). Denote by \(x\in \mathbb {R}^n\) geodesic normal coordinates around \(\mathfrak {p}\in X\), and write \({\hat{x}}=\frac{x}{\epsilon }\). Then there exist polyhomogeneous sections \(\widetilde{\gamma },\widetilde{k}\) of \(S^2\widetilde{T}^*\widetilde{X}\) with the following properties.
-
(1)
the restriction \((\gamma _\epsilon ,k_\epsilon )\) of \((\widetilde{\gamma },\widetilde{k})\) to each fiber \(X_\epsilon \) of \(\widetilde{X}\setminus \widetilde{K}^\circ \rightarrow (0,\epsilon _\sharp )\) satisfies the constraint equations (1.1) with the same cosmological constant as \((\gamma ,k)\).
-
(2)
On \([0,\epsilon _\sharp )\times (X\setminus \mathcal {U}^\circ )\), we have \((\widetilde{\gamma },\widetilde{k})=(\gamma ,k)\).
-
(3)
The leading order terms of \((\widetilde{\gamma },\widetilde{k})\) at the \(X\setminus \{\mathfrak {p}\}\) component of the singular fiber over \(\epsilon =0\) are given by \((\gamma ,k)\); the leading order terms of \((\widetilde{\gamma }_{i j},\epsilon \widetilde{k}_{i j})\) at the \(\mathbb {R}^n\setminus {\hat{K}}^\circ \) component are \(({\hat{\gamma }}_{{\hat{i}}{\hat{j}}},{\hat{k}}_{{\hat{i}}{\hat{j}}})\). Near the codimension 2 corner of \(\widetilde{X}\), and writing \(\omega =\frac{x}{|x|}\), this means the following: the tensors
$$\begin{aligned} \widetilde{\gamma } = \widetilde{\gamma }_{i j}\Bigl (|x|,\frac{\epsilon }{|x|},\omega \Bigr )\textrm{d}x^i\,\textrm{d}x^j,\quad \widetilde{k} = \widetilde{k}_{i j}\Bigl (|x|,\frac{\epsilon }{|x|},\omega \Bigr )\textrm{d}x^i\,\textrm{d}x^j, \end{aligned}$$which are jointly polyhomogeneous in \(|x|\in [0,1)\) and \(\frac{\epsilon }{|x|}\in [0,1)\), have the following leading order behavior at \(\frac{\epsilon }{|x|}=0\), resp. \(|x|=0\):
$$\begin{aligned} \begin{aligned} \widetilde{\gamma }_{i j}(|x|,0,\omega )&=\gamma _{i j}(|x|\omega ),&\qquad \widetilde{\gamma }_{i j}\Bigl (0,\frac{1}{|{\hat{x}}|},\omega \Bigr )&={\hat{\gamma }}_{{\hat{i}}{\hat{j}}}(|{\hat{x}}|\omega ), \\ \widetilde{k}_{i j}(|x|,0,\omega )&=k_{i j}(|x|\omega ),&\qquad \epsilon \widetilde{k}_{i j}\Bigl (0,\frac{1}{|{\hat{x}}|},\omega \Bigr )&={\hat{k}}_{{\hat{i}}{\hat{j}}}(|{\hat{x}}|\omega ). \end{aligned} \end{aligned}$$(1.4)
See Theorem 5.2 for the full statement. We stress that the glued initial data \((\widetilde{\gamma },\widetilde{k})\) have exactly two singular limits, namely the two given data sets \(({\hat{\gamma }},{\hat{k}})\) and \((\gamma ,k)\); we do not need to introduce any sort of intermediate gluing region.
Remark 1.3
(Limited regularity) One can construct the families \((\widetilde{\gamma },\widetilde{k})\), and thus the glued data \((\gamma _\epsilon ,k_\epsilon )\), under weaker assumptions on regularity and asymptotics. Note first that for \({\hat{\rho }}=(\epsilon ^2+{|}x{|}^2)^{1/2}\), the rescaling \({\hat{\rho }} \widetilde{k}_{i j}\) can be restricted to \(\frac{\epsilon }{|x|}=0\) and \(|x|=0\), with leading order terms \(|x|k_{i j}\) and \(\langle {\hat{x}}\rangle {\hat{k}}_{{\hat{i}}{\hat{j}}}\), respectively. When \(({\hat{\gamma }},{\hat{k}})\) is merely conormal, then one can construct a conormal solution \((\widetilde{\gamma },\widetilde{k})\) of the gluing problem so that \((\widetilde{\gamma },\hat{\rho }\widetilde{k})\) has boundary data as in (1.4). Less restrictively still, it suffices to assume that \(\hat{\gamma }\) and \({\hat{k}}\) have weighted Hölder regularity of degree \((s,\alpha )\) with \(\mathbb {N}_0\ni s\ge 2\), \(\alpha \in (0,1)\), with weights \(\delta \) and \(1+\delta \) as above (see also [CD03, Appendix A]); the solution \((\widetilde{\gamma },\hat{\rho }\widetilde{k})\), with boundary data (1.4), given by our methods then has matching regularity on an appropriate scale of weighted Hölder spaces. One can similarly weaken the regularity requirements on \(\gamma ,k\). We shall omit a detailed treatment of such finite regularity versions of our main results in this paper.
The construction of \((\widetilde{\gamma },\widetilde{k})\) proceeds in three steps.
-
(1)
The first step is to define sections \((\widetilde{\gamma }_0,\widetilde{k}_0)\) on the total space which have the desired boundary data as in (1.4); such sections already solve the constraint equations (with cosmological constant \(\Lambda \)) to leading order at both components of the singular fiber at \(\epsilon =0\). We note here that at the \(\mathbb {R}^n\setminus {\hat{K}}^\circ \) component, the appropriately rescaled cosmological constant vanishes to leading order (see Lemma 4.19 and Proposition 4.22(1).
-
(2)
In a second step,Footnote 6 one corrects the lower order terms of \((\widetilde{\gamma }_0,\widetilde{k}_0)\) in turns at the \(\mathbb {R}^n\setminus {\hat{K}}^\circ \), resp. the \(X\setminus \{\mathfrak {p}\}\) components of the singular fiber (denoted \({\hat{X}}\setminus {\hat{K}}^\circ \), resp. \(X_\circ \) in the bulk of the paper); this requires right-inverting the linearized constraints maps on the asymptotically flat space \((\mathbb {R}^n\setminus {\hat{K}}^\circ ,\hat{\gamma },{\hat{k}})\), resp. on the punctured space \((X\setminus \{\mathfrak {p}\},\gamma ,k)\) with suitable control of the solution at infinity, resp. at \(\mathfrak {p}\). This is accomplished in Sects. 4.1 and 4.2. The genericity condition on \((X,\gamma ,k)\) is used to effect the right inversion on the punctured space with control on supports; this uses that on appropriate function spaces on the punctured space the triviality of the cokernel of the linearized constraints map can be inferred from the triviality on the original space X. We do not require any genericity for \((\mathbb {R}^n\setminus {\hat{K}}^\circ ,\hat{\gamma },{\hat{k}})\), as the relevant cokernel is trivial if we permit the solution of the linearized constraints map to have less decay than \(|{\hat{x}}|^{-n+2}\). (See Proposition 4.10.) Using an asymptotic summation argument, we can then construct a formal solution \((\widetilde{\gamma }',\widetilde{k}')\) of the gluing problem: it satisfies the constraints modulo rapidly vanishing errors as \(\epsilon \searrow 0\). Steps (1) and (2) are completed in Sect. 5.1 (the formal solution being denoted \((\widetilde{\gamma },\widetilde{k})\) there).
-
(3)
Finally, we correct the formal solution to a true solution \((\widetilde{\gamma },\widetilde{k})\) by solving the nonlinear constraint equations with rapidly vanishing (as \(\epsilon \searrow 0\)) right hand side. This is done using a standard contraction mapping argument on a nonstandard scale of Sobolev spaces with \(\epsilon \)-dependent norms. The correction term vanishes to infinite order at \(\epsilon =0\) as well. See Sect. 5.2 (the true solution being denoted \((\widetilde{\gamma }',\widetilde{k}')\) there).
Remark 1.4
(Difficulties of a bare interpolation approach) We contrast the above construction with a more traditional approach as used e.g. in [Cor00, CD03]. For brevity, we only consider the time-symmetric case, so \({\hat{k}}=0\) and \(k=0\). We use geodesic normal coordinates \(x\in \mathbb {R}^n\) on X near \(\mathfrak {p}\). For small parameters \(\epsilon ,\eta >0\), we define a bare interpolation of \(\hat{\gamma }\) and \(\gamma \) by
where the non-negative function \(\chi \in {\mathcal {C}}^\infty _{{\textrm{c}}}([0,2))\) is identically 1 on [0, 1]. The metric \(\gamma ^{(\epsilon ,\eta )}\) violates the (time-symmetric) constraint equations only in the gluing region \(U_\eta =\{\eta \le |x|\le 2\eta \}\). Taking \(\eta \) small and then \(\epsilon \ll \eta \) ensures that \(\gamma ^{(\epsilon ,\eta )}\) is Riemannian (and includes a large piece of the asymptotically flat metric \(\hat{\gamma }\)); moreover, in \(U_\eta \) we have \(\hat{\gamma }_{{\hat{i}}{\hat{j}}}(\frac{x}{\epsilon }),\gamma _{i j}(x)\approx \delta _{i j}\) and thus also \(\gamma _{i j}^{(\epsilon ,\eta )}(x)\approx \delta _{i j}\). However, the violation of the constraint equations in \(U_\eta \) is typically of size \(\mathcal {O}(1+\eta ^{-1}(\frac{\epsilon }{\eta })^{1+\delta })\),Footnote 7 and therefore is not small; hence, it is not clear how one can re-impose the constraints in \(U_\eta \) with perturbative methods, regardless of the smallness of \(\epsilon \ll \eta \ll 1\). This discussion serves to indicate that the deformation of \(\gamma ^{(\epsilon ,\eta )}\) is a delicate task.Footnote 8 But even if one can correct \(\gamma ^{(\epsilon ,\eta )}\) in \(U_\delta \) to a solution of the constraint equations in some way, the glued data set would transition from \(\gamma \) to the rescaling of \(\hat{\gamma }\) in the region \(|x|\simeq \eta \), with \(\epsilon \ll \eta \ll 1\). Thus, one would produce an intermediate gluing region \(\{\epsilon \ll |x|\ll 1\}\), which asymptotes to the codimension 2 corner of \(\widetilde{X}\).—By contrast to (1.5), the interpolation \(\widetilde{\gamma }_0\) of \(\hat{\gamma }\) and \(\gamma \) which is the starting point of our construction (see Step (1) above) can be defined as
on the \(\epsilon \)-level set of \(\widetilde{X}\) near \(x=0\) (while globally on X, one may take \(\sum _{i,j}\chi (|x|)\cdot \) (1.6) \(\textrm{d}x^i\,\textrm{d}x^j+(1-\chi (|x|))\gamma \)). Thus, there is only a single gluing region, which is of the rough form \(\epsilon \lesssim |x|\lesssim 1\); in particular, the lower gluing radius shrinks to 0. Moreover, carefully note that the modification of \(\widetilde{\gamma }_0\) to a solution of the constraint equations uses deformations which are global in a fixed punctured neighborhood of \(\mathfrak {p}\) in \(X\setminus \{\mathfrak {p}\}\) (i.e. all the way down to \(\mathfrak {p}\)) and global on \(\mathbb {R}^n_{{\hat{x}}}\) near \(|{\hat{x}}|=\infty \) (i.e. all the way up to infinity).
Remark 1.5
(Approximate solutions) The construction of \((\widetilde{\gamma },\widetilde{k})\) in (generalized) Taylor series at the two boundary hypersurfaces at \(\epsilon =0\) in Step (2) produces increasingly accurate approximate solutions \((\gamma _\epsilon ,k_\epsilon )\) for small \(\epsilon \); thus, one may be able to use part of the above procedure to produce rather accurate initial data for use in numerical evolutions.
The right inversion of the linear operators in Steps (2) and (3) is accomplished much as in the original works [Cor00, CS06, CD03] using exponential weights at the boundary of the gluing region. Our discussion of the coercivity of the adjoint of the linearized constraints map in the asymptotically flat setting takes inspiration from [Del12] in that we combine standard elliptic theory with a prolongation argument (Lemma 4.3), cf. the ‘Kernel Restriction Condition’ and Lemma 8.2 in [Del12].
We briefly comment on the choice of function spaces. In the asymptotically flat setting, one measures regularity with respect to the weighted operator \(\langle {\hat{x}}\rangle \hat{\nabla }\) (differentiation in the \({\hat{x}}\)-coordinates); this is also called b-regularity on \(\overline{\mathbb {R}^n}\) in the language of [Mel93]. Similarly, near the puncture of X at \(\mathfrak {p}\), one needs to measure regularity with respect to the weighted operator \(|x|\nabla \) (which is the same as b-regularity on the blow-up \([X;\{\mathfrak {p}\}]\)). Indeed, this is the notion of regularity which is compatible with polynomial weights or polyhomogeneous expansions at \(|x|=0\); moreover, it arises naturally by computing the form of \(\langle {\hat{x}}\rangle \hat{\nabla }\) near the asymptotically flat end in \({\hat{x}}\)-coordinates in the local coordinates x; see (1.7) below. On the total space \(\widetilde{X}\), these two notions can be merged into the notion of q-regularity, introduced by the author in a geometrically related context in [Hin21b]: this amounts to measuring regularity with respect to
The corresponding q-Sobolev spaces are standard Sobolev spaces on \(X_\epsilon \) as sets, but their norms depend on \(\epsilon \). The contraction mapping argument takes place on tensors on \(\widetilde{X}\) measured in such q-Sobolev spacesFootnote 9 (and thus the corrections to the formal solution are constructed for all small \(\epsilon >0\) simultaneously).
Melrose and Singer, in unpublished work [MS11], have developed a singular-geometric and analytic point of view for the problem of gluing constant scalar curvature Kähler (cscK) metrics into neighborhoods of points of a given compact cscK manifold which is closely related to the point of view adopted in the present paper. The gluing constructions for magnetic monopoles by Kottke–Singer [KS22] and for gravitational instantons by Schroers–Singer [SS21] are also closely related, though either the total space or the relevant rescaled tangent bundles differ from (and are in fact more delicate than) those used in the present work.
1.3 Exponential weights and geometric singular analysis
We briefly comment on a novel technical aspect of our analysis, which is due to Mazzeo [Maz]. Our discussion here shall take place in the half space \([0,\infty )_x\times \mathbb {R}^{n-1}_y\subset \mathbb {R}^n_{(x,y)}\). If one wishes to construct a smooth solution u of some underdetermined PDE \(P u=f\) (with \(x>0\) on \({\text {supp}}f\)) with support in \(x\ge 0\), one can use coercivity estimates for the overdetermined operator \(P^*\) on spaces with exponential weights \(e^{\beta /x}\), \(\beta >0\) (as done in all of the aforementioned gluing papers), or equivalently for \(e^{-\beta /x}P^* e^{\beta /x}\) on unweighted spaces. Consider, for example, the case \(P=\delta \), the (negative) divergence on 1-forms. Then \(P^*=\textrm{d}=(\partial _x,\partial _{y^1},\ldots ,\partial _{y^{n-1}})\) in the basis \(\textrm{d}x,\textrm{d}y^1,\ldots ,\textrm{d}y^{n-1}\); this is left elliptic (and indeed overdetermined). Conjugating \(P^*\) by \(e^{-\beta /x}\) produces singular terms of size \(\mathcal {O}(x^{-2})\). We thus consider the rescaling \(x^2 P^*\) as a left elliptic operator built from the 00-vector fields (‘double 0-vector fields’) \(x^2\partial _x\) and \(x^2\partial _{y^j}\); conjugation of such a 00-differential operator by \(e^{\beta /x}\) merely produces smooth lower order terms.
A 00-differential operator has a principal symbol which is an extension of the usual principal symbol map down to \(x=0\); this symbol takes the \(x^2\)-degeneration into account by passing to an appropriate rescaling of the cotangent bundle. That is, the principal symbol is a fiberwise polynomial in the momentum variables \(\xi ,\eta \) defined by writing 1-forms as \(\xi \frac{\textrm{d}x}{x^2}+\eta \frac{\textrm{d}y}{x^2}\).
The crucial observation is that 00-differential operators have yet another commutative symbol which captures their leading order behavior at any boundary point \((0,y)\in [0,\infty )\times \mathbb {R}^{n-1}\); for example, the boundary principal symbol of \(e^{-\beta /x}P^* e^{\beta /x}=(x^2\partial _x-\beta ,\partial _{y^1},\ldots ,\partial _{y^{n-1}})\) is \((i\xi -\beta ,i\eta )\), and this is well-defined for all \((\xi ,\eta )\in \mathbb {R}\times \mathbb {R}^{n-1}\). The existence of this second commutative symbol is structurally due to the fact that 00-vector fields commute to leading order at \(x=0\).
The left ellipticity of both symbols then suffices to prove a semi-Fredholm estimate for \(e^{-\beta /x}P^* e^{\beta /x}\) on a scale of 00-Sobolev spaces. Complemented with a direct analysis of the kernel, one obtains the solvability of \(P u=f\) on 00-Sobolev spaces, with u having the rapidly decaying weight \(e^{-\beta /x}\) near \(x=0\). The extension of u by 0 to \(x<0\) furnishes a solution with the desired support property. See Sects. 2.2 and 2.4 for a detailed discussion, and Examples 2.2 and 2.8 for the case of the exterior derivative mentioned here.
In Sect. 2.4, we explain the relationship of the part of 00-analysis which only uses the standard principal symbol to bounded geometry analysis as discussed in [Shu92]; this is also related to the scaling property of [CD03, Appendix B]. The existence and importance of the second, boundary, principal symbol on the other hand appears to be noted here for the first time.
1.4 Outlook: evolution of glued initial data
We make the conjecture in Sect. 1.1 about the evolution of glued initial data more precise:
Conjecture
Let \((\hat{\gamma },{\hat{k}})\) denote boosted subextremal Kerr black hole initial data with parameters \(\hat{\mathfrak {m}}\) (mass) and \(\hat{\mathfrak {a}}\) (specific angular momentum). If (M, g) is a compact globally hyperbolic subset of the maximal globally hyperbolic development of \((X,\gamma ,k)\), then the maximal globally hyperbolic development of \((X_\epsilon ,\gamma _\epsilon ,k_\epsilon )\), for a ‘suitable’ family \((\gamma _\epsilon ,k_\epsilon )\) of initial data with boundary data \((\gamma ,k)\) and \((\hat{\gamma },{\hat{k}})\), contains a region \((M_\epsilon ,g_\epsilon )\) with the following properties:
-
(1)
\(M_\epsilon \) is obtained from M by excising a size \(\epsilon \)-neighborhood of the geodesic \({\mathcal {C}}\subset M\) whose initial conditions at \(X\subset M\) are determined by the point \(\mathfrak {p}\in X\) and the boost parameter of the Kerr data set;
-
(2)
\(g_\epsilon \) tends to g away from \({\mathcal {C}}\), while in an \(\mathcal {O}(\epsilon )\)-neighborhood of \({\mathcal {C}}\), the rescaling \(\epsilon ^{-2}g_\epsilon \) tends to a family (depending on the point in \({\mathcal {C}}\)) of Kerr black hole metrics with parameters \((\hat{\mathfrak {m}},\hat{\mathfrak {a}})\);
-
(3)
the total family \(\epsilon \mapsto g_\epsilon \) is polyhomogeneous on a total space obtained by resolving \([0,1)\times M\) at \(\{0\}\times {\mathcal {C}}\).
In particular, when (M, g) is a unit mass Kerr–de Sitter black hole, and the gluing data are chosen so that \({\mathcal {C}}\) starts in the exterior region but crosses the event horizon in finite proper time, then \((M_\epsilon ,g_\epsilon )\) describes the merger of a mass \(\epsilon \) black hole with a unit mass black hole. (Moreover, once the black holes have merged, one may restrict to a suitable neighborhood of the Kerr–de Sitter exterior region and apply [HV18] to conclude that the merged black hole settles down to a Kerr–de Sitter black hole.) We hope to address this conjecture in future work. At this point, we merely remark that the conjecture in the stated form cannot hold without restricting to ‘suitable’ initial data; roughly speaking, the initial data must be such that the evolving spacetime metric is adiabatic, i.e. does not vary much on the fast time scale \({\hat{t}}\) of the small black hole.
1.5 Outline of the paper
In Sect. 2, we recall the notions of geometric singular analysis which are relevant for the present paper, namely b-vector fields and operators (and their normal operators), scattering vector fields, and real blow-ups; we further discuss 00-geometry and the associated scales of Sobolev spaces, and develop elliptic theory in this context. In Sect. 3, we define the manifold with corners \(\widetilde{X}\) on which the gluing procedure will take place, and we describe the class of q-differential operators on \(\widetilde{X}\) which will precisely capture the behavior of the (linearized) constraints map.
In Sect. 4, we study the (linearized) constraints map in detail. Estimates and solvability results for the (linearized) constraints map on asymptotically flat and punctured manifolds are combined into a uniform solvability theory on \(\widetilde{X}\). In Sect. 5, we apply these results to construct the glued initial data set first on the level of (generalized) Taylor series; we then correct this formal solution to a true solution using a suitable contraction mapping principle.
Applications and variants of the main result are discussed in Sect. 6.
2 Geometric and Analytic Background
2.1 b- and scattering structures; blow-ups
For manifolds with boundary or with corners, we demand that all boundary hypersurfaces be embedded. A defining function of a boundary hypersurface \(H\subset X\) of a manifold with corners is a function \(\rho \in {\mathcal {C}}^\infty (X)\) for which \(H=\rho ^{-1}(0)\), further \(\rho >0\) on \(X\setminus H\), and finally \(\textrm{d}\rho \ne 0\) along H. In the case that X is a manifold with boundary \(H=\partial X\), we called such a function \(\rho \) simply a boundary defining function.
The radial compactification
where we identify \(0\ne x=r\omega \in \mathbb {R}^n\) with \((\rho ,\omega )=(r^{-1},\omega )\), is a manifold with boundary ‘at infinity’ given by \(\partial \overline{\mathbb {R}^n}\cong \mathbb {S}^{n-1}\); the interior is \(\mathbb {R}^n\). Invertible linear maps on \(\mathbb {R}^n\) extend by continuity to diffeomorphisms of \(\overline{\mathbb {R}^n}\); thus, the radial compactification of a finite-dimensional real vector space is well-defined.
On a manifold with corners X, we write \(\mathcal {V}(X)\) for the Lie algebra of all smooth vector fields (sections of \(T X\rightarrow X\)), and \(\mathcal {V}_{\textrm{b}}(X)\subset \mathcal {V}(X)\) for the Lie algebra of b-vector fields [Mel93], i.e. vector fields which are tangent to \(\partial X\). In local coordinates \(x^1,\ldots ,x^k\ge 0\), \(x^{k+1},\ldots ,x^n\in \mathbb {R}\), near a codimension k corner of X, the space \(\mathcal {V}_{\textrm{b}}(X)\) is spanned over \({\mathcal {C}}^\infty (X)\) by the vector fields \(x^i\partial _{x^i}\) (\(i=1,\ldots ,k\)), \(\partial _{x^j}\) (\(j=k+1,\ldots ,n\)), and therefore these vector fields are a frame of the b-tangent bundle \({}^{{\textrm{b}}}TX\rightarrow X\). When X is a manifold with boundary and boundary defining function \(\rho \in {\mathcal {C}}^\infty (X)\), then \(\mathcal {V}_{\textrm{sc}}(X):=\rho \mathcal {V}_{\textrm{b}}(X)\) is the Lie algebra of scattering vector fields [Mel94]; in local coordinates \(x\ge 0\), \(y=(y^1,\ldots ,y^{n-1})\in \mathbb {R}^{n-1}\), this space is spanned over \({\mathcal {C}}^\infty (X)\) by the vector fields \(x^2\partial _x\) and \(x\partial _{y^j}\) (\(j=1,\ldots ,n-1\)), which are a frame of the scattering tangent bundle \({}^{{\textrm{sc}}}TX\rightarrow X\). The dual bundles \({}^{{\textrm{b}}}T^*X\rightarrow X\) and \({}^{{\textrm{sc}}}T^*X\rightarrow X\) have frames \(\frac{\textrm{d}x^i}{x^i}\) (\(i=1,\ldots ,k\)), \(\textrm{d}x^j\) (\(j=k+1,\ldots ,n\)) and \(\frac{\textrm{d}x}{x^2}\), \(\frac{\textrm{d}y^j}{x}\) (\(j=1,\ldots ,n-1\)), respectively. In the special case \(X=\overline{\mathbb {R}^n}\), a change of coordinates calculation shows that \({\mathcal {C}}^\infty (\overline{\mathbb {R}^n})=S^0_\textrm{cl}(\mathbb {R}^n)\) consists of classical symbols on \(\mathbb {R}^n\) (i.e. smooth functions which are in addition smooth in \((r^{-1},\omega )\)), and \(\mathcal {V}_{\textrm{sc}}(\overline{\mathbb {R}^n})\) is spanned over \({\mathcal {C}}^\infty (\overline{\mathbb {R}^n})\) by the (translation-invariant) coordinate vector fields \(\partial _{x^i}\) (\(i=1,\ldots ,n\)). In particular, the Euclidean metric on \(\mathbb {R}^n\) is a smooth Riemannian scattering metric, i.e. a positive definite section of \(S^2\,{}^{{\textrm{sc}}}T^*\overline{\mathbb {R}^n}\).
When X is a manifold with corners, we write
for the space of all k-th order b-differential operators, i.e. locally finite sums of up to k-fold compositions of elements of \(\mathcal {V}_{\textrm{b}}(X)\). If \(H_1,\ldots ,H_N\) is a list of the boundary hypersurfaces of X, and if \(\rho _j\in {\mathcal {C}}^\infty (X)\) denotes a defining function of \(H_j\) for \(j=1,\ldots ,N\), then we furthermore write
for the space of weighted b-differential operators; here \(\alpha _j\in \mathbb {R}\), \(j=1,\ldots ,N\). We note that conjugation by \(\rho _j^{\alpha _j}\) preserves the space \(\textrm{Diff}_{\textrm{b}}^k(X)\) and its weighted analogues. These spaces are \({\mathcal {C}}^\infty (X)\)-modules; spaces \(\textrm{Diff}_{\textrm{b}}^k(X;E,F)\) of operators acting between sections of vector bundles \(E,F\rightarrow X\) can thus be defined in the usual manner, likewise for weighted operators.
The b-principal symbol of \(A\in \textrm{Diff}_{\textrm{b}}^k(X)\) is an element \({}^{{\textrm{b}}}\upsigma ^k(A)\in P^k_\textrm{hom}({}^{{\textrm{b}}}T^*X)\), the space of smooth functions on \({}^{{\textrm{b}}}T^*X\) which are fiberwise homogeneous polynomials of degree k; it is multiplicative, given via pullback along the base projection for \(k=0\), and given by \({}^{{\textrm{b}}}\upsigma ^1(V)(z,\zeta )=i \zeta (V|_z)\) for \(V\in \mathcal {V}_{\textrm{b}}(X)\). We make this explicit in the case that X is a manifold with boundary: in local coordinates \(x\ge 0\), \(y\in \mathbb {R}^{n-1}\) as above, and writing b-covectors as \(\xi \frac{\textrm{d}x}{x}+\eta \,\textrm{d}y\),
The principal symbol captures \(A\in \textrm{Diff}_{\textrm{b}}^k(X)\) to leading order in the differential order sense, in that \({}^{{\textrm{b}}}\upsigma ^k(A)=0\) implies \(A\in \textrm{Diff}_{\textrm{b}}^{k-1}(X)\). In order to capture A to leading order at \(\partial X\), one fixes a collar neighborhood \([0,x_0)_x\times \partial X\) of \(\partial X\), and defines, in local coordinates as above, the b-normal operator of A by
This operator is dilation-invariant in x, and it differs from A on \([0,x_0)\times \partial X\) by an element of \(x\textrm{Diff}_{\textrm{b}}^k\). Formally conjugating N(A) by the Mellin transform in x (see also Sect. 2.3) gives the Mellin transformed normal operator family
We next recall the notion of (real) blow-up; see [Mel96] for a detailed account. We are given a manifold with corners X and a p-submanifold \(S\subset X\), i.e. a submanifold so that near all \(p\in S\) there exist local coordinates \(x^1,\ldots ,x^k\ge 0\), \(x^{k+1},\ldots ,x^n\in \mathbb {R}\) on X so that S is given by the vanishing of a subset of these coordinates. (If this subset always contains at least one of the \(x^1,\ldots ,x^k\), we call S a boundary p-submanifold.) The blow-up of X along S is defined as
where \(S N^+S=(N^+S\setminus o)/\mathbb {R}^+\) is the spherical inward pointing normal bundle of S; here, we set \(N^+S=T^+_S X/T S\), with \(T^+_p X\subset T_p X\) consisting of all (non-strictly) inward pointing tangent vectors on X at p. The space [X; S] is equipped with a blow-down map \(\upbeta :[X;S]\rightarrow X\), which is the identity on \(X\setminus S\) and the base projection on the front face \(S N^+S\). The space [X; S] can be given the structure of a smooth manifold by declaring polar coordinates around S to be smooth down to the origin. To make this concrete, say S is given by \(x^1=\ldots =x^p=0\), \(x^{k+1}=\ldots =x^{k+q}=0\) for some \(0\le p\le k\) and \(0\le q\le n-k\) with \((p,q)\ne (0,0)\); setting \(R=(\sum _{i=1}^p (x^i)^2+\sum _{j=k+1}^{k+q}(x^j)^2)^{1/2}\) and \(\Omega =R^{-1}(x^1,\ldots ,x^p,x^{k+1},\ldots ,x^{k+q})\in \mathbb {S}_p^{p+q-1}\) (the unit sphere in \(\mathbb {R}^{p+q}\) intersected with \([0,\infty )^p\times \mathbb {R}^q\)), the map \((x^1,\ldots ,x^n)\mapsto (R,\Omega ,x^{p+1},\ldots ,x^k,x^{k+q+1},\ldots ,x^n)\) extends from \(X\setminus S\) to a diffeomorphism from a neighborhood of the front face of [X; S] to \([0,\infty )\times \mathbb {S}_+^{p+q-1}\times \mathbb {R}^{(k-p)+(n-k-q)}\). The blow-down map is the product of the map \((R,\Omega )\mapsto R\Omega \) and the identity map in the remaining coordinates. Finally, if \(T\subset X\) is another p-submanifold so that for each point in \(S\cap T\) there exist local coordinates in which both S and T are of the above product form, then the lift of T to [X; S] is defined by \(\upbeta ^*T:=\upbeta ^{-1}(T)\) when \(T\subset S\), and \(\upbeta ^*T:={\textrm{cl}}(\upbeta ^{-1}(T\setminus S))\) otherwise (where ‘\({\textrm{cl}}\)’ denotes the closure in [X; S]). One can then define the iterated blow-up \([X;S;T]:=[[X;S];\upbeta ^*T]\). (This construction can be iterated.)
When using index notation, we use the summation convention; that is, indices appearing twice are summed over (unless otherwise noted).
2.2 00-structures
Mazzeo [Maz] defines spaces of 00-(pseudo)differential operators and studies their symbolic properties and parametrices; in the present paper, we restrict ourselves to the case of 00-differential operators, and pursue a hands-on approach to their analysis. Such operators are tailored to the analysis of overdetermined operators on spaces which feature exponential weights at hypersurfaces. Thus, on a manifold X with boundary \(\partial X\), denote by \(\rho \in {\mathcal {C}}^\infty (X)\) a boundary defining function and recall from [MM87] the Lie algebra \(\mathcal {V}_0(X)\) of 0-vector fields consisting of all \(V\in \mathcal {V}(X)\) which vanish at \(\partial X\). We then set
In local coordinates \(x\ge 0\), \(y\in \mathbb {R}^{n-1}\), near a boundary point of X, elements of \(\mathcal {V}_{0 0}(X)\) are of the form \(a(x,y)x^2\partial _x+b^j(x,y)x^2\partial _{y^j}\) for smooth a, b. Using that \(\mathcal {V}_0(X)\) is a Lie algebra (or by direct computation), one finds that commutators of 00-vector fields gain a factor of the boundary defining function,
Moreover, since \(\mathcal {V}_{0 0}(X)\subset \rho \mathcal {V}_{\textrm{b}}(X)\), we also have \(\rho ^{-\alpha }V(\rho ^\alpha )\in \rho {\mathcal {C}}^\infty (X)\) for \(\alpha \in \mathbb {R}\). For the corresponding spaces \(\textrm{Diff}_{0 0}^k(X)\) and \(\rho ^{-\alpha }\textrm{Diff}_{0 0}^k(X)\) of (weighted) 00-differential operators, this implies
i.e. composition of 00-operators is commutative to leading order in the differential order sense as usual, and also to leading order at \(\partial X\). We can use more singular weights as well: for \(\beta \in \mathbb {R}\), we have \(e^{-\beta /\rho }V(e^{\beta /\rho })\in {\mathcal {C}}^\infty (X)\), as is easily checked in local coordinates, and therefore conjugation by \(e^{\beta /\rho }\) preserves the space \(\rho ^{-\alpha }\textrm{Diff}_{0 0}^k(X)\)
The space \(\mathcal {V}_{0 0}(X)\) is the space of smooth sections of the 00-tangent bundle \({}^{0 0}T X\rightarrow X\), with local frame \(x^2\partial _x\), \(x^2\partial _{y^j}\) (\(j=1,\ldots ,n-1\)); the dual bundle is the 00-cotangent bundle \({}^{0 0}T^*X\rightarrow X\). Letting \(P^k({}^{0 0}T^*X)\) denote the space of smooth functions on \({}^{0 0}T^*X\) which are fiberwise polynomials of degree k, we can define a surjective (fullFootnote 10) principal symbol map
with kernel \(\rho \textrm{Diff}_{0 0}^{k-1}(X)\) as follows: in local coordinates \(x\ge 0\), \(y\in \mathbb {R}^{n-1}\), and writing 00-covectors as \(\xi \frac{\textrm{d}x}{x^2}+\eta _j\frac{\textrm{d}y^j}{x^2}\) (so \(\xi \in \mathbb {R}\), \(\eta \in \mathbb {R}^{n-1}\)), we assign
(The map \({}^{0 0}\upsigma ^{k,\alpha }:\rho ^{-\alpha }\textrm{Diff}^k_{0 0}\rightarrow \rho ^{-\alpha }P^k/\rho ^{-\alpha +1}P^{k-1}\) is defined analogously.) Thus, \({}^{0 0}\upsigma ^k(A)\) is the usual principal symbol over \(X^\circ \) (where \({}^{0 0}T^*X\) and \(T^*X\) are equal) and can be identified there with a homogeneous polynomial, whereas over \(x=0\), the 00-principal symbol captures the full operator A modulo 00-operators with decaying coefficients.
Lemma 2.1
(Well-definedness of the principal symbol). The map \({}^{0 0}\upsigma ^k\), defined by (2.3), is well-defined as a map (2.2), i.e. it is independent of the choice of local coordinates. It is multiplicative in the sense that \({}^{0 0}\upsigma ^{k_1+k_2}(A_1\circ A_2)={}^{0 0}\upsigma ^{k_1}(A_1)\cdot {}^{0 0}\upsigma ^{k_2}(A_2)\) for \(A_j\in \textrm{Diff}_{0 0}^{k_j}(X)\), \(j=1,2\), and it maps adjoints to adjoints (complex conjugates).
Proof
We can write coordinates \({\tilde{x}}\ge 0\), \({\tilde{y}}\in \mathbb {R}^{n-1}\) near (0, 0) as \({\tilde{x}}=x b(x,y)\) and \({\tilde{y}}=\Phi (x,y)\) with \(0<b\) and \(\Phi (x,-)\) a local diffeomorphism for small x. Then
where \(a_{1 1}=b^{-2}(b + x b'_x)\), \(a_{1 2}=b^{-2}\Phi '_x\), \(a_{2 1}=x b^{-2}b'_y\), and \(a_{2 2}=b^{-2}\Phi '_y\) are smooth. We likewise have
with \(\xi =a_{1 1}\tilde{\xi }+a_{1 2}\tilde{\eta }\) and \(\eta =a_{2 1}\tilde{\xi }+a_{2 2}\tilde{\eta }\). (Thus, the putative principal symbols of \(x^2 D_x\) and \(x^2 D_y\) are indeed well-defined.) Expanding \((x^2 D_x)^j=\sum _{p+|\beta |\le j} f_{p\beta }({\tilde{x}},{\tilde{y}})({\tilde{x}}^2 D_{{\tilde{x}}})^p \) \( ({\tilde{x}}^2 D_{{\tilde{y}}})^\beta \), all terms with \(p+|\beta |<j\) involve at least one derivative \({\tilde{x}}^2 D_{{\tilde{x}}}\) or \({\tilde{x}}^2 D_{{\tilde{y}}}\) falling onto a coefficient \(a_{1 1},a_{1 2}\), or \({\tilde{x}}^2 D_{{\tilde{x}}}\) falling onto \({\tilde{x}}^2\); any such term thus comes with at least one additional power of \({\tilde{x}}\). Therefore, modulo \({\tilde{x}}\textrm{Diff}_{0 0}^{j-1}\), only the terms with \(p+|\beta |=j\) survive, and the putative principal symbols of the two sides are thus \(\xi ^j\) and \((a_{1 1}\tilde{\xi }+a_{1 2}\tilde{\eta })^j\), which are equal. The same argument applies more generally to \((x^2 D_x)^j(x^2 D_y)^\alpha \).
The multiplicativity of the 00-principal symbol follows similarly: upon writing
in the form (2.3) by commuting \((x^2 D_x)^j(x^2 D_y)^\alpha \) through \(a_{2,k\beta }\), and further commuting \((x^2 D_y)^\alpha \) through \((x^2 D_x)^k\), each commutator drops one derivative and gains one power of x. \(\quad \square \)
The setting in which 00-operators arise in the present paper takes the following general form: let \(X^\circ \) be an open manifold which is separated into two connected components \(X_+\) and \(X_-\) by a hypersurface H; let \(\rho \in {\mathcal {C}}^\infty (X^\circ )\) be a defining function of H inside \(X_+\). Let \(L\in \textrm{Diff}^m(X^\circ )\); then
Indeed, in local coordinates \(x\in \mathbb {R}\), \(y\in \mathbb {R}^{n-1}\) in which \(X_+\) is given by \(x\ge 0\), the operator L is a linear combination (with \({\mathcal {C}}^\infty \) coefficients) of \(D_x^j D_y^\alpha \), \(j+|\alpha |\le m\), and therefore
indeed. This calculation also implies \({}^{0 0}\upsigma ^m(L_+)=M^*\upsigma ^m(L)\), where \(\upsigma ^m(L)\in P^m_\textrm{hom}(T^*X^\circ )\) denotes the standard (homogeneous) principal symbol of L, and \(M:{}^{0 0}T^*X_+\rightarrow T^*_{X_+}X^\circ \) is the bundle isomorphism given by multiplication by \(\rho ^2\). In particular, the boundary principal symbol \({}^{0 0}\upsigma ^m(L_+)|_{{}^{0 0}T^*_H X_+}\) is a homogeneous polynomial, and correspondingly \(L_+\) cannot have an injective principal symbol (except in the case \(m=0\)). A conjugation by an exponential weight can change this, since
(This follows in local coordinates by multiplicativity from the easy cases \(L_+=\rho ^2 D_\rho \)—with \(e^{-\beta /\rho }\rho ^2 D_\rho e^{\beta /\rho }=\rho ^2 D_\rho +i\beta \)—and \(L_+=\rho ^2 D_y\).) Note that the right hand side of (2.4) is not homogeneous in \(\zeta \) anymore (unless \(m=0\) and \(\beta =0\)).
Example 2.2
(Exterior derivative) Consider \(X^\circ =\mathbb {R}^n\), with coordinates \((x,y)\in \mathbb {R}\times \mathbb {R}^{n-1}\), and \(H=\{x=0\}\), \(X_\pm =\{\pm x\ge 0\}\), and \(\rho =x\); let \(L=\textrm{d}:{\mathcal {C}}^\infty (\mathbb {R}^n)\rightarrow {\mathcal {C}}^\infty (\mathbb {R}^n;T^*\mathbb {R}^n)\), with principal symbol \(\upsigma ^1(L)(z,\zeta )=i\zeta \). Then \(L_+=x^2\,\textrm{d}=(x^2\partial _x,x^2\partial _y)\), and its conjugation
has principal symbol \({}^{0 0}T^*X_+ \ni (\xi ,\eta ) \mapsto (i x^2\xi -\beta ,i x^2\eta )\). This is injective for \(\beta \ne 0\); note that \(x^2:{}^{0 0}T^*X_+\rightarrow T^*_{X_+}\mathbb {R}^n\) is an isomorphism.
2.3 Function spaces
Let X be a manifold with corners and boundary hypersurfaces \(H_1,\ldots ,H_N\), and denote a boundary defining function of \(H_j\) by \(\rho _j\). We write \(\dot{{\mathcal {C}}}^\infty (X)\subset {\mathcal {C}}^\infty (X)\) for the space of smooth functions which vanish to infinite order at each \(H_j\). For \(\alpha _j\in \mathbb {R}\) (\(j=1,\ldots ,N\)), set \(w=\prod _{j=1}^N \rho _j^{\alpha _j}\); we then define the space of conormal functions
We refer to \(\alpha _j\) as the weight at the j-th boundary hypersurface of X. Polyhomogeneous spaces refine the mere boundedness to generalized Taylor expansions. Index sets capture the exponents in the Taylor expansion; recall here that an index set is a subset \(\mathcal {E}\subset \mathbb {C}\times \mathbb {N}_0\) so that \((z,k)\in \mathcal {E}\) implies \((z+1,k)\in \mathcal {E}\), and also \((z,k-1)\in \mathcal {E}\) when \(k\ge 1\), and so that \({\text {Re}}z_j\rightarrow \infty \) whenever \((z_j,k_j)\in \mathcal {E}\) tends to infinity. (Thus, the subset of elements \((z,k)\in \mathcal {E}\) with \({\text {Re}}z<C\) is finite for any choice of \(C\in \mathbb {R}\).) Frequently occurring index sets are denoted somewhat imprecisely as
We write \(\min {\text {Re}}\mathcal {E}=\min \{{\text {Re}}z:(z,k)\in \mathcal {E}\}\); for \(a\in \mathbb {R}\), and we write \({\text {Re}}\mathcal {E}>a\) as an abbreviation for \(\min {\text {Re}}\mathcal {E}>a\). If now \(\mathcal {E}=(\mathcal {E}_1,\ldots ,\mathcal {E}_N)\), \(\mathcal {E}_j\subset \mathbb {C}\times \mathbb {N}_0\), is a collection of index sets, and \({\text {Re}}\mathcal {E}_j>a_j\), then
consists of all conormal functions u so that for all \(j=1,\ldots ,N\), the following holds in a collar neighborhood \([0,\epsilon )_{\rho _j}\times H_j\) of \(H_j\subset X\): letting \(\alpha ^{(j)}=(\alpha _1,\ldots ,\alpha _{j-1},\alpha _{j+1},\ldots ,\alpha _N)\), there exist \(a_{(z,k)}\in \mathcal {A}^{\alpha ^{(j)}}(H_j)\) for \((z,k)\in \mathcal {E}_j\) so that
for all \(C\in \mathbb {R}\). Necessarily, then, the \(a_{(z,k)}\) are themselves polyhomogeneous on \(H_j\), with index set \(\mathcal {E}_i\) at the boundary hypersurface \(H_j\cap H_i\) of \(H_j\) for those i for which \(H_j\cap H_i\ne \emptyset \). See [Mel96, Chapter 4] for a detailed treatment of polyhomogeneous distributions. If \(u\in \mathcal {A}_{\textrm{phg}}^\mathcal {E}(X)\) and \(v\in \mathcal {A}_{\textrm{phg}}^\mathcal {F}(X)\), then \(u+v\in \mathcal {A}_{\textrm{phg}}^{\mathcal {E}\cup \mathcal {F}}(X)\) and \(u\cdot v\in \mathcal {A}_{\textrm{phg}}^{\mathcal {E}+\mathcal {F}}(X)\), where we set \(\mathcal {E}+\mathcal {F}=\{(z_1+z_2,k_1+k_2): (z_1,k_1)\in \mathcal {E},\ (z_2,k_2)\in \mathcal {F}\}\). For \(k\in \mathbb {N}\), we also write
thus \(u^k\in \mathcal {A}_{\textrm{phg}}^{k\mathcal {E}}(X)\) when \(u\in \mathcal {A}_{\textrm{phg}}^\mathcal {E}(X)\). We say that an index set \(\mathcal {E}\) is nonlinearly closed if \(\mathcal {E}=\bigcup _{j\in \mathbb {N}}j\mathcal {E}\). The nonlinear closure of an index set \(\mathcal {E}\) with \({\text {Re}}\mathcal {E}>0\) is defined by \(\mathcal {F}:=\bigcup _{j\in \mathbb {N}}j\mathcal {E}\); it is the smallest index set with the property that \(u^k\in \mathcal {A}_{\textrm{phg}}^\mathcal {F}(X)\) for all \(k\in \mathbb {N}_0\) and for all \(u\in \mathcal {A}_{\textrm{phg}}^\mathcal {E}(X)\). We finally recall the extended union of index sets
We next recall weighted b-Sobolev spaces on a compact manifold X with boundary \(\partial X\ne \emptyset \). Fix a smooth positive b-density on X, i.e. a positive section of the density bundle \({}^{{\textrm{b}}}\Omega X\rightarrow X\) associated to \({}^{{\textrm{b}}}TX\rightarrow X\); in local coordinates \(x\ge 0\), \(y\in \mathbb {R}^{n-1}\), such a b-density takes the form \(a(x,y)|\frac{\textrm{d}x}{x}\,\textrm{d}y|\) where \(0<a\in {\mathcal {C}}^\infty \). Then \(L^2(X)\) denotes the corresponding \(L^2\)-space on \(X^\circ \). For \(s\in \mathbb {N}_0\), we define more generally the b-Sobolev space \(H_{{\textrm{b}}}^s(X)\) to consist of all \(u\in L^2(X)=H_{{\textrm{b}}}^0(X)\) so that \(L u\in H_{{\textrm{b}}}^0(X)\) for all \(L\in \textrm{Diff}_{\textrm{b}}^s(X)\). If \(\{V_1,\ldots ,V_N\}\subset \mathcal {V}_{\textrm{b}}(X)\) is a collection of b-vector fields which spans \(\mathcal {V}_{\textrm{b}}(X)\), then \(\Vert u\Vert _{H_{{\textrm{b}}}^s(X)}^2:=\sum \Vert V_{i_1}\ldots V_{i_m}u\Vert _{L^2(X)}^2\), with the sum taken over all \(m\le s\) and \(1\le i_1,\ldots ,i_m\le N\), gives \(H_{{\textrm{b}}}^s(X)\) the structure of a Hilbert space. Weighted b-Sobolev spaces are denoted
with \(\rho \in {\mathcal {C}}^\infty (X)\) denoting a boundary defining function; this is a Hilbert space with norm \(\Vert u\Vert _{H_{{\textrm{b}}}^{s,\alpha }(X)}=\Vert \rho ^{-\alpha }u\Vert _{H_{{\textrm{b}}}^s(X)}\). For \(s\in \mathbb {R}\), one can define \(H_{{\textrm{b}}}^{s,\alpha }(X)\) via interpolation and duality. Since conjugation by \(\rho ^\alpha \) preserves the space \(\textrm{Diff}_{\textrm{b}}^k(X)\), an operator \(A\in \rho ^{-\beta }\textrm{Diff}_{\textrm{b}}^k(X)\) defines a bounded linear map \(H_{{\textrm{b}}}^{s,\alpha }(X)\rightarrow H_{{\textrm{b}}}^{s-k,\alpha -\beta }(X)\) for all \(s,\alpha \in \mathbb {R}\). In a completely analogous fashion, one can define 00-Sobolev spaces \(H_{0 0}^s(X)\) (relative to a positive 00-density, in local coordinates \(a(x,y)|\frac{\textrm{d}x}{x^2}\frac{\textrm{d}y^1}{x^2}\cdots \frac{\textrm{d}y^{n-1}}{x^2}|\)) and their weighted analoguesFootnote 11
with (weighted) 00-operators giving bounded maps between such spaces with appropriately shifted orders. When one order, say s, takes the value \(+\infty \) (resp. \(-\infty \)), the corresponding function space is defined as the intersection (resp. union) over all \(s\in \mathbb {R}\), so e.g. \(H_{0 0}^\infty (X)=\bigcap _{s\in \mathbb {R}}H_{0 0}^s(X)\); likewise in the cases \(\alpha =\infty \) and \(\beta =\infty \).
Returning to b-Sobolev spaces, we recall their interaction with the Mellin-transform from [Vas13, §3.1] (though we opt to use large parameter spaces here instead of semiclassical ones). To wit, we work on \([0,\infty )_x\times \partial X\), with \(\partial X\) a compact manifold without boundary; this is the model for a manifold with boundary near its boundary. The Mellin transform of a function \(u=u(x,y)\) with compact support in \(x^{-1}((0,\infty ))\) is then the Fourier transform in logarithmic coordinates,
The range can be characterized using the Paley–Wiener theorem; the inverse Mellin transform is \((\mathcal {M}_\alpha ^{-1}{\hat{u}})(x,y) = \int _{{\text {Im}}\lambda =-\alpha } x^{i\alpha }{\hat{u}}(\lambda ,y)\,\textrm{d}\lambda \) for any \(\alpha \in \mathbb {R}\). The Mellin transform diagonalizes dilation-invariant operators: if \(N(A)\in \textrm{Diff}_{\textrm{b}}^k([0,\infty )\times \partial X)\) is the (dilation-invariant) b-normal operator of a b-differential operator A, then \(\mathcal {M}(N(A)u)(\lambda )=N(A,\lambda )(\mathcal {M}u)(\lambda )\).
Equipping \(\partial X\) with a volume density, and taking its product with \(|\frac{\textrm{d}x}{x}|\) to define a b-density on \([0,\infty )\times \partial X\), Plancherel’s Theorem gives an isomorphism
Upon defining higher order b-Sobolev spaces via testing with dilation-invariant vector fields, i.e. with \(x\partial _x\) and x-independent vector fields on \(\partial X\), this extends to an isomorphism
for \(s\in \mathbb {N}_0\), where we introduced the large parameter Sobolev space \(H_{\langle \lambda \rangle }^s(\partial X)\). As a set, the space \(H_{\langle \lambda \rangle }^s(\partial X)\) is equal to \(H^s(\partial X)\), but its norm depends on \(\lambda \): if \(V_1,\ldots ,V_N\in \mathcal {V}(\partial X)\) is a collection of vector fields on X which spans \(\mathcal {V}(\partial X)\), we set
Via interpolation and duality, one can extend the isomorphism (2.5) to \(s\in \mathbb {R}\).
We also recall the action of the Mellin-transform on conormal or polyhomogeneous distributions; see [Maz91, §2A] for an overview, and [Mel96, §4] for a detailed account. To wit, if \(u\in \mathcal {A}^\alpha ([0,\infty )_x\times H)\) is supported in \(x\le x_0\in (0,\infty )\), then \((\mathcal {M}u)(\lambda )\) is holomorphic in \({\text {Im}}\lambda >-\alpha \), and for any \(\epsilon >0\) we have the Paley–Wiener type bound
Conversely, if \(\mathcal {M}u\) satisfies these bounds, then u is supported in \(x\le x_0\), and \(u\in \mathcal {A}^{\alpha -\epsilon }\) for all \(\epsilon >0\).
When \(\mathcal {E}\) is an index set and \(u\in \mathcal {A}^\mathcal {E}_{\textrm{phg}}([0,\infty )_x\times H)\), with \(x\le x_0\) on \({\text {supp}}u\), then \((\mathcal {M}u)(\lambda )\) is meromorphic in \(\lambda \in \mathbb {C}\) with values in \({\mathcal {C}}^\infty (H)\), with poles of order \(\le k+1\) at \(\lambda =-i z\) for \((z,k)\in \mathcal {E}\); and the above Paley–Wiener type bound holds for any \({\text {Im}}\lambda \ge -C\) for sufficiently large \(|{\text {Re}}\lambda |\) (depending on \(\mathcal {E}\) and C). Conversely, if \(\mathcal {M}u\) satisfies these conditions, then \(u\in \mathcal {A}^\mathcal {E}_{\textrm{phg}}\) (together with the support property).
2.4 Estimates for b- and 00-differential operators
In this paper, we only work with differential operators L that have injective, surjective, or invertible (elliptic) principal symbols \(\upsigma (L)\) in the appropriate sense; note that if \(\upsigma (L)\) is injective, resp. surjective, then \(L^*L\), resp. \(L L^*\) is elliptic, and therefore (approximate) left, resp. right inverses of L can be constructed from (approximate) inverses of an appropriate elliptic operator. In the b-setting, the (large) calculus of b-pseudodifferential operators [Mel93] is a very precise tool for this purpose; one can similarly construct an equally useful algebra of 00-pseudodifferential operators (which is rather similar to the scattering ps.d.o. algebra) for the (approximate) inversion of elliptic 00-differential operators. Here, in order to keep the paper more easily accessible, we opt for a direct approach based on a bounded geometry perspective, cf. [Shu92], and (for the 00-boundary symbol) on direct Fourier inversion.
We begin with the b-setting, and indeed work in a local coordinate chart \(X=[0,\infty )_x\times \mathbb {R}^{n-1}_y\) which we equip with the b-density \(|\frac{\textrm{d}x}{x}\,\textrm{d}y|\); we moreover define b-Sobolev spaces via testing with the vector fields \(x\partial _x\) and \(\partial _y\). For \(p\in \mathbb {Z}^{n-1}\) and \(j\in \mathbb {Z}\), we define the diffeomorphism
Note that \(\Phi _{p,j}^*((\log 2)x\partial _x)=-\partial _w\) and \(\Phi _{p,j}^*(\partial _y)=\partial _z\), so \(\mathcal {D}_{p,j}\) is a unit size cell for b-vector fields. See Fig. 3.
Illustration of the unit cells in the b-setting; see (2.6)
Fix a nonnegative function \(\chi \in {\mathcal {C}}^\infty _{{\textrm{c}}}(\mathcal {D})\) so that \(\chi =1\) on \([-1,1]\times \{|z|\le 1\}\), and put \(\chi _{p,j}:=(\Phi _{p,j})_*\chi \); then the sum of all \(\chi _{p,j}\), \(p\in \mathbb {Z}^{n-1}\), \(j\in \mathbb {Z}\), is locally finite and everywhere positive, and there exists some number \(J\in \mathbb {N}\) so that the intersection of \({\text {supp}}\chi _{p,j}\) for more than J pairwise distinct pairs \((p,j)\in \mathbb {Z}^{n-1}\times \mathbb {Z}\) is empty.
Lemma 2.3
(b-Sobolev norms) Let \(s,\alpha \in \mathbb {R}\). Then we have an equivalence of norms
in the sense that there exists a constant \(C>1\) only depending on \(s,\alpha \) so that the left hand side is bounded by C times the right hand side and vice versa.
Proof
Since \(x/2^{-j}\in [\frac{1}{4},4]\) on \(\mathcal {D}_{p,j}\), we only need to consider the case \(\alpha =0\). By interpolation and duality, it moreover suffices to show, for \(s\in \mathbb {N}_0\), the boundedness of the maps
(For \(s=0\), the map \(\Xi '\) is the adjoint of \(\Xi \).) The boundedness of \(\Xi \) is equivalent to the estimate \(\sum \Vert \chi _{p,j}u\Vert _{H^s_{\textrm{b}}}^2\lesssim \Vert u\Vert _{H^s_{\textrm{b}}}^2\); this is clear for \(s=0\), and follows for \(s\in \mathbb {N}\) by induction, noting that the pullbacks of \([x\partial _x,\chi _{p,j}]\) and \([\partial _y,\chi _{p,j}]\) along \(\Phi _{p,j}\) are uniformly bounded in \({\mathcal {C}}^\infty (\mathbb {R}^n)\). The boundedness of \(\Xi '\) follows for \(s=0\) from the Cauchy–Schwarz inequality, using the finite intersection property of \({\text {supp}}((\Phi _{p,j})_*\chi )={\text {supp}}\chi _{p,j}\); for \(s\in \mathbb {N}\), one uses induction. \(\quad \square \)
Corollary 2.4
(b-Sobolev embedding). On an n-dimensional compact manifold X with boundary, equipped with a smooth positive b-density, we have \(H_{{\textrm{b}}}^{s,\alpha }(X)\hookrightarrow \rho ^\alpha L^\infty (X)\) for \(s>\frac{n}{2}\), where \(\rho \in {\mathcal {C}}^\infty (X)\) is a boundary defining function. More generally, \(H_{{\textrm{b}}}^{s,\alpha }(X)\hookrightarrow \rho ^\alpha {\mathcal {C}}_{\textrm{b}}^k(X)\) for \(s>\frac{n}{2}+k\), where \({\mathcal {C}}_{\textrm{b}}^k(X)\) denotes the space of \({\mathcal {C}}^k\)-functions on \(X^\circ \) all of whose b-derivatives of order up to k are bounded. Finally, \(H_{{\textrm{b}}}^{\infty ,\alpha }(X)\hookrightarrow \mathcal {A}^\alpha (X)\).
Proof
This follows from Lemma 2.3 and Sobolev embedding on \(\mathbb {R}^n\). \(\quad \square \)
As a consequence of elliptic regularity on bounded subsets of \(\mathbb {R}^n\), we obtain:
Lemma 2.5
(Elliptic b-estimate). Let \(L\in \rho ^{-\beta }\textrm{Diff}_{\textrm{b}}^k(X)\). Let \(\chi _1,\chi _2\in {\mathcal {C}}^\infty _{{\textrm{c}}}(X)\), with \(\chi _2\equiv 1\) near \({\text {supp}}\chi _1\), and with \(\rho ^\beta L\) having an elliptic principal symbol near \({\text {supp}}\chi _2\). Then for all \(s,s_0\in \mathbb {R}\), there exists a constant C so that
This holds in the strong sense that if the right hand side is finite, then so is the left hand side and the estimate holds.
Via a partition of unity argument, we also obtain the estimate (2.7) for elliptic b-differential operators on manifolds.
Proof of Lemma 2.5
We can reduce to the case \(\alpha =\beta =0\). Then, for \(\tilde{\chi }\in {\mathcal {C}}^\infty _{{\textrm{c}}}(\mathcal {D})\) identically 1 near \({\text {supp}}\chi \), the pullback \(\tilde{\chi }\Phi _{p,j}^*L\in \textrm{Diff}^k(\mathcal {D})\) is uniformly bounded and uniformly elliptic on \(\tilde{\chi }^{-1}(1)\). Thus, for \(s_0\le s-1\), we have a uniform bound
which gives (2.7) for \(s_0=s-1\). Iterating this estimate on the error term (upon enlarging the supports of the cutoffs) finitely many times gives (2.7) in general. \(\quad \square \)
Consider now the case \(L\in \textrm{Diff}_{\textrm{b}}^k(X)\). Even when X is compact and one takes \(\chi _1=\chi _2=1\) in (2.7), the error term is not relatively compact (i.e. the inclusion \(H_{{\textrm{b}}}^{s,\alpha }(X)\hookrightarrow H_{{\textrm{b}}}^{s_0,\alpha }(X)\) fails to be a compact map); in fact, the Fredholm property of elliptic b-operators requires the invertibility of its normal operator [Mel93, §5.17]. Note then that when \(L\in \textrm{Diff}_{\textrm{b}}^k(X)\) is elliptic, the Mellin-transformed normal operator family \(N(L,\lambda )\in \textrm{Diff}^k(\partial X)\), regarded as an operator with large parameter \({\text {Re}}\lambda \), is elliptic when \(|{\text {Im}}\lambda |\) is bounded by an arbitrary but fixed constant \(C_1\), as follows from the explicit expression (2.1) and the fact that the contributions from \({\text {Im}}\lambda \) (when writing \(\lambda ={\text {Re}}\lambda +i{\text {Im}}\lambda \)) are subprincipal. A parametrix \(Q(\lambda )\) for \(N(L,\lambda )\) can then be constructed within the class of pseudodifferential operators with large parameter [Shu87], and as such, \(Q(\lambda )\) has order \(-k\); it has the property that \(Q(\lambda )\circ N(L,\lambda )=I+R(\lambda )\) where \(R(\lambda )\) is residual, in the sense that the Schwartz kernel of \(R(\lambda )\) is smooth and rapidly vanishing as \(|{\text {Re}}\lambda |\rightarrow \infty \), \(|{\text {Im}}\lambda |<C_1\). Thus, \(I+R(\lambda )\) is invertible for large \(|{\text {Re}}\lambda |\), with inverse on \(L^2(\partial X)\) given by a convergent Neumann series whose limit is again of the form \(I+{\tilde{R}}(\lambda )\) where \({\tilde{R}}(\lambda )\) is residual; therefore, \(N(L,\lambda )^{-1}=(I+\tilde{R} (\lambda ))Q(\lambda )\) is, for large \(|{\text {Re}}\lambda |\) (depending on the bound \(|{\text {Im}}\lambda |<C_1\)) a large parameter ps.d.o. of order \(-k\). As such, it is uniformly bounded as a map
between large parameter Sobolev spaces (see Sect. 2.3). The Fredholm property of an elliptic operator L as a map \(L:H_{{\textrm{b}}}^{s,\alpha }(X)\rightarrow H_{{\textrm{b}}}^{s-k,\alpha }(X)\) is equivalent to the invertibility of \(N(L,\lambda )\) for all \(\lambda \in \mathbb {C}\) with \({\text {Im}}\lambda =-\alpha \). See [Vas18, §6] for details. (A particular instance of this is given in the proof of Lemma 4.9 below.)
We now turn to the 00-setting on \(X=[0,\infty )_x\times \mathbb {R}^{n-1}_y\). The unit cells being of size \(\sim x^2\) in the x- and y-directions at a point \((x,y)\in X^\circ \), we now define, for \(p\in \mathbb {Z}^{n-1}\) and \(3\le j\in \mathbb {N}\), the diffeomorphismFootnote 12
where \(\mathcal {D}=(-2,2)_w\times B(0,2)_z\) as before. See Fig. 4 for an illustration. The inverse map is \((x,y)\mapsto (\frac{1}{x}-j,j^2 y-p)\); thus, the 00-vector fields \(x^2\partial _x\) and \(x^2\partial _y\) pull back to the uniformly bounded vector fields \(-\partial _w\) and \((\frac{j}{j+w})^2\partial _z\) on \(\mathcal {D}\). Fixing \(\chi \in {\mathcal {C}}^\infty _{{\textrm{c}}}(\mathcal {D})\), with \(\chi \equiv 1\) on \([-1,1]\times \{|z|\le 1\}\), the pushforwards \(\tilde{\chi }_{p,j}=(\Phi _{p,j})_*\chi \) have similar properties as above: their sum is locally finite and everywhere positive for \(x\le \frac{1}{2}\), and the intersections of more than a suitable fixed number \(J\in \mathbb {N}\) of supports are empty.
Illustration of the unit cells in the 00-setting, cf. (2.8)
The restriction to \(x\le \frac{1}{2}\) above is due to the fact that the scaling of the ‘natural’ 00-vector fields \(x^2\partial _x\) and \(x^2\partial _y\) is artificial for \(x\gtrsim 1\). For technical convenience, we thus fix \(\chi _\partial \in {\mathcal {C}}^\infty _{{\textrm{c}}}([0,\frac{1}{2}))\) to be identically 1 on \([0,\frac{1}{4}]\), and we fix \(\rho =\rho (x)\) so that \(\rho (x)=x\) for \(x\le \frac{1}{2}\), \(\rho \) is increasing, and \(\rho (x)=1\) for \(x\ge 1\); we then set
The analogue of Lemma 2.3 is then for \(s,\alpha ,\beta \in \mathbb {R}\) the norm equivalence
The proof is analogous to that of Lemma 2.3. (Only \({\text {supp}}(1-\chi _\partial )\) intersects more than J of the \({\text {supp}}\chi _{p,j}\) nontrivially, which still allows for an application of the Cauchy–Schwarz inequality in the proof of the boundedness of the analogue of the map denoted \(\Xi '\) there.)
Corollary 2.6
(00-Sobolev embedding) On an n-dimensional compact manifold X with boundary, equipped with a positive 00-density, we have \(H_{0 0}^s(X)\hookrightarrow L^\infty (X)\) for \(s>\frac{n}{2}\). Moreover, if \(\beta >0\), then \(e^{-\beta /\rho }H_{0 0}^\infty (X)\hookrightarrow \dot{{\mathcal {C}}}^\infty (X)\).
Proof
(Cf. the discussion before [Cor00, Theorem 4], and also [Del12, Corollary 4.3].) We only prove the last statement. Note that \(H_{0 0}^s(X)\hookrightarrow {\mathcal {C}}_{0 0}^k(X)\) for \(s>\frac{n}{2}+k\), where \({\mathcal {C}}_{0 0}^k(X)\) consists of all elements of \({\mathcal {C}}^k(X^\circ )\) which are uniformly bounded upon application of up to k 00-derivatives. Let \(u=e^{-\beta /\rho } u_0\) with \(u_0\in H_{0 0}^s(X)\). Suppose first that \(s>\frac{n}{2}\); then \(u\in e^{-\beta /\rho }{\mathcal {C}}_{0 0}^0(X)\hookrightarrow e^{-\beta '/\rho }{\mathcal {C}}^0(X)\) for all \(\beta '<\beta \). If \(s>\frac{n}{2}+1\), and if \(V\in \mathcal {V}(X)\), then
The terms in parentheses lie in \(H_{0 0}^{s-1}(X)\), and thus \(V u\in \rho ^{-2}e^{-\beta /\rho }{\mathcal {C}}_{0 0}^0(X)\hookrightarrow e^{-\beta '/\rho }{\mathcal {C}}^0(X)\). Continuing in this manner, we conclude that \(e^{-\beta /\rho }H_{0 0}^s(X)\hookrightarrow e^{-\beta '/\rho }{\mathcal {C}}^k(X)\) for \(s>\frac{n}{2}+k\) and \(\beta '<\beta \). This implies the Corollary upon letting \(s,k\nearrow \infty \), since \(e^{-\beta '/\rho }{\mathcal {C}}^\infty (X)\hookrightarrow \dot{{\mathcal {C}}}^\infty (X)\) for \(\beta '>0\). \(\quad \square \)
The characterization (2.9) can be used to prove an analogue of Lemma 2.5 for 00-differential operators \(L\in \textrm{Diff}_{0 0}^k(X)\) which only need to be assumed to be elliptic in the differential order sense, meaning that the image of \({}^{0 0}\upsigma ^m(L)\) in the quotient space \((P^m/P^{m-1})({}^{0 0}T^*X)=P^m_\textrm{hom}({}^{0 0}T^*X)\) is elliptic. Thus,
Improving the error term—concretely, weakening the polynomial weight \(\rho ^\alpha \) to \(\rho ^{\alpha '}\) with \(\alpha '<\alpha \)—requires the ellipticity of the boundary principal symbol of L:
Proposition 2.7
(Elliptic 00-estimate near the boundary). Let \(X=[0,\infty )\times \mathbb {R}^{n-1}\) and \(L\in \textrm{Diff}_{0 0}^k(X)\). Let \(R>0\), and suppose that \(\ell ={}^{0 0}\upsigma ^k(L)\) is elliptic at \((0,y)\in X\) when \(|y|<R\) (i.e. \(\ell (0,y)\in P^m({}^{0 0}T^*_{(0,y)}X)\) is elliptic and non-vanishing). Let \(\chi _1,\chi _2\in {\mathcal {C}}^\infty _{{\textrm{c}}}([0,\frac{1}{4})_x\times B(0,R))\), with \(\chi _2\equiv 1\) near \({\text {supp}}\chi _1\). Then for all \(s,\alpha \in \mathbb {R}\) there exists a constant C so that
This holds in the strong sense that if the right hand side is finite, then so is the left hand side and the estimate holds.
Note that conjugation of L by an exponential weight may destroy ellipticity of the boundary principal symbol; this is the reason for only using polynomially weighted spaces.
Proof of Proposition 2.7
Upon conjugating L by \(x^{-\alpha }\), which preserves the ellipticity condition, it suffices to consider the case \(\alpha =0\). Let \(\delta \in (0,\frac{1}{8})\) and \(R_0<R_1<R_2<R\). It suffices to prove (2.10) in the case that \(\chi _j(x,y)=\psi _j(x)\omega _j(y)\), \(j=1,2\), where \(\psi _j\in {\mathcal {C}}^\infty _{{\textrm{c}}}([0,j\delta ))\) is identically 1 on \([0,(j-\frac{1}{2})\delta ]\), and \(\omega _j\in {\mathcal {C}}^\infty _{{\textrm{c}}}(B(0,R_j))\) is identically 1 on \(B(0,R_{j-1})\). Fix moreover \(\chi _3\) of the same product form, which is in between \(\chi _1\) and \(\chi _2\) in the sense that \(\chi _3\equiv 1\) near \({\text {supp}}\chi _1\), and \(\chi _2\equiv 1\) near \({\text {supp}}\chi _3\).
Writing 00-covectors as \(\xi \frac{\textrm{d}x}{x^2}+\eta \frac{\textrm{d}y}{x^2}\), the ellipticity of \(\ell (0,y;\xi ,\eta )\) implies an estimate \(|\ell (0,y;\xi ,\eta )|\ge c(1+|\xi |+|\eta |)^k\) for some \(c>0\). Set
which is thus a symbol of order \(-k\) in \((\xi ,\eta )\). Let \(\phi \in {\mathcal {C}}^\infty _{{\textrm{c}}}((-\frac{1}{2},\frac{1}{2}))\) be identically 1 near 0. We then define the oscillatory integral operator
A few technical comments about this definition are in order. Firstly, the two cutoffs involving \(\phi \) ensure that the Schwartz kernel of Q, given by the inverse Fourier transform of q (i.e. dropping f and the \((x',y')\)-integration), is supported in \(\frac{x}{x'}\simeq 1\) and \(|y-y'|\lesssim x,x'\) (and thus will be shown to preserve whatever weight the input f has on the scale of weighted 00-Sobolev spaces). Carefully note moreover that the cutoffs do not localize (x, y) and \((x',y')\) in the same unit cells for 00-geometry, but rather in the same unit cells for 0-geometry: they cut the Schwartz kernel off in the region where the Fourier transform of q already vanishes to infinite order in the limit \(x+x'\searrow 0\).Footnote 13
We shall prove:
Granted these estimates, we have \(\chi _1 u=Q L(\chi _1 u)-R\chi _1 u=Q(\chi _2 L\chi _1 u)-R\chi _1 u\); but since \(\chi _2 L\chi _1 u=\chi _2 L u-\chi _2[L,\chi _1]u\), with \(\Vert \chi _2[L,\chi _1]u\Vert _{H_{0 0}^{s-k}}\le C\Vert \chi _2 u\Vert _{x^{-1}H_{0 0}^{s-1}}\), this gives (2.10).
In order to prove (2.12), consider a unit cell \(\mathcal {D}_{p,j}\) (see (2.8)) with nonempty intersection with \({\text {supp}}\chi _2\), and consider points \((x,y)\in \mathcal {D}_{p,j}\). If \((x,x',y,y')\) lies in the support of the integrand of (2.11), then \(|\frac{x'}{x}|\in (\frac{1}{2},2)\) and \(|y'-y|\in B(0,\frac{1}{2} x')\), and therefore \((x',y')\in \mathcal {D}_{p',j'}\) where
for some constant \(C_1>1\) independent of p, j. Choosing the localization constant \(\delta >0\) in the definition of \(\chi _2\) small enough, we have \(j\ge 10 C_1\) and thus \(j'\ge 10\). We shall now prove the following estimate for the ‘matrix elements’ of Q:
Here, \(\tilde{\chi }_{p',j'}=\frac{\chi _{p',j'}}{S}\) where \(S=\sum \chi _{p'',j''}\) is positive on \({\text {supp}}\chi _2\); the pullbacks \(\Phi _{p,j}^*S\), \(\Phi _{p,j}^*S^{-1}\), and thus also \(\Phi _{p,j}^*\tilde{\chi }_{p',j'}\) are uniformly bounded in \({\mathcal {C}}^\infty (\mathcal {D})\), and \(\sum _{p',j'}\tilde{\chi }_{p',j'}=1\) on \({\text {supp}}\chi _2\). Once (2.15) is proved, summing over \(p',j'\) subject to (2.14) gives a square-integrable sequence in (p, j) provided \(\Vert \Phi _{p',j'}^*(\chi _{p',j'}f)\Vert _{H^{s-k}}\) is square-integrable; indeed, note that the sum of (2.15) over \(p',j'\) is of convolution type, and \((1+|j|+|p|)^{-N}\) is summable for \(N>n\). (That is, we use the fact that \(\ell _1*\ell _2\subset \ell _2\).) In view of (2.9), this establishes (2.12) upon using \(\chi _2 f\) instead of f in (2.15).
For the proof of (2.15) then, consider first the case \((p',j')=(p,j)\). Inserting \((x,y)=\Phi _{p,j}(w,z)\) and \((x',y')=\Phi _{p',j'}(w,z)\) in (2.11), the operator \(\chi _{p,j}Q\tilde{\chi }_{p',j'}\) becomes an operator on \(\mathbb {R}^n\) with Schwartz kernel
where \(\phi _2\) is the product of the two \(\phi \) factors in (2.11). Changing variables to \(\frac{\xi }{(j+w)/(j-w)}\) and \(\frac{\eta }{(j/(j+w))^2}\) gives a pseudodifferential operator on \(\mathbb {R}^n\) which obeys uniform (in p, j) bounds as a map \(H^{s-k}(\mathbb {R}^n_{(w',z')})\rightarrow H^s(\mathbb {R}^n_{(w,z)})\).
Consider next the case that the supports of \(\chi _{p,j}\) and \(\chi _{p',j'}\) are disjoint; in this case, we can integrate by parts in (2.11) using that \(|(\frac{x-x'}{x^2},\frac{y-y'}{x^2})|^{-2}(\frac{x-x'}{x^2},\frac{y-y'}{x^2})\cdot \nabla _{(\xi ,\eta )}\) preserves the exponential, and using that \(\nabla _{(\xi ,\eta )}\) reduces the symbolic order of q by 1. The operator \(\chi _{p,j}Q\tilde{\chi }_{p',j'}\) thus has a smooth Schwartz kernel which is rapidly decaying as \(|(\frac{x-x'}{x^2},\frac{y-y'}{x^2})|\rightarrow \infty \), which upon passing to \((w,z,w',z')\)-coordinates as above implies the bound (2.15) (where one can in fact replace the Sobolev orders on both sides by arbitrary but fixed numbers).
Finally, for those \((p',j')\) for which \(\chi _{p,j}\) and \(\chi _{p',j'}\) have intersecting supports—recall that the number of such \((p',j')\) is bounded independently of (p, j)—we can split \(\tilde{\chi }_{p',j'}\) in (2.15) into \(\tilde{\chi }_{p',j'}\chi ^\sharp _{p,j}+\tilde{\chi }_{p',j'}(1-\chi ^\sharp _{p,j})\), where \(\chi ^\sharp _{p,j}=\Phi _{p,j}^*\chi ^\sharp \) with \(\chi ^\sharp \in {\mathcal {C}}^\infty _{{\textrm{c}}}(\mathcal {D})\) identically 1 near \({\text {supp}}\chi \). In the first summand, we pass to \(\Phi _{p,j}\)-coordinates for both x, y and \(x',y'\), obtaining a ps.d.o. on \(\mathbb {R}^n\) as before; the second summand gives a smoothing operator using the above integration by parts argument, and (2.15) holds (with \(|j-j'|\) and \(|p-p'|\) bounded independently of (p, j) for the values of \((p',j')\) we are currently considering). This finishes the proof of (2.15), and thus of (2.12).
We next turn to the proof of (2.13). Plugging \(f=L(\chi _1 u)\) into (2.11), with L (in primed coordinates) a sum of terms \(a(x',y')(x'{}^2 D_{x'})^j(x'{}^2 D_{y'})^\alpha \), we integrate by parts in \((x',y')\). When \(-D_{x'}x'{}^2=i x'{}^2\partial _{x'}+2 i x'\) falls on the exponential, we obtain a factor of
We regard the first term as the main term; it has the same symbol as \(x^2 D_x\). Note that the third summand has a factor of \(x'\); and in the second summand, we can rewrite \(\frac{x-x'}{x^2}\) as the \(D_\xi \)-derivative of the exponential followed by integration by parts in \(\xi \), with the \(\xi \)-derivative of q dropping one symbolic order. Another possibility is that \(-x'{}^2 D_{x'}\) falls on one of the cutoffs \(\phi \) in (2.11), resulting in a term with an extra factor of \(x'\) and a differentiated cutoff which is thus equal to 0 near \((\frac{x-x'}{x'},\frac{|y-y'|}{x'})=0\). Derivatives along \(-D_{y'}x'{}^2\) are rewritten in a similar manner.
Altogether then, \(Q L(\chi _1 u)\) is the sum of three types of terms: one term which is a quantization as in (2.11) with symbol \(q(x,y;\xi ,\eta )\ell (x,y;\xi ,\eta )=\chi _2(x,y)+x r(x,y;\xi ,\eta )\) where r is a symbol of order 0; secondly, terms with undifferentiated cutoffs \(\phi \), with symbolic order 0, but with extra factors of x or \(x'\); and thirdly, terms in which at least one of the cutoffs \(\phi \) is differentiated at least once. It then suffices to note that the quantization of xr as well as terms of the second type are bounded from \(x^{-1}H_{0 0}^s\rightarrow H_{0 0}^s\) (as follows from the mapping properties we showed above for Q—which did not rely on the specific form of Q). Moreover, terms of the third type, due to the localization away from the diagonal, are bounded between any two polynomially weighted 00-Sobolev spaces; for present purposes, it suffices to note the weaker statement that upon performing one integration by parts in the momentum variables, one obtains an operator which is a 00-quantization (with slightly enlarged cutoffs \(\phi \)) whose symbol gains a power of \(|(\frac{x-x'}{x^2},\frac{y-y'}{x^2})|^{-1}=x|(\frac{x-x'}{x},\frac{y-y'}{x})|^{-1}\), and thus a power of x since \(|(\frac{x-x'}{x},\frac{y-y'}{x})|\) is bounded from below by a positive constant on . \(\quad \square \)
Example 2.8
(Exterior derivative) Slightly modifying Example 2.2, let \(\Omega \subset \mathbb {R}^n\) be a bounded domain with smooth boundary, and let \(\rho \in {\mathcal {C}}^\infty (\overline{\Omega })\) be a boundary defining function. For \(\beta \in \mathbb {R}\), we consider
with principal symbol \({}^{0 0}\upsigma ^1(L)(z,\zeta )=i\rho ^2\zeta -\beta \,\textrm{d}\rho :\mathbb {R}\rightarrow T^*_z\mathbb {R}^n\) where \(\zeta \in {}^{0 0}T^*\overline{\Omega }\); recall also that \(\rho ^2:{}^{0 0}T^*\overline{\Omega }\rightarrow T^*_{\overline{\Omega }}\mathbb {R}^n\) is an isomorphism). For \(\beta \ne 0\), this is injective. Taking adjoints with respect to the standard volume density and fiber inner products on \(\mathbb {R}^n\), we have \(L^*=e^{\beta /\rho }\delta _\mathfrak {e}\,\rho ^2 e^{-\beta /\rho }\) where \(\delta _\mathfrak {e}\) is the (negative) divergence on 1-forms. The operator \(L^*L\in \textrm{Diff}_{0 0}^2(\overline{\Omega })\) has an elliptic principal symbol (both at fiber infinity and at \(\partial \Omega \)). The elliptic estimates proved above therefore give
Note then that the inclusion \(H_{0 0}^s(\overline{\Omega })\rightarrow x^{-1}H_{0 0}^{s-1}(\overline{\Omega })\) is compact; and any \(u\in H_{0 0}^s\) with \(L^*L u=0\) automatically lies in \(\rho ^\infty H_{0 0}^\infty (\overline{\Omega })\), and therefore (via pairing with u and integrating by parts) satisfies \(L u=0\), so \(u=c e^{-\beta /\rho }\) for some constant \(c\in \mathbb {C}\). For \(\beta >0\), we have \(e^{-\beta /\rho }\in H_{0 0}^s\) indeed, and we conclude that \(L^*L: H_{0 0}^s(\overline{\Omega })\rightarrow H_{0 0}^{s-2}(\overline{\Omega })\) has closed range with 1-dimensional kernel and cokernel spanned by \(e^{-\beta /\rho }\). Let now \(f\in {\mathcal {C}}^\infty _{{\textrm{c}}}(\Omega )\) with \(\int f(x)\,\textrm{d}x=0\); then \(e^{\beta /\rho }f\in {\text {ran}}L^*L\), so there exists \(u\in H_{0 0}^\infty (\overline{\Omega })\) with \(L^*(L u)=e^{\beta /\rho }f\), or equivalently
where we used Corollary 2.6. Thus, we have solved the divergence equation with control on the support \(\omega \).
3 Geometry and Analysis on the Total Gluing Space
We now turn to the gluing problem in earnest. Fix an n-dimensional manifold X (without boundary) and a point \(\mathfrak {p}\in X\); here \(n\ge 3\). We then consider a fibration
together with a choice of identification of the fiber over 0 with X. For the purpose of doing analysis, it is convenient to immediately fix an identification of the other fibers with X as well; that is, we fix a trivialization
We denote the coordinate in the first factor by \(\epsilon \).
Definition 3.1
(Total gluing space; tangent bundle). In the above notation, we define
as the real blow-up of \(\widetilde{X}'\) at the point \(\mathfrak {p}\in X\) in the fiber over \(\epsilon =0\); we write \(\widetilde{\upbeta }:\widetilde{X}\rightarrow \widetilde{X}'\) for the blow-down map. We write
for the front face and the side face (the lift of the original boundary), respectively; the restriction of \(\widetilde{\upbeta }\) to \(X_\circ \) is denoted \(\upbeta _\circ : X_\circ \rightarrow X\). For \(\epsilon \in (0,1)\) we define the level set
which we also regard as a submanifold of \(\widetilde{X}'\). We moreover denote by \(\widetilde{T}\widetilde{X}'\rightarrow \widetilde{X}'\) the vertical tangent bundle, i.e. the bundle of tangent vectors which are tangent to the fibers, and by \(\widetilde{T}\widetilde{X}\rightarrow \widetilde{X}\) the pullback of \(\widetilde{T}\widetilde{X}'\) along \(\widetilde{\upbeta }\). We write \(\widetilde{\mathcal {V}}(\widetilde{X}):={\mathcal {C}}^\infty (\widetilde{X};\widetilde{T}\widetilde{X})\).
We denote by \(\hat{\rho }\) and \(\rho _\circ \in {\mathcal {C}}^\infty (\widetilde{X})\) defining functions of \({\hat{X}}\) and \(X_\circ \), respectively; when working in subsets of \(\widetilde{X}\), we use the same notation for local defining functions (i.e. defining functions of \({\hat{X}}\) and \(X_\circ \) inside the submanifold).
Concretely, in local coordinates \(x=(x^1,\ldots ,x^n)\in \mathbb {R}^n\) on X, valid for \(|x|<r_0\) and with \(\mathfrak {p}\) given by \(x=0\), a neighborhood of \({\hat{X}}\subset \widetilde{X}\) is covered by the two coordinate charts
We shall see in Lemma 3.2(2) below that \({\hat{x}}\) is a linear coordinate on the interior \({\hat{X}}^\circ \cong T_\mathfrak {p}X\) of \({\hat{X}}\). See Fig. 5.
For any \(\epsilon \in (0,1)\), we have \(\widetilde{X}_\epsilon \cong X\) via \((\epsilon ,p)\mapsto p\), and \(\widetilde{T}_{\widetilde{X}_\epsilon }\widetilde{X}=\widetilde{T}_{\widetilde{X}_\epsilon }\widetilde{X}'=T X\). In \(\epsilon >0\) then, elements \(V\in \widetilde{\mathcal {V}}(\widetilde{X})\) are smooth families (in \(\epsilon \)) of smooth vector fields \(V|_{\widetilde{X}_\epsilon }\) on \(\widetilde{X}_\epsilon =X\) which become singular in a specific fashion as \(\epsilon \searrow 0\): in local coordinates x on X, they are linear combinations of \(\partial _{x^i}\) (\(i=1,\ldots ,n\)) with coefficients which are smooth functions on \(\widetilde{X}\), i.e. smooth functions of \((\epsilon ,{\hat{x}})\), resp. \((\hat{\rho },\rho _\circ ,\omega )\). (Away from \({\hat{X}}\), the notions of smoothness on \(\widetilde{X}\) and \(\widetilde{X}'\) agree since \(\widetilde{X}\setminus \widehat{X}=\widetilde{X}'\setminus \{(0,\mathfrak {p})\}\).)
Lemma 3.2
(Basic properties of \(\widetilde{X}\))
-
(1)
We have \(\widetilde{T}_{X_\circ }\widetilde{X}=\upbeta _\circ ^*(T X)\). For all \(q\in {\hat{X}}\), we have \(\widetilde{T}_q\widetilde{X}=T_\mathfrak {p}X\).
-
(2)
We have \({\hat{X}}\cong \overline{T_\mathfrak {p}}X\), with the isomorphism given by continuous extension of \(T_\mathfrak {p}X\ni V\mapsto \lim _{s\searrow 0}(s,\gamma (s))\) where \(\gamma \) is a curve in X with \(\gamma (0)=\mathfrak {p}\) and \(\gamma '(0)=V\). In particular, \({\hat{X}}^\circ \cong T_\mathfrak {p}X\) is a vector space.
-
(3)
Let \(q\in {\hat{X}}^\circ \). Then the map assigning to \(V\in \widetilde{T}_q\widetilde{X}\) the restriction \((\epsilon \widetilde{V})|_q\), where \(\widetilde{V}\in \widetilde{\mathcal {V}}(\widetilde{X})\) satisfies \(\widetilde{V}(q)=V\), defines an isomorphism \(\widetilde{T}_q\widetilde{X}\xrightarrow {\cong } T_q{\hat{X}}\) which extends to an isomorphism \(\widetilde{T}_{{\hat{X}}}\widetilde{X}\xrightarrow {\cong }{}^{{\textrm{sc}}}T{\hat{X}}\).
-
(4)
We have \(\widetilde{\mathcal {V}}(\widetilde{X})\hookrightarrow \hat{\rho }^{-1}\mathcal {V}_{\textrm{b}}(\widetilde{X})\).
Proof
The first part follows directly from the definitions. For the second part, write \(V=V^i\partial _{x^i}\) in local coordinates centered at \(\mathfrak {p}\); then \(\gamma (s)=(s V^1,\ldots ,s V^n)+\mathcal {O}(s^2)\), and therefore \(s\mapsto (s,\gamma (s))\) in \((\epsilon ,{\hat{x}})\)-coordinates is given by \(s\mapsto (\epsilon ,{\hat{x}})=(s,(V^1,\ldots ,V^n)+\mathcal {O}(s))\). This establishes an isomorphism \(T_\mathfrak {p}X\cong {\hat{X}}^\circ \). Passing to inverse polar coordinates \((\rho _T,\omega _T):=|(V^1,\ldots ,V^n)|^{-1}(1,(V^1,\ldots ,V^n))\) and \((\rho _\circ ,\omega )\), \(\rho _\circ =|{\hat{x}}|^{-1}\), near \(\partial \overline{T_\mathfrak {p}}X\) and \(\partial {\hat{X}}\), respectively, this map is given by \((\rho _T,\omega _T)\mapsto (\rho _\circ ,\omega )=(\rho _T,\omega _T)\) and thus extends to a diffeomorphism as claimed.
In part (3), one assigns to \(V^i\partial _{x^i}=\epsilon ^{-1}V^i\partial _{{\hat{x}}^i}\) the tangent vector \(V^i\partial _{{\hat{x}}^i}\). It then remains to note that \(\partial _{{\hat{x}}^i}\) is a frame of \({}^{{\textrm{sc}}}T{\hat{X}}\). Part (4) finally follows from the above away from the corner \(X_\circ \cap {\hat{X}}\) of \(\widetilde{X}\); near the corner on the other hand, note that \(V=|x|\partial _{x^i}=\hat{\rho }\partial _{x^i}\) is a smooth b-vector field on \(X_\circ \), thus its lift to \(\widetilde{X}\) is a linear combination of \(\hat{\rho }\partial _{\hat{\rho }}\), \(\rho _\circ \partial _{\rho _\circ }\), \(\partial _\omega \) with \({\mathcal {C}}^\infty ([0,1)_{\hat{\rho }}\times \mathbb {S}^{n-1}_\omega )\)-coefficients. Therefore, \(\partial _{x^i}=\hat{\rho }^{-1}V\in \hat{\rho }^{-1}\mathcal {V}_{\textrm{b}}(\widetilde{X})\), as claimed.
\(\square \)
Definition 3.3
(Scaling) For \(p,q\in \mathbb {N}_0\), we write \(\textsf{s}:\widetilde{T}_{\widetilde{X}^\circ }^{p,q}\widetilde{X}\rightarrow \widetilde{T}_{\widetilde{X}^\circ }^{p,q}\widetilde{X}\) for multiplication by \(\epsilon ^{q-p}\). By an abuse of notation, we denote the direct sum of several such maps (for various values of (p, q)) by \(\textsf{s}\) as well. We write \(\hat{\textsf{s}}:{}^{{\textrm{sc}}}T^{p,q}{\hat{X}}\rightarrow \widetilde{T}_{{\hat{X}}}\widetilde{X}\) for the restriction of \(\textsf{s}\) to \({\hat{X}}\).
The map \(\hat{\textsf{s}}\) is well-defined and indeed a bundle isomorphism by Lemma 3.2(3). For example, \(\textsf{s}\) is division by \(\epsilon \) on tangent vectors (mapping \(\partial _{{\hat{x}}^i}\mapsto \partial _{x^i}\) in the above local coordinates), and multiplication by \(\epsilon ^2\) on symmetric 2-tensors (mapping \(\textrm{d}{\hat{x}}^i\,\textrm{d}{\hat{x}}^j\mapsto \textrm{d}x^i\,\textrm{d}x^j\)).
When relating objects (such as functions or differential operators) on \(\widetilde{X}\) to their restrictions to \({\hat{X}}\) or \(X_\circ \), the following geometric result will be useful; we work with local coordinates \(x\in \mathbb {R}^n\) on X near \(\mathfrak {p}\), and with \({\hat{x}}=\frac{x}{\epsilon }\).
Lemma 3.4
(Relationships between parameterized spaces) The identity map on \(\widetilde{X}'\) lifts to a diffeomorphism
and also, near \(\{0\}\times \{\mathfrak {p}\}\), to a diffeomorphism
Proof
This can be checked directly by covering the spaces on both sides with coordinate charts and writing down the identity map on \(\widetilde{X}'\) with respect to those; see Fig. 6 for the diffeomorphism (3.1). Alternatively, the first diffeomorphism arises from the commutation result [Mel96, Proposition 5.8] for iterated blow-ups, which gives
where we used in the last line that the lift of \(\{0\}\times \{\mathfrak {p}\}\) to \([0,1)\times [X;\{\mathfrak {p}\}]=[0,1)\times X_\circ \) is \(\{0\}\times \partial X_\circ \). \(\quad \square \)
The diffeomorphism (3.1) in local coordinates. Matching coordinates are indicated with matching colors
Let now
be identically 1 near, and supported in, a small collar neighborhood of \({\hat{X}}\) and \(X_\circ \), respectively. Then via Lemma 3.4, we can extend any given function \(f\in {\mathcal {C}}^\infty (X_\circ )\) to an \(\epsilon \)-independent function on \([0,1)\times X_\circ \), lift it to the resolution, and upon localizing with \(\chi _\circ \) finally define \(\chi _\circ f\) as a smooth function on \(\widetilde{X}\) (given that \({\text {supp}}\chi _\circ \) is disjoint from the lift of \(x=0\)), with support near \(X_\circ \); similarly for the extension of functions from \({\hat{X}}\). In the same manner, a differential operator such as \(a(x)D_x^\alpha \) on \(X_\circ \), resp. \(a({\hat{x}})D_{{\hat{x}}}^\alpha \) on \({\hat{X}}\), lifts upon localization to a differential operator \(\chi _\circ a(x)D_x^\alpha \), resp. \(\hat{\chi } a(x/\epsilon )(\epsilon D_x)^\alpha \) on \(\widetilde{X}\) (using coordinates \(\epsilon >0\) and \(x\in \mathbb {R}^n\)).
3.1 Analysis on the total gluing space
As we shall see in Sect. 4.3, the degeneration of the linearized constraints maps associated with a metric and second fundamental form which are (weighted) sections of \(S^2\widetilde{T}^*\widetilde{X}\) can be captured precisely using the terminology of q-analysis, which was introduced in [Hin21b, §2.1] as a close relative of the analytic surgery calculus of McDonald [McD90] and Mazzeo–Melrose [MM95]. Using the notation of Definition 3.1, we recall the Lie algebra of q-vector fields
and the associated space \(\textrm{Diff}_{\textrm{q}}^m(\widetilde{X})\) of m-th order q-differential operators; thus, elements of \(\mathcal {V}_{\textrm{q}}(\widetilde{X})\) are families of vector fields on X which depend smoothly on \(\epsilon \in (0,1)\) and degenerate in a specific manner as \(\epsilon \searrow 0\). The restriction of q-vector fields to the boundary hypersurfaces of \(\widetilde{X}\) induces surjective normal operator maps
with kernels \(\hat{\rho }\mathcal {V}_{\textrm{q}}(\widetilde{X})\) and \(\rho _\circ \mathcal {V}_{\textrm{q}}(\widetilde{X})\), respectively. These extend multiplicatively to normal operator homomorphisms
when \(E\rightarrow \widetilde{X}\) is a vector bundle. We remark that if \(E=\widetilde{T}^{p,q}\widetilde{X}\) and \(A\in \textrm{Diff}_{\textrm{q}}(\widetilde{X};E)\), then \(\hat{\textsf{s}}^{-1}\circ N_{{\hat{X}}}(A)\circ \hat{\textsf{s}}\), resp. \(N_{X_\circ }(A)\) is an element of \(\textrm{Diff}_{\textrm{b}}({\hat{X}} ;{}^{{\textrm{sc}}}T^{p,q}{\hat{X}})\), resp. \(\textrm{Diff}_{\textrm{b}}(X_\circ ;\upbeta _\circ ^* T^{p,q}X)\); cf. Lemma 3.2(1), (3). We furthermore note that Lemma 3.2(4) gives \(\widetilde{\upbeta }^*\mathcal {V}(X)\hookrightarrow \hat{\rho }^{-1}\mathcal {V}_{\textrm{q}}(\widetilde{X})\), where on the left we regard vector fields on X as \(\epsilon \)-independent vector fields on \(\widetilde{X}'\).
In local coordinates \(x\in \mathbb {R}^n\) on X, with \(x=0\) at \(\mathfrak {p}\), we can take \(\hat{\rho }=(\epsilon ^2+|x|^2)^{1/2}=\epsilon \langle {\hat{x}}\rangle \) where \({\hat{x}}=\frac{x}{\epsilon }\), and the space \(\mathcal {V}_{\textrm{q}}(\widetilde{X})\) is then spanned over \({\mathcal {C}}^\infty (\widetilde{X})\) by the vector fields
whose \({\hat{X}}\)-, resp. \(X_\circ \)-normal operators are given by \(\langle {\hat{x}}\rangle \partial _{{\hat{x}}^j}\) (cf. Lemma 3.2(3)), resp. \(|x|\partial _{x^j}\). (This can be used to prove the stated properties of the maps (3.3).) The vector fields (3.4) form a frame of the q-tangent bundle \({}^{\textrm{q}}T\widetilde{X}\rightarrow \widetilde{X}\), the smooth sections of which are exactly the q-vector fields. The principal symbol of \(A\in \textrm{Diff}_{\textrm{q}}^k(\widetilde{X})\) is then an element
defined analogously to the b-principal symbol in Sect. 2.1.
Turning to function spaces, suppose that X is compact without boundary, and \(\widetilde{X}=[[0,1)\times X;\{0\}\times \{\mathfrak {p}\}]\) as usual. Fix a smooth positive q-density \(\mu \) on \(\widetilde{X}\), which in local coordinates as above is thus a smooth positive multiple of \((\epsilon ^2+|x|^2)^{-n/2}|\textrm{d}x|=\langle {\hat{x}}\rangle ^{-n}|\textrm{d}{\hat{x}}|\). For \(\epsilon \in (0,1)\), we then define \(H_{{\textrm{q}},\epsilon }^0(X)\) as the \(L^2\)-space on X with respect to the smooth density \(\mu |_{\widetilde{X}_\epsilon }\) on \(\widetilde{X}_\epsilon \cong X\). Fixing moreover a collection \(\{V_1,\ldots ,V_N\}\subset \mathcal {V}_{\textrm{q}}(\widetilde{X})\) of q-vector fields which spans \(\mathcal {V}_{\textrm{q}}(\widetilde{X})\) over \({\mathcal {C}}^\infty (\widetilde{X})\), we define for \(s\in \mathbb {N}_0\) and \(\epsilon \in (0,1)\)
This is equivalent to the norm on \(H^s(X)\) for any fixed \(\epsilon >0\), but not uniformly equivalent when \(\epsilon \searrow 0\). Thus, we define \(H_{{\textrm{q}},\epsilon }^s(X)=H^s(X)\) with the norm (3.5). Via interpolation and duality, we can define \(H_{{\textrm{q}},\epsilon }^s(X)\) as a Hilbert space for \(s\in \mathbb {R}\); we shall also consider weighted versions \(\hat{\rho }^\alpha \rho _\circ ^\beta H_{{\textrm{q}},\epsilon }^s(X)\), where \(\alpha ,\beta \in \mathbb {R}\). Any fixed element \(A\in \textrm{Diff}_{\textrm{q}}^m(\widetilde{X})\) defines a uniformly (in \(\epsilon \)) bounded linear map \(H_{{\textrm{q}},\epsilon }^s(X)\rightarrow H_{{\textrm{q}},\epsilon }^{s-m}(X)\), similarly for weighted spaces.
In a collar neighborhood of \({\hat{X}}\), resp. \(X_\circ \), we can relate q-Sobolev spaces to b-Sobolev spaces (defined with respect to positive smooth b-densities on \({\hat{X}}\) and \(X_\circ \)); to wit, in terms of the maps
and recalling the cutoffs \(\hat{\chi }\), \(\chi _\circ \) from (3.2), we have uniform (in \(\epsilon \in (0,1)\)) norm equivalences
i.e. the left hand side is bounded by a constant (independent of \(\epsilon \) and u) times the right hand side, and vice versa. See [Hin21b, Proposition 2.13] for a pseudodifferential proof (though using differently weighted densities); alternatively, for \(s\in \mathbb {N}_0\), the equivalences (3.7) follow from the fact that lifts of b-vector fields on \({\hat{X}}\), resp. \(X_\circ \), span the space of q-vector fields near \({\text {supp}}\hat{\chi }\), resp. \({\text {supp}}\chi _\circ \), cf. the local coordinate descriptions above. For general s one then uses interpolation and duality. We can combine the two equivalences in (3.7) to the equivalence
upon (as we may) choosing \(\hat{\chi },\chi _\circ \) such that \(\chi _\circ =1\) on \({\text {supp}}(1-\hat{\chi })\). (In the second term, the power of \(\hat{\rho }\) is arbitrary since \(1-\hat{\chi }=0\) near \(\hat{\rho }=0\). One has an analogous norm equivalence upon replacing \(\hat{\chi }\), \(1-\hat{\chi }\) by \(1-\chi _\circ \), \(\chi _\circ \), respectively; in this case the power of \(\rho _\circ \) in the first term is arbitrary.) To each of the two summands, the analysis of Sect. 2.4 applies; thus we obtain a bounded geometry perspective on q-analysis. This immediately gives (elliptic) estimates for q-differential operators using the arguments in Sect. 2.4.
4 The Constraints Map and Its Linearization
Let \(\Lambda \in \mathbb {R}\). We fix an n-dimensional manifold X, \(n\ge 3\), and define, for a \({\mathcal {C}}^2\) Riemannian metric \(\gamma \) and a \({\mathcal {C}}^1\) symmetric 2-tensor k on X, the tensor
where \(\underline{\mathbb {R}}{}=X\times \mathbb {R}\rightarrow X\) is the trivial bundle. The operator P is called the constraints map, and the constraint equations (1.1) read \(P(\gamma ,k;\Lambda )=0\).
Lemma 4.1
(Linearized constraints map) The linearization of P around \((\gamma ,k)\), defined by
takes the block form
where we write \(A\circ B\) (for A, B sections of \(T^*X\otimes T^*X\)) for the section of \(T^*X\otimes T^*X\) given by \((A\circ B)_{i j}=A_{i\ell }B^\ell {}_j\); moreover, for tensors \(T\in (T^*X)^p\) and \(S\in (T^*X)^q\) with \(p<q\), and for \(1\le i_1<\ldots <i_p\le q\), we write \(\langle T,S\rangle _{i_1\ldots i_p}=\langle S,T\rangle _{i_1\ldots i_p}\in (T^*X)^{q-p}\) for the contraction of the \((i_1,\ldots ,i_p)\)-components of S with T. All inner products and covariant derivatives are with respect to \(\gamma \), and we write \((\nabla k)_{i j \ell }=k_{i j;\ell }\). Finally, \(\Delta _\gamma =-{\text {tr}}_\gamma \nabla ^2\). The adjoint is
Proof
For the formula for \(L_{\gamma ,k}\), see [FM73] (which uses the momentum tensor \(\pi =({\text {tr}}_\gamma k)\gamma -k\) instead of k), or [CD03, Equation (2.2)] (where the opposite sign convention for the Laplacian is used). The expression for the adjoint follows from a short calculation. \(\quad \square \)
As observed in [CD03], \(L_{\gamma ,k}\) is a Douglis–Nirenberg system of operators [DN55], with
and thus \((L_{\gamma ,k}^*)_{i j}\in \textrm{Diff}^{s_j+t_i}\); and indeed
At \(\xi \in T_x^*X\), the principal symbol of \(L_{\gamma ,k}\), which is a map \(S^2 T_x^*X\oplus T_x^*X\rightarrow \mathbb {R}\oplus T_x^*X\) in the Douglis–Nirenberg sense, is
where \(\iota _{\xi ^\sharp }\) is the contraction with \(\xi ^\sharp =\gamma ^{-1}(\xi ,-)\), and we do not write out the off-diagonal term explicitly. The principal symbol of the adjoint is
An important feature of the order convention (4.3) (specifically, the fact that all \(t_j\) are equal) is that the Douglis–Nirenberg principal symbol of
i.e. the matrix of principal symbols, is equal to the product \(\upsigma (L_{\gamma ,k})\upsigma (L_{\gamma ,k}^*)\). The following result thus shows that (4.6) is elliptic:
Lemma 4.2
(Left ellipticity of the adjoint) For \(\xi \in T^*_x X\setminus o\), the linear map
is injective.
Proof
This is proved in [CD03, Lemma 2.3]: if \((f^*,j^*)\in \mathbb {R}\oplus T^*_x X\) lies in the kernel of \(\upsigma (L_{\gamma ,k}^*)(x,\xi )\), then \(\xi \otimes _s j^*-\gamma \langle \xi ,j^*\rangle =0\), so upon taking traces \((1-n)\langle \xi ,j^*\rangle =0\); this then gives \(\xi \otimes _s j^*=0\), and therefore \(j^*=0\). Moreover, the fact that \(\gamma |\xi |_{\gamma ^{-1}}^2-\xi \otimes \xi \ne 0\) for \(\xi \ne 0\) (this being the difference of a rank n and a rank 1 operator, which thus has rank \(\ge n-1>0\)) forces \(f^*=0\). \(\quad \square \)
Lemma 4.3
(Prolongation of \(L_{\gamma ,k}^*\)) Fix a Riemannian metric \(\gamma \in {\mathcal {C}}^2(X;S^2 T^*X)\) and a symmetric 2-tensor \(k\in {\mathcal {C}}^1(X;T^*X)\). Let \(\alpha :[0,1]\rightarrow X\) be a \({\mathcal {C}}^2\) path. Then there exists a \({\mathcal {C}}^0\) bundle endomorphism F on \((\,(\underline{\mathbb {R}}{}\oplus T^*X)\oplus (T^*X\oplus (T^*X\otimes T^*X))\oplus (S^2 T^*X\oplus (S^2 T^*X\oplus (T^*X\otimes S^2 T^*X)))\,)|_{\alpha ([0,1])}\) with the following property: for all \(f^*\in {\mathcal {C}}^2(X)\) and \(j^*\in {\mathcal {C}}^2(X;T^*X)\), and writing \((h^*,q^*)=L_{\gamma ,k}^*(f^*,j^*)\in {\mathcal {C}}^0(X;S^2 T^*X)\oplus {\mathcal {C}}^0(X;S^2 T^*X)\), the tensor \(Z(t):=(f^*,\nabla f^*,j^*,\nabla j^*)|_{\alpha (t)}\) satisfies the ODE
In particular, if \(Z(0)=0\) and if \(h^*,q^*,\nabla q^*\) vanish on \(\alpha ([0,1])\), then \(Z(t)=0\) for \(t\in [0,1]\).
Similarly, increasing the regularity requirements on \(\gamma ,k,\alpha \) by one order, the tensor \({\tilde{Z}}(t):=(f^*,\nabla f^*,\nabla ^2 f^*,j^*,\nabla j^*)|_{\alpha (t)}\) satisfies an ODE of the form
Proof
This is an extension of analogous results for the adjoint of the linearization of the scalar curvature operator (see [Cor00, Proposition 2.3 and Corollary 2.4], following [FM75, Proof of Theorem 1], i.e. the special case \(k=0\), \(j^*=0\), to the adjoint of the full linearized constraints map. We give an ad hoc argument, inspired by the setup of [Del12]; see [BČEG06] for a systematic approach for general systems of overdetermined (semi)linear equations.
Let us write the (i, j) entry of \(L_{\gamma ,k}^*\) as \(L_{i j}^*\). First, note that \(\gamma \Delta _\gamma +\delta _\gamma ^*\textrm{d}=(1-\gamma {\text {tr}}_\gamma )\nabla ^2=(1-\frac{\gamma }{n-1}{\text {tr}}_\gamma )^{-1}\nabla ^2\). The equation \(L_{1 1}^*f^*=h^*-L_{1 2}^*j^*\) thus implies, upon applying \((1-\frac{\gamma }{n-1}{\text {tr}}_\gamma )\) to both sides and putting the terms arising from \((-\textrm{Ric}(\gamma )+2 k\circ k-2({\text {tr}}_\gamma k)k)f^*\) (the coefficient of \(f^*\) lying in \({\mathcal {C}}^0\)) on the right hand side, an equation
where \(F_1\) is linear and only depends on \(\gamma ,k\); note here that \(L_{1 2}^*\) is a first order operator (with \({\mathcal {C}}^0\) coefficients) acting on \(j^*\).
Consider next the equation \(L_{2 2}^*j^*=q^*-L_{2 1}^*f^*\), and note that \(L_{2 1}^*\) is an operator of order zero (i.e. a bundle map) and of class \({\mathcal {C}}^1\). Since \(L_{2 2}^*=\delta _\gamma ^*+\gamma \delta _\gamma =(1-\gamma {\text {tr}}_\gamma )\delta _\gamma ^*\), this can be written in the form
with \({\tilde{F}}_2\) of class \({\mathcal {C}}^1\). We now claim that we can express \(\nabla ^2 j^*\) in terms of \(j^*\), \(f^*\), \(q^*\) (through \({\tilde{F}}_2(f^*,q^*)\)), and \(\nabla f^*\), \(\nabla q^*\). To this end, let \(t\in [0,1]\) and take V, W, Y to be coordinate vector fields in normal coordinates on X centered at \(\alpha (t)\); thus, V, W, Y are \({\mathcal {C}}^1\). We then compute at \(\alpha (t)\):
The second and third terms are lower order terms in the following sense: the second summand on the right can be written in terms of the Riemann curvature tensor of \(\gamma \) and is thus tensorial in \(j^*\); the third term only involves up to first derivatives of \(f^*,q^*\). We can then repeat the same procedure for the first term, which is thus equal to \(\langle \nabla _W\nabla _Y j^*,V\rangle \) modulo lower order terms, and this in turn is equal to \(-\langle \nabla _V\nabla _W j^*,Y\rangle \) modulo lower order terms. Rearranging the resulting equation \(\langle \nabla _V\nabla _W j^*,Y\rangle =-\langle \nabla _V\nabla _W j^*,Y\rangle +\text {(l.o.t)}\) expresses \(\langle (\nabla ^2 j^*)(V,W),Y\rangle \) in terms of \(j^*\), \(f^*\), \(q^*\), \(\nabla f^*\), \(\nabla q^*\), as claimed.
In summary, we now have a closed system
so a fortiori \((\nabla ^2 f^*,\nabla ^2 j^*)={\tilde{F}}(f^*,\nabla f^*,j^*,\nabla j^*,h^*,q^*,\nabla q^*)\), with \({\tilde{F}}\) only depending on \(\gamma ,\partial \gamma ,\partial ^2\gamma ,k,\partial k\) (and \(\alpha \)). Since
this implies the first part of the Lemma.
The final statement follows by taking the covariant derivative of the first equation in (4.7); this expresses \(\nabla ^3 f^*\) in terms of \(f^*,\nabla f^*,j^*,\nabla j^*,\nabla ^2 j^*,\nabla h^*\), with \(\nabla ^2 j^*\) itself expressable via the second equation in (4.7). This implies the desired ODE for \({\tilde{Z}}(t)\). \(\square \)
In preparation for the analysis of P on the total gluing space \(\widetilde{X}\), we develop some preliminary material on the (linearized) constraint equations on the two model spaces \({\hat{X}}\) and \(X_\circ \) in Sects. 4.1 and 4.2, respectively. In Sect. 4.3, we study P, its linearization, and its model operators on \(\widetilde{X}\).
4.1 Asymptotically flat initial data
LetFootnote 14\({\hat{K}}\subset \mathbb {R}^n\) be compact (possibly empty), smoothly bounded, and having connected complement; set
All tensors and coefficients below will be required to be smooth down to \(\partial {\hat{K}}\subset {\hat{X}}\); polyhomogeneity, or b- or scattering behavior, always refer to behavior at \(\partial \overline{\mathbb {R}^n}\subset {\hat{X}}\). For example, we write \({}^{{\textrm{sc}}}T{\hat{X}}={}^{{\textrm{sc}}}T_{{\hat{X}}}\overline{\mathbb {R}^n}\). We write \({\hat{x}}=({\hat{x}}^1,\ldots ,{\hat{x}}^n)\) for the standard coordinates on \(\mathbb {R}^n\), and let \({\hat{r}}=|{\hat{x}}|\). Let \(\rho _\circ \in {\mathcal {C}}^\infty ({\hat{X}})\) denote a function which equals \(\underline{\rho }_\circ :={\hat{r}}^{-1}\) for \({\hat{r}}>1\).
Definition 4.4
(Asymptotic flatness) Let \(\mathcal {E}\subset \mathbb {C}\times \mathbb {N}_0\) be a nonlinearly closed index set with \({\text {Re}}\mathcal {E}>0\). We then call a pair \((\hat{\gamma },{\hat{k}})\) of symmetric 2-tensors on \(\mathbb {R}^n\setminus {\hat{K}}^\circ \) \(\mathcal {E}\)-asymptotically flat if
where \(\hat{\mathfrak {e}}=\sum _{j=1}^n(\textrm{d}{\hat{x}}^j)^2\) is the Euclidean metric.
In particular, \(\hat{\gamma }=\hat{\mathfrak {e}}+\mathcal {O}({\hat{r}}^{-\epsilon })\) and \({\hat{k}}=\mathcal {O}({\hat{r}}^{-1-\epsilon })\) for any \(\epsilon <{\text {Re}}\mathcal {E}\) as \({\hat{r}}\rightarrow \infty \). Since nothing is required of the index set \(\mathcal {E}\) except for \({\text {Re}}\mathcal {E}>0\), the data \((\hat{\gamma },{\hat{k}})\), apart from having strong regularity near infinity, may have rather weak decay. A more common notion of asymptotic flatness requires \(\mathcal {O}({\hat{r}}^{-q})\) decay for \(\hat{\gamma }-\hat{\mathfrak {e}}\) (and up to 2 b-derivatives thereof), \(\mathcal {O}({\hat{r}}^{-q-1})\) decay for \({\hat{k}}\) (and its first b-derivative), and \(\mathcal {O}({\hat{r}}^{-q_0})\) decay for the mass and current densities \(P_1(\hat{\gamma },{\hat{k}};0)\) and \(P_2(\hat{\gamma },{\hat{k}})\), where \(q>\frac{n-2}{2}\) and \(q>n\), cf. [Car21, Definition B.7]; this ensures the finiteness of the ADM mass and momentum. See also Remark 5.3.
We furthermore remark that whether or not \((\hat{\gamma },{\hat{k}})\) satisfies the constraint equations is immaterial for the results in this section. (By Lemma 4.6(1) below, \((\hat{\gamma },{\hat{k}})\) can satisfy the constraint equations only for \(\Lambda =0\).)
Definition 4.5
(Weights) We define
and similarly \(\hat{\underline{w}}={\text {diag}}(\underline{\rho }_\circ ^{-2},\underline{\rho }_\circ ^{-1})\) on \([0,\infty )_{\underline{\rho }_\circ }\times \mathbb {S}^{n-1}=\overline{\mathbb {R}^n}\setminus \{0\}\).
Lemma 4.6
(Constraints map in the asymptotically flat case) Let \((\hat{\gamma },{\hat{k}})\) be \(\mathcal {E}\)-asymptotically flat. Let \(({\hat{f}},{\hat{j}})=P(\hat{\gamma },{\hat{k}};0)\). Then:
-
(1)
We have \(({\hat{f}},{\hat{j}})\in \mathcal {A}_{\textrm{phg}}^{\mathcal {E}+2}({\hat{X}}\setminus {\hat{K}}^\circ ;\underline{\mathbb {R}}{}\oplus {}^{{\textrm{sc}}}T^*{\hat{X}})\).
-
(2)
The operator \(L_{\hat{\gamma },{\hat{k}}}{\hat{w}}\) is an element of
$$\begin{aligned} \begin{pmatrix} ({\mathcal {C}}^\infty {+}\mathcal {A}_{\textrm{phg}}^\mathcal {E})\textrm{Diff}_{\textrm{b}}^2({\hat{X}}\setminus {\hat{K}}^\circ ;S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}},\underline{\mathbb {R}}{}) &{} \mathcal {A}_{\textrm{phg}}^\mathcal {E}({\hat{X}}\setminus {\hat{K}}^\circ ;{\text {Hom}}(S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}},\underline{\mathbb {R}}{})) \\ \mathcal {A}_{\textrm{phg}}^\mathcal {E}\textrm{Diff}_{\textrm{b}}^1({\hat{X}}\setminus {\hat{K}}^\circ ;S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}},{}^{{\textrm{sc}}}T^*{\hat{X}}) &{} ({\mathcal {C}}^\infty {+}\mathcal {A}_{\textrm{phg}}^\mathcal {E})\textrm{Diff}_{\textrm{b}}^1({\hat{X}}\setminus {\hat{K}}^\circ ;S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}},{}^{{\textrm{sc}}}T^*{\hat{X}}) \end{pmatrix}. \end{aligned}$$The normal operator of \(L_{\hat{\gamma },{\hat{k}}}{\hat{w}}\) at \(\partial {\hat{X}}\) is \(\hat{\underline{L}}\hat{\underline{w}}\) where \(\hat{\underline{L}}:=L_{\hat{\mathfrak {e}},0}\), i.e.
$$\begin{aligned} \hat{\underline{L}} = \begin{pmatrix} \hat{\underline{L}}_1 &{} 0 \\ 0 &{} \hat{\underline{L}}_2 \end{pmatrix},\qquad \hat{\underline{L}}_1:=\Delta _{\hat{\mathfrak {e}}}{\text {tr}}_{\hat{\mathfrak {e}}}+\,\delta _{\hat{\mathfrak {e}}}\delta _{\hat{\mathfrak {e}}},\quad \hat{\underline{L}}_2:= \delta _{\hat{\mathfrak {e}}}+\textrm{d}{\text {tr}}_{\hat{\mathfrak {e}}}, \end{aligned}$$(4.8)in the sense that \(L_{\hat{\gamma },{\hat{k}}}{\hat{w}}-\chi _\circ \hat{\underline{L}}{\hat{w}}\in \mathcal {A}_{\textrm{phg}}^\mathcal {E}\textrm{Diff}_{\textrm{b}}\) when \(\chi _\circ \in {\mathcal {C}}^\infty ({\hat{X}})\) is equal to 1 in a collar neighborhood of \(\partial {\hat{X}}\) and has support in any larger neighborhood.
Since \(L_{\hat{\gamma },{\hat{k}}}{\hat{w}}\), being an unweighted b-operator, naturally acts on pairs of conormal sections of \(S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}}\) which have the same weight at \({\hat{X}}\), the weight \({\hat{w}}\) encodes the relative weighting of linearized metrics \({\hat{h}}\) and linearized second fundamental forms \({\hat{q}}\), namely \({\hat{q}}\) decays faster than \({\hat{h}}\) by one order.Footnote 15
Proof of Lemma 4.6
We have \(\hat{\gamma }^{-1}-\hat{\mathfrak {e}}^{-1}=\sum _{j=1}^\infty (-1)^j(\hat{\gamma }-\hat{\mathfrak {e}})^j\in \mathcal {A}_{\textrm{phg}}^\mathcal {E}({\hat{X}};S^2\,{}^{{\textrm{sc}}}T{\hat{X}})\) since \(\mathcal {E}\supset j\mathcal {E}\) for all \(j\in \mathbb {N}\), and therefore \(\hat{\gamma }^{-1}\in \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \mathcal {E}}\). The Christoffel symbols of \(\hat{\gamma }\), in the coordinate system \({\hat{x}}=({\hat{x}}^1,\ldots ,{\hat{x}}^n)\), therefore satisfy
where we use that \(\partial _i\in \rho \mathcal {V}_{\textrm{b}}({\hat{X}})\). Therefore,
and thus also \(R_{\hat{\gamma }}=\hat{\gamma }^{j l}\textrm{Ric}(\hat{\gamma })_{j l}\in \mathcal {A}_{\textrm{phg}}^{\mathcal {E}+2}({\hat{X}})\).
We furthermore have \(|{\hat{k}}|_{\hat{\gamma }}^2=\hat{\gamma }^{i i'}\hat{\gamma }^{j j'}{\hat{k}}_{i j}{\hat{k}}_{i'j'}\), \(({\text {tr}}_{\hat{\gamma }}{\hat{k}})^2=\hat{\gamma }^{i i'}\hat{\gamma }^{j j'}{\hat{k}}_{i i'}{\hat{k}}_{j j'}\in \mathcal {A}_{\textrm{phg}}^{\mathcal {E}+2}\). Thus, \({\hat{f}}\in \mathcal {A}_{\textrm{phg}}^{\mathcal {E}+2}({\hat{X}})\). Similarly, \(\delta _{\hat{\gamma }}{\hat{k}}\) is of the schematic form \(\hat{\gamma }^{-1}(\partial {\hat{k}}+\Gamma (\hat{\gamma }){\hat{k}})\) and thus lies in \(\mathcal {A}_{\textrm{phg}}^{\mathcal {E}+2}({\hat{X}};{}^{{\textrm{sc}}}T^*{\hat{X}})\); and also \(\textrm{d}({\text {tr}}_{\hat{\gamma }}{\hat{k}})\) (schematically: \(\partial (\hat{\gamma }^{-1}{\hat{k}})\)) lies in \(\mathcal {A}_{\textrm{phg}}^{\mathcal {E}+2}\). This proves part (1).
For part (2), note that the only terms of \(L_{\hat{\gamma },{\hat{k}}}\) which are of leading order at \(\partial {\hat{X}}\) (in the sense that they contribute to the normal operator) are those in which \({\hat{k}}\) does not appear; for example, the \(\langle \nabla {\hat{k}},-\rangle _{2 3}\) term of \(L_{\hat{\gamma },{\hat{k}}}\) has the schematic form \(\hat{\gamma }^{-1}\hat{\gamma }^{-1}(\partial {\hat{k}}+\Gamma (\hat{\gamma }){\hat{k}})\) and is thus polyhomogeneous with index set \(\mathcal {E}+2\). In the terms of \(L_{\hat{\gamma },{\hat{k}}}\) then in which only \(\hat{\gamma }\) appears, only the leading part \(\hat{\mathfrak {e}}\) of \(\hat{\gamma }\) contributes to the normal operator. This gives (4.8). \(\quad \square \)
We proceed to study the solvability properties of \(L_{\hat{\gamma },{\hat{k}}}\). Using a geometric singular analysis point of view, rather than more frequently used direct coercivity estimates (as for example in [Cor00, §3] or [CS16, §4] in related contexts), we provide a technically novel perspective on this. We begin by studying the symbolic properties of the adjoint of \(L_{\hat{\gamma },{\hat{k}}}{\hat{w}}\).
Lemma 4.7
(Injective b-principal symbol) In the notation of Lemma 4.6(2), the operator
has an injective principal symbol as a b-differential operator (in the sense of Douglis–Nirenberg).
Proof
This was verified in Lemma 4.2 in \({\hat{X}}\setminus \partial \overline{\mathbb {R}^n}\); near \(\partial \overline{\mathbb {R}^n}\), the same argument applies upon noting that the principal symbol of \(\underline{\rho }_\circ ^{-1}\partial _{x^j}\) is a nonvanishing linear form on \({}^{{\textrm{b}}}T^*\overline{\mathbb {R}^n}\setminus o\). \(\quad \square \)
We next turn to the normal operator of the adjoint \(L_{\hat{\gamma },{\hat{k}}}^*\) (with respect to the volume density and bundle inner products induced by \(\hat{\gamma }\)), which is the diagonal operator
i.e. the formal adjoint of \(\hat{\underline{L}}\) with respect to \(\hat{\mathfrak {e}}\); the operator \(\hat{\underline{L}}_1^*\), resp. \(\hat{\underline{L}}_2^*\) maps real-valued functions, resp. sections of \({}^{{\textrm{sc}}}T^*\overline{\mathbb {R}^n}\) to sections of \(S^2\,{}^{{\textrm{sc}}}T^*\overline{\mathbb {R}^n}\). We work in polar coordinates \({\hat{r}}\ge 0\), \(\omega \in \mathbb {S}^{n-1}\) on \(\mathbb {R}^n\setminus \{0\}\). By trivializing \({}^{{\textrm{sc}}}T^*\overline{\mathbb {R}^n}\) by means of the standard coordinate differentials, we can extend sections of \({}^{{\textrm{sc}}}T^*\overline{\mathbb {R}^n}\) (and its tensor powers) over \(\mathbb {S}^{n-1}=\partial \overline{\mathbb {R}^n}\) to sections over \(\overline{\mathbb {R}^n}\setminus \{0\}\) by degree 0 homogeneity with respect to dilations \((\underline{\rho }_\circ ,\omega )\mapsto (s\underline{\rho }_\circ ,\omega )\), \(s>0\). Consider then the Mellin-transformed normal operator
Lemma 4.8
(Kernel and boundary spectrum of \(\underline{L}^*\)) (See also [CS06, Lemma 2.5].) For any connected open subset \(\Omega \subset \mathbb {R}^n\), the kernel of \(\underline{L}^*\) on \(\mathscr {D}'(\Omega ;\underline{\mathbb {R}}{}\oplus T^*\mathbb {R}^N)\) is
In particular, the operator \(N({\hat{w}}\hat{\underline{L}}^*,\lambda )\) is injective for \(\lambda \in \mathbb {C}\setminus \{0,i\}\). For any compact set \(\Lambda \subset \mathbb {C}\setminus \{0,i\}\), and for any \(s,s'\in \mathbb {R}\), there exists a constant C so that, for \(\lambda \in \Lambda \),
Proof
By taking the trace of \(\hat{\underline{L}}_1^*{\hat{f}}^*=0\), we obtain \((n-1)\Delta _{\hat{\mathfrak {e}}}{\hat{f}}^*=0\); this then implies \(0=\hat{\underline{L}}_1^*{\hat{f}}^*=\delta _{\hat{\mathfrak {e}}}^*\textrm{d}{\hat{f}}^*=(\partial _{{\hat{x}}^i}\partial _{{\hat{x}}^j}{\hat{f}}^*)_{i,j=1,\ldots ,n}\), so \({\hat{f}}^*\) is linear. Conversely, for linear \({\hat{f}}^*\) we do have \(\hat{\underline{L}}_1^*{\hat{f}}^*=0\).
We similarly note that taking the trace of \(\hat{\underline{L}}_2^*{\hat{j}}^*=0\) gives \((n-1)\delta _{\hat{\mathfrak {e}}}{\hat{j}}^*=0\) and thus \(\delta _{\hat{\mathfrak {e}}}^*{\hat{j}}^*=0\). Conversely, if \(\delta _{\hat{\mathfrak {e}}}^*{\hat{j}}^*=0\), then by taking the trace we have \(-\delta _{\hat{\mathfrak {e}}}{\hat{j}}^*=0\), and therefore \(\hat{\underline{L}}_2^*{\hat{j}}^*=0\). Therefore, \(\ker \hat{\underline{L}}_2^*=\ker \delta _{\hat{\mathfrak {e}}}^*\) consists of all Killing 1-forms on Euclidean space; these are the 1-forms dual to translations and rotations.
The estimate in (4.9) is the sum of two estimates, one for \({\hat{f}}^*\) (with \({\hat{j}}^*=0\)) and one for \({\hat{j}}^*\) (with \({\hat{f}}^*=0\)). The estimate for \({\hat{f}}^*\) follows from the ellipticity of \(N(\underline{\rho }_\circ ^{-2}\hat{\underline{L}}_1^*,\lambda )^*N(\underline{\rho }_\circ ^{-2}\hat{\underline{L}}_1^*,\lambda )\in \textrm{Diff}^4(\partial \overline{\mathbb {R}^n})\) (which gives the desired estimate except for an additional, relatively compact, error term \(C\Vert {\hat{f}}^*\Vert _{H^{-N}(\partial \overline{\mathbb {R}^n})}\) on the right hand side) and the triviality of \(\ker N(\underline{\rho }_\circ ^{-2}\hat{\underline{L}}_1^*,\lambda )\) (which allows one to drop this relatively compact term, upon increasing the constant C, by a standard functional analytic argument). The proof of the estimate for \({\hat{j}}^*\) is completely analogous. \(\quad \square \)
Lemma 4.9
(Cokernel of \(L_{\hat{\gamma },{\hat{k}}}\)) Let \((\hat{\gamma },{\hat{k}})\) be \(\mathcal {E}\)-asymptotically flat. Let \(\alpha ^*>0\), and supposeFootnote 16\(({\hat{f}}^*,{\hat{j}}^*)\in H_{{\textrm{b}}}^{-\infty ,\alpha ^*}(\overline{\mathbb {R}^n};\underline{\mathbb {R}}{}\oplus S^2\,{}^{{\textrm{sc}}}T^*\overline{\mathbb {R}^n})\) satisfies \(L_{\hat{\gamma },{\hat{k}}}^*({\hat{f}}^*,{\hat{j}}^*)=0\) in a connected open subset \(\Omega \subset \overline{\mathbb {R}^n}\setminus K\) which contains a neighborhood of \(\partial \overline{\mathbb {R}^n}\). Then \(({\hat{f}}^*,{\hat{j}}^*)=0\) on \(\Omega \).
Proof
By elliptic regularity for \(L_{\hat{\gamma },{\hat{k}}}^*\) (cf. Lemma 4.7), we automatically have \(({\hat{f}}^*,{\hat{j}}^*)\in H_{{\textrm{b}}}^{\infty ,\alpha ^*}\) on \(\Omega \). We first show that \(({\hat{f}}^*,{\hat{j}}^*)=0\) for large \({\hat{r}}\). To this end,.Footnote 17 we work with inverse polar coordinates \(\rho _\circ ,\omega \) near \(\partial \overline{\mathbb {R}^n}\) and consider the scaling map \(s_\epsilon :(\rho _\circ ,\omega )\mapsto (\epsilon \rho _\circ ,\omega )\). Using the weight \({\hat{w}}\) of Definition 4.5, we consider
for \(\epsilon \in (0,\rho _0)\) as a b-differential operator on \([0,1]\times \mathbb {S}^{n-1}\), where \(\rho _0\in (0,1]\) is fixed so that \(\rho _\circ ^{-1}([0,\rho _0))\subset \Omega \) and \({\hat{r}}=\rho _\circ ^{-1}<\rho _0^{-1}\) on K. As such, we have
in the space
note indeed that the pullback of the polyhomogeneous terms with index set \(\mathcal {E}\) vanish in the limit \(\epsilon \searrow 0\), since, recalling that \({\text {Re}}\mathcal {E}>0\), we have \(s_\epsilon ^*(\rho _\circ ^z(\log \rho _\circ ^{-1})^k)=\epsilon ^z\cdot \rho _\circ ^z(\log \rho _\circ ^{-1}+\log \epsilon ^{-1})^k\rightarrow 0\) when \({\text {Re}}z>0\).
We now use the uniform b-ellipticity of \((\mathcal {L}_\epsilon ^*)^*\mathcal {L}_\epsilon ^*\in (\textrm{Diff}_{\textrm{b}}^{s_j+s_i})_{i,j=1,2}\) in \(\epsilon \in (0,\rho _0)\), with \(s_1=2\), \(s_2=1\) (see (4.6)). This gives the uniform elliptic estimateFootnote 18
for any \(s_0<s\), where we fix \(\chi \in {\mathcal {C}}^\infty _{{\textrm{c}}}([0,1)_{\rho _\circ }\times \mathbb {S}^{n-1})\) to be 1 near \(\rho _\circ ^{-1}([0,\frac{1}{2}])\). We claim that we can replace the norm on \((\chi {\hat{f}}^*,\chi {\hat{j}}^*)\) by the \({\bar{H}}_{{\textrm{b}}}^{s_0,\alpha ^*-\delta }(\rho _\circ ^{-1}([0,1]))\)-norm on \(({\hat{f}}^*,{\hat{j}}^*)\), where \(\delta >0\) is such that \({\text {Re}}\mathcal {E}>\delta \); we accomplish this via a standard b-normal operator argument [Vas18, §6]. To wit, passing to the Mellin-transform in \(\rho _\circ \) and using the material of Sect. 2.3, we have an equivalence of norms
Using a parametrix for the Mellin-transformed normal operator of \((\mathcal {L}_\epsilon ^*)^*\mathcal {L}_\epsilon ^*\), which thus has an error term which tends to zero in operator norm on \(H_{\langle \lambda \rangle }^{s_0+2}\oplus H_{\langle \lambda \rangle }^{s_0+1}\) as \(|{\text {Re}}\lambda |\rightarrow \infty \) along the contour \({\text {Im}}\lambda =-\alpha ^*\), we can estimate
for \({\text {Im}}\lambda =-\alpha ^*\) when \(|{\text {Re}}\lambda |\ge C_0\) (independently of \(\epsilon \)). For \(|{\text {Re}}\lambda |\le C_0\) on the other hand, the large parameter norms are equivalent to standard Sobolev norms, and we can then appeal to (4.9) to conclude the estimate (4.11) for all \(\lambda \) with \({\text {Im}}\lambda =-\alpha ^*\). We have thus proved
We then commute \(\mathcal {L}_0^*\) through \(\chi \), producing an error term \(\Vert (\chi ^\flat {\hat{f}}^*,\chi ^\flat {\hat{j}}^*)\Vert _{H_{{\textrm{b}}}^{s_0+1}\oplus H_{{\textrm{b}}}^{s_0}}\) where \(\chi ^\flat \in {\mathcal {C}}^\infty _{{\textrm{c}}}((0,1)_{\rho _\circ }\times \mathbb {S}^{n-1})\), with , is disjoint from \(\partial \overline{\mathbb {R}^n}\). We can further write \(\chi \mathcal {L}_0^*({\hat{f}}^*,{\hat{j}}^*)=\chi \mathcal {L}_\epsilon ^*({\hat{f}}^*,{\hat{j}}^*)-\chi (\mathcal {L}_\epsilon ^*-\mathcal {L}_0^*)({\hat{f}}^*,{\hat{j}}^*)\) and estimate the error using the fact that
is uniformly bounded in \(\epsilon \). Altogether, we obtain the uniform estimate
By the final part of Lemma 4.3, we can express \(({\hat{f}}^*,{\hat{j}}^*,\nabla {\hat{f}}^*,\nabla {\hat{j}}^*,\nabla ^2 {\hat{f}}^*)(\rho _\circ ,\omega )\) for \(\rho _\circ \in [\frac{1}{2},1]\) in terms of the values of the tensors \({\hat{f}}^*,{\hat{j}}^*,\nabla {\hat{f}}^*,\nabla {\hat{j}}^*\) at the point \((\rho _0,\omega )\) and of \(\mathcal {L}_\epsilon ^*({\hat{f}}^*,{\hat{j}}^*),\nabla (\mathcal {L}_\epsilon ^*({\hat{f}}^*,{\hat{j}}^*))\) at all points \((\rho _1,\omega )\), \(\rho _1\in [\rho _0,\rho _\circ ]\), where \(\rho _0\in [\frac{1}{4},\frac{1}{2}]\) is arbitrary, as the solution of an ODE whose coefficients depend only on \(\rho _0\), \(\omega \), \(\epsilon \), \(\gamma \), and k. Therefore, upon integrating over the \((n-1)\)-sphere and averaging over \(\rho _0\),
Applying the Cauchy–Schwarz inequality in the last integral, performing the integral over \(\rho _0\), and subsequently integrating over \(\rho _\circ \in [\frac{1}{2},1]\) gives
Estimating the first term on the right here by means of (4.12) with \(s=0\) and \(s_0=-1\), and adding the resulting estimate to (4.12), we finally deduce the uniform semi-Fredholm estimate
for all \(\epsilon \in [0,1]\). Now the error term is relatively compact, in that the inclusion \(({\bar{H}}_{{\textrm{b}}}^{2,\alpha ^*}\oplus {\bar{H}}_{{\textrm{b}}}^{1,\alpha ^*})(\rho _\circ ^{-1}([0,1]))\hookrightarrow ({\bar{H}}_{{\textrm{b}}}^{1,\alpha ^*-\delta }\oplus {\bar{H}}_{{\textrm{b}}}^{0,\alpha ^*-\delta })(\rho _\circ ^{-1}([0,1]))\) is compact.
We now make use of the limiting behavior (4.10). Note that the distributional kernel of \(\mathcal {L}_0^*\) is given by \(({\hat{f}}^*,{\hat{j}}^*)\) where \({\hat{f}}^*\in \ker \underline{L}_1^*\) and \({\hat{j}}^*\in \ker \underline{L}_2^*\) are described in Lemma 4.8; but elements of the kernel of \(\mathcal {L}_0^*\) on \((H_{{\textrm{b}}}^{2,\alpha ^*}\oplus H_{{\textrm{b}}}^{1,\alpha ^*})(\rho _\circ ^{-1}([0,1]))\) lie in \(H_{{\textrm{b}}}^{\infty ,\alpha ^*}(\rho _\circ ^{-1}([0,1]))\) by elliptic regularity, and thus decay (as scattering tensors, i.e. with respect to the frame \(\textrm{d}x^1,\ldots ,\textrm{d}x^n\)) to zero as \(|x|\rightarrow \infty \), and hence they must be zero. A standard functional analytic argument allows one to drop the relatively compact error term in (4.13) for \(\epsilon =0\), and then also for all sufficiently small \(\epsilon \ge 0\). Translated back to \(L_{\gamma ,k}^*\), this means that for sufficiently small \(\epsilon >0\), we have
Thus, if \(L_{\hat{\gamma },{\hat{k}}}^*({\hat{f}}^*,{\hat{j}}^*)\) vanishes in \(\rho _\circ \le \epsilon \), then so does \(({\hat{f}}^*,{\hat{j}}^*)\). By Lemma 4.3, this then implies the vanishing of \(({\hat{f}}^*,{\hat{j}}^*)\) in \(\Omega \), completing the proof. \(\quad \square \)
Proposition 4.10
(Solvability of the linearized constraints) Let \((\hat{\gamma },{\hat{k}})\) be \(\mathcal {E}\)-asymptotically flat. Let \({\hat{R}}_0>0\) be such that \({\hat{K}}\subset B(0,{\hat{R}}_0)\), and put \(\hat{\rho }_2={\hat{R}}_0^{-1}-{\hat{r}}^{-1}\) and \({\hat{w}}_2={\text {diag}}(\hat{\rho }_2^4,\hat{\rho }_2^2)\); recall also \({\hat{w}}={\text {diag}}(\rho _\circ ^{-2},\rho _\circ ^{-1})\) from Definition 4.5. Let \(\hat{\Omega }=\rho _\circ ^{-1}([0,{\hat{R}}_0^{-1}])\), and define b-00-Sobolev spaces on \(\hat{\Omega }\) (with b-character at \(\rho _\circ ^{-1}(0)\) and 00-character at \({\hat{r}}^{-1}({\hat{R}}_0)\)) using a positive b-00-density on \(\hat{\Omega }\). Fix \(\alpha _\circ <n-2\).
-
(1)
(Basic solvability.) Fix \(\beta >0\) and \(\alpha _2\in \mathbb {R}\). Put
$$\begin{aligned} \mathcal {L}_{\hat{\gamma },{\hat{k}}}:= e^{\beta /\hat{\rho }_2}\hat{\rho }_2^{-\alpha _2}\rho _\circ ^{-\tilde{\alpha }}{\hat{w}}_2 L_{\hat{\gamma },{\hat{k}}}{\hat{w}} \rho _\circ ^{\tilde{\alpha }}\hat{\rho }_2^{\alpha _2} e^{-\beta /\hat{\rho }_2},\qquad \tilde{\alpha }:=\alpha _\circ -n+2+\frac{n}{2}.\nonumber \\ \end{aligned}$$(4.14)Define adjoints with respect to the volume density and fiber inner products induced by \(\hat{\gamma }\). Then for all \(s\in \mathbb {R}\), the operator \(\mathcal {L}_{\hat{\gamma },{\hat{k}}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*\) is invertible as a map
$$\begin{aligned} \mathcal {L}_{\hat{\gamma },{\hat{k}}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*: \rho _\circ ^{\frac{n}{2}}H_{{\textrm{b}},0 0}^{s+2}(\hat{\Omega })\oplus \rho _\circ ^{\frac{n}{2}}H_{{\textrm{b}},0 0}^{s+1}(\hat{\Omega };{}^{{\textrm{sc}}}T^*_{\hat{\Omega }}{\hat{X}}) \rightarrow \rho _\circ ^{\frac{n}{2}}H_{{\textrm{b}},0 0}^{s-2}(\hat{\Omega })\oplus \rho _\circ ^{\frac{n}{2}}H_{{\textrm{b}},0 0}^{s-1}(\hat{\Omega };{}^{{\textrm{sc}}}T^*_{\hat{\Omega }}{\hat{X}}). \nonumber \\ \end{aligned}$$(4.15) -
(2)
(Polyhomogeneity.) There existsFootnote 19 an index set \(\hat{\mathcal {S}}\subset \mathbb {C}\times \mathbb {N}_0\) only depending on \(\mathcal {E}\) and \(\alpha _\circ \) and satisfying \({\text {Re}}\hat{\mathcal {S}}>\alpha _\circ \) so that the following holds. If \(\mathcal {G}\subset \mathbb {C}\times \mathbb {N}_0\) is an index set with \({\text {Re}}\mathcal {G}>\alpha _\circ \), and if \(({\hat{f}},{\hat{j}})\in \mathcal {A}_{\textrm{phg}}^{\mathcal {G}+2}({\hat{X}};\underline{\mathbb {R}}{}\oplus {}^{{\textrm{sc}}}T^*{\hat{X}})\) vanishes in a neighborhood of \({\hat{r}}\le {\hat{R}}_0\), then there exists
$$\begin{aligned} ({\hat{h}},{\hat{q}}) \in \mathcal {A}_{\textrm{phg}}^{\mathcal {G}\mathrel {\overline{\cup }}\hat{\mathcal {S}}}({\hat{X}};S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}}) \oplus \mathcal {A}_{\textrm{phg}}^{(\mathcal {G}\mathrel {\overline{\cup }}\hat{\mathcal {S}})+1}({\hat{X}};S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}}) \end{aligned}$$so that \(L_{\hat{\gamma },{\hat{k}}}({\hat{h}},{\hat{q}})=({\hat{f}},{\hat{j}})\), and so that \(({\hat{h}},{\hat{q}})\) vanishes in a neighborhood of \({\hat{r}}\le {\hat{R}}_0\).
Remark 4.11
(Right inverse on b-00-Sobolev spaces) Part (1) is formulated in the form needed later on. For the orientation of the reader, we note here that this produces a right inverse \(S_{\hat{\gamma },{\hat{k}}}\) of \(L_{\hat{\gamma },{\hat{k}}}\) which is bounded as a map
such a right inverse is given by \(S_{\hat{\gamma },{\hat{k}}} = e^{-\beta /\hat{\rho }_2}\hat{\rho }_2^{\alpha _2}\rho _\circ ^{\tilde{\alpha }}{\hat{w}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*(\mathcal {L}_{\hat{\gamma },{\hat{k}}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*)^{-1}{\hat{w}}_2 \rho _\circ ^{-\tilde{\alpha }}\hat{\rho }_2^{-\alpha _2}e^{\beta /\hat{\rho }_2}\).
Remark 4.12
(Weights and KIDs) The assumption on \(\alpha _\circ \) eliminates the cokernel of \(L_{\hat{\gamma },{\hat{k}}}\), as we demonstrate in the proof below. Note that for Kerr data sets \((\hat{\gamma },{\hat{k}})\), the kernel of \(L_{\hat{\gamma },{\hat{k}}}^*\) is indeed non-trivial [Mon75]. If however \(\ker L_{\hat{\gamma },{\hat{k}}}^*\) is trivial, then Proposition 4.10 holds for all \(\alpha _\circ \in \mathbb {R}\).
Proof of Proposition 4.10
Using Lemma 4.6(2), and as in Sect. 2.2, we have
Away from the boundary \(\hat{\rho }_2^{-1}(0)\), the operator \(\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*\) has an injective b-principal symbol by Lemma 4.7; near \(\hat{\rho }_2=0\) on the other hand, the Douglis–Nirenberg 00-principal symbol at a point \(({\hat{x}},\xi _{0 0})\in {}^{0 0}T^*\hat{\Omega }\) in \(\rho >0\) is w times
by (2.4), where we use the notation (4.5) and the normalization (4.3). By the same arguments as in the proof of Lemma 4.2 upon setting \(\xi =\hat{\rho }_2^2\xi _{0 0}\), this is injective, including for \(\xi _{0 0}=0\), since \(\hat{\rho }_2^2\xi _{0 0}+i\beta \,\textrm{d}\hat{\rho }_2\ne 0\) for all (real) 00-covectors \(\xi _{0 0}\). Therefore, the b-00-operator
is elliptic in the Douglis–Nirenberg sense (including at \(\hat{\rho }_2=0\) as a 00-operator), the (i, j)-entry having differential order \(s_j+s_i\) where \(s_1=2\), \(s_2=1\); see also the comments following (4.6). Given \(s_0,s\in \mathbb {R}\) with \(s_0<s\), the elliptic estimates of Lemma 2.5 and Proposition 2.7 thus give
While this estimate is valid for any weight, the significance of \(\frac{n}{2}\) is that it translates between b-densities (used in the function spaces) and asymptotically Euclidean densities (used in the definition of the adjoint), in that \(\rho ^{\frac{n}{2}}H_{{\textrm{b}}}^0(\overline{\mathbb {R}^n})=L^2(\mathbb {R}^n;|\textrm{d}\gamma |)\) (with equivalent norms).
As in the proof of Lemma 4.9, we can weaken the second, error, term in (4.16) further provided that the Mellin-transformed b-normal operator of \(\mathcal {L}_{\hat{\gamma },{\hat{k}}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*\) is invertible for all Mellin-dual parameters \(\lambda \in \mathbb {C}\) with \({\text {Im}}\lambda =-\frac{n}{2}\). But the invertibility of
is equivalent to that of its constant multiple
Note now that \(N(\hat{\underline{L}}\hat{\underline{w}},\mu )=N(\hat{\underline{w}}\hat{\underline{L}}^*,\bar{\mu }-i n)^*\); this follows from the fact that the adjoint of \(\underline{\rho }_\circ ^{-i\mu }\hat{\underline{L}}\hat{\underline{w}}\underline{\rho }_\circ ^{i\mu }\) with respect to a dilation-invariant b-density on \([0,\infty )_{\underline{\rho }_\circ }\times \partial {\hat{X}}\)—which is thus \(\underline{\rho }_\circ ^n\) times the Euclidean density—is given by \(\underline{\rho }_\circ ^{-n}\underline{\rho }_\circ ^{-i\bar{\mu }}\hat{\underline{w}}\hat{\underline{L}}^*\underline{\rho }_\circ ^{i\bar{\mu }}\underline{\rho }_\circ ^n\). For \(\mu =\lambda +i\tilde{\alpha }\) with \({\text {Im}}\lambda =-\frac{n}{2}\), thus \(\bar{\lambda }=\lambda +i n\), we have \(\bar{\mu }-i n=\lambda -i\tilde{\alpha }\), and thus we can write
But since \({\text {Im}}(\lambda +i\tilde{\alpha })=\alpha _\circ -n+2<0\), Lemma 4.8 implies the invertibility of \(\mathcal {N}(\alpha _\circ ,\lambda )\) for \({\text {Im}}\lambda =-\frac{n}{2}\). We may thus improve (4.16) to
where \(0<\delta <{\text {Re}}\mathcal {E}\); the error term is now relatively compact.
We next argue that the error term can be dropped altogether, which follows if we prove that \(\mathcal {L}_{\hat{\gamma },{\hat{k}}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*({\hat{f}}^*,{\hat{j}}^*)=0\) for \(({\hat{f}}^*,{\hat{j}}^*)\in \rho _\circ ^{\frac{n}{2}}H_{{\textrm{b}},0 0}^{s+2}\oplus \rho _\circ ^{\frac{n}{2}}H_{{\textrm{b}},0 0}^{s+1}\) implies \(({\hat{f}}^*,{\hat{j}}^*)=0\). First, elliptic regularity implies that \(({\hat{f}}^*,{\hat{j}}^*)\) has infinite b-00-regularity and vanishes to infinite order at \(\hat{\rho }_2^{-1}(0)\). But by the comment following (4.17), we may then integrate by parts and deduce \(0=\langle \mathcal {L}_{\hat{\gamma },{\hat{k}}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*({\hat{f}}^*,{\hat{j}}^*),({\hat{f}}^*,{\hat{j}}^*)\rangle _{L^2}=\Vert \mathcal {L}_{\hat{\gamma },{\hat{k}}}^*({\hat{f}}^*,{\hat{j}}^*)\Vert _{L^2}^2\); therefore, \(L_{\hat{\gamma },{\hat{k}}}^*({\tilde{f}}^*,{\tilde{j}}^*)=0\) for \(({\tilde{f}}^*,{\tilde{j}}^*):={\hat{w}}_2\rho _\circ ^{-\tilde{\alpha }}\hat{\rho }_2^{-\alpha _2}e^{\beta /\hat{\rho }_2}({\hat{f}}^*,{\hat{j}}^*)\). But since \(({\hat{f}}^*,{\hat{j}}^*)\in \mathcal {A}^{\frac{n}{2}}([0,R_0^{-1})\times \partial {\hat{X}})\) by Corollary 2.4, we have \(({\tilde{f}}^*,{\tilde{j}}^*)\in \mathcal {A}^{\frac{n}{2}-\tilde{\alpha }}([0,R_0^{-1})\times \partial {\hat{X}})=\mathcal {A}^{-(\alpha _\circ -n+2)}\). Lemma 4.9 gives \(({\tilde{f}}^*,{\tilde{j}}^*)=0\), and therefore \(({\hat{f}}^*,{\hat{j}}^*)=0\).
The dual space of \(\rho _\circ ^{\frac{n}{2}}H_{{\textrm{b}},0 0}^s(\hat{\Omega })\) with respect to the \(L^2\)-inner product with volume density \(|\textrm{d}\hat{\gamma }|\) (which we stress was also used in the definition of \(\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*\)) is \(\rho _\circ ^{\frac{n}{2}}H_{{\textrm{b}},0 0}^{-s}(\hat{\Omega })\), i.e. the weight remains unchanged. Thus, \(\mathcal {L}_{\hat{\gamma },{\hat{k}}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*\) has trivial cokernel too, and we deduce the invertibility of (4.15). This proves part (1).
For part (2), we claim that \(({\hat{h}},{\hat{q}})=e^{-\beta /\hat{\rho }_2}\hat{\rho }_2^{\alpha _2}\rho _\circ ^{\tilde{\alpha }}{\hat{w}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*(h',q')\) is the desired polyhomogeneous solution, where
Note that \((f',j')\) vanishes near \(r\le R_0\), and note that \({\text {Re}}\mathcal {G}'>\frac{n}{2}\). The extension by 0 of \(h',q'\) to \({\hat{r}}\le {\hat{R}}_0\) satisfies \((h',q')\in \rho ^{\frac{n}{2}}H_{{\textrm{b}}}^\infty ({\hat{X}})\subset \mathcal {A}^{\frac{n}{2}}({\hat{X}})\). In view of Lemma 4.6(2), it thus suffices to show that \((h',q')\) is polyhomogeneous with index set \(\mathcal {G}'\mathrel {\overline{\cup }}\mathcal {S}'\), where \(\mathcal {S}'\subset \mathbb {C}\times \mathbb {N}_0\) with \({\text {Re}}\mathcal {S}'>\frac{n}{2}\) depends only on \(\hat{\gamma }\), \({\hat{k}}\), and \(\alpha _\circ \). This is a standard result for elliptic b-differential operators such as \(\mathcal {L}:=\mathcal {L}_{\hat{\gamma },{\hat{k}}}\mathcal {L}_{\hat{\gamma },{\hat{k}}}^*\), see e.g. [Mel93, Proposition 5.61] in the case that \(\mathcal {L}\) has smooth (rather than smooth plus decaying polyhomogeneous) coefficients. Briefly, one rewrites \(\mathcal {L}(h',q')=(f',j')\) in terms of its b-normal operator \(N(\mathcal {L})\) at \(\partial {\hat{X}}\) as
and extracts the polyhomogeneous expansion of \((h',q')\) at \(\partial \overline{\mathbb {R}^n}\) iteratively, noting that the coefficients of \(\mathcal {L}-N(\mathcal {L})\) are of class \(\mathcal {A}_{\textrm{phg}}^\mathcal {E}\) (and thus decay). Indeed, having obtained an expansion up to conormal errors with decay rate \(N>\frac{n}{2}\), the Mellin-transform of \((h',q')\) is meromorphic in \({\text {Im}}\lambda >-N\) (see the end of Sect. 2.3). One then passes to the Mellin transform and inverts \(N(\mathcal {L},\lambda )\), or equivalently the operator \(\mathcal {N}(\alpha _\circ ,\lambda )\) in (4.17); the set of poles of \(\mathcal {N}(\alpha _\circ ,\lambda )^{-1}\) is finite in any strip of bounded \({\text {Im}}\lambda \) (cf. (4.11)). This demonstrates that the Mellin-transform of \((h',q')\) is meromorphic, with finitely many poles, in the larger half space \({\text {Im}}\lambda >-N-\delta \) (where \(0<\delta <{\text {Re}}\mathcal {E}\)), since this is true for the right hand side of (4.18). This shows the polyhomogeneity of \((h',q')\) modulo conormal errors with the faster rate \(N+\delta \). This finishes the proof, except \(({\hat{h}},{\hat{q}})\) vanishes only in \({\hat{r}}\le {\hat{R}}_0\); to get the vanishing in a neighborhood of \({\hat{r}}\le {\hat{R}}_0\), one applies the above arguments with \({\hat{R}}_0\) replaced by \({\hat{R}}_0+\eta \) for \(0<\eta \ll 1\). \(\quad \square \)
4.2 Manifolds with punctures
We now turn to the analysis of the linearized constraints map on a manifold \(X_\circ =[X;\{\mathfrak {p}\}]\) arising via the blow-up of a smooth n-dimensional manifold X (without boundary) at a point; here \(n\ge 3\).
First, however, let \(\gamma ,k\in {\mathcal {C}}^\infty (X;S^2 T^*X)\), with \(\gamma \) Riemannian; they do not need to satisfy the constraint equations (though in our application they do).
Definition 4.13
(No KIDs)(See [Mon75].) Denote by \(L_{\gamma ,k}\) the linearization of the constraints map \(P(-,-;\Lambda )\) (see Lemma 4.1) at \((\gamma ,k)\), and let \(\mathcal {U}^\circ \subset X\) be a smoothly bounded precompact connected open set containing \(\mathfrak {p}\). We then say that \((X,\gamma ,k)\) has no KIDs (‘Killing Initial Data’) in \(\mathcal {U}^\circ \) if the kernel of \(L_{\gamma ,k}^*\) on \({\mathcal {C}}^\infty (\mathcal {U};\underline{\mathbb {R}}{}\oplus T^*X)\) is trivial, where \(\mathcal {U}=\overline{\mathcal {U}^\circ }\).
Beig–Chruściel–Schoen showed in [BCS05, Theorem 1.3] that the subset of initial data sets without KIDs is generic (open and dense) for a variety of classes of initial data sets.
Remark 4.14
(Shrinking \(\mathcal {U}\)) If \(\mathcal {U}_\epsilon ^\circ \subset \mathcal {U}^\circ \) is the subset consisting of all points which have distance \(>\epsilon \) away from \(\partial \mathcal {U}\), then \((X,\gamma ,k)\) has no KIDs in \(\mathcal {U}_\epsilon ^\circ \) when \(\epsilon \in (0,\epsilon _0]\) where \(\epsilon _0>0\) is small enough. Indeed, \(\mathcal {U}_\epsilon ^\circ \) is smoothly bounded, connected, and precompact for all sufficiently small \(\epsilon >0\). Moreover, if there existed a nonzero element \((f_\epsilon ^*,j_\epsilon ^*)\in {\mathcal {C}}^\infty (\overline{\mathcal {U}_\epsilon ^\circ };\underline{\mathbb {R}}{}\oplus T^*X)\) with \(L_{\gamma ,k}^*(f_\epsilon ^*,j_\epsilon ^*)=0\) for a sequence of \(\epsilon \) tending to 0, then we could rescale a subsequence of \((f_\epsilon ^*,j_\epsilon ^*)\) so that \((f_\epsilon ^*,\nabla f_\epsilon ^*,j_\epsilon ^*,\nabla j_\epsilon ^*)\) converged to a nonzero tensor at some point in \(\mathcal {U}_{\epsilon _0}^\circ \); and then Lemma 4.3 would imply that this rescaled subsequence would converge to a (nonzero) element of \({\mathcal {C}}^\infty (\mathcal {U}^\circ ;\underline{\mathbb {R}}{}\oplus T^*X)\) which, again by Lemma 4.3 extends smoothly to \(\mathcal {U}\), in contradiction to the absence of KIDs in \(\mathcal {U}^\circ \).
Write now \(\upbeta _\circ : X_\circ \rightarrow X\) for the blow-down map. Denote by \(\hat{\rho }\in {\mathcal {C}}^\infty (X_\circ )\) a defining function of \(\partial X_\circ \). In geodesic normal coordinates \(x=(x^1,\ldots ,x^n)\), \(|x|<r_0\), around \(\mathfrak {p}\in X\), we can locally take \(\hat{\rho }=r:=|x|\) (Euclidean norm). Setting \(\omega =\frac{x}{|x|}\in \mathbb {S}^{n-1}\), we identify a collar neighborhood of \(\partial X_\circ \) with \([0,r_0)_r\times \mathbb {S}^{n-1}_\omega \); we furthermore fix an identification \(\upbeta _\circ ^*T^*X\cong T^*_\mathfrak {p}X\) in this collar neighborhood which is the identity over \(\partial X_\circ \). Put
where \(\underline{w}_\circ \) is an operator on \([0,\infty )_r\times \mathbb {S}^{n-1}_\omega \) acting on pairs of sections of the pullback of \(S^2 T_\mathfrak {p}^*X\) along \((r,\omega )\mapsto \mathfrak {p}\).
Lemma 4.15
(Linearized constraints map) Denote the lift of \(L_{\gamma ,k}\) toFootnote 20\(X_\circ \) by \(L_{\circ ,\gamma ,k}\). Then
Letting \(\mathfrak {e}=\sum _{j=1}^n(\textrm{d}x^j)^2\) in geodesic normal coordinates, the normal operator of \(L_{\circ ,\gamma ,k}w_\circ \) at \(\partial X_\circ \) is given by \(\underline{L}_\circ \underline{w}_\circ \), where \(\underline{L}_\circ ={\text {diag}}(\underline{L}_{\circ ,1},\underline{L}_{\circ ,2})\) with \(\underline{L}_{\circ ,1}=\Delta _\mathfrak {e}{\text {tr}}_\mathfrak {e}+\delta _\mathfrak {e}\delta _\mathfrak {e}\) and \(\underline{L}_{\circ ,2}=\delta _\mathfrak {e}+\textrm{d}{\text {tr}}_\mathfrak {e}\) (cf. (4.8)).
The weight \(w_\circ \) at \(r=0\) now encodes the opposite relative weighting of the linearized metric and the linearized second fundamental form as compared to the situation at infinity in Lemma 4.6: the latter is one order more singular (less decaying) than the former.Footnote 21
Proof of Lemma 4.15
Note that \(L_{\gamma ,k}\) is a smooth coefficient operator on X; and since in local coordinates on X near \(\mathfrak {p}\), the lift of \(r\partial _{x^i}\) to \(X_\circ \) is a smooth b-vector field, we have \(\partial _{x^i}\in r^{-1}\mathcal {V}_{\textrm{b}}(X_\circ )\). In view of (4.4), this gives (4.20). Moreover, the only terms contributing to the normal operators of the (1, 1) and (2, 2) components of \(L_{\circ ,\gamma ,k}w_\circ \) are those with the maximal number of derivatives—so \(\Delta _\gamma {\text {tr}}_\gamma +\delta _\gamma \delta _\gamma \) and \(\delta _\gamma +\textrm{d}{\text {tr}}_\gamma \)—and we may moreover freeze the metric \(\gamma \) at the point \(\mathfrak {p}\) (since the difference of \(\gamma \) and \(\gamma (\mathfrak {p})\) vanishes at \(r=0\)). This gives \(\underline{L}_\circ \underline{w}_\circ \), as claimed. \(\quad \square \)
Note now that the Mellin-transformed normal operator family of \(\underline{w}_\circ \underline{L}_\circ ^*\) at \(r=0\) satisfies
where the right hand side was studied in Lemma 4.8. Thus, \(N(\underline{w}_\circ \underline{L}_\circ ^*,\lambda )\) is injective for \(\lambda \in \mathbb {C}\setminus \{-i,0\}\), and the estimate (4.9) holds for the operator \(\underline{w}_\circ \underline{L}_\circ ^*\) for \(\lambda \in \Lambda \Subset \mathbb {C}\setminus \{-i,0\}\).
Proposition 4.16
(Solvability of the linearized constraints) In the notation of Definition 4.13, suppose that \((X,\gamma ,k)\) has no KIDs in \(\mathcal {U}^\circ \). Let \(\mathcal {U}_\circ =\upbeta _\circ ^{-1}(\mathcal {U})\) (where \(\mathcal {U}=\overline{\mathcal {U}^\circ }\)), and denote by \(\rho _2\in {\mathcal {C}}^\infty (\mathcal {U}_\circ )\) a defining function of \(\partial \mathcal {U}\); put \(w_2:={\text {diag}}(\rho _2^4,\rho _2^2)\). Define b-00-Sobolev spaces on \(\mathcal {U}_\circ \) (with b-character at \(\partial X_\circ =\hat{\rho }^{-1}(0)\) and 00-character at \(\partial \mathcal {U}\)) using a positive b-00-density on \(\mathcal {U}_\circ \). Let \(\hat{\alpha }\in \mathbb {R}\), \(\hat{\alpha }\ne -n+2,-n+1\).
-
(1)
(Basic solvability.) Let \(\alpha _2\in \mathbb {R}\) and \(\beta >0\). Put
$$\begin{aligned} \mathcal {L}_{\circ ,\gamma ,k}:= e^{\beta /\rho _2}\rho _2^{-\alpha _2}\hat{\rho }^{-\tilde{\alpha }}w_2 L_{\circ ,\gamma ,k}w_\circ \hat{\rho }^{\tilde{\alpha }}\rho _2^{\alpha _2}e^{-\beta /\rho _2},\qquad \tilde{\alpha }:=\hat{\alpha }+n-2-\frac{n}{2}.\nonumber \\ \end{aligned}$$(4.21)Define adjoints with respect to the volume density and fiber inner products induced by \(\gamma \). Then for all \(s\in \mathbb {R}\), the operator \(\mathcal {L}_{\circ ,\gamma ,k}\mathcal {L}_{\circ ,\gamma ,k}^*\) is invertible as a mapFootnote 22
$$\begin{aligned}{} & {} \mathcal {L}_{\circ ,\gamma ,k}\mathcal {L}_{\circ ,\gamma ,k}^*: \hat{\rho }^{-\frac{n}{2}}H_{{\textrm{b}},0 0}^{s+2}(\mathcal {U}_\circ )\oplus \hat{\rho }^{-\frac{n}{2}}H_{{\textrm{b}},0 0}^{s+1}(\mathcal {U}_\circ ;\upbeta _\circ ^*T^*X) \nonumber \\{} & {} \quad \rightarrow \hat{\rho }^{-\frac{n}{2}}H_{{\textrm{b}},0 0}^{s-2}(\mathcal {U}_\circ )\oplus \hat{\rho }^{-\frac{n}{2}}H_{{\textrm{b}},0 0}^{s-1}(\mathcal {U}_\circ ;\upbeta _\circ ^*T^*X). \end{aligned}$$(4.22) -
(2)
(Polyhomogeneity.) There exists an index set \(\mathcal {S}_\circ \subset \mathbb {C}\times \mathbb {N}_0\), only depending on \(\hat{\alpha }\) and satisfying \({\text {Re}}\mathcal {S}_\circ >\hat{\alpha }\) so that the following holds. If \(\mathcal {G}\subset \mathbb {C}\times \mathbb {N}_0\) is an index set with \({\text {Re}}\mathcal {G}>\hat{\alpha }\), and if \((f,j)\in \mathcal {A}_{\textrm{phg}}^{\mathcal {G}-2}(X_\circ ;\underline{\mathbb {R}}{}\oplus \upbeta _\circ ^*T^*X)\) vanishes in a neighborhood of \(\partial \mathcal {U}\), then there exists
$$\begin{aligned} (h,q)\in \mathcal {A}_{\textrm{phg}}^{\mathcal {G}\mathrel {\overline{\cup }}\mathcal {S}_\circ }(X_\circ ;\upbeta _\circ ^*S^2 T^*X)\oplus \mathcal {A}_{\textrm{phg}}^{(\mathcal {G}\mathrel {\overline{\cup }}\mathcal {S}_\circ )-1}(X_\circ ;\upbeta _\circ ^*S^2 T^*X) \end{aligned}$$with \(L_{\circ ,\gamma ,k}(h,q)=(f,j)\), and so that (h, q) vanishes in a neighborhood of \(X_\circ \setminus \mathcal {U}_\circ \).
Proof
The proof of part (1) is very close to that of Proposition 4.10(1), except for the analysis of the cokernel. The operator \(\mathcal {L}_{\circ ,\gamma ,k}\) is a smooth coefficient b-00-differential operator on \(\mathcal {U}_\circ \), and the principal symbol of \(\mathcal {L}_{\circ ,\gamma ,k}^*\) is injective. For the Douglis–Nirenberg-elliptic operator \(\mathcal {L}_{\circ ,\gamma ,k}^*\mathcal {L}_{\circ ,\gamma ,k}\), we thus have the estimate (4.16) where we replace \(\rho _\circ \) and \(\frac{n}{2}\) by \(\hat{\rho }\) and \(-\frac{n}{2}\), respectively. Improving the error term requires the invertibility of the Mellin-transformed normal operator on the line \({\text {Im}}\lambda =\frac{n}{2}\), i.e. the invertibility of \(N(\underline{w}_\circ \underline{L}_\circ ^*,\lambda +i\tilde{\alpha })^*N(\underline{w}_\circ \underline{L}_\circ ^*,\lambda +i\tilde{\alpha })\), which holds because of \({\text {Im}}(\lambda +i\tilde{\alpha })=\hat{\alpha }+n-2\notin \{0,-1\}\). This gives the estimate
Once we prove the triviality of the kernel of \(\mathcal {L}_{\circ ,\gamma ,k}^*\), one can drop the relatively compact error term on the right. But if \(\mathcal {L}_{\circ ,\gamma ,k}^*(f^*,j^*)=0\), then \((f^*,j^*)\) enjoys infinite b-00-regularity and infinite decay at \(\rho _2^{-1}(0)\), and in particular it is smooth in \((\mathcal {U}_\circ )^\circ \setminus \partial X_\circ \) and conormal at \(\partial X_\circ \). Moreover, \(({\tilde{f}}^*,{\tilde{j}}^*):=w_2\hat{\rho }^{-\tilde{\alpha }}\rho _2^{-\alpha _2}e^{\beta /\rho _2}(f^*,j^*)\) satisfies \(L_{\circ ,\gamma ,k}^*({\tilde{f}}^*,{\tilde{j}}^*)=0\). By Lemma 4.3, \(({\tilde{f}}^*,{\tilde{j}}^*)\) is necessarily smooth down to \(\partial \mathcal {U}\). Near \(\partial X_\circ \) on the other hand, where \(({\tilde{f}}^*,{\tilde{j}}^*)\in \mathcal {A}^{-\frac{n}{2}-\tilde{\alpha }}=\mathcal {A}^{-(\hat{\alpha }+n-2)}\) is polyhomogeneous (as follows from the same argument, mutatis mutandis, that we already used in the proof of Proposition 4.10(2)), we write
and use the injectivity of \(N(\underline{w}_\circ \underline{L}_\circ ^*,\lambda )\) for \(\lambda \ne -i,0\) to conclude thatFootnote 23\(({\tilde{f}}^*,{\tilde{j}}^*)\in \mathcal {A}^{-\epsilon }\) for all \(\epsilon >0\). Thus, \(({\tilde{f}}^*,{\tilde{j}}^*)\) is the pullback of a distribution \((f^\sharp ,j^\sharp )\) on \(\mathcal {U}\) which is smooth away from \(\mathfrak {p}\) and satisfies \(|D_x^\alpha (f^\sharp ,j^\sharp )|\lesssim |x|^{-\epsilon -|\alpha |}\) near \(\mathfrak {p}\) for all \(\alpha \). Therefore, \((f^\sharp ,j^\sharp )\in H^{\frac{n}{2}-\epsilon }(\mathcal {U})\). But \(L_{\gamma ,k}^*(f^\sharp ,j^\sharp )\in H^{\frac{n}{2}-2-\epsilon }(\mathcal {U})\) vanishes on \(\mathcal {U}\setminus \{\mathfrak {p}\}\) and is therefore a distribution with support in \(\{\mathfrak {p}\}\); since \(\frac{n}{2}-2>-\frac{n}{2}\), this forces \(L_{\gamma ,k}^*(f^\sharp ,j^\sharp )=0\). Again applying Lemma 4.3, this implies that \((f^\sharp ,j^\sharp )\) is smooth on \(\mathcal {U}\). Using the absence of KIDs, this finally implies \((f^\sharp ,j^\sharp )=0\) and thus \((f^*,j^*)=0\).
We have now shown the invertibility of (4.22), completing the proof of part (1). The proof of part (2) is the same as that of Proposition 4.10(2); to arrange for the vanishing of (h, q) in a neighborhood of \(X_\circ \setminus \mathcal {U}_\circ \), we work in a slight shrinking of \(\mathcal {U}\), cf. Remark 4.14.
\(\square \)
Remark 4.17
(Weights and KIDs) When \(-(\hat{\alpha }+n-2)>1\), i.e. \(\hat{\alpha }<-n+1\), then elements of \(\ker \mathcal {L}_{\circ ,\gamma ,k}^*\) give rise to elements \((f^\sharp ,j^\sharp )\) of the distributional nullspace of \(L_{\gamma ,k}^*\) with \(|D_x^\alpha (f^\sharp ,j^\sharp )|\lesssim |x|^{1+\epsilon -|\alpha |}\) for \(|\alpha |=0,1\); one can then directly apply Lemma 4.3 (i.e. unique continuation at \(x=0\)) to conclude that \((f^\sharp ,j^\sharp )=0\). When \(\hat{\alpha }>-n+1\) however, the a priori bounds for \(({\tilde{f}}^*,{\tilde{j}}^*)\) in the above proof do not enforce the vanishing, together with all first derivatives, of \(({\tilde{f}}^*,{\tilde{j}}^*)\) at \(x=0\); and indeed when \(\gamma \) is the Euclidean metric and \(k=0\), then \(\ker \mathcal {L}_{\circ ,\gamma ,k}^*\) is not trivial for such \(\hat{\alpha }\) since it contains \(e^{-\beta /\rho _2}\rho _2^{\alpha _2}\hat{\rho }^{\tilde{\alpha }}w_2^{-1}\) times \((0,{\tilde{j}}^*)\) where \({\tilde{j}}^*\) (regarded as a vector field) is a spatial rotation vector field (which is a Killing vector field vanishing at \(x=0\)). By contrast, in the proof of Proposition 4.10, the triviality of \(\ker \mathcal {L}_{\hat{\gamma },{\hat{k}}}^*\) was proved using the assumed fast order of decay as \(|{\hat{x}}|\rightarrow \infty \) and unique continuation at \(|{\hat{x}}|=\infty \); see also Remark 4.12.
Remark 4.18
(Explicit index set of (h, q)) An alternative argument for Proposition 4.16(2) proceeds as follows. One first solves the equation \(L_{\circ ,\gamma ,k}(h,q)=(f,j)\) formally, i.e. modulo smooth errors vanishing to infinite order at \(\partial X_\circ \), using the fact that \(N(\underline{w}_\circ \underline{L}_\circ ,\lambda )\) is surjective for all \(\lambda \) except for \(\lambda =i(n-1),i n\). (The presence of a cokernel for \(\lambda =i(n-1),i n\) necessitates the introduction of logarithmic terms.) To solve away the remaining error, which is the lift of a smooth tensor on \(\mathcal {U}\), one can blow down \(\partial X_\circ \), i.e. work on \(\mathcal {U}\); the resulting correction to the formal solution is smooth on \(\mathcal {U}\), and thus lifts to be polyhomogeneous with index set \(\mathbb {N}_0\) on \(X_\circ \). We leave the details to the interested reader.
4.3 Constraints map on the total gluing space
We use the notation of Sect. 3; near \(\mathfrak {p}\in X\), we work with geodesic normal coordinates \(x=(x^1,\ldots ,x^n)\), and we recall the local coordinate charts \((\epsilon ,{\hat{x}})=(\epsilon ,\frac{x}{\epsilon })\) near \({\hat{X}}^\circ \) and \((\hat{\rho },\rho _\circ ,\omega )=(|x|,\frac{\epsilon }{|x|},\frac{x}{|x|})\) near \({\hat{X}}\cap X_\circ \). We first note:
Lemma 4.19
(Scaling). For \(\lambda >0\), we have \(P(\lambda ^2\gamma ,\lambda k;\lambda ^{-2}\Lambda )={\text {diag}}(\lambda ^{-2},\lambda ^{-1}) P(\gamma ,k;\Lambda )\).
Proof
Since \(\textrm{Ric}(\lambda ^2\gamma )=\textrm{Ric}(\gamma )\), we have \(R_{\lambda ^2\gamma }=\lambda ^{-2}R_\gamma \). Replacing in \(|k|_\gamma ^2=\gamma ^{i i'}\gamma ^{j j'}k_{i j}k_{i' j'}\) the term \(\gamma ^{i i'}\) by \(\lambda ^{-2}\gamma ^{i i'}\) and \(k_{i j}\) by \(\lambda k_{i j}\), one similarly picks up an overall factor of \(\lambda ^{-2}\). For the second component of \(P(\lambda ^2\gamma ,\lambda k;\lambda ^{-2}\Lambda )\), we note that \(\textrm{d}({\text {tr}}_{\lambda ^2\gamma }\lambda k)=\lambda ^{-1}\,\textrm{d}{\text {tr}}_\gamma k\), similarly \(\delta _{\lambda ^2\gamma }(\lambda k)=\lambda ^{-1}\delta _\gamma k\). \(\quad \square \)
In a collar neighborhood of \({\hat{X}}\) and recalling the scaling map \(\textsf{s}\) from Definition 3.3, we deduce, for \(\lambda =\epsilon ^{-1}\), that
note here that the right hand side is \({\text {diag}}(1,\epsilon )P(\epsilon ^{-2}\widetilde{\gamma },\epsilon ^{-1}\widetilde{k};\epsilon ^2\Lambda )\). Conversely, for \(\lambda =\epsilon \), we find that \(P(\hat{\gamma },{\hat{k}};0)=0\) implies \(P(\textsf{s}\hat{\gamma },\epsilon ^{-1}\textsf{s}{\hat{k}};0)=0\); notice the \(\epsilon ^{-1}\) scaling of the second fundamental form.
Definition 4.20
(Total families). Let \(\hat{\mathcal {E}},\mathcal {E}\subset \mathbb {C}\times \mathbb {N}_0\) be nonlinearly closed index sets with \({\text {Re}}\hat{\mathcal {E}}>0\) and \({\text {Re}}\mathcal {E}>0\). Let \({\hat{K}}\Subset {\hat{X}}^\circ \) be a compact (possibly empty) subset, smoothly bounded and with connected complement, and set \(\widetilde{K}:=\{(\epsilon ,{\hat{x}}):{\hat{x}}\in {\hat{K}}\}\subset \widetilde{X}\). Then a pair \((\widetilde{\gamma },\widetilde{k})\) of sections of \(S^2\widetilde{T}^*\widetilde{X}\rightarrow \widetilde{X}\setminus \widetilde{K}^\circ \) is called an \((\hat{\mathcal {E}},\mathcal {E})\)-smooth total family if
where, with the index sets referring to \({\hat{X}}\) and \(X_\circ \) (in this order),
with \(\gamma ,k\) regarded as \(\epsilon \)-independent elements of \({\mathcal {C}}^\infty (\widetilde{X}';S^2\widetilde{T}^*\widetilde{X}')\), and where \(\widetilde{\gamma }|_{{\hat{X}}}\) and \(\gamma \) (and thus \(\widetilde{\gamma }|_{X^\circ }=\upbeta _\circ ^*\gamma \)) are positive definite. The boundary data of \((\widetilde{\gamma },\widetilde{k})\) are \((\hat{\gamma },{\hat{k}})\) and \((\gamma ,k)\), where
See Fig. 7.
Recalling from Lemma 3.2(2) that \({\hat{X}}^\circ \) is a vector space, denote by
the constant (i.e. translation-invariant) metric given by \(\hat{\textsf{s}}^{-1}(\widetilde{\upbeta }^*\widetilde{\gamma }_{(0)}(\mathfrak {p}))\) where \(\hat{\textsf{s}}\) is given in Definition 3.3. Since \(\widetilde{\gamma }_{(0)}(\mathfrak {p})=\sum _{j=1}^n(\textrm{d}x^j)^2\), we have \(\hat{\mathfrak {e}}=\sum _{j=1}^n(\textrm{d}{\hat{x}}^j)^2\), so \(({\hat{X}}^\circ ,\hat{\mathfrak {e}})\) is isometric to \(\mathbb {R}^n\) with the Euclidean metric. The boundary data of \((\widetilde{\gamma },\widetilde{k})\) at \({\hat{X}}\) then satisfy
Thus, \((\hat{\gamma },{\hat{k}})\) is \(\mathcal {E}\)-asymptotically flat in the sense of Definition 4.4. The pair \((\gamma ,k)\) on X immediately fits into the setup of Sect. 4.2.
The goal of this paper is to construct a total family \((\widetilde{\gamma },\widetilde{k})\), with prescribed boundary data \((\hat{\gamma },{\hat{k}})\) and \((\gamma ,k)\), which satisfies the constraint equationsFootnote 24\(P(\widetilde{\gamma },\widetilde{k};\Lambda )=0\) for small \(\epsilon >0\). Note here that \(\widetilde{\gamma }\) is automatically positive definite for all sufficiently small \(\epsilon >0\); in the sequel, when we write \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\), we shall implicitly always restrict to the parameter range \(\epsilon \le \epsilon _0\) where \(\epsilon _0>0\) is so small that \(\widetilde{\gamma }\) is Riemannian. Our first task is to analyze the constraints map on total families, and the structure and normal operators of its linearization at a total family.
Definition 4.21
(Total weight). With \(\hat{\rho }\in {\mathcal {C}}^\infty (\widetilde{X})\) denoting a defining function of \({\hat{X}}\), we set
This matches \(w_\circ \) in (4.19) at \(X_\circ \). Since \(\hat{\rho }=\epsilon \rho _\circ ^{-1}\) for a defining function \(\rho _\circ \in {\mathcal {C}}^\infty (\widetilde{X})\) of \(X_\circ \), we have \(\widetilde{w}={\text {diag}}(\epsilon ^2\rho _\circ ^{-2},\epsilon \rho _\circ ^{-1})={\text {diag}}(\epsilon ^2,\epsilon ){\hat{w}}\), which at \({\hat{X}}\) thus combines the weight in Definition 4.5 with the scaling from Lemma 4.19 (with \(\lambda =\epsilon ^{-1}\)).
Proposition 4.22
(Constraints map on the total gluing space). Let \((\widetilde{\gamma },\widetilde{k})\) be a \((\hat{\mathcal {E}},\mathcal {E})\)-smooth total family, and define \((\hat{\gamma },{\hat{k}})\) and \((\gamma ,k)\) by (4.25).
-
(1)
(Nonlinear constraints.) We have \(P(\widetilde{\gamma },\widetilde{k};\Lambda )=\widetilde{\upbeta }^*p + p^{(1)}\), where, in the notation of Definition 4.20,
$$\begin{aligned} p:=P(\gamma ,k;\Lambda )\in {\mathcal {C}}^\infty (X;\underline{\mathbb {R}}{}\oplus T^*X),\qquad p^{(1)}\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-2,\mathcal {E}}(\widetilde{X}\setminus \widetilde{K}^\circ ;\underline{\mathbb {R}}{}\oplus \widetilde{T}^*\widetilde{X}). \end{aligned}$$Moreover,
$$\begin{aligned} (\epsilon ^2 P(\widetilde{\gamma },\widetilde{k};\Lambda ))|_{{\hat{X}}} = (\epsilon ^2 p^{(1)})|_{{\hat{X}}} = \hat{\textsf{s}}\bigl (P(\hat{\gamma },{\hat{k}};0)\bigr ). \end{aligned}$$(4.26) -
(2)
(Linearization as a q-differential operator.) The linearization \(L_{\widetilde{\gamma },\widetilde{k}}\) of \(P(-,-;\Lambda )\) at \((\widetilde{\gamma },\widetilde{k})\) satisfies
$$\begin{aligned} \begin{aligned}&L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w} \in (\widetilde{\upbeta }^*{\mathcal {C}}^\infty (X){+}\mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathcal {E}}(\widetilde{X}\setminus \widetilde{K}^\circ )) \\&\qquad \times \begin{pmatrix} \textrm{Diff}_{\textrm{q}}^2(\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X},\underline{\mathbb {R}}{}) &{} {\mathcal {C}}^\infty (\widetilde{X}\setminus \widetilde{K}^\circ ;{\text {Hom}}(S^2\widetilde{T}^*\widetilde{X},\underline{\mathbb {R}}{})) \\ \textrm{Diff}_{\textrm{q}}^1(\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X},\widetilde{T}^*\widetilde{X}) &{} \textrm{Diff}_{\textrm{q}}^1(\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X},\widetilde{T}^*\widetilde{X}) \end{pmatrix}. \end{aligned} \end{aligned}$$(4.27)Using the notation of Sect. 3.1 and Lemma 4.15, we have
$$\begin{aligned} \hat{\textsf{s}}^{-1}\circ N_{{\hat{X}}}(L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w})\circ \hat{\textsf{s}}=L_{\hat{\gamma },{\hat{k}}}{\hat{w}},\qquad N_{X_\circ }(L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w})=L_{\circ ,\gamma ,k}w_\circ . \end{aligned}$$(4.28)Moreover, omitting the bundles and differential orders,
$$\begin{aligned} \begin{aligned} L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w} - \hat{\chi }\textsf{s}\circ L_{\hat{\gamma },{\hat{k}}}{\hat{w}}\circ \textsf{s}^{-1}&\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0+1)\cup \hat{\mathcal {E}},\mathbb {N}_0\cup \mathcal {E}}\textrm{Diff}_{\textrm{q}}(\widetilde{X}\setminus \widetilde{K}^\circ ), \\ L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w}-\chi _\circ L_{\circ ,\gamma ,k}w_\circ&\in \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathcal {E}}\textrm{Diff}_{\textrm{q}}(\widetilde{X}\setminus \widetilde{K}^\circ ). \end{aligned} \end{aligned}$$(4.29)Here, \(\hat{\chi } L_{\hat{\gamma },{\hat{k}}}\) and \(\chi _\circ L_{\circ ,\gamma ,k}\) are regarded as differential operators on \(\widetilde{X}\) via Lemma 3.4.
Proof
The arguments for part (1) are similar to those in the proof of Lemma 4.6. Near \({\hat{X}}^\circ \), we work in the coordinates \((\epsilon ,{\hat{x}})\). Since \(\widetilde{\gamma }_{i j}\in \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}}}([0,1)_\epsilon \times {\hat{X}}^\circ )\), we have \(\widetilde{\gamma }^{i j}\in \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}}}\). Since \(\partial _\ell \widetilde{\gamma }_{i j}\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-1}\), this gives \(\Gamma (\widetilde{\gamma })_{i j}^\ell \in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-1}\). We moreover have \(\widetilde{k}_{i j}\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-1}\) and thus \(\partial _\ell \widetilde{k}_{i j}\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-2}\); this gives \(R(\widetilde{\gamma })\), \(|{\text {tr}}_{\widetilde{\gamma }}\widetilde{k}|^2\), \(|\widetilde{k}|_{\widetilde{\gamma }}^2\), \(\nabla _\ell \widetilde{k}_{i j}\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-2}\) (where \(\nabla \) is given by \(\nabla ^{\widetilde{g}_\epsilon }\) on \(\widetilde{X}_\epsilon \)). Therefore, \(P(\widetilde{\gamma },\widetilde{k})\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-2}\). Tracking the leading order terms of \(\widetilde{\gamma }\), \(\widetilde{k}\), and their derivatives at \(\epsilon =0\), one finds that the restriction of \(\epsilon ^2 P_1(\widetilde{\gamma },\widetilde{k};\Lambda )\), resp. \(\epsilon ^2 P_2(\widetilde{\gamma },\widetilde{k})_i\) to \(\epsilon =0\) is given by \(P_1(\hat{\gamma },{\hat{k}};\Lambda )\), resp. \(P_2(\widetilde{\gamma },\widetilde{k})_{{\hat{i}}}\) indeed, where we write \({\hat{i}}\) for indices in the \({\hat{x}}\)-variables; this uses that \(\partial _i=\epsilon ^{-1}\partial _{{\hat{i}}}\). More conceptually, Lemma 4.19 gives, in a collar neighborhood of \({\hat{X}}\),
as sections of \(\underline{\mathbb {R}}{}\oplus \widetilde{T}_{{\hat{X}}}^*\widetilde{X}\); restriction to \({\hat{X}}\) gives (4.26).
Near \((X_\circ )^\circ \), note that \(\epsilon \mapsto (\widetilde{\gamma }|_{\widetilde{X}_\epsilon },\widetilde{k}|_{\widetilde{X}_\epsilon })\), composed with \(\widetilde{X}_\epsilon \cong X\), is a equal to an \(\epsilon \)-independent smooth symmetric 2-tensor on \(X\setminus \{\mathfrak {p}\}\) plus a polyhomogeneous family (with index set \(\mathcal {E}\)) of such tensors, and the conclusion follows via arguments much as in the previous paragraph. Near the corner \(X_\circ \cap {\hat{X}}\) finally, we have a combination of both settings. Indeed,
where \(\gamma _{i j}\), \(k_{i j}\) are smooth in the blown-down coordinates \(x=\hat{\rho }\omega \in \mathbb {R}^n\), while \(\widetilde{\gamma }^{(1)}_{i j}\in \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathcal {E}}\) and \(\widetilde{k}^{(1)}_{i j}\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-1,\mathcal {E}}\). Thus, \(\widetilde{\gamma }^{-1}\in {\mathcal {C}}^\infty (\mathbb {R}^n_x)+\mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathcal {E}}(\widetilde{X})\) in this chart. Recall from Lemma 3.2(4) that \(\partial _i\in \widetilde{\mathcal {V}}(\widetilde{X})\hookrightarrow \hat{\rho }^{-1}\mathcal {V}_{\textrm{b}}(\widetilde{X})\); therefore, \(\partial _\ell \widetilde{\gamma }_{i j} \in {\mathcal {C}}^\infty (\mathbb {R}^n_x) + \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-1,\mathcal {E}}(\widetilde{X})\) and \(\partial _\ell \widetilde{k}_{i j} \in {\mathcal {C}}^\infty (\mathbb {R}^n_x) + \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-2,\mathcal {E}}(\widetilde{X})\). This gives
We conclude that \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\in {\mathcal {C}}^\infty (\mathbb {R}^n_x)+\mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-2,\mathcal {E}}(\widetilde{X})\), as claimed.
For part (2), we use the expression for \(L_{\widetilde{\gamma },\widetilde{k}}\) given in (4.2), in conjunction with (4.31) and \(\partial _{x^i}\in \hat{\rho }^{-1}\mathcal {V}_{\textrm{q}}(\widetilde{X})\). The normal operators of \(L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w}\) at \({\hat{X}}\) and \(X_\circ \) can be computed by evaluating the limit of \(L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w}\) at the interiors \({\hat{X}}^\circ \) and \((X_\circ )^\circ \), respectively. Since near \((X_\circ )^\circ \), \((\widetilde{\gamma },\widetilde{k})\) is a polyhomogeneous (with index set \(\mathbb {N}_0\cup \mathcal {E}\)) family of symmetric 2-tensors (see also Lemma 3.4) with leading term \((\gamma ,k)\), we immediately obtain the second equation in (4.28). For the first equation, we use the scaling relation (4.30) and \(\hat{\rho }=\epsilon \rho _\circ ^{-1}\); thus,
where in the second line we introduced \({\tilde{t}}=t\epsilon ^2\). Since near \({\hat{X}}^\circ \) the pair \((\textsf{s}^{-1}\widetilde{\gamma },\textsf{s}^{-1}(\epsilon \widetilde{k}))\) is a polyhomogeneous (with index set \(\mathbb {N}_0\cup \hat{\mathcal {E}}\)) family of symmetric 2-tensors on \({\hat{X}}^\circ \) with leading term \((\hat{\gamma },{\hat{k}})\), we obtain the first equation in (4.28).
Since \(\widetilde{\upbeta }^*{\mathcal {C}}^\infty (X)+\mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathcal {E}}(\widetilde{X})\subset \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathbb {N}_0\cup \mathcal {E}}(\widetilde{X})\), we immediately obtain the first membership in (4.29). For the second membership, we note that the \(\epsilon \)-independent extension of \(\hat{\chi } L_{\circ ,\gamma ,k}w_\circ \) to an operator on \(\widetilde{X}{\setminus } x^{-1}(0)\) has coefficients in \(\hat{\chi }\widetilde{\upbeta }^*{\mathcal {C}}^\infty (X_\circ )\), and it is equal to \(L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w}\) at \(\epsilon =0\); therefore, its deviation from \(\hat{\chi } L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w}\) is given by the polyhomogeneous term in (4.27). \(\quad \square \)
This result will suffice for the construction of a formal solution of the gluing problem. In order to correct a formal solution to a true solution of the nonlinear constraint equations, we need a solvability theory for \(L_{\widetilde{\gamma },\widetilde{k}}\) on spaces of tensors with finite regularity:
Proposition 4.23
(Uniform estimates for the linearized constraints map). Let \((\widetilde{\gamma },\widetilde{k})\) be a \((\hat{\mathcal {E}},\mathcal {E})\)-smooth total family on \(\widetilde{X}\setminus \widetilde{K}^\circ \). Let \({\hat{R}}_0>0\) be such that \({\hat{K}}\subset B(0,{\hat{R}}_0)\), and denote by \(\hat{\rho }_2\in {\mathcal {C}}^\infty (\widetilde{X})\) a function which vanishes simply at \({\hat{r}}^{-1}({\hat{R}}_0)\) and is positive for \({\hat{r}}>{\hat{R}}_0\) (e.g. one can take \(\hat{\rho }_2={\hat{r}}^{-1}-{\hat{R}}_0^{-1}\) near \({\hat{X}}\)). Denote by \(\rho _2\in {\mathcal {C}}^\infty (\widetilde{X})\) a function which vanishes simply at \([0,1)\times \partial \mathcal {U}\) and is positive on \([0,1)\times \mathcal {U}^\circ \); put \(\tilde{\rho }_2=\hat{\rho }_2\rho _2\). Put \(\widetilde{w}_2:={\text {diag}}(\tilde{\rho }_2^4,\tilde{\rho }_2^2)\), and recall \(\widetilde{w}={\text {diag}}(\hat{\rho }^2,\hat{\rho })\). Define q-00-Sobolev spaces on \(\widetilde{\Omega }:=\{\hat{\rho }_2\ge 0,\ \rho _2\ge 0\}\subset \widetilde{X}\setminus \widetilde{K}^\circ \) (with 00-behavior at \({\hat{r}}={\hat{R}}_0\) and \(\partial \mathcal {U}\)) using a positive q-00-density; write \(\widetilde{\Omega }_\epsilon =\widetilde{\Omega }\cap \widetilde{X}_\epsilon \). Let \(\alpha _2\in \mathbb {R}\) and \(\beta >0\), and fix \(d\in (-\infty ,n-2)\). Then there exist \(\epsilon _0>0\) so that for all \(s\in \mathbb {R}\) and \(\hat{\alpha },\alpha _\circ \in \mathbb {R}\) satisfying \(\alpha _\circ -\hat{\alpha }=d\), and setting
the map
is invertible and has a uniformly bounded inverse for \(\epsilon <\epsilon _1\) for all \(\epsilon _1<\epsilon _0\).Footnote 25 Here, adjoints are defined with respect to the volume density and fiber inner products induced by \(\widetilde{\gamma }\).
The definition (4.32) combines the conjugations and re-weightings (4.14) and (4.21) performed in the analysis of the two normal operators.
Proof of Proposition 4.23
Since \(\epsilon /\rho _\circ \) is a defining function of \({\hat{X}}\), we may arrange that \(\hat{\rho }\rho _\circ =\epsilon \). Since \(L_{\widetilde{\gamma },\widetilde{k}}\) commutes with multiplication by functions of \(\epsilon =\hat{\rho }\rho _\circ \), we may therefore replace \((\hat{\alpha },\alpha _\circ )\) by \((\hat{\alpha }-c,\alpha _\circ -c)\) for any c; for \(c=\alpha _\circ \), we may thus reduce to the case \(\alpha _\circ =0\), which we assume henceforth. In particular, \(\hat{\alpha }=-d>-n+2\) and thus also \(\hat{\alpha }\ne -n+2,-n+1\).
\(\bullet \) Symbolic estimate. Note that \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\) is a q-00-operator with coefficients in \(\mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathbb {N}_0\cup \mathcal {E}}(\widetilde{X})\), and the q-00-principal symbol of \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) is injective in the Douglis–Nirenberg sense: this follows either by inspection of the form of \(\widetilde{w}_2 L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w}\), or for small \(\epsilon >0\) from the injectivity of the b-00-principal symbols of the normal operators of \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\). Note that the \(X_\circ \)-normal operator is
where \(c_\circ ^L\), resp. \(c_\circ ^R\) is a smooth bundle isomorphism on \(\underline{\mathbb {R}}{}\oplus \upbeta _\circ ^*T^*X\), resp. \(\upbeta _\circ ^*S^2 T^*X\oplus \upbeta _\circ ^*S^2 T^*X\); here \(\mathcal {L}_{\circ ,\gamma ,k}\) is given by (4.21), and the presence of \(c_\circ ^{L/R}\) arises from the fact that \(\rho _2|_{X_\circ }\) is not necessarily equal to, but certainly a smooth positive multiple of, \(\tilde{\rho }_2|_{X_\circ }\). Similarly, writing \(\hat{\rho }=\epsilon \rho _\circ ^{-1}\) in (4.32), one finds using (4.28) that the \({\hat{X}}\)-normal operator satisfies
with \({\hat{c}}^L\), resp. \({\hat{c}}^R\) a smooth bundle isomorphism of \(\underline{\mathbb {R}}{}\oplus {}^{{\textrm{sc}}}T^*{\hat{X}}\), resp. \(S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}}\oplus S^2\,{}^{{\textrm{sc}}}T^*{\hat{X}}\), and where \(\mathcal {L}_{\hat{\gamma },{\hat{k}}}\) is defined by (4.14), with \(\alpha _\circ \) there equal to \(-\hat{\alpha }\) and thus \(\tilde{\alpha }\) there equal to \(-\tilde{\alpha }\) in present notation.
Elliptic estimates for the Douglis–Nirenberg elliptic operator
give uniform (in \(\epsilon \)) estimates
for any fixed \(s_0<s\). (The weights here can be chosen arbitrarily, but with foresight we choose them to match those in (4.23).) One could weaken the error term by estimating it in spaces with weight \(\tilde{\rho }_2^{-1}(0)\) (due to the 00-ellipticity of \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) there), but we shall not need this improvement here.
\(\bullet \) \(\underline{{ Normal}\ { operator}\ { argument}\ { at}\ {\hat{X}}.}\) In the second, error, term on the right hand side of (4.35), we insert a partition of unity \(1=\hat{\chi }+(1-\hat{\chi })\) and apply the triangle inequality; in the term involving \(1-\hat{\chi }\), the weight at \({\hat{X}}\) can then be changed arbitrarily since \(\hat{\rho }>0\) on \({\text {supp}}(1-\hat{\chi })\). In the term involving \(\hat{\chi }\), the first norm equivalence in (3.7) gives the uniform estimate
Observe now that
is invertible as a map (4.15). We can thus estimate the right hand side of (4.36) by an \(\epsilon \)-independent constant times
Using (3.8) to pass back to q-00-Sobolev spaces on \(\widetilde{X}\), we may replace the normal operator here by the operator \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) itself, and as a consequence of Proposition 4.22(2) pick up an error term whose weight at \({\hat{X}}\) is reduced by an amount \(0<\eta <\min (1,\min {\text {Re}}\hat{\mathcal {E}})\). This gives
We now drop the cutoff in the second term on the right; note that multiplication by \(\hat{\chi }\) is uniformly bounded on every q-00-Sobolev space. Since the coefficients \([\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*,\hat{\chi }]\) are supported away from \({\hat{X}}\), we can omit the cutoff \(\hat{\chi }\) in the first term as well since the contribution from the commutator is controlled by a constant times the second term (with \(\hat{\chi }\) dropped).
As a consequence, we can now strengthen the estimate (4.35) by replacing the norm of the error term by a weaker norm at \({\hat{X}}\):
\(\bullet \) \(\underline{{ Normal}\ { operator}\ { argument}\ { at}\ X_\circ .}\) We improve the estimate (4.37) further by using the \(X_\circ \)-normal operator. To wit, inserting a partition of unity \(1=\chi _\circ +(1-\chi _\circ )\) in the second term on the right and applying the triangle inequality, the weight at \(X_\circ \) of the term with \(1-\chi _\circ \) can be changed arbitrarily; for the term with \(\chi _\circ \) on the other hand, we use the second norm equivalence in (3.7) to get an upper bound by an \(\epsilon \)-independent constant times
We now use that, analogously to (4.22), the normal operator
is invertible as a map
if \(\eta >0\) is sufficiently small. (Note that (4.22) gives this for \(\eta =0\); furthermore, the requirements on the \(\hat{\rho }\)-weight \(-\frac{n}{2}\) used in the proof of (4.22) are open in the weight, and therefore are satisfied for \(-\frac{n}{2}-\eta \) as well when \(\eta >0\) is small enough.) We can thus estimate (4.38) by an \(\epsilon \)-independent constant times
Passing back to q-00-Sobolev spaces on \(\widetilde{X}\) via (3.7) and replacing the normal operator by \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) produces an error with reduced weight at \(X_\circ \).
The grand total is the uniform estimate
for some \(\eta >0\). But the second term on the right is bounded by \(C\epsilon ^\eta \) times the left hand side and thus can be absorbed for sufficiently small \(\epsilon >0\). This gives a uniform coercive estimate for \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\). The same estimate applies on the dual function spaces as well, and we conclude the uniformly invertibility of (4.32) for \(\epsilon <\epsilon _0\) when \(\epsilon _0>0\) is sufficiently small (depending only on the choice of \(s,s_0\)). Fixing \(\epsilon _0\) for the choice \(s=0\), \(s_0=-1\), say, the uniform invertibility follows for any other choice of \(s\in \mathbb {R}\) by elliptic regularity (when \(s>0\)) and duality (when \(s<0\)). \(\quad \square \)
We shall need to understand the regularity properties in \(\epsilon \) of the inverse of (4.33). Here, regularity is meant in the following sense:
Definition 4.24
(Regularity of families of elements of q-00-Sobolev spaces). Define the domain \(\widetilde{\Omega }\) as in Proposition 4.23; let \(\hat{\Omega }=\{\hat{\rho }_2\ge 0\}\subset {\hat{X}}\) and \(\Omega _\circ =\{\rho _2\ge 0\}\subset X_\circ \) as in Propositions 4.10 and 4.16. Fix cutoffs \(\hat{\chi },\chi _\circ \in {\mathcal {C}}^\infty (\widetilde{X})\) as in (3.2) with the additional property that \(\hat{\rho }_2>0\) on \({\text {supp}}\chi _\circ \), and \(\rho _2>0\) on \({\text {supp}}\hat{\chi }\). Recall the maps \(\hat{\phi }\), \(\phi _\circ \) from (3.6). Let \(\epsilon _0>0\) and \(s\in \mathbb {R}\).
-
(1)
(Continuity.) The space
$$\begin{aligned} {\mathcal {C}}^0([0,\epsilon _0),H_{{\textrm{q}},0 0,\epsilon }^s(\widetilde{\Omega }_\epsilon )) \end{aligned}$$consists of all families \((u_\epsilon )_{\epsilon \in (0,\epsilon _0)}\) such that \(\Vert u_\epsilon \Vert _{H_{{\textrm{q}},0 0,\epsilon }^s(\widetilde{\Omega }_\epsilon )}\) is uniformly bounded for \(\epsilon \in (0,\epsilon _1)\) for all \(\epsilon _1<\epsilon _0\), and so that \(\hat{\phi }^*(\hat{\chi } u_\epsilon )\in H_{{\textrm{b}},0 0}^s(\hat{\Omega })\) and \(\phi _\circ ^*(\chi _\circ u_\epsilon )\in H_{{\textrm{b}},0 0}^s(\Omega _\circ )\) depend continuously on \(\epsilon \).
-
(2)
(Conormal regularity.) For \(m\in \mathbb {N}_0\), define \(\mathcal {A}_m([0,\epsilon _0),H_{{\textrm{q}},0 0,\epsilon }^s(\widetilde{\Omega }_\epsilon ))\) to consist of all \(u\in {\mathcal {C}}^0([0,\epsilon _0),H_{{\textrm{q}},0 0,\epsilon }^s(\widetilde{\Omega }_\epsilon ))\) so that \((\epsilon \partial _\epsilon )^j \hat{\phi }^*(\hat{\chi } u_\epsilon )\in H_{{\textrm{b}},0 0}^{s-j}(\hat{\Omega })\) and \((\epsilon \partial _\epsilon )^j \phi _\circ ^*(\chi _\circ u_\epsilon )\in H_{{\textrm{b}},0 0}^{s-j}(\Omega _\circ )\) are continuous and uniformly bounded for all \(j\le m\) and \(\epsilon \in (0,\epsilon _1)\), \(\epsilon _1<\epsilon _0\). (In the first expression, \(\partial _\epsilon \) is the coordinate derivative in \((\epsilon ,{\hat{x}})\)-coordinates; in the second expression, it is defined as the derivative along the first factor of \([0,1)_\epsilon \times X_\circ \).)
Spaces of families of elements of weighted q-00-Sobolev spaces are defined analogously (cf. the relationship (3.8)).
Thus, \(\mathcal {A}_0={\mathcal {C}}^0\). We also note that the derivatives in part (2) are well-defined as distributional derivatives due to the a priori assumption that \(u\in {\mathcal {C}}^0\). The regularity condition in part (2) can be rewritten as follows. Let \(\mathcal {R}\in \mathcal {V}_{\textrm{b}}(\widetilde{X})\) denote a vector field whose push-forward to \([0,1)_\epsilon \) is given by \(\epsilon \partial _\epsilon \) and which is tangent to \(\partial \widetilde{\Omega }\). One possible choice for \(\mathcal {R}\), in \((\epsilon ,x)\)-coordinates, is \(\epsilon \partial _\epsilon +\hat{\chi } x\partial _x\); indeed, this equals \(\epsilon \partial _\epsilon -(1-\hat{\chi }){\hat{x}}\partial _{{\hat{x}}}\) in \((\epsilon ,{\hat{x}})\)-coordinates. Then \(u\in {\mathcal {C}}^0([0,\epsilon _0),H_{{\textrm{q}},0 0,\epsilon }^s(\widetilde{\Omega }_\epsilon ))\) lies in \(\mathcal {A}_m\) iff \(\mathcal {R}^j u\in {\mathcal {C}}^0([0,\epsilon _0),H_{{\textrm{q}},0 0,\epsilon }^{s-j}(\widetilde{\Omega }_\epsilon ))\) for \(j=1,\ldots ,m\).
Lemma 4.25
(Regularity of the inverse). Let \(m\in \mathbb {N}_0\). In the notation of Proposition 4.23, the inverse of the map (4.33) is bounded as a map
Proof
Suppose that \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*u=f\) with \(f\in \mathcal {A}_0={\mathcal {C}}^0\). To prove the continuity of \(u=u(\epsilon )\) at some value \(\epsilon =\epsilon _\infty \in (0,\epsilon _0)\), note that the weak compactness of the unit ball in the \(H_{{\textrm{b}},0 0}\)-spaces on \(\hat{\Omega }\) and \(\Omega _\circ \) implies that for any given sequence \(\epsilon _i\rightarrow \epsilon _\infty \), we can pass to a subsequence so that \(\epsilon ^{\frac{n}{2}}\hat{\phi }^*(\hat{\chi } u(\epsilon _i))\) and \(\phi _\circ ^*(\chi _\circ u(\epsilon _i))\) converge weakly to some
Moreover, \(\epsilon ^{-\frac{n}{2}}\hat{\phi }_*({\hat{u}})=(\phi _\circ )_*(u_\circ )\) on \(\hat{\chi }^{-1}(1)\cap \chi _\circ ^{-1}(1)\). Thus, there exists \(u_\infty \in (H_{0 0}^{s+2}\oplus H_{0 0}^{s+1})(\widetilde{\Omega }\cap \widetilde{X}_{\epsilon _\infty })\) with \(\epsilon ^{\frac{n}{2}}\hat{\phi }^*(\hat{\chi } u_\infty )={\hat{u}}\) and \(\phi _\circ ^*(\chi _\circ u_\infty )=u_\circ \). Since the coefficients of \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) on \(\widetilde{\Omega }^\circ \cap \{\epsilon >0\}\) depend smoothly on \(\epsilon \), we find, upon taking the limit \(i\rightarrow \infty \), that \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*u_\infty =f(\epsilon _\infty )\). The invertibility of \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) implies that \(u_\infty =u(\epsilon _\infty )\), as desired.
When \(f\in \mathcal {A}_k\) and \(m=1\), we note that
The first term lies in \(\mathcal {A}_0\); furthermore, the operator acting on u in the second term is a q-00-operator of class (4.34), and thus the second term lies in \(\mathcal {A}_0\) as well. Inverting \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) therefore implies \(\mathcal {R}u\in \mathcal {A}_0\). The case of \(m\ge 2\) follows by induction. \(\quad \square \)
Remark 4.26
(Relaxing the regularity of \(\widetilde{\gamma },\widetilde{k}\) in \(\epsilon \)) Later, we shall use slight extension of Proposition 4.22, in which we work with \(\widetilde{\gamma }+\widetilde{h},\widetilde{k}+\widetilde{q}\) where \(\widetilde{h},\widetilde{q}\) are continuous families (in \(\epsilon \)) of fiberwise smooth tensors on \(\widetilde{X}\) which decay rapidly as \(\epsilon \searrow 0\). Lemma 4.25 similarly remains valid for \(m=0\). Note indeed that in the proof of Proposition 4.22, the regularity of \(\widetilde{\gamma },\widetilde{h}\) in \(\epsilon \) is not used except when arguing that \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) differs from its two normal operators by error terms which decay, as q-00-differential operators, at the respective boundary hypersurface of \(\widetilde{X}\). This decay is still guaranteed due to the rapid decay of \(\widetilde{h},\widetilde{q}\) as \(\epsilon \searrow 0\).
Since in the gluing problem the boundary data (4.25) will be given, and satisfy the constraint equations, our task (in view of Proposition 4.22(1) is to find subleading corrections so as produce a solution of the constraint equations. At the subleading level, the nonlinear constraints map is well approximated by its linearization:
Lemma 4.27
(Accuracy of the linearization) Let \((\widetilde{\gamma },\widetilde{k})\) be a \((\hat{\mathcal {E}},\mathcal {E})\)-smooth total family. Put
for all \(\widetilde{h},\widetilde{q}\) for which the right hand side is defined for \(\epsilon <\epsilon _0\) with some \(\epsilon _0>0\) (depending on \(\widetilde{\gamma },\widetilde{k},\widetilde{h},\widetilde{q}\)).
-
(1)
(Polyhomogeneous version.) Suppose \(\hat{\mathcal {E}}'\subset \hat{\mathcal {E}}\) and \(\mathcal {E}'\subset \mathcal {E}\) are index sets. Put \(\hat{\mathcal {F}}'=2\hat{\mathcal {E}}'+(\mathbb {N}_0\cup \hat{\mathcal {E}})\) and \(\mathcal {F}'=2\mathcal {E}'+(\mathbb {N}_0\cup \mathcal {E})\). Let \(({\tilde{h}},{\tilde{q}})\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {E}}',\mathcal {E}'}(\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X})\oplus \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {E}}'-1,\mathcal {E}'}(\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X})\). ThenFootnote 26
$$\begin{aligned} Q(\widetilde{\gamma },\widetilde{k};\widetilde{h},\widetilde{q}) \in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}'-2,\mathcal {F}'}(\widetilde{X}\setminus \widetilde{K}^\circ ;\underline{\mathbb {R}}{}\oplus \widetilde{T}^*\widetilde{X}). \end{aligned}$$(4.40) -
(2)
(q-00-Sobolev version.) In the notation of Proposition 4.23, suppose that \(\alpha _0,\hat{\alpha }>0\) and \(s>\frac{n}{2}+2\). Let \(\alpha '_2\in \mathbb {R}\). Then we have a uniform estimate
$$\begin{aligned} \begin{aligned}&\Vert Q(\widetilde{\gamma },\widetilde{k};\widetilde{h}_1,\widetilde{q}_1)-Q(\widetilde{\gamma },\widetilde{k};\widetilde{h}_2,\widetilde{q}_2)\Vert _{\hat{\rho }^{2\hat{\alpha }-2}\rho _\circ ^{2\alpha _\circ }\tilde{\rho }_2^{\alpha '_2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^{s-2}\oplus \hat{\rho }^{2\hat{\alpha }-2}\rho _\circ ^{2\alpha _\circ }\tilde{\rho }_2^{\alpha '_2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^{s-1}} \\&\quad \le C\Vert (\widetilde{h}_1,\widetilde{q}_1)-(\widetilde{h}_2,\widetilde{q}_2)\Vert _{\hat{\rho }^{\hat{\alpha }}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^s\oplus \hat{\rho }^{\hat{\alpha }-1}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^s} \\&\qquad \times \max _{j=1,2} \Vert (\widetilde{h}_j,\widetilde{q}_j)\Vert _{\hat{\rho }^{\hat{\alpha }}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^s\oplus \hat{\rho }^{\hat{\alpha }-1}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^s} \end{aligned} \end{aligned}$$(4.41)for all \(({\tilde{h}}_j,{\tilde{q}}_j)\), \(j=1,2\), whose norm in the function space \(\hat{\rho }^{\hat{\alpha }}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^{s_0}\oplus \hat{\rho }^{\hat{\alpha }-1}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^{s_0}\) is sufficiently small depending on \(s_0>\frac{n}{2}+2\), \(\alpha _2\in \mathbb {R}\), \(\beta >0\), and \(\alpha _\circ -\hat{\alpha }\) (as well as on \(\widetilde{\gamma },\widetilde{k}\)). In particular, using the same norms as in (4.41),
$$\begin{aligned} \Vert Q(\widetilde{\gamma },\widetilde{k};\widetilde{h},\widetilde{q})\Vert \le C\Vert (\widetilde{h},\widetilde{q})\Vert ^2. \end{aligned}$$
In part (1), note that \(L_{\widetilde{\gamma },\widetilde{k}}=L_{\widetilde{\gamma },\widetilde{k}}\widetilde{w}\circ \widetilde{w}^{-1}\) maps \((\widetilde{h},\widetilde{q})\) into
Thus, (4.40) shows that \(P(\widetilde{\gamma }+\widetilde{h},\widetilde{k}+\widetilde{q};\Lambda )-P(\widetilde{\gamma },\widetilde{k};\Lambda )\) is given by \(L_{\widetilde{\gamma },\widetilde{k}}(\widetilde{h},\widetilde{q})\) modulo quadratically small error terms; similarly for part (2).
Proof of Lemma 4.27
Consider the schematic form of \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\) in (4.1). In local coordinates on X, lifted to (singular) coordinates on \(\widetilde{X}\), the Christoffel symbols of \(\widetilde{\gamma }\) are of the form \(\Gamma (\widetilde{\gamma })\sim \widetilde{\gamma }^{-1}D\widetilde{\gamma }\) where D is a coordinate derivative, and where \(\widetilde{\gamma },\widetilde{\gamma }^{-1}\) denote components of the (inverse) metric in these coordinates. The scalar curvature is thus of the form
Now, in the neighborhood of any point on X and for sufficiently small \(|\widetilde{h}|\) (maximum norm of the components of \(\widetilde{h}\)), we have \((\widetilde{\gamma }+\widetilde{h})^{-1}=\sum _{j=0}^\infty (-1)^j\widetilde{\gamma }^{-1}(\widetilde{h}\widetilde{\gamma }^{-1})^j\); therefore, for twice continuously differentiable \(\widetilde{h}\) with small \(|\widetilde{h}|\), we can expand \(R_{\widetilde{\gamma }+\widetilde{h}}\sim (\widetilde{\gamma }+\widetilde{h})^{-1}D^2(\widetilde{\gamma }+\widetilde{h})+(\widetilde{\gamma }+\widetilde{h})^{-2}(D\widetilde{\gamma }+D\widetilde{h})^2\) into a convergent (in \({\mathcal {C}}^0\)) power series around \(\widetilde{h}=0\), giving
Here, we have \(c_j(\widetilde{\gamma })\in \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathbb {N}_0\cup \mathcal {E}}\) since \(\widetilde{\gamma }\in \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0\cup \hat{\mathcal {E}},\mathbb {N}_0\cup \mathcal {E}}\). In a similar vein, since \(({\text {tr}}_{\widetilde{\gamma }}\widetilde{k})^2\sim (\widetilde{\gamma }^{-1}\widetilde{k})^2\), the expression \(({\text {tr}}_{\widetilde{\gamma }+\widetilde{h}}(\widetilde{k}+\widetilde{q}))^2\) is equal to the sum of the nonlinear leading term \(({\text {tr}}_{\widetilde{\gamma }}\widetilde{k})^2\), its linearization in \((\widetilde{\gamma },\widetilde{k})\) evaluated at \((\widetilde{h},\widetilde{q})\), and a nonlinear error term
the term \(|\widetilde{k}|_{\widetilde{\gamma }}^2\sim (\widetilde{\gamma }^{-1}\widetilde{k})^2\) has a completely analogous description.
For the second component of \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\), the terms \(\delta _{\widetilde{\gamma }}\widetilde{k}\) and \(\textrm{d}{\text {tr}}_{\widetilde{\gamma }}\widetilde{k}\) have the schematic forms \(\widetilde{\gamma }^{-1}(D\widetilde{k}+(\widetilde{\gamma }^{-1}D\widetilde{\gamma })\widetilde{k})\) and \(D(\widetilde{\gamma }^{-1}\widetilde{k})\), respectively, which are both schematically equal to \(\widetilde{\gamma }^{-1}D\widetilde{k}+\widetilde{\gamma }^{-2}(D\widetilde{\gamma })\widetilde{k}\). Thus, the quadratic and higher order (in \((\widetilde{h},\widetilde{q})\)) terms of \(\delta _{\widetilde{\gamma }+\widetilde{h}}(\widetilde{k}+\widetilde{q})\) and \(\textrm{d}{\text {tr}}_{\widetilde{\gamma }+\widetilde{h}}(\widetilde{k}+\widetilde{q})\sim (\widetilde{\gamma }+\widetilde{h})^{-1}D(\widetilde{k}+\widetilde{q})+(\widetilde{\gamma }+\widetilde{h})^{-2}(D(\widetilde{\gamma }+\widetilde{h}))(\widetilde{k}+\widetilde{q})\) are of the schematic form
Using
and the fact that \(D\in \hat{\rho }^{-1}\mathcal {V}_{\textrm{q}}(\widetilde{X})\), we can now prove the Lemma.
For part (1), consider the first term of \(Q_1\), which is \(\widetilde{h}^j(D^2\widetilde{\gamma })\widetilde{h}^2\in \mathcal {A}_{\textrm{phg}}^{(\mathbb {N}_0\cup \hat{\mathcal {E}})-2,\mathbb {N}_0\cup \mathcal {E}}\cdot \mathcal {A}_{\textrm{phg}}^{(2+j)\hat{\mathcal {E}}',(2+j)\mathcal {E}'}\); since \(\hat{\mathcal {E}}'\subset \hat{\mathcal {E}}\), we have \((\mathbb {N}_0\cup \hat{\mathcal {E}})+(2+j)\hat{\mathcal {E}}'\subset (\mathbb {N}_0\cup \hat{\mathcal {E}})+2\hat{\mathcal {E}}'\), similarly for the un-hatted index sets. Thus, \(\widetilde{h}^j(D^2\widetilde{\gamma })\widetilde{h}^2\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}'-2,\mathcal {F}'}\) indeed. Structurally, note that this term, and indeed all terms comprising \(Q_1\) and \(Q_2\), contain a total of exactly 2 factors involving \(D,\widetilde{k},\widetilde{q}\). Each of these factors reduces the index set at \({\hat{X}}\) by 1 compared to \(\widetilde{\gamma },\widetilde{h}\); therefore, \(Q_1,Q_2\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}'-2,\mathcal {F}'}\). The terms comprising \(Q_3\) have the same structure (for instance, \(\widetilde{h}^{2+j}(D\widetilde{k})\) involves D and \(\widetilde{k}\), while \(\widetilde{h}^j(D\widetilde{h})\widetilde{q}\) involves D and \(\widetilde{q}\)), and thus also \(Q_3\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}'-2,\mathcal {F}'}\).
Turning to part (2) and considering the case \((\widetilde{h}_2,\widetilde{q}_2)=0\) first, we use the multiplicative properties of q-00-Sobolev spaces (defined relative to a positive q-00-density), which for unweighted spaces are the same as those of standard Sobolev spaces on \(\mathbb {R}^n\) (as follows from the bounded geometry perspective), while for weighted spaces the weights are additive. In particular, \(H_{{\textrm{q}},0 0,\epsilon }^s\) is an algebra under pointwise multiplication for \(s>\frac{n}{2}\), and there exists an \(\epsilon \)-independent constant C so that \(\Vert u v\Vert _{H_{{\textrm{q}},0 0,\epsilon }^s}\le C\Vert u\Vert _{H_{{\textrm{q}},0 0,\epsilon }^s}\Vert v\Vert _{H_{{\textrm{q}},0 0,\epsilon }^s}\). Thus, for example, using the membership \(D^2\widetilde{\gamma }\in \mathcal {A}^{-2,0}\), we have
Since for \(\beta >0\) multiplication by \(e^{-\beta /\tilde{\rho }_2}\tilde{\rho }_2^\alpha \) is uniformly bounded on any fixed weighted q-00-Sobolev space for any \(\alpha \), we can replace the weight \(\tilde{\rho }_2^{(2+j)\alpha _2}e^{-(2+j)\beta /\tilde{\rho }_2}\) on the left hand sides by \(C_2^j\tilde{\rho }_2^{\alpha '_2}e^{-\beta /\tilde{\rho }_2}\) with \(C_2>0\). Turning to the term \(c_j(\widetilde{\gamma })\widetilde{h}^{j+1}(D^2\widetilde{h})\) in \(Q_1(\widetilde{\gamma },\widetilde{h})\), observe that D, relative to a q-00-derivative, loses two powers of \(\tilde{\rho }_2\) in addition to one power of \(\hat{\rho }\); we thus have
Arguing as before, the \(\tilde{\rho }_2\)-weight on the left can be replaced by \(\tilde{\rho }_2^{\alpha '_2}e^{-\beta /\tilde{\rho }_2}\) upon suitably enlarging the constant C. The analysis of the remaining terms comprising \(Q_1,Q_2,Q_3\) is analogous. In order to estimate the difference \(Q(\widetilde{\gamma },\widetilde{k};\widetilde{h}_1,\widetilde{q}_1)-Q(\widetilde{\gamma },\widetilde{k};\widetilde{h}_2,\widetilde{q}_2)\), one uses identities such as \(\widetilde{h}_1^j-\widetilde{h}_2^j=(\widetilde{h}_1-\widetilde{h}_2)\Sigma \) where \(\Sigma :=\sum _{k=0}^{j-1}\widetilde{h}_1^k\widetilde{h}_2^{j-1-k}\); and the norm of \(\Sigma \) is bounded by Cj times the \((j-1)\)-st power of the maximum of the norms of \({\tilde{h}}_1\) and \({\tilde{h}}_2\). \(\quad \square \)
For the construction of a formal solution to the gluing problem, we only need:
Corollary 4.28
(Accuracy of the normal operators) Let \((\widetilde{\gamma },\widetilde{k})\) be a \((\hat{\mathcal {E}},\mathcal {E})\)-smooth total family with boundary data \((\hat{\gamma },{\hat{k}})\) and \((\gamma ,k)\); let further \(\hat{\mathcal {E}}'\subset \hat{\mathcal {E}}\) and \(\mathcal {E}'\subset \mathcal {E}\) be index sets, and let \((\widetilde{h},\widetilde{q})\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {E}}',\mathcal {E}'}(\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X})\oplus \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {E}}'-1,\mathcal {E}'}(\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X})\). Using the notation of Sect. 3.1, we then have
Proof
This follows from Lemma 4.27 upon using (4.29). Indeed, acting with
on \(\widetilde{w}^{-1}(\widetilde{h},\widetilde{q})\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {E}}'-2,\mathcal {E}'}\) gives an element of the space in (4.43a); note also that acting on the pair \((\widetilde{h},\widetilde{q})\) of symmetric 2-tensors, we have
Moreover, the quadratic error term (4.40) lies in the space (4.43a) as well, since in the notation of Lemma 4.27 we have \(\hat{\mathcal {F}}'\subset \hat{\mathcal {E}}'+\hat{\mathcal {E}}\) and \(\mathcal {F}'\subset \mathcal {E}'+\mathcal {E}\). The argument for (4.43b) is completely analogous, now using that \(w_\circ \widetilde{w}^{-1}={\text {Id}}\). \(\quad \square \)
5 Gluing Construction
The data for our gluing problem are as follows.
Definition 5.1
(Gluing data) Let X be an n-dimensional smooth manifold, \(n\ge 3\); let \(\Lambda \in \mathbb {R}\). Let \(\mathcal {E}\subset \mathbb {C}\times \mathbb {N}_0\) be an index set with \({\text {Re}}\mathcal {E}>0\); let \({\hat{K}}\subset \mathbb {R}^n\) be compact with smooth boundary and connected complement. Then \(\mathcal {E}\)-gluing data (with cosmological constant \(\Lambda \)) consist of a point \(\mathfrak {p}\in X\) and two pairs \((\hat{\gamma },{\hat{k}})\), \((\gamma ,k)\) with the following properties: \((\hat{\gamma },{\hat{k}})\) is \(\mathcal {E}\)-asymptotically flat on \(\mathbb {R}^n\setminus {\hat{K}}^\circ \) (see Definition 4.4); \(\gamma ,k\) are smooth symmetric 2-tensors on X, with \(\gamma \) Riemannian; and both pairs satisfy the nonlinear constraint equations
Given X, we recall from Definition 3.1 the notation \(\widetilde{X}\) for the total gluing space and \({\hat{X}}\cong \overline{\mathbb {R}^n}\) and \(X_\circ =[X;\{\mathfrak {p}\}]\) for the boundary hypersurfaces of \(\widetilde{X}\).
Theorem 5.2
(Main result, detailed version) Let \(\mathfrak {p}\in X\) and \((\hat{\gamma },{\hat{k}})\), \((\gamma ,k)\) be \(\mathcal {E}\)-gluing data (with cosmological constant \(\Lambda \)) as in Definition 5.1. Suppose \(\mathcal {U}^\circ \subset X\) is a smoothly bounded connected open set containing \(\mathfrak {p}\) with compact closure \(\mathcal {U}=\overline{\mathcal {U}^\circ }\), and suppose that \((X,\gamma ,k)\) has no KIDs in \(\mathcal {U}^\circ \) (see Definition 4.13). Let \(0<\delta <\min ({\text {Re}}\mathcal {E},n-2)\). Then there exist index sets \(\hat{\mathcal {E}}_\sharp ,\mathcal {E}_\sharp \subset \mathbb {C}\times \mathbb {N}_0\) with \({\text {Re}}\hat{\mathcal {E}}_\sharp >0\) and \({\text {Re}}\mathcal {E}_\sharp >\delta \), and a \((\hat{\mathcal {E}}_\sharp ,\mathcal {E}_\sharp )\)-smooth total family \((\widetilde{\gamma },\widetilde{k})\) (defined on \(\widetilde{X}\setminus \widetilde{K}^\circ \), with \(\widetilde{K}^\circ \) given in Definition 4.20) with the following properties:
-
(1)
the boundary data of \((\widetilde{\gamma },\widetilde{k})\) are equal to \((\hat{\gamma },{\hat{k}})\) and \((\gamma ,k)\);
-
(2)
we have \((\widetilde{\gamma },\widetilde{k})=(\gamma ,k)\) near \(\widetilde{X}\setminus ([0,1)\times \mathcal {U}^\circ )\), and \((\textsf{s}^{-1}\widetilde{\gamma },\textsf{s}^{-1}(\epsilon \widetilde{k}))=(\hat{\gamma },{\hat{k}})\) near \({\hat{r}}\le {\hat{R}}_0\) where \({\hat{R}}_0>0\) is any fixed constant such that \(B(0,{\hat{R}}_0)\supset {\hat{K}}\);
-
(3)
there exists \(\epsilon _\sharp >0\) so that \((\widetilde{\gamma },\widetilde{k})\) solves the constraint equations in \(\epsilon <\epsilon _\sharp \), i.e. \(P(\widetilde{\gamma },\widetilde{k};\Lambda )=0\) on every \(\epsilon \)-level set of \(\widetilde{X}\setminus \widetilde{K}^\circ \) for \(\epsilon <\epsilon _\sharp \).
Remark 5.3
(\(\mathcal {E}\) and the Positive Mass Theorem) If \({\text {Re}}\mathcal {E}>n-2\), then the ADM energy E and ADM linear momentum \(P=(P_1,P_2,P_3)\) of the data set \((\hat{\gamma },{\hat{k}})\) are equal to 0 (which implies that the ADM mass vanishes); see e.g. [EHLS15, §2] for the definitions. This conjecturally implies that \(({\hat{X}},\hat{\gamma },{\hat{k}})\) can be isometrically embedded as a spacelike hypersurface in \((n+1)\)-dimensional Minkowski space. This is known for \(n\le 7\) [Eic13, HL20], following earlier work in the case of spin manifolds [Wit81, BC96, CM06]. See also [SY79, EHLS15, SY17, Loh16, HZ22]. We also remark that while we assume strong regularity of \(\hat{\gamma },{\hat{k}}\) at infinity, the decay may be too weak to allow for a definition of the ADM energy and linear momentum.
5.1 Formal solution
By appropriately alternating the solution operators for the linearized constraints map of the two given initial data sets, we shall now prove:
Proposition 5.4
(Polyhomogeneous formal solution of the gluing problem) Under the assumptions of Theorem 5.2 and using its notation, there exists a \((\hat{\mathcal {E}}_\sharp ,\mathcal {E}_\sharp )\)-smooth total family \((\widetilde{\gamma },\widetilde{k})\) satisfying properties (1), (2), but only:
- (3’):
-
\((\widetilde{\gamma },\widetilde{k})\) is a formal solution of the constraint equations, i.e. \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\in \dot{{\mathcal {C}}}^\infty (\widetilde{X}\setminus \widetilde{K}^\circ ;\underline{\mathbb {R}}{}\oplus \widetilde{T}^*\widetilde{X})\) vanishes to infinite order at \({\hat{X}}\) and \(X_\circ \).
Proof of Proposition 5.4
Recall that \(\hat{\rho }\) and \(\rho _\circ \in {\mathcal {C}}^\infty (\widetilde{X})\) are boundary defining functions of \({\hat{X}}\) and \(X_\circ \), respectively; for convenience, we choose them so that \(\hat{\rho }\rho _\circ =\epsilon \). Moreover, by replacing \(\mathcal {E}\) with \(\mathcal {E}\cup (\mathbb {N}_0+n-2)\), we may assume that \(\min {\text {Re}}\mathcal {E}\le n-2\); and we replace \(\mathcal {E}\) by its nonlinear closure \(\bigcup _{j\in \mathbb {N}}j\mathcal {E}\). Lastly, we choose the localizer \(\hat{\chi }\in {\mathcal {C}}^\infty (\widetilde{X})\) to a collar neighborhood of \({\hat{X}}\) so that \({\text {supp}}\hat{\chi }\subset \widetilde{\upbeta }^{-1}([0,1)\times \mathcal {U}^\circ )\).
\(\bullet \) Naive gluing. Fix geodesic normal coordinates \(x\in \mathbb {R}^n\), \(|x|<r_0\), around \(\mathfrak {p}\) with respect to the metric \(\gamma \), and set \(\mathfrak {e}=\sum _{j=1}^n(\textrm{d}x^j)^2\); thus, \(\gamma =\mathfrak {e}\) at \(\mathfrak {p}\). We regard \({\hat{x}}=\frac{x}{\epsilon }\) as a map from a neighborhood of \({\hat{X}}\subset \widetilde{X}\) to \(\overline{\mathbb {R}^n}\). Writing then
we regard \(\textsf{s}(\hat{\gamma })=\hat{\gamma }_{{\hat{i}}{\hat{j}}}({\hat{x}})\,\textrm{d}x^i\,\textrm{d}x^j\) (in the notation of Definition 3.3) as a section of \(S^2\widetilde{T}^*\widetilde{X}\rightarrow \widetilde{X}\) in a collar neighborhood of \({\hat{X}}\); since \(\hat{\gamma }_{{\hat{i}}{\hat{j}}}-\delta _{i j}\) is polyhomogeneous, with index set \(\mathcal {E}\), in the defining function \(|{\hat{x}}|^{-1}\) of \(X_\circ \), we have
where \(\widetilde{K}=\{(\epsilon ,{\hat{x}}):{\hat{x}}\in {\hat{K}}\}\) as in Definition 4.20, and where \(\hat{\chi }\in {\mathcal {C}}^\infty (\widetilde{X})\) is 1 near \({\hat{X}}\) and is supported in \(|x|<r_0\). Similarly,
Therefore, the naive gluing
of the two pairs of initial data is an \((\hat{\mathcal {E}}_0,\mathcal {E}_0)\)-smooth total family in the sense of Definition 4.20, where we set \((\hat{\mathcal {E}}_0,\mathcal {E}_0)=(\mathbb {N}_0+1,\mathcal {E})\); its boundary data are \((\hat{\gamma },{\hat{k}})\) and \((\gamma ,k)\). By Proposition 4.22(1), we have
The absence of the \(\mathcal {O}(\hat{\rho }^{-2})\) leading order term of the error \(P(\widetilde{\gamma }_0,\widetilde{k}_0;\Lambda )\) at \({\hat{X}}\) and the strong vanishing at \(X_\circ \) are due to the fact that \((\hat{\gamma },{\hat{k}})\) and \((\gamma ,k)\) satisfy the constraint equations. This is the starting point for an iterative construction of a formal solution to the gluing problem. We record here that
\(\bullet \) \(\underline{{ Solving}\ { away}\ { to}\ { leading}\ { order}\ { at}\ {\hat{X}}.}\) Fix any \(0<\eta <\min (1,\delta )\). Suppose \((\widetilde{\gamma }_\ell ,\widetilde{k}_\ell )\) (for some \(\ell \in \mathbb {N}_0\), starting with \(\ell =0\)) is an \((\hat{\mathcal {E}}_\ell ,\mathcal {E}_\ell )\)-smooth total family, with boundary data equal to \((\hat{\gamma },{\hat{k}})\) and \((\gamma ,k)\), and so that
where the index sets satisfy
Moreover, we assume that \((\widetilde{\gamma }_\ell ,\widetilde{k}_\ell )=(\gamma ,k)\) in a neighborhood of \([0,1)\times (X_\circ \setminus \mathcal {U})\), and \((\textsf{s}^{-1}\widetilde{\gamma }_\ell ,\textsf{s}^{-1}(\epsilon \widetilde{k}_\ell ))=(\hat{\gamma },{\hat{k}})\) in a neighborhood of \({\hat{r}}\le {\hat{R}}_0\).
Consider \((z,m)\in \hat{\mathcal {F}}_\ell \) so that \((z,m+1)\notin \hat{\mathcal {F}}_\ell \) and \({\text {Re}}z=\min {\text {Re}}\hat{\mathcal {F}}_\ell \). Then there exists \((\widetilde{f},\widetilde{j})\in \mathcal {A}_{\textrm{phg}}^{\mathbb {N}_0,\mathcal {F}_\ell }(\widetilde{X}\setminus \widetilde{K}^\circ ;\underline{\mathbb {R}}{}\oplus \widetilde{T}^*\widetilde{X})\), vanishing near \({\hat{r}}\le {\hat{R}}_0\), so that
Our present aim is to correct \((\widetilde{\gamma }_\ell ,\widetilde{k}_\ell )\) so as to remove the term \(\hat{\rho }^{z-2}(\log \hat{\rho })^m(\widetilde{f},\widetilde{j})\) from \(P(\widetilde{\gamma }_\ell ,\widetilde{k}_\ell ;\Lambda )\). To this end, we write \(\hat{\rho }=\epsilon \rho _\circ ^{-1}\) and thus
Consider the i-th summand; we have
where \(\mathcal {G}_i=\mathcal {F}_\ell -z+(0,i)\). By (5.2), we have \({\text {Re}}\mathcal {G}_i>\delta -1\). We can thus apply Proposition 4.10(2) (with \(\alpha _\circ :=\delta -1<n-2\) indeed) to produce \(({\hat{h}}_i,{\hat{q}}_i)\in \mathcal {A}_{\textrm{phg}}^{\mathcal {G}_i\mathrel {\overline{\cup }}\hat{\mathcal {S}}}\oplus \mathcal {A}_{\textrm{phg}}^{(\mathcal {G}_i\mathrel {\overline{\cup }}\hat{\mathcal {S}})+1}\), vanishing near \({\hat{r}}\le {\hat{R}}_0\), with
the index set \(\hat{\mathcal {S}}\) here only depends on \(\delta \) and satisfies \({\text {Re}}\hat{\mathcal {S}}>\delta -1\). Put then
where
Note that \(\mathcal {F}^\sharp _\ell \supset \mathcal {F}_\ell \) and \(\min {\text {Re}}\mathcal {F}^\sharp _\ell =\min (\min {\text {Re}}\mathcal {F}_\ell ,\min {\text {Re}}\hat{\mathcal {F}}_\ell +\min {\text {Re}}\hat{\mathcal {S}})\). Therefore, \(\min {\text {Re}}\mathcal {F}^\sharp _\ell -\min {\text {Re}}\hat{\mathcal {F}}_\ell >\delta -1\), and \({\text {Re}}\mathcal {F}_\ell ^\sharp >\min (\delta -\eta ,(1-\eta )+(\delta -1))=\delta -\eta \); we may thus replace \(\mathcal {E}_\ell \) by the nonlinear closure of \(\mathcal {E}_\ell \cup (\mathcal {F}_\ell ^\sharp +(\mathbb {N}_0\cup \mathcal {E}_\ell ))\), and replace \(\mathcal {F}_\ell \) by \(\mathcal {F}^\sharp _\ell +(\mathbb {N}_0\cup \mathcal {E}_\ell )\), without affecting the validity of (5.2).
Having adjusted the index sets in this fashion, we now apply Corollary 4.28 (concretely, the membership (4.43a)), with \((\hat{\mathcal {E}},\mathcal {E})=(\hat{\mathcal {E}}_\ell ,\mathcal {E}_\ell )\) and \((\hat{\mathcal {E}}',\mathcal {E}')=((z,m),\mathcal {F}_\ell )\). This gives
Note here that \(\hat{\mathcal {F}}'_\ell \subsetneq \hat{\mathcal {F}}_\ell \), and \(\min {\text {Re}}\hat{\mathcal {F}}'_\ell >\min {\text {Re}}\hat{\mathcal {F}}_\ell +(1-\eta )\).
Proceeding in this manner for all \((z,m)\in \hat{\mathcal {F}}_\ell \) with the property that \(\min {\text {Re}}\hat{\mathcal {F}}_\ell \le {\text {Re}}z\le \min {\text {Re}}\hat{\mathcal {F}}_\ell +(1-\eta )\) (and adjusting \(\mathcal {E}_\ell \) and \(\mathcal {F}_\ell \) as above) produces \(({\tilde{h}},{\tilde{q}})\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}_\ell ,\mathcal {F}_\ell }\oplus \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}_\ell -1,\mathcal {F}_\ell }\) so that for the \((\hat{\mathcal {E}}_\ell ,\mathcal {E}_\ell )\)-smooth total family
we have
Thus, the remaining error has improved decay at \({\hat{X}}\) by more than \(1-\eta \) orders as compared to \(P(\widetilde{\gamma }_\ell ,\widetilde{k}_\ell ;\Lambda )\), while the adjustments of \(\mathcal {E}_\ell \) and \(\mathcal {F}_\ell \) were mild enough so as to preserve (5.2).
If \(\min {\text {Re}}\mathcal {F}_\ell -\min {\text {Re}}\hat{\mathcal {F}}^\ell >\delta -1\) still, we may repeat this procedure (with \(\hat{\mathcal {F}}^\ell \) in place of \(\hat{\mathcal {F}}_\ell \)), thus producing a further correction (with yet stronger decay at \({\hat{X}}\)) to \((\widetilde{\gamma }^\ell ,\widetilde{k}^\ell )\) whose remaining error has yet another \(1-\eta \) orders of vanishing at \({\hat{X}}\); and so on. Upon repeating this procedure the maximal number of times,Footnote 27 and by a minor abuse of notation again calling the final approximate solution \((\widetilde{\gamma }^\ell ,\widetilde{k}^\ell )\), we thus still have (5.4) and \((\widetilde{\gamma }^\ell ,\widetilde{k}^\ell )=(\gamma ,k)\) near \([0,1)\times (X_\circ \setminus \mathcal {U})\), and furthermore
\(\bullet \) \(\underline{{ Solving}\ { away}\ { to}\ { leading}\ { order}\ { at}\ X_\circ .}\) The arguments near \(X_\circ \) are very similar to (and somewhat simpler, due to the absence of scaling considerations, than) those near \({\hat{X}}\). We thus consider \((z,m)\in \mathcal {F}_\ell \) with \((z,m+1)\notin \mathcal {F}_\ell \) and \({\text {Re}}z=\min {\text {Re}}\mathcal {F}_\ell \), and \((\widetilde{f},\widetilde{j})\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}^\ell -2,\mathbb {N}_0}(\widetilde{X}{\setminus }\widetilde{K}^\circ ; \underline{\mathbb {R}}{}\oplus \widetilde{T}^*\widetilde{X})\) with the property that
In the expansion
we consider the i-th summand. Note that
and \((f_i,j_i)\) has support in \(\upbeta _\circ ^{-1}(\mathcal {U}^\circ )\); moreover, \(\min {\text {Re}}\mathcal {G}_i\ge -\delta +1\) by (5.5). Proposition 4.16(2) (with \(\hat{\alpha }=-\delta +1-\eta >-n+2\)) then produces
with \({\text {supp}}h_i\), \({\text {supp}}q_i\subset \mathcal {U}^\circ \), and where the index set \(\mathcal {S}_\circ \) only depends on \(\delta ,\eta \) and satisfies \({\text {Re}}\mathcal {S}_\circ >-\delta +1-\eta \). We then put
where
We have \(\hat{\mathcal {F}}^{\ell \sharp }\supset \hat{\mathcal {F}}^\ell \) and \(\min {\text {Re}}\hat{\mathcal {F}}^{\ell \sharp }=\min (\min {\text {Re}}\hat{\mathcal {F}}^\ell ,\min {\text {Re}}\mathcal {F}_\ell +\min {\text {Re}}\mathcal {S}_\circ )\). Therefore, \(\min {\text {Re}}\mathcal {F}_\ell -\min {\text {Re}}\hat{\mathcal {F}}^{\ell \sharp }<\delta -1+\eta \), and \({\text {Re}}\hat{\mathcal {F}}^{\ell \sharp }>\min (1-\eta ,(\delta -2+\eta )+(2-\eta )+(-\delta +1-\eta ))=1-\eta \). Thus, we can replace \(\hat{\mathcal {E}}_\ell \) by the nonlinear closure of \(\hat{\mathcal {E}}_\ell \cup (\hat{\mathcal {F}}^{\ell \sharp }+(\mathbb {N}_0\cup \hat{\mathcal {E}}_\ell ))\), and replace \(\hat{\mathcal {F}}^\ell \) by \(\hat{\mathcal {F}}^{\ell \sharp }+(\mathbb {N}_0\cup \hat{\mathcal {E}}_\ell )\), without affecting the validity of the first two lines of (5.5).
Corollary 4.28 (concretely, the membership (4.43b)), with \((\hat{\mathcal {E}},\mathcal {E})=(\hat{\mathcal {E}}_\ell ,\mathcal {E}_\ell )\) and \((\hat{\mathcal {E}}',\mathcal {E}')=(\hat{\mathcal {F}}^\ell ,(z,m))\) gives
Thus, \({\text {Re}}\mathcal {F}_\ell '>\min {\text {Re}}\mathcal {F}_\ell +(\delta -\eta )\). In this fashion, we solve away all terms of \(P(\widetilde{\gamma }^\ell ,\widetilde{k}^\ell ;\Lambda )\) at \(X_\circ \) corresponding to \((z,m)\in \mathcal {F}_\ell \) with \(\min {\text {Re}}\mathcal {F}_\ell \le {\text {Re}}z\le \min {\text {Re}}\mathcal {F}_\ell +(\delta -\eta )\) at once. If subsequently we still have \(\min {\text {Re}}\mathcal {F}_\ell '-\min {\text {Re}}\hat{\mathcal {F}}^\ell \in (\delta -2+\eta ,\delta -1]\), we repeat this procedure. We thus produce \((\widetilde{h},\widetilde{q})\in \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}^\ell ,\mathcal {F}_\ell }\oplus \mathcal {A}_{\textrm{phg}}^{\hat{\mathcal {F}}^\ell -1,\mathcal {F}_\ell }\) (with a new index set \(\hat{\mathcal {E}}_\ell \)) so that \((\widetilde{\gamma }_{\ell +1},\widetilde{k}_{\ell +1})=(\widetilde{\gamma }^\ell ,\widetilde{k}^\ell )+(\widetilde{h},\widetilde{q})\) is a \((\hat{\mathcal {E}}_{\ell +1},\mathcal {E}_{\ell +1})\)-smooth total family (with \(\hat{\mathcal {E}}_{\ell +1},\mathcal {E}_{\ell +1}\) denoting the adjustments of the index sets \(\hat{\mathcal {E}}_\ell ,\mathcal {E}_\ell \) performed in the course of the argument) for which
where \({\text {Re}}\hat{\mathcal {F}}_{\ell +1}>\min {\text {Re}}\hat{\mathcal {F}}_\ell +(1-\eta )\) and \({\text {Re}}\mathcal {F}_{\ell +1}>\min {\text {Re}}\mathcal {F}_\ell +(\delta -\eta )\), and (5.2) holds for \(\ell +1\) in place of \(\ell \). This completes the inductive step.
\(\bullet \) Asymptotic summation. We now have a sequence \((\widetilde{\gamma }_\ell ,\widetilde{k}_\ell )\), \(\ell =0,1,2,\ldots \), with the property that \(P(\widetilde{\gamma }_\ell ,\widetilde{k}_\ell ;\Lambda )\) is polyhomogeneous on \(\widetilde{X}\) with the index sets at \({\hat{X}}\) and \(X_\circ \) having minimal real parts tending to \(+\infty \), and so that \((\widetilde{h}_\ell ,\widetilde{q}_\ell ):=(\widetilde{\gamma }_{\ell +1},\widetilde{k}_{\ell +1})-(\widetilde{\gamma }_\ell ,\widetilde{k}_\ell )\) is polyhomogeneous with index sets having real parts larger than any desired constant C for all \(\ell \ge \ell _0(C)\). The latter property enables us to take \((\widetilde{\gamma },\widetilde{k})\) to be an asymptotic sum
which is an \((\hat{\mathcal {E}}_\sharp ,\mathcal {E}_\sharp )\)-smooth family for some index sets \(\hat{\mathcal {E}}_\sharp ,\mathcal {E}_\sharp \) which satisfy \({\text {Re}}\hat{\mathcal {E}}_\sharp >1-\eta \) and \({\text {Re}}\mathcal {E}_\sharp >\delta -\eta \) (from (5.2)). The former property implies, in view of Lemma 4.27(1) and (4.42), that \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\) is polyhomogeneous on \(\widetilde{X}\) with index sets which have arbitrarily large minimal real parts; thus, \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\in \dot{{\mathcal {C}}}^\infty (\widetilde{X}\setminus \widetilde{K}^\circ )\), as desired. The proof is complete. \(\quad \square \)
Remark 5.5
(Better regularity at \({\hat{X}}\)) In Proposition 5.4, one can arrange, for any desired value \(N\in \mathbb {N}\), that \({\text {Re}}\hat{\mathcal {E}}_\sharp >N\); thus, \((\widetilde{\gamma },\epsilon \widetilde{k})\) is of class \({\mathcal {C}}^N\) down to \({\hat{X}}\). Indeed, starting with the naive gluing \((\widetilde{\gamma }_0,\widetilde{k}_0)\) in (5.1), one solves away the error \(P(\widetilde{\gamma }_0,\widetilde{k}_0;\Lambda )\) to leading order at \({\hat{X}}\) (where it has index set \((\mathbb {N}_0+1)-2\)); arguing iteratively, once the error \(P(\widetilde{\gamma }_\ell ,\widetilde{k}_\ell ;\Lambda )\) has index set \((\mathbb {N}_0+\ell +1)-2\) at \({\hat{X}}\), and an index set \(\mathcal {E}_\ell \) at \(X_\circ \) with \({\text {Re}}\mathcal {E}_\ell >{\text {Re}}\mathcal {E}-\eta \), one solves away the leading term at \({\hat{X}}\) using Proposition 4.10(2) with \(\alpha _\circ =({\text {Re}}\mathcal {E}-\eta )-(\ell +1)\) (which satisfies the bound \(\alpha _\circ <n-2\) by an increasingly wide margin as \(\ell \) increases). Stopping this procedure at step \(\ell \gg N\), one then solves away the error term to a high order at \(X_\circ \) by means of Proposition 4.16(2) with large values of \(\hat{\alpha }\) (so that the corrections vanish to high order at \({\hat{X}}\)). Once one has thus constructed a \((\hat{\mathcal {E}}_\flat ,\mathcal {E}_\flat )\)-smooth total family \((\widetilde{\gamma }_\flat ,\widetilde{k}_\flat )\), with \({\text {Re}}\hat{\mathcal {E}}_\flat >N\), and with \(P(\widetilde{\gamma }_\flat ,\widetilde{k}_\flat ;\Lambda )\) vanishing to order \(\gg N\) at \({\hat{X}}\) and \(X_\circ \), one can correct \((\widetilde{\gamma }_\flat ,\widetilde{k}_\flat )\) to a formal solution by solving away the error in turns at \({\hat{X}}\) and \(X_\circ \) as in the proof of Proposition 5.4; an inspection of the proof shows that these corrections are polyhomogeneous and vanish to order \(\gg N\) at \({\hat{X}}\) and \(X_\circ \).
5.2 True solution
In this section, we prove Theorem 5.2. We start with the formal solution \((\widetilde{\gamma },\widetilde{k})\) from Proposition 5.4. Theorem 5.2 is then a consequence of the following result:
Proposition 5.6
(Correction of a formal solution) With \((\widetilde{\gamma },\widetilde{k})\) as in Proposition 5.4, there exists \((\widetilde{h},\widetilde{q})\in \dot{{\mathcal {C}}}^\infty (\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X}\oplus S^2\widetilde{T}^*\widetilde{X})\) with the following properties:
-
(1)
we have \((\widetilde{h},\widetilde{q})=0\) near \(\widetilde{X}\setminus ([0,1)\times \mathcal {U}^\circ )\) and near \(\{(\epsilon ,x): \epsilon x\in B(0,{\hat{R}}_0)\}\subset \widetilde{X}\);
-
(2)
for \((\widetilde{\gamma }',\widetilde{k}')=(\widetilde{\gamma }+\widetilde{h},\widetilde{k}+\widetilde{q})\), we have \(P(\widetilde{\gamma }',\widetilde{k}';\Lambda )=0\) in \(\epsilon <\epsilon _\sharp \) for some \(\epsilon _\sharp >0\).
Proof
We choose \(\hat{\rho }\) and \(\rho _\circ \) so that \(\hat{\rho }\rho _\circ =\epsilon \). Fix \(\alpha _2\in \mathbb {R}\), \(\beta >0\), and \(\hat{\alpha },\alpha _\circ >0\) with \(\alpha _\circ -\hat{\alpha }<n-2\). Recall \(\widetilde{w}\) and \(\widetilde{w}_2\) from Proposition 4.23. Set \(\tilde{\alpha }=\hat{\alpha }+n-2-\frac{n}{2}\) and write
for the weights appearing in the definition (4.32) of \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}=w_L L_{\widetilde{\gamma },\widetilde{k}}w_R\). We shall construct \((\widetilde{h},\widetilde{q})\) of the form
where \((\widetilde{f},\widetilde{j})\) solves the equation
Recalling the notation \(Q(\widetilde{\gamma },\widetilde{k};\widetilde{h},\widetilde{q})\) for the quadratic error (4.39), this equation is equivalent to
where \(\mathcal {L}:=\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) is an elliptic unweighted q-00-operator (see (4.34)) which is uniformly invertible as a map (4.33). This lends itself to a fixed point formulation using the map
as we proceed to demonstrate.
\(\bullet \) Existence of a solution with fixed regularity. Fix \(s>\frac{n}{2}+2\); let \(N>0\) be arbitrary. We claim that there exist \(\epsilon _0>0\) and \(\delta >0\) so that T is a uniform contraction on the \(\delta \)-ball in the space
for \(\epsilon <\epsilon _0\), where \(\widetilde{\Omega }\subset \widetilde{X}\) is the domain defined in Proposition 4.23. To show this, note first that the norm of \(w_L P(\widetilde{\gamma },\widetilde{k};\Lambda )\) in the space
is bounded by \(C_1\epsilon \) since \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\) lies in \(\dot{{\mathcal {C}}}^\infty (\widetilde{X})\) and vanishes near \(\tilde{\rho }_2=0\). (Here, \(C_1\) depends on s, N, and tacitly of course on the fixed constants \(\alpha _2,\beta ,\alpha _\circ ,\hat{\alpha }\) and on the tensors \(\widetilde{\gamma },\widetilde{k}\).) Next, when \(\Vert (\widetilde{f},\widetilde{j})\Vert _{\mathcal {X}_\epsilon ^{s,N}}<\delta \) and thus
an application of Lemma 4.27(2) gives, a fortiori,
Here, \(C_2\) and \(C'_2\) are constants depending only on s, N. Finally, write \(C_3\) for an upper bound on the (N-independent) operator norm of \(\mathcal {L}^{-1}:\mathcal {Y}_\epsilon ^{s,N}\rightarrow \mathcal {X}_\epsilon ^{s,N}\) for \(\epsilon <\epsilon _0\), where \(\epsilon _0\) is given by Proposition 4.23. Fix \(\delta _0>0\) so that \(C_3 C_2\delta _0^2<\frac{\delta _0}{2}\); for all \(\delta \le \delta _0\), there then exists \(\epsilon _1\in (0,\epsilon _0)\) so that \(C_3(C_1\epsilon +C_2\delta ^2)<\delta \) for \(\epsilon <\epsilon _1\). The map T thus maps the \(\delta \)-ball in \(\mathcal {X}_\epsilon ^{s,N}\) into itself.
Next, consider the difference
for \((\widetilde{h}_j,\widetilde{q}_j)\in \mathcal {X}_\epsilon ^{s,N}\) of norm less than \(\delta \). Using the estimate (4.41), the \(\mathcal {X}_\epsilon ^{s,N}\)-norm of (5.8) is bounded by \(C_3 C\delta \Vert (\widetilde{f}_1,\widetilde{j}_1)-(\widetilde{f}_2,\widetilde{j}_2)\Vert _{\mathcal {X}_\epsilon ^{s,N}}\). If we choose \(\delta \) so that \(C_3 C\delta <\frac{1}{2}\) (and then \(\epsilon _1\) as above), we conclude that T is a contraction on the \(\delta \)-ball of \(\mathcal {X}_\epsilon ^{s,N}\), uniformly for \(\epsilon <\epsilon _1\).
We therefore obtain a solution \((\widetilde{f},\widetilde{j})\) of the equation (5.6) in \(\epsilon <\epsilon _1=\epsilon _1(s,N)\) which is uniformly bounded in the space \(\mathcal {X}_\epsilon ^{s,N}\). Moreover, the solution produced by the contraction mapping principle is independent of the choices of N and s (though the constant \(\epsilon _1\) given by the above arguments may depend on these choices); this follows from the fact that the tensors \(T^k(0,0):=T(T(\cdots (T(0,0))\cdots ))\) (k compositions) are independent of s, N (since s, N do not enter in the definition of T), and hence so is their limit. But note that whether or not a family of tensors is uniformly bounded in \(\mathcal {X}_\epsilon ^{s,N}\) depends on N only in an arbitrarily small neighborhood of \(\epsilon =0\); therefore, we can take \(\epsilon _1=\epsilon _1(s)\) to be independent of N. Furthermore, since the construction of \((\widetilde{f},\widetilde{j})\) is independent of s, we automatically have
\(\bullet \) Higher q-00-regularity. We proceed to show that, in fact, \((\widetilde{f},\widetilde{j})\in \mathcal {X}_\epsilon ^{s,N}\) for all \(s,N\in \mathbb {R}\) for \(\epsilon <\epsilon _1(s_0)\) where we fix \(s_0>\frac{n}{2}+2\) (e.g. \(s_0=\frac{n}{2}+3\)). For any fixed s, we only need to prove this for \(\epsilon \in [\epsilon _1(s),\epsilon _1(s_0))\) in view of (5.9), but we give an argument which directly works in the full range \(\epsilon <\epsilon _1(s_0)\). To wit, consider the localization of \((\widetilde{f},\widetilde{j})\) to a q-00-unit cell on \(\widetilde{\Omega }\), or equivalently, in the notation of (3.2) and (3.6), the localization of \(\hat{\phi }^*(\hat{\chi }\widetilde{f},\hat{\chi }\widetilde{j})\) to a b-00-unit cell on \(\{{\hat{r}}\ge {\hat{R}}_0\}\subset {\hat{X}}\) or of \(\hat{\phi }_\circ ^*(\chi _\circ \widetilde{f},\chi _\circ \widetilde{j})\) to a b-00-unit cell on \(\upbeta _\circ ^{-1}(\mathcal {U})\subset X_\circ \). Identifying such a unit cell with the unit ball in \(\mathbb {R}^n\) via coordinate charts of the bounded geometry perspective, this localization has \({\mathcal {C}}^{4,\alpha }\oplus {\mathcal {C}}^{3,\alpha }\)-norms (with \(\alpha \in (0,s-\frac{n}{2}-2)\)) uniformly bounded by \(C_N\epsilon ^N\) for all N; we use this bound for \(N=1\). The ellipticity of \(\mathcal {L}\) as an unweighted q-00-operator, and this \(C_1\epsilon \)-bound of the localizations of \((\widetilde{f},\widetilde{j})\) implies that for sufficiently small \(\epsilon \) (independently of s) the PDE (5.6) in such charts is a quasilinear Douglis–Nirenberg elliptic system (with uniform dependence on the unit cell); such systems are discussed in Appendix A. By Proposition A.1, this implies \(\mathcal {O}(\epsilon ^N)\) bounds in \({\mathcal {C}}^{k+2,\alpha }\oplus {\mathcal {C}}^{k+1,\alpha }\) on the localizations of \((\widetilde{f},\widetilde{j})\) for all k, as claimed.
\(\bullet \) \(\underline{{ Conormality}\ { in}\ \epsilon .}\) Recalling Definition 4.24, we may run the above contraction mapping argument on the space \({\mathcal {C}}^0([0,\epsilon _1(s_0)),\mathcal {X}_\epsilon ^{s_0,N})\) using Lemma 4.25 and deduce that \((\widetilde{f},\widetilde{j})\) is in fact continuous with values in \(\mathcal {X}_\epsilon ^{s_0,N}\) (for all N); continuity in \(\mathcal {X}_\epsilon ^{s,N}\) for all \(s\ge s_0\) follows from a parameter-dependent variant of Proposition A.1 (the proof of which requires choosing the small parameter \(\delta \) and the elliptic parametrix Q there in a continuous fashion). We conclude that
for all s, N, with continuous dependence on \(\epsilon \in (0,\epsilon _\sharp )\);
We can improve this further: for some \(\epsilon _\sharp <\epsilon _1(s_0)\) depending only on \(s_0\), we have \((\widetilde{f},\widetilde{j})\in \mathcal {A}_1([0,\epsilon _\sharp ),\mathcal {X}_\epsilon ^{s_0,N})\). Indeed, we may run the contraction mapping argument on the space \(\mathcal {A}_1([0,\epsilon _\sharp ),\mathcal {X}_\epsilon ^{s_0,N})\) by using Lemma 4.25 and direct differentiation of the argument of \(\mathcal {L}^{-1}\) in (5.7) along the testing operator \(\mathcal {R}\) (see the paragraph preceding Lemma 4.25). Proposition A.1, with added \({\mathcal {C}}^1\) dependence on the parameter \(\epsilon \), improves this to
We claim that \((\widetilde{f},\widetilde{j})\in \mathcal {A}_k([0,\epsilon _\sharp ),\mathcal {X}_\epsilon ^{s,N})\) for all \(k\in \mathbb {N}\), with \(\epsilon _\sharp \) fixed. Using the testing operator \(\mathcal {R}\in \mathcal {V}_{\textrm{b}}(\widetilde{X})\) again, this follows most easily by direct differentiation of (5.6). We explain this schematically, for simplicity of notation. The PDE (5.6) is of the form \(\mathcal {P}(\epsilon ,u(\epsilon );\Lambda )=0\), so differentiating once along \(\epsilon \partial _\epsilon \) (which is justified in view of (5.10)), a stand-in for \(\mathcal {R}\), gives \(\mathcal {L}_{\epsilon ,u(\epsilon )}(\epsilon \partial _\epsilon u)=-(\epsilon \partial _\epsilon \mathcal {P})(\epsilon ,u(\epsilon ))\) where \(\mathcal {L}_{\epsilon ,u(\epsilon )}\) is the linearization of \(\mathcal {P}\) in the second argument. Differentiating this once more gives an equation of the form \(\mathcal {L}_{\epsilon ,u(\epsilon )}((\epsilon \partial _\epsilon )^2 u)=f\) where f depends only on \(\epsilon ,u(\epsilon ),\partial _\epsilon u(\epsilon )\) and in particular depends continuously on \(\epsilon \). While a priori \((\epsilon \partial _\epsilon )^2 u\) is defined only as a distribution in \(\epsilon \) with values in \(\mathcal {X}_\epsilon ^{s,N}\), the invertibility of \(\mathcal {L}\) (which in our concrete setting is the operator \(\mathcal {L}_{\widetilde{\gamma }+\widetilde{h},\widetilde{k}+\widetilde{q}}\mathcal {L}_{\widetilde{\gamma }+\widetilde{h},\widetilde{k}+\widetilde{q}}^*\)) implies that \((\epsilon \partial _\epsilon )^2 u\in {\mathcal {C}}^0\) after all. (See Remark 4.26.) This can be iterated, giving the desired conormal regularity of \((\widetilde{f},\widetilde{j})\).
By Sobolev embedding (Corollary 2.6), we can now conclude that \((\widetilde{h},\widetilde{q})\in \epsilon ^N{\mathcal {C}}^\infty (\widetilde{X}\setminus \widetilde{K}^\circ ;S^2\widetilde{T}^*\widetilde{X}\oplus S^2\widetilde{T}^*\widetilde{X})\) (for all N) vanishes in \({\hat{r}}<{\hat{R}}_0\) and outside of \([0,1)\times \mathcal {U}\). The proof is complete. \(\quad \square \)
6 Applications
As an immediate application of Theorem 5.2, we explain in Sect. 6.1 how to glue exact Kerr initial data into a given generic initial data set. On the other hand, all known explicit initial data sets are contained in spacetimes with symmetries, and thus the initial data do admit KIDs; we thus demonstrate in Sect. 6.2 how to extend our methods so as to glue any asymptotically flat data set into the initial data of an exact Kerr–de Sitter (or Kerr, or Kerr–anti de Sitter) black hole. Combining these two applications, we can thus produce initial data sets by gluing small Kerr black holes into Kerr (or Kerr–de Sitter, or Kerr–anti de Sitter) initial data. For brevity, we only discuss the Kerr and Kerr–de Sitter cases here, leaving the (purely notational) changes required to treat the Kerr–anti de Sitter case (\(\Lambda <0\)) to the interested reader.
For \(\Lambda \ge 0\) and parameters \(\mathfrak {m}\in \mathbb {R}\), \(\mathfrak {a}\in \mathbb {R}\), the Kerr (for \(\Lambda =0\)) or Kerr–de Sitter (for \(\Lambda >0\)) metric \(g_{\Lambda ,\mathfrak {m},\mathfrak {a}}\) describes a rotating black hole with mass \(\mathfrak {m}\) and specific angular momentum \(\mathfrak {a}\); in the case \(\mathfrak {a}=0\), the metric \(g_{\Lambda ,\mathfrak {m},0}\) reduces to the Schwarzschild or Schwarzschild–(anti) de Sitter metric [Sch16, Ker63, Car68]. In Boyer–Lindquist coordinates, and using the notation from [Hin21b, §1], we have
We call the parameters \((\Lambda ,\mathfrak {m},\mathfrak {a})\) subextremal if \(\mu _{\Lambda ,\mathfrak {m},\mathfrak {a}}\) has two (when \(\Lambda =0\)) or four (when \(\Lambda >0\)) distinct real roots. The metrics \(g_{\Lambda ,\mathfrak {m},\mathfrak {a}}\) are stationary and axisymmetric: the vector fields \(\partial _t\) and \(\partial _\phi \) are Killing vector fields. When \(\mathfrak {a}=0\), the metric is spherically symmetric. We shall henceforth write K(dS) for Kerr and Kerr–de Sitter (depending on \(\Lambda \)).
For a discussion of the geometry of K(dS) spacetimes, we refer the reader to [DR08, Vas13]. We mention here only that the above metric of a subextremal K(dS) black hole can be extended analytically past the larger two roots \(0<r_{\Lambda ,\mathfrak {m},\mathfrak {a}}^e<r_{\Lambda ,\mathfrak {m},\mathfrak {a}}^c\) of \(\mu _{\Lambda ,\mathfrak {m},\mathfrak {a}}(r)\), with the level set \(r=r_{\Lambda ,\mathfrak {m},\mathfrak {a}}^e\) containing the future event horizon.
6.1 Gluing Kerr initial data into a given data set
Fix the parameters \(\mathfrak {m},\mathfrak {a}\in \mathbb {R}\) of a Kerr black hole (not necessarily subextremal). Fix \(r^e>0\) large enough so that the Kerr metric, in Boyer–Lindquist coordinates, is defined on the manifold
We may take \(r^e\) to be any number larger than \(\mathfrak {m}+\sqrt{\mathfrak {m}^2-\mathfrak {a}^2}\) in the subextremal case \(|\mathfrak {a}|<\mathfrak {m}\). We introduce the usual Cartesian spatial coordinates \(x=(x^1,x^2,x^3)\) on \([r^e,\infty )\times \mathbb {S}^2_{\theta ,\phi }\). Fix now the generators \(B_j\in \mathfrak {so}(1,3)\subset \mathbb {R}^{4\times 4}\), \(j=1,2,3\), of Lorentz boosts in the three spatial coordinate directions \(x^1,x^2,x^3\) to be \((B_j)_{p q}=\delta _{0 p}\delta _{j q}+\delta _{0 q}\delta _{j p}\); thus, for example, \(\exp (w_1 B_1)\) is a Lorentz boost in the \(x^1\)-direction with rapidity \(w_1\). Let \(\textbf{B}=(B_1,B_2,B_3)\); given a rapidity vector \(\textbf{w}\in \mathbb {R}^3\), we then write \(t_\textbf{w}=\exp (-\textbf{w}\cdot \textbf{B})^*t\) and \(x_\textbf{w}=\exp (-\textbf{w}\cdot \textbf{B})^*x\) for the time and spatial coordinates of an observer that is boosted with rapidity \(\textbf{w}\). Note that since \(\exp (-\textbf{w}\cdot \textbf{B})\in SO(1,3)\), the pullback of the Minkowski metric \(-\textrm{d}t^2+\textrm{d}x^2\) is equal to \(-\textrm{d}t_\textbf{w}^2+\textrm{d}x_\textbf{w}^2\).
Lemma 6.1
(Boosted Kerr initial data) Let \(\Sigma \subset M\) denote a spacelike hypersurface which is a graph over \([r^e,\infty )\times \mathbb {S}^2\), and which for large r is equal to a level set of \(t_\textbf{w}\). Let \({\hat{K}}\subset \mathbb {R}^3\) denote the closed ball of radius \(r^e\), and identify \(\Sigma \cong \mathbb {R}^3\setminus {\hat{K}}^\circ \) via a diffeomorphism which is given by x, resp. \(x_\textbf{w}\) near \({\hat{K}}\), resp. for large r. Then the initial data at \(\Sigma \) of the metric \(g_{0,\mathfrak {m},\mathfrak {a}}\) on M are \(\mathcal {E}\)-asymptotically flat on \(\mathbb {R}^3\setminus {\hat{K}}^\circ \), where \(\mathcal {E}=\mathbb {N}_0+1\).
See also the closely related discussion in [CD03, Appendices E and F].
Proof of Lemma 6.1
For \(\Lambda =0\), the coefficients \(b_{\Lambda ,\mathfrak {m},\mathfrak {a}}\) and \(c_{\Lambda ,\mathfrak {m},\mathfrak {a}}\) in (6.1) are equal to 1. Let \(\rho =r^{-1}\). One can then check that \(g_{0,\mathfrak {m},\mathfrak {a}}=-\textrm{d}t^2+\textrm{d}x^2+g'\) where \(g'\in \rho {\mathcal {C}}^\infty (\overline{\mathbb {R}^3}\setminus {\hat{K}}^\circ ;S^2\,{}^{{\textrm{sc}}}T^*\overline{\mathbb {R}^4})\) is independent of t; that is, the coefficients of \(g'\) with respect to \(\partial _t\), \(\partial _{x^1}\), \(\partial _{x^2}\), \(\partial _{x^3}\) are smooth functions of \(r^{-1}\) and \(\omega =\frac{x}{|x|}\) and vanish at \(r^{-1}=0\). A fortiori, \(g_{0,\mathfrak {m},\mathfrak {a}}\) is thus for large r of the form
where we write \(\rho {\mathcal {C}}^\infty _\textrm{ext}\) for the space of smooth symmetric 2-tensors whose coefficients, for large r and for \(|\frac{t}{r}|<1\), are smooth functions of \(\frac{t}{r}\), \(r^{-1}\), and \(\omega \). The Lemma now follows from the following two claims:
-
(1)
the initial data of any metric g of the form (6.2) at \(t=0\) are \(\mathcal {E}\)-asymptotically flat for large r;
-
(2)
the class of metrics of the form (6.2) is preserved under pullback along a Lorentz boost.
Regarding claim (1), note that the metric induced by g on \(t^{-1}(0)\) satisfies
for some large R indeed. The future unit normal vector field n at \(t^{-1}(0)\) is \(n=\partial _t+n'\), \(n'\in \rho {\mathcal {C}}^\infty (\overline{\mathbb {R}^3}{\setminus } B(0,R);{}^{{\textrm{sc}}}T\overline{\mathbb {R}^4})\) (i.e. a linear combination of \(\partial _t\), \(\partial _{x^1}\), \(\partial _{x^2}\), \(\partial _{x^3}\) with coefficients of class \(\rho {\mathcal {C}}^\infty \)). But the Christoffel symbols of g (in the coordinates t, x) lie in \(\rho ^2{\mathcal {C}}^\infty _\textrm{ext}\) (the space of smooth functions of \(\frac{t}{r}\), \(r^{-1}\), \(\omega \) in \(r\gg 1\) which vanish quadratically at \(r^{-1}=0\)), since \(\partial _t,\partial _{x^j}:\rho {\mathcal {C}}^\infty _\textrm{ext}\rightarrow \rho ^2{\mathcal {C}}^\infty _\textrm{ext}\). Therefore, the coefficients of the second fundamental form satisfy \(k_{i j}=g(\nabla _i\partial _j,n)\in \rho ^2{\mathcal {C}}^\infty (\overline{\mathbb {R}^3}\setminus B(0,R))\).
Regarding claim (2), note that \(\phi :=\exp (-\textbf{w}\cdot \textbf{B})\) is an invertible linear map on \(\mathbb {R}^4\), and thus extends by continuity to a diffeomorphism of \(\overline{\mathbb {R}^4}\) (cf. [Mel96, Lemma 5.1.1]) preserving the boundary at infinity. As a consequence, push-forward along \(\phi \) or \(\phi ^{-1}\) maps \(\mathcal {V}_{\textrm{b}}(\overline{\mathbb {R}^4})\) into itself, and also maps functions of class \(\rho {\mathcal {C}}^\infty _\textrm{ext}\) into itself, and therefore \(\phi ^*\) preserves symmetric 2-tensors of class \(\rho {\mathcal {C}}^\infty _\textrm{ext}\). Furthermore, as already noted, we have \(\phi ^*(-\textrm{d}t^2+\textrm{d}x^2)=-\textrm{d}t^2+\textrm{d}x^2\). This completes the proof that the pullback of (6.2) under \(\phi ^*\) is again of the form (6.2). \(\quad \square \)
Therefore, we may apply Theorem 5.2 to glue a Kerr black hole (whether subextremal or not) into the neighborhood of a point \(\mathfrak {p}\) in any given initial data set \((X,\gamma ,k)\) (with or without cosmological constant), provided X does not have KIDs in a neighborhood of \(\mathfrak {p}\).
6.2 Gluing a given data set into subextremal Kerr and Kerr–(anti) de Sitter initial data
Fix \(\Lambda \ge 0\) and the parameters \(\mathfrak {m},\mathfrak {a}\in \mathbb {R}\) of a subextremal K(dS) black hole. We write \(r^e=r_{\Lambda ,\mathfrak {m},\mathfrak {a}}^e\) for the radius of the event horizon. For sufficiently small \(\eta >0\), the K(dS) metric can then be analytically extended to the manifold
where \(\mathfrak {t}=t-T(r)\) and \(\varphi =\phi -\Phi (r)\) for suitable functions \(T,\Phi \); see e.g. [Hin21b, Equation (1.5)] in the Kerr–de Sitter case. We require that the level sets of \(\mathfrak {t}\) are spacelike and transversal to the future event horizon. For \(\delta \in (0,2\eta ]\), let
and denote by
the initial data on \(X_\delta \) induced by \(g_{\Lambda ,\mathfrak {m},\mathfrak {a}}\). Suppose we are given an \(\mathcal {E}\)-asymptotically flat initial data set \((\hat{\gamma },{\hat{k}})\) on \(\mathbb {R}^3\setminus {\hat{K}}^\circ \) as in Definition 4.4, which satisfies the constraint equations,. Since \((M,g_{\Lambda ,\mathfrak {m},\mathfrak {a}})\) has global Killing vector fields, the data set \((X_\delta ,\gamma ,k)\) does have KIDs, i.e. the kernel of \(L_{\gamma ,k}^*\) is nontrivial on any nonempty open subset of \(X_\delta \) [Mon75]. Therefore, Theorem 5.2 is not directly applicable. Note however that in the spacetime evolving from a putative gluing \((\widetilde{\gamma },\widetilde{k})\) of \((\hat{\gamma },{\hat{k}})\) into a neighborhood of a point \(\mathfrak {p}\in X_\delta \), the exterior domain is independent of the glued data in \(r<r^e-\eta \), provided \(\epsilon >0\) is small enough, in view of finite speed of propagation for solutions of the Einstein field equations. Thus, we shall relax the gluing requirements by tolerating violations of the constraint equations in \(r<r^e-\eta \):
Theorem 6.2
(Gluing into K(dS)) Let \((\hat{\gamma },{\hat{k}})\) be \(\mathcal {E}\)-asymptotically flat data on \(\mathbb {R}^3\setminus {\hat{K}}^\circ \) with \(P(\hat{\gamma },{\hat{k}};0)=0\), and let \((\gamma ,k)\) be K(dS) data on \(X_{2\eta }\) as above. Let \(\mathfrak {p}\in X_0\), and let \(\mathcal {U}^\circ \subset X_{2\eta }\) be a smoothly bounded connected open set containing \(\mathfrak {p}\) with compact closure \(\mathcal {U}=\overline{\mathcal {U}^\circ }\) disjoint from \(\{r=r^e-2\eta \}\), and so that \(\mathcal {U}^\circ \cap (X_{2\eta }\setminus X_\eta )\ne \emptyset \). Then the conclusions of Theorem 5.2 hold for \(n=3\) and \(\widetilde{X}=\widetilde{X}_\eta \).
See Fig. 8.
Illustration of Theorem 6.2 and its proof; shown is a part of \(X_{2\eta }\). We permit violations of the constraint equations in \(r<r^e-\eta \); in the gluing procedure, which modifies the data \((\gamma ,k)\) in \(\mathcal {U}\), we add suitable error terms (which we may take to be supported in the red region) to the constraints into \(X_{2\eta }\setminus X_\eta \)
Remark 6.3
(Mass) If we start with Kerr data \((\gamma ,k)\), then the glued initial data \((\widetilde{\gamma },\widetilde{k})\), for all \(\epsilon >0\) (in the notation of Theorem 5.2), agree with \((\gamma ,k)\) for large radii, and therefore their ADM mass is always equal to \(\mathfrak {m}\). Note however that \((\widetilde{\gamma },\widetilde{k})\) differs from \((\gamma ,k)\) near the Kerr event horizon, and thus Bartnik-type notions of (quasilocal) mass (see e.g. [Bar89, Bar97, AJ19]) which depend on properties of minimal surfaces or MOTS depend on \(\epsilon \). Heuristically, the proof below thus takes mass away from \((\gamma ,k)\) and uses it to glue in the data set \((\hat{\gamma },{\hat{k}})\).
Remark 6.4
(Other gluing procedures) Another possible avenue for accounting for the cokernel of \(L_{\gamma ,k}\) is to modulate the parameters of the K(dS) black hole in the gluing procedure, as has been done in related contexts for \(\Lambda =0\) in [Cor00, CS06], and for \(\Lambda >0\) in [CP08, Cor13]; we shall not pursue this possibility in this paper. Conjecturally, in such a gluing procedure in the case \(\Lambda =0\), in which one moreover leaves \((\gamma ,k)\) unchanged near the event horizon, the ADM mass of the glued data set is \(\mathfrak {m}+\epsilon \hat{\mathfrak {m}}+o(\epsilon )\) where \(\hat{\mathfrak {m}}\) is the ADM mass of \((\hat{\gamma },{\hat{k}})\).
Proof of Theorem 6.2
The main task is to find a suitable replacement for Proposition 4.16. Let \(\mathcal {U}_\circ =\upbeta _\circ ^*(\mathcal {U})\) where \(\mathcal {U}=\overline{\mathcal {U}^\circ }\). Let \(N<\infty \) denote the dimension of the kernel \(K^*\) of \(L_{\gamma ,k}^*\) on \({\mathcal {C}}^\infty (\mathcal {U}^\circ )\). We claim that we may select \((f_i^\sharp ,j_i^\sharp )\in {\mathcal {C}}^\infty _{{\textrm{c}}}(\mathcal {U}^\circ {\setminus } X_\eta ;\underline{\mathbb {R}}{}\oplus T^* X_{2\eta })\), \(i=1,\ldots ,N\), so that
restricts to a linear isomorphism \(\mathcal {K}^*\cong \mathbb {R}^N\) where \(\mathcal {K}^*\) is the kernel of \(\mathcal {L}_{\circ ,\gamma ,k}^*\) on the domain of the map (4.22). To prove this, note first that elements of \(\mathcal {K}^*\) are rescaled versions of restrictions of pullbacks of elements of \(K^*\) to \(\mathcal {U}_\circ \) along \(\upbeta _\circ \). But by Lemma 4.3, elements of \(K^*\) are uniquely determined by their restriction to any non-empty open subset of \(\mathcal {U}^\circ \); thus, if \((f_i^*,j_i^*)\), \(i=1,\ldots ,N\), is a basis of \(\mathcal {K}^*\), then the restrictions of \((f_i^*,j_i^*)\) to \(\mathcal {U}^\circ \setminus X_\eta \) are linearly independent—a fortiori also as distributions on \(\mathcal {U}^\circ \setminus X_\eta \). This implies the existence of \((f_i^\sharp ,j_i^\sharp )\in {\mathcal {C}}^\infty _{{\textrm{c}}}(\mathcal {U}^\circ {\setminus } X_\eta )\) with \(\langle (f_i^*,j_i^*),(f_k^\sharp ,j_k^\sharp )\rangle =\delta _{i k}\) for \(1\le i,k\le N\).
Define now \(E_1:\mathbb {R}^N\rightarrow {\mathcal {C}}^\infty _{{\textrm{c}}}(\mathcal {U}^\circ {\setminus } X_\eta ;\underline{\mathbb {R}}{}\oplus T^*X_{2\eta })\) by \((c_i)_{i=1,\ldots ,N}\mapsto \sum _{i=1}^N c_i(f_i^\sharp ,j_i^\sharp )\). Since \({\text {ran}}E_1\) spans the cokernel of \(\mathcal {L}_{\circ ,\gamma ,k}\mathcal {L}_{\circ ,\gamma ,k}^*\), we conclude that the map
is invertible. Regarding part (2) of Proposition 4.16, we may now uniquely solve, for any given polyhomogeneous (f, j), the systemFootnote 28
for \(c\in \mathbb {R}^N\) and polyhomogeneous (h, q). Since \({\text {supp}}E_1(c)\subset \mathcal {U}^\circ {\setminus } X_\eta \), this implies that \(L_{\circ ,\gamma ,k}(h,q)=(f,j)\) on \(X_\eta \).
We similarly modify Proposition 4.23: we may take the same \((f_i^\sharp ,j_i^\sharp )\), regarded as \(\epsilon \)-independent tensors on \(\widetilde{X}'\), and consider the map
between the direct sums of the spaces in (4.33) with \(\mathbb {R}^N\). This can be shown to be invertible with uniformly bounded inverse for all sufficiently small \(\epsilon >0\) by following the proof of Proposition 4.23: for the normal operator argument at \({\hat{X}}_{2\eta }\), one works only with \(\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}\mathcal {L}_{\widetilde{\gamma },\widetilde{k}}^*\) and obtains an improvement of the weight at \({\hat{X}}_{2\eta }\), while the normal operator argument at \((X_{2\eta })_\circ \) provides the estimate for the \(\mathbb {R}^N\)-summand, and gives the improvement of the weight at \((X_{2\eta })_\circ \) as before. Lemma 4.25 remains valid, mutatis mutandis, for the map (6.3).
The arguments of Sect. 5 now apply with minor modifications; for instance, instead of (5.6), one solves the system
where \(\widetilde{c}=\widetilde{c}(\epsilon )\). We leave the details to the reader. \(\quad \square \)
If we take \((\hat{\gamma },{\hat{k}})\) to be (boosted) Kerr initial data, then Theorem 6.2 produces initial data sets in which a small mass Kerr black hole is glued into a given K(dS) initial data set. These are natural initial data sets which (depending on the choice of location and boost parameter of the small black hole, which determine the geodesic along which the small Kerr black hole moves) conjecturally evolve into spacetimes describing extreme mass ratio inspirals; see Sect. 1.4.
Data Availability
Data sharing is not applicable to this article as no datasets were generated or analysed during the current study.
Notes
The \(\epsilon ^{-1}\)-scaling of the second fundamental form in \((\gamma _\epsilon ,k_\epsilon )\approx (\epsilon ^2{\hat{\gamma }},\epsilon ^{-1}\epsilon ^2{\hat{k}})\) arises from the scaling properties of (1.1), see Lemma 4.19; heuristically, it follows from the fact that the future unit normal for the embedding of \((X,\gamma ,k)\) into the spacetime \(M=\mathbb {R}_t\times X\) is, near \(\mathfrak {p}\), scaled by \(\epsilon ^{-1}\) relative to the future unit normal of the embedding of \(({\hat{X}},{\hat{\gamma }},{\hat{k}})\), \({\hat{X}}=\mathbb {R}_{{\hat{x}}}^n\), into \(\mathbb {R}_{{\hat{t}}}\times {\hat{X}}\); that is, the time scale \({\hat{t}}\) of the small black hole is related to the time scale t of the ambient spacetime M by \({\hat{t}}=\frac{t}{\epsilon }\).
One can extend Theorem 1.1 to simultaneously glue any finite number of asymptotically flat data sets into neighborhoods of a matching number of distinct points in X, under a local genericity condition near each of these points. One may also apply Theorem 1.1 to a data set \((X_\epsilon ,\gamma _\epsilon ,k_\epsilon )\), for some fixed small \(\epsilon >0\), in place of \((X,\gamma ,k)\), or more generally iterate a combination of these two procedures finitely many times.
Later on, we use the notation \(\mathcal {A}^\alpha (\overline{\mathbb {R}^n})\) for the space of such functions.
We make this more precise using the radial compactification \(\overline{\mathbb {R}^n}\) of \(\mathbb {R}^n\) in Sect. 2.1.
That is, the exponents \(z_\ell \) in the expansion (1.3) for \({\hat{\gamma }}_{{\hat{i}}{\hat{j}}}\) satisfy \({\text {Re}}z_\ell \ge \delta \) (with strict inequality if \(k_\ell \ge 1\)), except for a single term \((z_0,k_0)=(0,0)\) with coefficient \(u_0=\delta _{i j}\). The definition used in the main part of the paper records the set \(\{(z_\ell ,k_\ell )\}\), see Definition 4.4.
If one is satisfied with less than full polyhomogeneity of \((\widetilde{\gamma },\widetilde{k})\), one skip all, or part, of this step.
Indeed, the scalar curvature of \(\gamma ^{(\epsilon ,\eta )}\) is schematically of the form \(\sum _{j=1}^2\partial ^j(\chi (\frac{|x|}{\eta }))\partial ^{2-j}(\gamma (x)-\hat{\gamma }(\frac{x}{\epsilon }))\), and thus—using that \(\partial ^k(\gamma _{i j}(x)-\delta _{i j})=\mathcal {O}(|x|^{2-k})\) and \(\partial ^k(\hat{\gamma }_{i j}(\frac{x}{\epsilon })-\delta _{i j})\lesssim |\frac{x}{\epsilon }|^{-\delta -k}\) for \(k=0,1,2\)—of size \(\sum _{j=1}^2\eta ^{-j}(\eta ^{2-(2-j)}+(\frac{\eta }{\epsilon })^{-\delta -(2-j)})\simeq 1+\epsilon ^{1+\delta }\eta ^{-2-\delta }\) for \(|x|\simeq \eta \) indeed.
Passing to the rescaled coordinates \({\tilde{x}}=\frac{x}{\eta }\) (and regarding \(\gamma _{i j}(\eta {\tilde{x}})\) and \(\hat{\gamma }_{{\hat{i}}{\hat{j}}}(\eta \epsilon ^{-1}{\tilde{x}})\) as the coefficients of a metric in the frame \(\textrm{d}{\tilde{x}}^1,\ldots ,\textrm{d}{\tilde{x}}^n\)) does not eliminate this issue. The violation of the constraints in the gluing region \({\tilde{U}}=\{1\le |{\tilde{x}}|\le 2\}\) is now of size \(\mathcal {O}(\eta ^2+(\frac{\epsilon }{\eta })^\delta )\), which is small for \(\epsilon \ll \eta \ll 1\). However, the metric \(\gamma ^{(\epsilon ,\eta )}(\eta {\tilde{x}})\) converges to the Euclidean metric in \({\tilde{U}}\) as \(\eta \rightarrow 0\), and hence one faces an approximate cokernel of the linearized constraints map when attempting to deform \(\gamma ^{(\epsilon ,\eta )}\) to a solution of the constraints (cf. [Cor00]). If one relaxed the localization requirements of the deformation and attempted to take advantage of the local genericity of X in a fixed neighborhood of \(\mathfrak {p}\), one would have to work on a domain with diameter \(\simeq \eta ^{-1}\rightarrow \infty \) in \({\tilde{x}}\)-coordinates, which would lead to different analytic subtleties.
Hölder spaces would work just as well; we use Sobolev spaces for convenience only.
This map combines the two symbols alluded to in Sect. 1.3 into a single object.
The sign convention is such that increasing any one of \(\alpha ,\beta ,s\) gives a smaller space.
We repurpose the notation previously used in the b-setting.
The definition (2.11), for a general symbol \(q=q(x,y;x^2\hat{\xi },x^2\hat{\eta })\) (with compact support in (x, y)) of order k, is a quantization map \({{\,\textrm{Op}\,}}_{0 0}(q)\) for 00-pseudodifferential operators of order k; adding to the space of all such quantizations the space of residual operators—those having smooth Schwartz kernels on \(X\times X\) which vanish to infinite order at \(\partial X\times X\) and \(X\times \partial X\)—gives the full space \(\Psi _{0 0}^k(X)\). Starting from the perspective of [MM87], Schwartz kernels of elements of \(\Psi _{0 0}^k(X)\) are conormal distributions on the ‘00-double space’ \(X^2_{0 0}:=[X^2_0;\partial {\text {diag}}_0]\), where \(X^2_0=[X^2;\partial {\text {diag}}_X]\) is the 0-double space, with \({\text {diag}}_X\subset X^2\) the diagonal, and \({\text {diag}}_0\subset X^2_0\) is the lift of \({\text {diag}}_X\). These conormal distributions are required to vanish to infinite order at all boundary hypersurfaces of \(X^2_{0 0}\) except the front face of \(X^2_{0 0}\). From this perspective, the necessity of localizing in the weak manner provided by the factors of \(\phi \) is easily seen: the supports of the differentials of these cutoffs are disjoint from the 00-front face (and indeed instead intersect the 0-front face away from the 00-front face and the left and right boundaries).
The somewhat cumbersome notation involving hats and circles is used here for the sake of consistency with later sections.
The fact that \(L_{\hat{\gamma },{\hat{k}}}{\hat{w}}\) is unweighted can be re-interpreted as the statement that \(L_{\hat{\gamma },{\hat{k}}}\) is a Douglis–Nirenberg system of weighted b-differential operators not only in the differential order sense, but also in the sense of weights, in that the (i, j)-block has weight \(\rho _\circ ^{a_j+b_i}\) where \(a_1=2\), \(a_2=1\), \(b_1=0\), \(b_2=0\). We opt for the explicit weight \({\hat{w}}\) for clarity here, however.
Recall that we are using b-densities throughout. With respect to the Euclidean volume density, the assumption would read \(({\hat{f}}^*,{\hat{j}}^*)\in H_{{\textrm{b}}}^{-\infty ,\frac{n}{2}+\alpha ^*}\), with \(\frac{n}{2}+\alpha ^*>\frac{n}{2}\).
Using the information about the boundary spectrum of \(\hat{\underline{L}}^*\), one could improve the conormality of \(({\hat{f}}^*,{\hat{j}}^*)\) at infinity to rapid vanishing of \(({\hat{f}}^*,{\hat{j}}^*)\), whence the vanishing of \({\hat{f}}^*,{\hat{j}}^*\) near infinity would follow from a unique continuation result; this is the path taken in [CS06, Proposition 3.1] We present a different argument here which directly exploits elliptic estimates and the overdetermined nature of \(L_{\hat{\gamma },{\hat{k}}}^*\).
Here, the b-Sobolev norm on \(\rho _\circ ^{-1}([0,\rho _1])\), for \(\rho _1>0\), is defined via differentiation along \(\rho _\circ \partial _{\rho _\circ }\) and spherical vector fields; that is, we test with all smooth vector fields near the ‘artificial boundary’ at \(\rho _\circ =\rho _1\). The usage of the notation \({\bar{H}}_{{\textrm{b}}}\) instead of \(H_{{\textrm{b}}}\) stresses this extendible character of the function space in the terminology of Hörmander [Hör07, Appendix B].
One can in principle compute the set \(\hat{\mathcal {S}}\) produced in the proof explicitly (the main ingredient being the computation of the set of poles of the operator \(\mathcal {N}(\alpha _\circ ,\lambda )^{-1}\) in (4.17) below), though we shall not present the details here as the result depends on \(\alpha _\circ \) in a rather complicated manner. We leave open the interesting problem of finding the smallest possible \(\hat{\mathcal {S}}\); we conjecture that one can ensure \(\hat{\mathcal {S}}\subset -i(\mathbb {N}_0+n-2)\times \mathbb {N}_0\). See also Remark 4.18.
This lift is the differential operator on \(X_\circ \) given by the original operator \(L_{\gamma ,k}\) acting on smooth compactly supported sections of \(\upbeta _\circ ^*(S^2 T^*X\oplus S^2 T^*X)\) over the interior \((X_\circ )^\circ \) of \(X_\circ \).
This sign switch arises naturally from the total gluing space perspective, where it is an instance of the typical relationship between small ends of cones (\(\partial X_\circ \) from the perspective of \(X_\circ \)) which emanate from the large end of a cone (\(\partial {\hat{X}}\) from the perspective of \({\hat{X}}\)).
Paralleling Remark 4.11, this gives a right inverse of \(L_{\circ ,\gamma ,k}\) which is bounded as a map
$$\begin{aligned}&\hat{\rho }^{\hat{\alpha }-2}\rho _2^{\alpha _2-4}e^{-\beta /\rho _2}H_{{\textrm{b}},0 0}^{s-2}(\mathcal {U}_\circ ) \oplus \hat{\rho }^{\hat{\alpha }-2}\rho _2^{\alpha _2-2}e^{-\beta /\rho _2}H_{{\textrm{b}},0 0}^{s-1}(\mathcal {U}_\circ ;\upbeta _\circ ^*T^*_\mathcal {U}X) \\&\quad \rightarrow \hat{\rho }^{\hat{\alpha }}\rho _2^{\alpha _2} e^{-\beta /\rho _2}H_{{\textrm{b}},0 0}^s(\mathcal {U}_\circ ;\upbeta _\circ ^*S^2 T^*_\mathcal {U}X) \oplus \hat{\rho }^{\hat{\alpha }-1}\rho _2^{\alpha _2}e^{-\beta /\rho _2}H_{{\textrm{b}},0 0}^s(\mathcal {U}_\circ ;\upbeta _\circ ^*S^2 T^*_\mathcal {U}X). \end{aligned}$$If \(-(\hat{\alpha }+n-2)>1\), we could even conclude that \(({\tilde{f}}^*,{\tilde{j}}^*)\) vanishes to infinite order.
The tensor \(P(\widetilde{\gamma },\widetilde{k};\Lambda )\) on \(\widetilde{X}\setminus \widetilde{K}^\circ \) is equal to \(P(\widetilde{\gamma }|_{\widetilde{X}_\epsilon {\setminus }\widetilde{K}^\circ },\widetilde{k}|_{\widetilde{X}_\epsilon {\setminus }\widetilde{K}^\circ };\Lambda )\) on \(\widetilde{X}_\epsilon {\setminus }\widetilde{K}^\circ \).
This implies the existence of a right inverse of \(L_{\widetilde{\gamma },\widetilde{k}}\) which is uniformly bounded as a map
$$\begin{aligned}&\hat{\rho }^{\hat{\alpha }-2}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2-4}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^{s-2}(\widetilde{\Omega }_\epsilon ) \oplus \hat{\rho }^{\hat{\alpha }-2}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2-2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^{s-1}(\widetilde{\Omega }_\epsilon ;\widetilde{T}^*\widetilde{X}) \\&\quad \rightarrow \hat{\rho }^{\hat{\alpha }}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^s(\widetilde{\Omega }_\epsilon ;S^2\widetilde{T}^*\widetilde{X}) \oplus \hat{\rho }^{\hat{\alpha }-1}\rho _\circ ^{\alpha _\circ }\tilde{\rho }_2^{\alpha _2}e^{-\beta /\tilde{\rho }_2}H_{{\textrm{q}},0 0,\epsilon }^s(\widetilde{\Omega }_\epsilon ;S^2\widetilde{T}^*\widetilde{X}). \end{aligned}$$For all sufficiently small \(\epsilon >0\), the sum \(\widetilde{\gamma }+\widetilde{h}\) is a positive definite section of \(S^2\widetilde{T}^*\widetilde{X}\), and therefore the left hand side is well-defined.
Since \(\mathcal {F}_\ell \ne \emptyset \), cf. (5.3), this maximal number is finite for any fixed \(\ell \).
The choice \(0\in \mathbb {R}^N\) of the right hand side of the equation involving \(E_2\) is arbitrary.
References
Aretakis, S., Czimek, S., Rodnianski, I.: The characteristic gluing problem for the Einstein equations and applications. Preprint arXiv:2107.02441 (2021)
Aretakis, S., Czimek, S., Rodnianski, I.: The characteristic gluing problem for the Einstein vacuum equations. Linear and non-linear analysis. Preprint, arXiv:2107.02449, (2021)
Aretakis, S., Czimek, S., Rodnianski, I.: Characteristic gluing to the Kerr family and application to spacelike gluing. arXiv preprint arXiv:2107.02456, (2021)
Anderson, M.T., Jauregui, J.L.: Embeddings, immersions and the bartnik quasi-local mass conjectures. Annales Henri Poincaré 20(5), 1651–1698 (2019). https://doi.org/10.1007/s00023-019-00786-3
Bartnik, R.: New definition of quasilocal mass. Phys. Rev. Lett. 62, 2346–2348 (1989). https://doi.org/10.1103/PhysRevLett.62.2346
Bartnik, R.: Energy in general relativity. In: Tsing Hua lectures on geometry & analysis (Hsinchu. 1990–1991), pp. 5–27. Int. Press, Cambridge, MA (1997)
Brandt, S., Bruegmann, B.: A Simple construction of initial data for multiple black holes. Phys. Rev. Lett. 78, 3606–3609 (1997). https://doi.org/10.1103/PhysRevLett.78.3606. arXiv: gr-qc/9703066
Beig, R., Chruściel, P.T.: Killing vectors in asymptotically flat space-times.I. Asymptotically translational Killing vectors and the rigid positive energy theorem. Journal of Mathematical Physics 37(4), 1939–1961 (1996)
Bieri, L., Chruściel, P.T.: Future-complete null hypersurfaces, interior gluings, and the trautman-bondi mass. Proceedings of the Harvard Center for Mathematical Sciences and Applications 1, 1–31 (2017)
Branson, T., Čap, A., Eastwood, M., Rod Gover, A.: Prolongations of geometric overdetermined systems. International Journal of Mathematics, 17(06):641–664, (2006)
Beig, R., Chruściel, P.T., Schoen, R.: KIDs are non-generic. Annales Henri Poincaré 6(1), 155–194 (2005)
Brill, D.R., Lindquist, R.W.: Interaction energy in geometrostatics. Physical Review 131(1), 471 (1963)
Carter, B.: Hamilton-Jacobi and Schrödinger separable solutions of Einstein’s equations. Communications in Mathematical Physics 10(4), 280–310 (1968)
Carlotto, A.: The general relativistic constraint equations. Living Reviews in Relativity 24(1), 1–170 (2021)
Choquet-Bruhat, Y.: Théorème d’existence pour certains systèmes d’équations aux dérivées partielles non linéaires. Acta mathematica 88(1), 141–225 (1952)
Choquet-Bruhat, Y., Geroch, R.: Global aspects of the Cauchy problem in general relativity. Communications in Mathematical Physics 14(4), 329–335 (1969)
Chruściel, P.T., Corvino, J., Isenberg, J.: Construction of n-body initial data sets in general relativity. Communications in Mathematical Physics 304(3), 637–647 (2011). https://doi.org/10.1007/s00220-011-1244-7
Chruściel, P.T., Delay, E.: Existence of non-trivial, vacuum, asymptotically simple spacetimes. Classical and Quantum Gravity 19(9), L71 (2002)
Chruściel, P.T., Delay, E.: On mapping properties of the general relativistic constraints operator in weighted function spaces, with applications. Mem. Soc. Math. France 94, 1–103 (2003)
Corvino, J., Huang, L.-H.: Localized deformation for initial data sets with the dominant energy condition. Calculus of Variations and Partial Differential Equations 59(1), 1–43 (2020)
Chruściel, P.T., Mazzeo, R.: On ‘many-black-hole’ vacuum spacetimes. Classical and Quantum Gravity 20(4), 729 (2003)
Chruściel, P.T., Maerten, D.: Killing vectors in asymptotically flat space-times. II. Asymptotically translational Killing vectors and the rigid positive energy theorem in higher dimensions. Journal of Mathematical Physics 47(2), 022502 (2006)
Corvino, J.: Scalar curvature deformation and a gluing construction for the Einstein constraint equations. Comm. Math. Phys. 214(1), 137–189 (2000)
Cortier, J.: Gluing construction of initial data with Kerr-de Sitter ends. Ann. Henri Poincaré 14(5), 1109–1134 (2013)
Chruściel, P.T., Pollack, D.: Singular Yamabe metrics and initial data with exactly Kottler-Schwarzschild-de Sitter ends. Ann. Henri Poincaré 9(4), 639–654 (2008)
Czimek, S., Rodnianski, I.: Obstruction-free gluing for the Einstein equations. Preprint, arXiv:2210.09663, (2022)
Corvino, J., Schoen, R.M.: On the asymptotics for the vacuum Einstein constraint equations. J. Differential Geom. 73(2), 185–217 (2006)
Carlotto, A., Schoen, R.: Localizing solutions of the Einstein constraint equations. Inventiones mathematicae 205(3), 559–615 (2016)
Delay, E.: Smooth compactly supported solutions of some underdetermined elliptic PDE, with gluing applications. Communications in Partial Differential Equations 37(10), 1689–1716 (2012)
Douglis, A., Nirenberg, L.: Interior estimates for elliptic systems of partial differential equations. Commun. Pure Appl. Math. 8(4), 503–538 (1955). https://doi.org/10.1002/cpa.3160080406
Dafermos, M., Rodnianski, I.: Lectures on black holes and linear waves. Evolution equations, Clay Mathematics Proceedings 17, 97–205 (2008)
Eichmair, M., Huang, L.-H., Lee, D.A., Schoen, R.: The spacetime positive mass theorem in dimensions less than eight. Journal of the European Mathematical Society 18(1), 83–121 (2015)
Eichmair, M.: The Jang equation reduction of the spacetime positive energy theorem in dimensions less than eight. Communications in Mathematical Physics 319(3), 575–593 (2013)
Fischer, A.E., Marsden, J.E.: Linearization stability of the Einstein equations. Bulletin of the American Mathematical Society 79(5), 997–1003 (1973)
Fischer, A.E., Marsden, J.E.: Deformations of the scalar curvature. Duke Mathematical Journal 42(3), 519–547 (1975)
Hintz, P.: Black hole gluing in de Sitter space. Communications in Partial Differential Equations 46(7), 1280–1318 (2021). https://doi.org/10.1080/03605302.2020.1871368
Hintz, P.: Mode stability and shallow quasinormal modes of Kerr–de Sitter black holes away from extremality. Preprint, arXiv:2112.14431, (2021)
Huang, L.-H., Lee, D.A.: Equality in the spacetime positive mass theorem. Communications in Mathematical Physics 376(3), 2379–2407 (2020)
Hörmander, L.: The analysis of linear partial differential operators. III. Classics in Mathematics. Springer, Berlin (2007)
Hintz, P., Vasy, A.: The global non-linear stability of the Kerr-de Sitter family of black holes. Acta mathematica 220, 1–206 (2018). https://doi.org/10.4310/acta.2018.v220.n1.a1
Hirsch, S., Zhang, Y.: The case of equality for the spacetime positive mass theorem. Preprint, arXiv:2203.01984, (2022)
Isenberg, J., Mazzeo, R., Pollack, D.: Gluing and wormholes for the Einstein constraint equations. Communications in Mathematical Physics 231(3), 529–568 (2002)
Kerr, R.P.: Gravitational field of a spinning mass as an example of algebraically special metrics. Physical Review Letters 11(5), 237 (1963)
Kottke, C., Singer, M. (2022) Partial compactification of monopoles and metric asymptotics. Memoirs of the AMS, to appear
Kastor, D., Traschen, J.: Cosmological multi-black-hole solutions. Physical Review D 47(12), 5370 (1993)
Lichnerowicz, A.: L’intégration des équations de la gravitation relativiste et le probleme des n corps. J. Math. Pures Appl. 23(37), 4 (1944)
Lindquist, R.W.: Initial-value problem on Einstein-Rosen manifolds. Journal of Mathematical Physics 4(7), 938–950 (1963)
Lohkamp, J.: The higher dimensional positive mass theorem II. Preprint, arXiv:1612.07505, (2016)
Luk, J.: On the Local Existence for the Characteristic Initial Value Problem in General Relativity. International Mathematics Research Notices 2012(20), 4625–4678 (2011). https://doi.org/10.1093/imrn/rnr201
Majumdar, S.D.: A class of exact solutions of Einstein’s field equations. Physical Review 72(5), 390 (1947)
Mazzeo, R.R.: Personal communication
Mazzeo, R.: Elliptic theory of differential edge operators I. Communications in Partial Differential Equations 16(10), 1615–1664 (1991). https://doi.org/10.1080/03605309108820815
McDonald, P.T.: The Laplacian for spaces with cone-like singularities. PhD thesis, Massachusetts Institute of Technology, (1990)
Melrose, R.B.: The Atiyah-Patodi-Singer index theorem, volume 4 of Research Notes in Mathematics. A K Peters, Ltd., Wellesley, MA, (1993). https://doi.org/10.1016/0377-0257(93)80040-i
Melrose, R.B. Spectral and scattering theory for the Laplacian on asymptotically Euclidian spaces. In Spectral and scattering theory (Sanda, 1992), volume 161 of Lecture Notes in Pure and Appl. Math., pages 85–130. Dekker, New York, (1994)
Melrose, R.B.: Differential analysis on manifolds with corners. Book, in preparation, available online, 1996. URL: https://math.mit.edu/~rbm/daomwcf.ps
Misner, C.W.: The method of images in geometrostatics. Annals of Physics 24, 102–117 (1963)
Mazzeo, R.R., Melrose, R.B.: Meromorphic extension of the resolvent on complete spaces with asymptotically constant negative curvature. J. Funct. Anal. 75(2), 260–310 (1987). https://doi.org/10.1016/0022-1236(87)90097-8
Mazzeo, R.R., Melrose, R.B.: The adiabatic limit, Hodge cohomology and Leray’s spectral sequence for a fibration. Journal of Differential Geometry 31, 185–213 (1990)
Mazzeo, R.R., Melrose, R.B.: Analytic surgery and the eta invariant. Geometric & Functional Analysis 5(1), 14–75 (1995). https://doi.org/10.1007/BF01928215
Moncrief, V.: Spacetime symmetries and linearization stability of the Einstein equations. I. Journal of Mathematical Physics 16(3), 493–498 (1975). https://doi.org/10.1063/1.522572
Melrose, R.B., Singer, M.: Gluing Kähler–Einstein metrics of constant curvature—a worked example. Unpublished work, (2011)
Papapetrou, A.: A Static Solution of the Equations of the Gravitational Field for an Arbitary Charge-Distribution. Proceedings of the Royal Irish Academy. Section A: Mathematical and Physical Sciences, 51:191–204, (1945). URL: http://www.jstor.org/stable/20488481
Schwarzschild, K.: Über das Gravitationsfeld eines Massenpunktes nach der Einsteinschen Theorie. Sitzungsberichte der Königlich Preußischen Akademie der Wissenschaften (Berlin), pages 189–196, (1916)
Shubin, M.A.: Pseudodifferential operators and spectral theory, volume 200. Springer, (1987)
Shubin, M.A.: Spectral theory of elliptic operators on non-compact manifolds. Astérisque 207, 37–108 (1992)
Schroers, B.J., Singer, M.A.: \(D_k\) gravitational instantons as superpositions of Atiyah-Hitchin and Taub-NUT geometries. The Quarterly Journal of Mathematics 72(1–2), 277–337 (2021)
Schoen, R., Yau, S.-T.: On the proof of the positive mass conjecture in general relativity. Communications in Mathematical Physics 65(1), 45–76 (1979)
Schoen, R., Yau, S.-T.: Positive scalar curvature and minimal hypersurface singularities. Preprint, arXiv:1704.05490, (2017)
Taylor, M.E.: Partial differential equations III. Nonlinear equations, volume 117 of Applied Mathematical Sciences. Springer, New York, second edition, (2011). https://doi.org/10.1007/978-1-4419-7049-7
Vasy, A.: Microlocal analysis of asymptotically hyperbolic and Kerr-de Sitter spaces (with an appendix by Semyon Dyatlov). Invent. Math. 194(2), 381–513 (2013). https://doi.org/10.1007/s00222-012-0446-8
Vasy, A.: A minicourse on microlocal analysis for wave propagation. In Thierry Daudé, Dietrich Häfner, and Jean-Philippe Nicolas, editors, Asymptotic Analysis in General Relativity, volume 443 of London Mathematical Society Lecture Note Series, pages 219–373. Cambridge University Press, (2018)
Witten, E.: A new proof of the positive energy theorem. Communications in Mathematical Physics 80(3), 381–402 (1981)
York, J.W.: Jr. Conformally invariant orthogonal decomposition of symmetric tensors on Riemannian manifolds and the initial-value problem of general relativity. Journal of Mathematical Physics, 14(4):456–464, (1973)
Acknowledgements
I would like to thank Richard Melrose and András Vasy for a number of conversations about black hole gluing related to the present work. I am also grateful to Rafe Mazzeo for suggesting the use of his 00-calculus [Maz] to facilitate a geometric singular analysis style treatment of localized gluing. Many thanks are also due to Alessandro Carlotto and Piotr Chruściel for a number of helpful comments and suggestions on content and exposition, to Michael Singer for important pointers to existing literature, and to an anonymous referee for a careful reading of the manuscript and several useful suggestions.
Funding
Open access funding provided by Swiss Federal Institute of Technology Zurich
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by E. Smith.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A. Quasilinear Elliptic Douglis–Nirenberg Systems
Appendix A. Quasilinear Elliptic Douglis–Nirenberg Systems
We work on \(\mathbb {R}^n\), \(n\ge 1\), with points in \(\mathbb {R}^n\) denoted by x. Let \(N\in \mathbb {N}\). Consider \(u(x)=(u_k(x))_{k=1,\ldots ,N}\), with \(u_k\) valued in \(\mathbb {R}^{d_k}\). Let \(t_1,\ldots ,t_N,s_1,\ldots ,s_N\in \mathbb {N}_0\). We consider a system \((P_j(u))_{j=1,\ldots ,N}=0\) of N nonlinear partial differential equations for u which is quasilinear in the following Douglis–Nirenberg sense: writing
for the vector of all (mixed) x-derivatives of u up to order \(m-1\), we have
where \(L_{j k,\alpha }\) and \({\tilde{P}}_j\) are smooth functions of their arguments. We furthermore assume that P is elliptic at u in the sense that the linear operator
is elliptic in the Douglis–Nirenberg sense.
Proposition A.1
(Bootstrap). Let \({\bar{s}}=\max s_j\). Suppose that \(u\in \bigoplus _{k=1}^N{\mathcal {C}}^{t_k+{\bar{s}},\alpha }(\mathbb {R}^n;\mathbb {R}^{d_k})\), \(\alpha \in (0,1)\), is a solution of \(P(x,u)=0\). Then \(u\in {\mathcal {C}}^\infty \).
Proof
This is a standard scaling and linear approximation argument, but we give a proof here for completeness. Since the claim is local, it suffices to work near \(x=0\). Assume that \(u_k\in {\mathcal {C}}^{t_k+{\bar{s}}+m,\alpha }\) with \(m\in \mathbb {N}_0\). For \(\delta >0\) to be determined later, consider the rescaled coordinate \({\hat{x}}=\frac{x}{\delta }\), and write \({\hat{D}}=\delta ^{-1}D\) for its coordinate derivatives. Setting \({\hat{u}}={\text {diag}}(\delta ^{-t_k})u\), the rescaling \((\delta ^{s_j}P_j(x,u))_{j=1,\ldots ,N}=0\) then reads
Working in \(|{\hat{x}}|\le 2\), we have \(f(\delta )\in \bigoplus _j {\mathcal {C}}^{m+1+{\bar{s}}-s_j,\alpha }\) (with norm depending on \(\delta \)). Moreover, since the norm of \(D^{<t_\ell +s_j}u_\ell (\delta {\hat{x}})-D^{<t_\ell +s_j}u_\ell (0)\in {\mathcal {C}}^{m+1+{\bar{s}}-s_j,\alpha }\) is of size \(\mathcal {O}(\delta )\), we can rewrite this further as
where \(\mathcal {L}=L(0,(D^{<t_\ell +s_j}u_\ell (0)),{\hat{D}})\) is Douglis–Nirenberg elliptic, and where the remainder term \(\tilde{\mathcal {L}}\in (\textrm{Diff}^{t_k+s_j})_{j,k=1,\ldots ,N}\) has coefficients of size \(\mathcal {O}(\delta )\) in \(|{\hat{x}}|\le 2\). Let \(\chi \in {\mathcal {C}}^\infty _{{\textrm{c}}}(B(0,2))\) be identically 1 on B(0, 1); then
Let \(Q\in (\Psi ^{-s_k-t_j})_{j,k=1,\ldots ,N}\) denote a parametrix of \(\mathcal {L}\) near \({\text {supp}}\chi \), with Schwartz kernel compactly supported in \(B(0,2)\times B(0,2)\), and so that \(Q\mathcal {L}=I+R\) where \(R\circ \chi \in \Psi ^{-1}\). We then have
But \(Q{\tilde{f}}(\delta ),R(\chi {\hat{u}})\in \bigoplus _k {\mathcal {C}}^{m+1+{\bar{s}}+t_k,\alpha }\), while the operator norm of \(Q\tilde{\mathcal {L}}\) on \(\bigoplus _k {\mathcal {C}}^{m+\eta +{\bar{s}}+t_k,\alpha }\) is of size \(\mathcal {O}(\delta )\) uniformly for \(\eta \in [0,1]\). (We use here the boundedness of ps.d.o.s on Hölder spaces, see e.g. [Tay11, §13.8].) Thus, for sufficiently small \(\delta \), we can invert \(I+Q\tilde{\mathcal {L}}\) (with the choice \(\eta =0\)) via a Neumann series, giving
But the right hand side is now also defined for the choice \(\eta =1\) (upon shrinking \(\delta \) further if necessary). This improves the regularity of \(\chi {\hat{u}}\), and thus of \(\chi (x/\delta )u(x)\), by 1 order relative to the inductive hypothesis. The proof is complete. \(\quad \square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hintz, P. Gluing Small Black Holes into Initial Data Sets. Commun. Math. Phys. 405, 114 (2024). https://doi.org/10.1007/s00220-024-04989-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00220-024-04989-6