1 Introduction

We use a combination of high-low frequency analysis and induction to prove a sharp \(L^{7}\) square function estimate for the moment curve \(\mathcal {M}^{3}=\{(t,t^{2},t^{3}):0\le t\le 1\}\). As in [GWZ20] for the cone in \(\mathbb{R}^{3}\), this is an example of using techniques developed in decoupling theory [BD15] to prove square function estimates. It is worth noting that the only previous sharp square function estimates in restriction theory are for even \(L^{p}\) exponents, which often makes Fourier analysis problems more approachable.

In Fourier restriction theory, we are concerned with functions \(f=\sum _{\theta }f_{\theta}\) where \(\mathrm {supp}\widehat{f_{\theta}}\subset \theta \) and the \(\theta \) are pairwise disjoint. The shape of the Fourier support \(\cup \theta \) and the sets \(\theta \) determines the kinds of \(L^{p}\) bounds that \(f\) satisfies. By an \(L^{p}\) square function estimate, we mean an inequality of the form \(\|f\|_{L^{p}}\le C\|(\sum _{\theta }|f_{\theta}|^{2})^{1/2}\|_{L^{p}}\), while an \((\ell ^{2},L^{p})\) decoupling inequality has the form \(\|f\|_{L^{p}}\le C(\sum _{\theta }\|f_{\theta}\|_{L^{p}}^{2})^{1/2}\). Taking the \(L^{p}\) norm of each summand \(f_{\theta}\) separately makes it natural to approach decoupling using rescaling and induction-on-scales arguments, which Bourgain and Demeter did in [BD15]. Indeed, organize the \(\theta \) into larger (pairwise disjoint) sets \(\tau \) and suppose that for \(f_{\tau}:=\sum _{\theta \subset \tau}f_{\theta}\),

$$ \|f\|_{L^{p}} \lessapprox (\sum _{\tau}\|f_{\tau}\|_{L^{p}}^{2})^{1/2}. $$

The desired decoupling inequality into the \(f_{\theta}\) then follows by rescaling each \(f_{\tau}\) and invoking induction to justify \(\|f_{\tau}\|_{L^{p}}\lessapprox (\sum _{\theta \subset \tau}\|f_{ \theta}\|_{L^{p}}^{2})^{1/2}\), since this inequality is equivalent to the initial problem but at a coarser scale. It is much less obvious how to use a multi-scale approach to prove square function estimates since we cannot rescale each \(f_{\tau}\) individually in the analogous intermediate expression \(\|(\sum _{\tau}|f_{\tau}|^{2})^{1/2}\|_{L^{p}}\). A major innovation of Guth-Wang-Zhang’s approach to the \(L^{4}\) square function estimate for the cone was to exploit Plancherel’s theorem and study \(\int (\sum _{\theta }|f_{\theta}|^{2})^{2}\) on the Fourier side. A detailed analysis of the Fourier support of \(\sum _{\theta }|f_{\theta}|^{2}\) then leads to an approximately equivalent expression for the \(L^{4}\) square function estimate called a wave envelope estimate. The advantage of the wave envelope expression is that it behaves well under rescaling and permits an induction-on-scales argument inspired by decoupling.

Guth-Wang-Zhang’s approach for the cone does not adapt to \(\mathcal {M}^{3}\). One immediate difference is that while it is possible to have a good understanding of the Fourier support of \(\sum _{\theta}|f_{\theta}|^{2}\), where \(\theta \) are pieces of a neighborhood of \(\mathcal {M}^{3}\), there is no Plancherel’s theorem for \(L^{7/2}\). This makes wave envelope expressions for \(\int (\sum _{\theta}|f_{\theta}|^{2})^{7/2}\) more complicated than in the \(p=4\) case, though it is still possible to identify a natural candidate. The main barrier to applying the Guth-Wang-Zhang inductive approach to \(\mathcal {M}^{3}\) is that there is no base case for the induction. The base case for the cone follows from approximating the cone by a cylindrical neighborhood of a parabola and invoking the classical Cordoba-Fefferman \(L^{4}\) square function estimate for the parabola [Có82]. There is no similar \(L^{7}\) estimate available for \(\mathcal {M}^{3}\) to establish a base case, so the inductive scheme breaks down.

Our argument builds on the high-low proof of decoupling for the moment curve, which is contained in [GM22b] and is based on the argument from [GMW20] for the parabola. The high-low method involves a stopping-time iteration depending on the behavior of square functions at many different scales. Nevertheless, it was not clear whether a high-low approach could be used to obtain the sharp \(L^{7}\) square function estimate for \(\mathcal {M}^{3}\). By introducing an algorithm inspired by wave envelope estimates, we are able to show that each case of the high-low iteration eventually leads to the desired \(L^{7}\) square function estimate or the initial problem at a coarser scale, which allows one to exploit induction. We provide an overview of the proof technique in §1.1.

For \(R\ge 1\), consider the anisotropic neighborhood

$$ \mathcal {M}^{3}(R)=\{(\xi _{1},\xi _{2},\xi _{3}): \xi _{1}\in [0,1],\,| \xi _{2}-\xi _{1}^{2}|\le R^{-{2/3}},\,|\xi _{3}-3\xi _{1}\xi _{2}+2 \xi _{1}^{3}|\le R^{-1} \}. $$

Partition this neighborhood of \(\mathcal {M}^{3}\) into canonical blocks \(\theta \), which have the form

$$ \begin{aligned} \theta ={}&\{(\xi _{1},\xi _{2},\xi _{3}): lR^{-1/3}\le \xi _{1}< (l+1)R^{-1/3}, \,|\xi _{2}-\xi _{1}^{2}|\le R^{-2/3},\,\\ &|\xi _{3}-3\xi _{1}\xi _{2}+2 \xi _{1}^{3}|\le R^{-1} . \} \end{aligned}$$
(1)

For Schwartz functions \(f:\mathbb{R}^{3}\to \mathbb{C}\), define the Fourier projection onto \(\theta \) by \(f_{\theta}(x)=\int _{\theta }\widehat{f}(\xi )e^{2\pi i x\cdot \xi} d \xi \). Our main result is the following.

Theorem 1

For any \(\varepsilon >0\), there exists \(C_{\varepsilon }<\infty \) such that

$$ \int _{\mathbb{R}^{3}}|f|^{7}\le C_{\varepsilon }R^{\varepsilon }\int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{ \theta}|^{2}|^{\frac{7}{2}} $$
(2)

for any Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\mathcal {M}^{3}(R)\).

We also obtain a version of Theorem 1 for curves in \(\mathbb{R}^{3}\) with torsion, which is explained in Appendix B. Theorem 1 is sharp, up to the \(C_{\varepsilon }R^{\varepsilon }\) factor. This may be seen from the constructive interference example which we now describe. Let \(\psi :\mathbb{R}^{3}\to [0,\infty )\) be a smooth bump function supported in the unit ball. For each \(\theta \), let \(\psi _{\theta}=|\theta |^{-1}\psi \circ T_{\theta}\) where \(T_{\theta}: \mathbb{R}^{3}\to \mathbb{R}^{3}\) is an affine transformation mapping an ellipsoid comparable to \(\theta \) to the unit ball. For a small universal constant \(c>0\), the support of \(\psi _{\theta}\) is completely contained in a canonical block at scale \(cR\), so Theorem 1 implies that

(3)

The function has amplitude \(\sim 1\) on the set \(\theta ^{*}=\{x\in \mathbb{R}^{3}:|x\cdot \xi |\le 1\quad \forall \xi \in \theta -\theta \}\) and decays rapidly away from \(\theta ^{*}\). The union of the sets \(\theta ^{*}\) is a bush centered at the origin of \(\sim R^{1/3}\times R^{2/3}\times R\) planks which are tangent to the light cone. A calculation shows that the right hand side of (3) is dominated by the portion of the bush at a distance \(\sim R\) from the origin, where the \(\theta ^{*}\) do not overlap, so

Finally, it is easy to see that in a neighborhood of radius \(\sim 1\) at the origin, , so the left hand side of (3) is bounded below by \(R^{7/3}\). This example also shows that no estimate of the form (2) holds in \(L^{p}\) if \(p>7\).

Estimates of the form (2) have a long history in harmonic analysis. The classical \(L^{4}\) square function estimate for the parabola is based on geometric observations by Fefferman [Fef73] and is recorded in [Có82]. The square function conjecture for paraboloids in \(\mathbb{R}^{n}\) (Conjecture 5.19 of [Dem20]) is known to imply sharp results for the Kakeya maximal function, the Bochner-Riesz means, and the Fourier restriction operator; see [Car15] and the references therein. In future work, we intend to explore analogous applications for the moment curve. Although square function estimates are known to have many applications, there are few sharp results of the form (2) in the literature. There has, however, been some partial progress on establishing square function estimates for the moment curve. In an unpublished work that was shared with the author, H. Jung proved a non-sharp version of Theorem 1 with a positive power of \(R\) in the upper bound. In [GGP21], the authors used approximate solution counting for Vinogradov systems to obtain square function estimates for the moment curve in \(\mathbb{R}^{n}\), but for non-sharp exponents \(2\le p\le 2n\) which are even. The only other sharp square function estimate we are aware of is for the cone in \(\mathbb{R}^{3}\) [GWZ20], which, by the work of [MSS92], resolves the local smoothing conjecture for the wave equation in \(\mathbb{R}^{2+1}\). Our proof of Theorem 1 uses both the square function estimate for the parabola in \(\mathbb{R}^{2}\) and for the cone in \(\mathbb{R}^{3}\).

The organization of this paper is as follows. In §1.1, we describe the main ideas behind the proof of Theorem 1. The proof of Theorem 1 happens in two steps. First, we prove a version of (2) where the terms in the square function are convolved with certain weights. Then we show that this averaged version implies Theorem 1. We introduce the square function constants \(\text{S}_{1}(R)\) and \(\text{S}_{2}(R)\) corresponding to these two versions of (2) in §1.2. We set up tools for the high-low argument in §2 and discuss the relevant geometry related to the moment curve and to the cone in §3. We devote §4 to a key technical step which unwinds the pruning process (see §1.1). The high-low method then allows us to bound the broad part of the left hand side of (2), when the integrand is dominated by a trilinear version of itself, which we carry out in §5. Finally we prove Theorem 1 by bounding \(\text{S}_{1}(R)\) and then \(\text{S}_{2}(R)\) in §6. Appendix A contains proofs of auxiliary square function and wave envelope estimates we require for the parabola and the cone. In Appendix B, we explain how to adapt the argument to obtain Theorem 1 for general curves \(\gamma (t)\) with torsion.

1.1 Overview of the proof of Theorem 1

In this section, we fix \(\varepsilon >0\) and a Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\). Let \(\theta \) denote canonical \(R^{-1/3}\times R^{-2/3}\times R^{-1}\) moment curve blocks. By pigeonholing arguments, it suffices to assume that either \(\|f_{\theta}\|_{\infty}\sim 1\) or \(\|f_{\theta}\|_{\infty}=0\) for each \(\theta \), and that for some \(\alpha >0\) and \(\beta >0\),

$$ \int _{\mathbb{R}^{3}}|f|^{7}\lesssim (\log R)\alpha ^{7}|U_{\alpha ,\beta }| $$

where \(U_{\alpha ,\beta }=\{x\in \mathbb{R}^{3}:|f(x)|>\alpha ,\quad \frac{\beta }{2}\le \sum _{ \theta}|f_{\theta}(x)|^{2}\le \beta \}\). Our goal is to show that

$$ \alpha ^{7}|U_{\alpha ,\beta }|\lesssim _{\varepsilon }R^{\varepsilon }\int _{\mathbb{R}^{3}}|\sum _{ \theta}|f_{\theta}|^{2}|^{7/2}. $$
(4)

The initial step in bounding \(|U_{\alpha ,\beta }|\) involves the \(L^{6}\) trilinear restriction theorem for \(\mathcal {M}^{3}\). Since this theorem bounds trilinear expressions, we are actually bounding the broad part of \(U_{\alpha ,\beta }\) (which is the subset where \(|f|\lesssim |f_{1}f_{2}f_{3}|^{1/3}\) for Fourier projections \(f_{i}\) of \(f\) onto separated neighborhoods of \(\mathcal {M}^{3}\)). The narrow (or not broad) part of \(U_{\alpha ,\beta }\) is dealt with using a standard inductive argument.

The high-low method partitions \(U_{\alpha ,\beta }\) into \(\le \varepsilon ^{-1}\) many subsets \(\Omega _{k}\) which we bound in separate cases. Let \(R_{N-1}=R^{1-\varepsilon }\), let \(\tau _{N-1}\) denote \(R_{N-1}^{-1/3}\times R_{N-1}^{-2/3}\times R_{N-1}\) moment curve blocks, and let \(g_{N-1}=\sum _{\tau _{N-1}}|f_{\tau _{N-1}}|^{2}\). The high-low decomposition for \(g_{N-1}\) is \(g_{N-1}=g_{N-1}^{\ell}+g_{N-1}^{h}\) where for a bump function \(\eta _{N}\) equal to 1 on \(B_{R^{-1/3}}(0)\) and supported in \(B_{2R^{-1/3}}(0)\). A simple local \(L^{2}\)-orthogonality argument shows that \(|g_{N-1}^{\ell}|\lesssim \sum _{\theta}|f_{\theta}|^{2}\). Then since \(\sum _{\theta}|f_{\theta}|^{2}\sim \beta \) on \(U_{\alpha ,\beta }\), the assumption that \(A\beta \le g_{N-1}(x)\) implies that \(g_{N-1}(x)\le 2|g_{N-1}^{h}(x)|\), if \(A\ge 1\) is a sufficiently large constant. This leads to the first subset we bound:

$$ \Omega _{N-1}=\{x\in U_{\alpha ,\beta }:A\beta \le g_{N-1}(x)\}, $$

which we will now describe how to control.

Applying the \(L^{6}\) multilinear restriction essentially yields

$$ \alpha ^{6}|\Omega _{N-1}|\lesssim \int _{R_{N-1}^{1/3}-\Omega _{N-1}}|g_{N-1}|^{3} $$

where \(R_{N-1}^{1/3}-\Omega _{N-1}\) means the \(R_{N-1}^{1/3}\)-neighborhood of \(\Omega _{N-1}\). Since the Fourier support of \(g_{N-1}\) is contained in \(\cup _{\tau _{N-1}}(\tau _{N-1}-\tau _{N-1})\subset B_{2R_{N-1}^{-1/3}}(0)\), \(g_{N-1}\) has roughly constant modulus on \({R_{N-1}^{1/3}}\)-balls on the spatial side. Therefore, since \(A\beta \le g_{N-1}\lesssim |g_{N-1}^{h}|\) on \(\Omega _{N-1}\),

$$ \int _{R_{N-1}^{1/3}-\Omega _{N-1}}|g_{N-1}|^{3} \lesssim \frac{1}{A\beta }\int _{R_{N-1}^{1/3}-\Omega _{N-1}}|g_{N-1}^{h}|^{4} . $$

Note that we chose to go from an \(L^{3}\) expression of \(g_{N-1}\) to an \(L^{4}\) expression. We did this because the Fourier support of \(g_{N-1}^{h}\) is contained in a neighborhood of the truncated cone, so we would like to use the sharp \(L^{4}\) square function estimate for the cone [GWZ20]. This allows us to control the integral on the right hand side above by

Since and there are \(\lesssim R^{\varepsilon /3}\) many \(\theta \) contained in each \(\tau _{N-1}\), the integral on the right hand side is bounded using Cauchy-Schwarz and Young’s convolution inequality by a constant factor times

$$ R^{2\varepsilon }\int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{4}|^{2}. $$

The summary of the argument so far is that

$$ \alpha ^{6}|\Omega _{N-1}|\lesssim \frac{1}{A\beta }R^{3\varepsilon }\int _{\mathbb{R}^{3}}| \sum _{\theta}|f_{\theta}|^{4}|^{2}. $$

Comparing with (4), we see that it suffices to check that \(\alpha \lesssim \beta \) and \(\int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{4}|^{2}\lesssim \int _{ \mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{2}|^{7/2}\). The first inequality is justified since morally, each \(|f_{\theta}|\) may be thought of as a sum amplitude 1 wave packets localized to non-overlapping translates of dual planks \(\theta ^{*}\), so \(|f_{\theta}|\lesssim |f_{\theta}|^{2}\). Then for \(x\in U_{\alpha ,\beta }\), we have \(\alpha \lesssim |\sum _{\theta }f_{\theta}(x)|\lesssim \sum _{\theta}|f_{ \theta}(x)|\lesssim \sum _{\theta}|f_{\theta}(x)|^{2}\lesssim \beta \). The second inequality is justified by the assumption that \(\|f_{\theta}\|_{\infty}\lesssim 1\), so \(\sum _{\theta}|f_{\theta}|^{4}\lesssim \sum _{\theta}|f_{\theta}|^{7/2}\), and then using \(\|\cdot \|_{\ell ^{7/2}}\le \|\cdot \|_{\ell ^{2}}\). This concludes the bound of \(|\Omega _{N-1}|\), which only involved a local multilinear restriction estimate, high-frequency dominance with the locally constant property, and applying the \(L^{4}\) square function estimate for the cone. For the remaining parts of \(U_{\alpha ,\beta }\), we will see that an additional idea is required.

Intermediate scales: Let \(R_{k}=R^{k\varepsilon }\), let \(\tau _{k}\) be canonical \(R_{k}^{-1/3}\times R_{k}^{-2/3}\times R_{k}^{-1}\) moment curve blocks, and let \(g_{k}=\sum _{\tau _{k}}|f_{\tau _{k}}|^{2}\). Decompose \(g_{k}\) into high-low parts \(g_{k}=g_{k}^{\ell}+g_{k}^{h}\) by defining , where \(\eta _{k}\) is a bump function equal to 1 on \(B_{R_{k+1}^{-1/3}}(0)\). The \((N-k)\)th subset of \(U_{\alpha ,\beta }\) that we consider is

$$ \Omega _{k}=\{x\in U_{\alpha ,\beta }:A^{N-k}\beta \le g_{k}\quad \text{and} \quad g_{l}\le A^{N-l}\beta \quad \forall l=k+1,\ldots ,N-1\}. $$

As in the analysis of \(\Omega _{N-1}\), we can show that on \(\Omega _{k}\), \(g_{k}\) is high-dominated. After applying the \(L^{6}\) trilinear restriction and using that \(A^{N-k}\beta \le g_{k}\lesssim |g_{k}^{h}|\) on \(\Omega _{k}\), we have

$$ \alpha ^{6}|\Omega _{k}|\lesssim \frac{1}{A^{N-k}\beta }\int _{\mathbb{R}^{3}}|g_{k}^{h}|^{4}. $$

Again, \(g_{k}^{h}\) is Fourier supported on the \(R_{k}^{-1/3}\)-dilation of the truncated cone, so we may apply the \(L^{4}\) square function estimate for the cone, yielding

$$ \alpha ^{6}|\Omega _{k}|\lesssim \frac{1}{A^{N-k}\beta }C_{\varepsilon }R^{\varepsilon }\int _{ \mathbb{R}^{3}}|\sum _{\tau _{k}}|f_{\tau _{k}}|^{4}|^{2}. $$
(5)

Here, unlike in the analysis of \(\Omega _{N-1}\), \(\tau _{k}\) may be much coarser than \(\theta \), so we cannot use trivial inequalities and the assumption that \(\|f_{\theta}\|_{L^{\infty}}\lesssim 1\) to arrive at the right hand side of (4). To provide an alternative \(L^{\infty}\) bound for each \(f_{\tau _{k}}\), we perform a pruning process on the wave packets. This pruning process is the same as the one from [GMW20], in which we argue that on \(\Omega _{k}\subset U_{\alpha ,\beta }\), \(f\) may be replaced by a version \(f^{k}=\sum _{\tau _{k}} f_{\tau _{k}}^{k}\) where each \(f_{\tau _{k}}^{k}\) only has wave packets with amplitude \(\lesssim \beta /\alpha \). The pruned \(f_{\tau _{k}}^{k}\) satisfy the property that \(\|f_{\tau _{k}}^{k}\|_{\infty}\lesssim \frac{\beta }{\alpha }\). Using \(f^{k}\) in place of \(f\) in (5) and the good \(L^{\infty}\) bound for each \(f_{\tau _{k}}^{k}\), we arrive at the inequality

$$ \alpha ^{6}|\Omega _{k}|\lesssim _{\varepsilon }\frac{1}{A^{N-k}\beta }R^{\varepsilon } \frac{\beta }{\alpha }\int _{\mathbb{R}^{3}}|\sum _{\tau _{k}}|f_{\tau _{k}}^{k}|^{7/2}|^{2}, $$

which, using \(\|\cdot \|_{\ell ^{7/2}}\le \|\cdot \|_{\ell ^{2}}\), implies that

$$ \alpha ^{7}|\Omega _{k}|\lesssim _{\varepsilon }R^{\varepsilon }\int _{\mathbb{R}^{3}}|\sum _{ \tau _{k}}|f_{\tau _{k}}^{k}|^{2}|^{7/2}. $$
(6)

The left hand side looks good because it is an \(L^{7}\) expression. It remains to consider how to bound the right hand side by the \(L^{7}\) integral of the square function at our desired scale \(\theta \). It looks as though the right hand side is partial progress towards \(\sum _{\theta}|f_{\theta}|^{2}\), so we would like to invoke induction to finish the argument. Indeed, if our goal were to prove an \((\ell ^{2},L^{7})\) decoupling estimate and we had shown

$$ \alpha ^{7}|\Omega _{k}|\lesssim _{\varepsilon }R^{\varepsilon }\Big(\sum _{\tau _{k}}\|f_{ \tau _{k}}\|_{L^{7}(\mathbb{R}^{3})}^{2} \Big)^{7/2}, $$

then we could rescale each \(f_{\tau _{k}}\) and invoke induction on scales to justify \(\|f_{\tau _{k}}\|_{L^{7}(\mathbb{R}^{3})}\lesssim (\sum _{\theta \subset \tau _{k}}\|f_{\theta}\|_{L^{7}(\mathbb{R}^{3})}^{2})^{1/2}\), which combines with the displayed inequality to give the desired \((\ell ^{2},L^{7})\) estimate. The main difficulty of proving a square function estimate compared to a decoupling estimate is that the integrand on the right hand side of (6) involves all of the \(f_{\tau _{k}}^{k}\), so we cannot rescale each \(f_{\tau _{k}}^{k}\) individually and invoke induction.

To address this problem, we introduce an algorithm which either isolates the different \(f_{\tau _{k}}^{k}\) to permit rescaling and induction, or allows us to further refine \(\sum _{\tau _{k}}|f_{\tau _{k}}^{k}|^{2}\) pointwise to \(\sum _{\tau '}|f_{\tau '}|^{2}\), where the \(\tau '\) are closer to our goal \(\theta \) than the \(\tau _{k}\). Our algorithm begins by using the wave envelope estimate Theorem 1.3 from [GWZ20] in place of the \(L^{4}\) square function estimate to bound \(g_{k}^{h}\). The wave envelope estimate for the cone replaces the standard square function integral by a controlled number of more-refined expressions. To simplify the explanation of our strategy, we will consider the special case coming from the wave envelope estimate in which \(\int |\sum _{\tau _{k}}|f_{\tau _{k}}^{k}|^{4}|^{2}\) from (5) is dominated by the part of the domain on which only one summand \(|f_{\tau _{k}}^{k}|\) is large at a time. Otherwise repeating the reasoning that led to (6), this yields

$$ \alpha ^{7}|\Omega _{k}|\lesssim _{\varepsilon }R^{\varepsilon }\int \sum _{\tau _{k}}|f_{ \tau _{k}}^{k}|^{7}. $$
(7)

At first sight, the integral on the right hand side of (7) looks similar to

$$ \sum _{\tau _{k}}\int |f_{\tau _{k}}|^{7}. $$
(8)

Since each \(f_{\tau _{k}}\) is integrated individually, this expression can be handled by induction as follows. Let \(\text{S}(R)\) be the smallest constant for which

$$ \int |\sum _{\theta }f_{\theta}|^{7}\le S(R)\int |\sum _{\theta}|f_{ \theta}|^{2}|^{7/2} $$

for any \(f\) satisfying the hypotheses of Theorem 1. Using affine rescaling of the moment curve, (8) is bounded by

$$ S(R/R_{k})\sum _{\tau _{k}}\int |\sum _{\theta \subset \tau _{k}}|f_{ \theta}|^{2}|^{7/2}\le S(R/R_{k})\int |\sum _{\theta}|f_{\theta}|^{2}|^{7/2} . $$

Supposing that \(R_{k}>R^{C_{0}\varepsilon }\), this would be a favorable scenario since the multi-scale inequality \(S(R)\lesssim _{\varepsilon }R^{C\varepsilon } S(R/R_{k})\) implies that \(S(R)\lesssim _{\delta }R^{\delta }\) for any \(\delta >0\). The issue with this argument is that the \(f_{\tau _{k}}^{k}\) in (7) are different from the \(f_{\tau _{k}}\) in (8). The pruning process which leads to the favorable \(L^{\infty}\) bounds for \(f_{\tau _{k}}^{k}\) also changes the Fourier support from \(\cup _{\theta \subset \tau _{k}}\theta \) to potentially all of \(\tau _{k}\). A rescaling argument may still be used to argue that

$$ \sum _{\tau _{k}}\int |f_{\tau _{k}}^{k}|^{7}\le S(R^{\varepsilon })\sum _{ \tau _{k}}\int |\sum _{\tau _{k+1}}|f_{\tau _{k+1}}^{k+1}|^{2}|^{7/2}, $$

but then we need to analyze the expression on the right hand side. The bulk of the new technical work in this paper is to perform a delicate algorithm addressing all of the cases from the wave envelope estimate (which is applied repeatedly) while “unpruning” \(f^{k}\) (going from expressions with \(f^{k}\) to \(f^{k+1}\) and eventually to \(f^{N}=f\)) and carefully keeping track of constants to ensure the induction closes. Eventually, we bound the right hand side of (7) by an expression like

$$ C_{\delta ,\varepsilon } R^{\delta +\varepsilon } [S(R^{\varepsilon })]^{\varepsilon ^{-1}-C_{0}}\int |\sum _{\theta}|f_{ \theta}|^{2}|^{7/2} $$

where \(\delta >0\) may be arbitrarily small. This is again a favorable case since \(S(R)\le C_{\delta ,\varepsilon } R^{\delta +\varepsilon } S(R^{\varepsilon })^{\varepsilon ^{-1}-C_{0}}\) for any \(\delta >0\) and any \(\varepsilon >0\) implies the desired bound for \(S(R)\).

Small scale: \(R_{k}\le R^{C_{0}\varepsilon }\). In this case, the remaining subset of \(U_{\alpha ,\beta }\) is

$$ L=\{x\in U_{\alpha ,\beta }:g_{l}\le A^{N-l}\beta \quad \forall l=C_{0},\ldots ,N-1 \}. $$

The argument for the intermediate scales case no longer works when \(R_{k}\) is too small since we could conclude, for example, that \(S(R)\le C_{\delta ,\varepsilon }R^{\delta }S(R^{\varepsilon })^{\varepsilon ^{-1}}\). This would not lead to the desired bound \(S(R)\lesssim _{\delta }R^{\delta }\). Instead, we use trivial bounds based on the definition of \(L\). For each \(x\in L\), for any \(l=C_{0},\ldots ,N-1\), we may write

$$ \alpha ^{2}=|\sum _{\tau _{l}}f_{\tau _{l}}(x)|^{2}\lesssim \#\tau _{l} \sum _{\tau _{l}}|f_{\tau _{l}}|^{2}(x)\lesssim R_{l}^{1/3}A^{N-l}\beta \lesssim R_{l}^{1/3}A^{N-l}\sum _{\theta}|f_{\theta}|^{2}. $$

Then using \(l=C_{0}\),

$$ \alpha ^{7}|L|\lesssim R^{C_{0}\varepsilon } \int _{L}|\sum _{\theta}|f_{\theta}|^{2}|^{7/2}, $$

which gives the bound \(S(R)\lesssim R^{C_{0}\varepsilon }\) directly.

The conclusion of the high-low argument from each of the previous cases is that for any \(\delta ,\varepsilon >0\) and \(C_{0}>0\),

$$ S(R)\lesssim _{\delta ,\varepsilon } R^{\delta +\varepsilon }\Big[R^{C_{0}\varepsilon }+S(R^{\varepsilon })^{\varepsilon ^{-1}-C_{0}} \Big]. $$

The exact version of the multi-scale inequality we prove is in Lemma 6.7. We show in §6.2 how the multi-scale inequality implies the desired bound for \(S(R)\).

1.2 Definitions of the square function constants \(S_{1}(R)\) and \(S_{2}(R)\)

Technically, the result of our high-low argument is

$$ \int _{\mathbb{R}^{3}}|f|^{7}\lesssim _{\varepsilon }R^{\varepsilon }\int _{\mathbb{R}^{3}}|\sum _{ \theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2} $$
(9)

for appropriate \(L^{1}\)-normalized weight functions \(\omega _{\theta}\) adapted to \(\theta ^{*}\). While the locally constant property (see Lemma 2.2) tells us that pointwise,

$$ \sum _{\theta}|f_{\theta}|^{2}(x)\lesssim \sum _{\theta}|f_{\theta}|^{2}* \omega _{\theta}(x), $$

it is not generally true that the reverse inequality holds, either pointwise or in \(L^{7/2}\). Indeed, let \({\mathbf{{v}}}_{\theta}\in \mathbb{R}^{3}\) be a unit vector in the direction of the \(R\)-long side of \(\theta ^{*}\), let \(T_{\theta}\) be an \(R^{2/3}\times R^{2/3}\times R\) tube centered at the origin with orientation \({\mathbf{{v}}}_{\theta}\), and let each \(|f_{\theta}|^{2}(x)\approx \chi _{T_{\theta}}(x-R{\mathbf{{v}}}_{\theta})\). Then

$$ \int |\sum _{\theta}|f_{\theta}|^{2}|^{7/2}\sim \sum _{\theta}\int |f_{ \theta}|^{7}\sim R^{8/3} $$

since the \(|f_{\theta}|^{2}\) are essentially disjointly supported. On the other hand, after averaging, \(\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}\gtrsim R^{1/3}\) on a ball of radius \(\sim R^{2/3}\) centered at the origin, giving the lower bound

$$ \int |\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2}\gtrsim R^{2}(R^{1/3})^{7/2}=R^{19/6}. $$

Therefore, the version of a square function estimate (9) that we obtain from the high-low argument is ostensibly weaker than our goal. After proving (9), we use an inductive argument to upgrade it to (2).

Definition 1

Let \(R\ge 10\). Let \(S_{1}(R)\) be the infimum of \(A>0\) such that

$$ \int _{\mathbb{R}^{3}}|f|^{7}\le A\int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{2}* \omega _{\theta}|^{7/2} $$

for any Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\mathcal {M}^{3}(R)\).

Definition 2

Let \(R\ge 10\). Let \(S_{2}(R)\) be the infimum of \(B>0\) such that

$$ \int _{\mathbb{R}^{3}}|f|^{7}\le B\int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{2}|^{7/2} $$

for any Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\mathcal {M}^{3}(R)\).

2 Set-up for the high-low analysis

In this section, we set-up the notation and basic properties of the high-low analysis of square functions at various scales. This is analogous to the high-low set-up from [GMW20].

Begin with precise definitions of canonical blocks of the moment curve.

Definition 3

Canonical moment curve blocks

For \(S\in 2^{\mathbb{N}}\), consider the anisotropic neighborhood

$$ \mathcal {M}^{3}(S^{3})=\{(\xi _{1},\xi _{2},\xi _{3}): \xi _{1}\in [0,1], \,|\xi _{2}-\xi _{1}^{2}|\le S^{-2},\,|\xi _{3}-3\xi _{1}\xi _{2}+2 \xi _{1}^{3}|\le S^{-3} \}. $$

Define \({\mathbf{{S}}}(S^{-1})\) to be the following collection of canonical moment curve blocks at scale \(S\) which partition \(\mathcal {M}^{3}(S^{3})\):

$$ \bigsqcup \limits _{l=0}^{S-1}\{(\xi _{1},\xi _{2},\xi _{3}): lS^{-1} \le \xi _{1}< (l+1)S^{-1},\,|\xi _{2}-\xi _{1}^{2}|\le S^{-2},\,|\xi _{3}-3 \xi _{1}\xi _{2}+2\xi _{1}^{3}|\le S^{-3} \}. $$

The (not unit-normalized) Frenet frame for the moment curve \(\mathcal {M}^{3}\) at \(t\in [0,1]\) is

$$\begin{aligned} {\mathbf{{T}}}(t)&=(1,2t,3t^{2}) \\ {\mathbf{{N}}}(t)&=(-2t-9t^{3}, 1-9t^{4}, 3t+6t^{3}) \\ {\mathbf{{B}}}(t)&=(3t^{2},-3t,1). \end{aligned}$$
(10)

If \(\tau \in{\mathbf{{S}}}(S^{-1})\) is the \(\ell \)th moment curve block

$$ \{(\xi _{1},\xi _{2},\xi _{3}): lS^{-1}\le \xi _{1}< (l+1)S^{-1},\,| \xi _{2}-\xi _{1}^{2}|\le S^{-2},\,|\xi _{3}-3\xi _{1}\xi _{2}+2\xi _{1}^{3}| \le S^{-3} \}, $$

then \(\tau \) is comparable to the set

$$ \{\gamma (lS^{-1})+A{\mathbf{{T}}}(lS^{-1})+B{\mathbf{{N}}}(lS^{-1})+C{\mathbf{{B}}}(lS^{-1}): |A|\le S^{-1},|B|\le S^{-2},|C|\le S^{-3} \} . $$

By comparable, we mean that there is an absolute constant \(C>0\) for which \(C^{-1}\tau \) is contained in the displayed set and \(C\tau \) contains the displayed set, where the dilations are taken with respect to the centroid of \(\tau \). Define the dual set \(\tau ^{*}\) by

$$ \tau ^{*} = \{A{\mathbf{{T}}}(lS^{-1})+B{\mathbf{{N}}}(lS^{-1})+C{\mathbf{{B}}}(lS^{-1}): |A|\le S,|B|\le S^{2},|C|\le S^{3} \}. $$
(11)

We sometimes refer to the set \(\tau ^{*}\) as well as its translates as wave packets.

Next, we fix some notation for the scales. Let \(\varepsilon >0\). To prove Theorem 1, it suffices to assume that \(R\) is larger than a constant which depends on \(\varepsilon \). Consider scales \(R_{k}\in 8^{\mathbb{N}}\) closest to \(R^{k\varepsilon }\), for \(k=1,\ldots ,N\) and \(R_{N}\le R\le R^{\varepsilon }R_{N}\). Since \(R\) differs from \(R_{N}\) at most by a factor of \(R^{\varepsilon }\), we will assume that \(R=R_{N}\). The relationship between the parameters is

$$ 1=R_{0}\le R_{k}^{\frac{1}{3}}\le R_{k+1}^{\frac{1}{3}}\le R_{N}^{ \frac{1}{3}}. $$

Fix notation for moment curve blocks of various sizes.

  1. (1)

    Let \(\theta \) denote \(\sim R^{-\frac{1}{3}}\times R^{-\frac{2}{3}}\times R^{-1}\) moment curve blocks from the collection \({\mathbf{{S}}}(R^{-1/3})\).

  2. (2)

    Let \(\tau _{k}\) denote \(\sim R_{k}^{-\frac{1}{3}}\times R_{k}^{-\frac{2}{3}}\times R_{k}^{-1}\) moment curve blocks from the collection \({\mathbf{{S}}}(R_{k}^{-1/3})\).

The definitions of \(\theta ,\tau _{k}\) provide the additional property that if \(k< m\) and \(\tau _{k}\cap \tau _{m}\neq\emptyset \), then \(\tau _{m}\subset \tau _{k}\).

Fix a ball \(B_{R}\subset \mathbb{R}^{3}\) of radius \(R\) as well as a Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\mathcal {M}^{3}(R)\). The parameters \(\alpha ,\beta >0\) describe the set

$$ U_{\alpha ,\beta }=\{x\in B_{R}:|f(x)|\ge \alpha ,\quad \frac{\beta }{2}\le \sum _{ \theta \in{\mathbf{{S}}}(R^{-1/3})}|f_{\theta}|^{2}*\omega _{\theta}(x)\le \beta \}. $$

The weight function \(\omega _{\theta}\) is defined in Definition 5 below. We assume throughout this section (and until §6.1) that the \(f_{\theta}\) satisfy the extra condition that

$$ \frac{1}{2}\le \|f_{\theta}\|_{L^{\infty}(\mathbb{R}^{3})}\le 2\qquad \text{or}\qquad \|f_{\theta}\|_{L^{\infty}(\mathbb{R}^{3})}=0. $$
(12)

2.1 A pruning step

We define wave packets associated to \(f_{\tau _{k}}\) and sort them according to an amplitude condition which depends on the parameters \(\alpha \) and \(\beta \).

For each \(\tau _{k}\), let \(\mathbb{T}_{\tau _{k}}\) contain \(\tau _{k}^{*}\) and its translates \(T_{\tau _{k}}\) which tile \(\mathbb{R}^{3}\). Fix an auxiliary function \(\varphi (\xi )\) which is a bump function supported in \([-\frac{1}{4},\frac{1}{4}]^{3}\). For each \(m\in \mathbb{Z}^{3}\), let

where \(c\) is chosen so that . Since is a rapidly decaying function, for any \(n\in \mathbb{N}\), there exists \(C_{n}>0\) such that

$$ \psi _{m}(x)\le c\int _{[-\frac{1}{2},\frac{1}{2}]^{3}} \frac{C_{n}}{(1+|x-y-m|^{2})^{n}}dy \le \frac{\tilde{C}_{n}}{(1+|x-m|^{2})^{n}}. $$

Define the partition of unity \(\psi _{T_{\tau _{k}}}\) associated to \({\tau _{k}}\) to be \(\psi _{T_{\tau _{k}}}(x)=\psi _{m}\circ A_{\tau _{k}}\), where \(A_{\tau _{k}}\) is a linear transformations taking \(\tau _{k}^{*}\) to \([-\frac{1}{2},\frac{1}{2}]^{3}\) and \(A_{\tau _{k}}(T_{\tau _{k}})=m+[-\frac{1}{2},\frac{1}{2}]^{3}\). The important properties of \(\psi _{T_{\tau _{k}}}\) are (1) rapid decay off of \(T_{\tau _{k}}\) and (2) Fourier support contained in \(\tau _{k}\) translated to the origin. We sort the wave packets \(\mathbb{T}_{\tau _{k}}=\mathbb{T}_{\tau _{k}}^{g}\sqcup \mathbb{T}_{\tau _{k}}^{b}\) into “good” and “bad” sets, and define corresponding versions of \(f\), as follows.

Remark 1

In the following definitions, let \(K\ge 1\) be a large parameter which will be used to define the broad set in Proposition 5.1. Also, \(A=A(\varepsilon )\gg 1\) is a large enough constant (determined by Lemma 2.6) which also satisfies \(A\ge D\), where \(D\) is from Lemma 2.4.

Definition 4

Pruning with respect to \(\tau _{k}\)

Let \(f^{N}=f\), \(f^{N}_{\tau _{N}}=f_{\theta}\). For each \(1\le k\le N-1\), let

$$\begin{aligned} \mathbb{T}_{\tau _{k}}^{g}&=\{T_{\tau _{k}}\in \mathbb{T}_{\tau _{k}}:\|\psi _{T_{ \tau _{k}}}^{1/2}f_{\tau _{k}}^{k+1}\|_{L^{\infty}(R^{3})}\le K^{3}A^{N-k+1} \frac{\beta }{\alpha }\}, \\ f_{\tau _{k}}^{k}=\sum _{T_{\tau _{k}}\in \mathbb{T}_{\tau _{k}}^{g}}&\psi _{T_{ \tau _{k}}}f^{k+1}_{\tau _{k}}\qquad \textit{and}\qquad f_{\tau _{k-1}}^{k}= \sum _{\tau _{k}\subset \tau _{k-1}}f_{\tau _{k}}^{k} . \end{aligned}$$

For each \(k\), the \(k\)th version of \(f\) is \(f^{k}=\underset{\tau _{k}}{\sum} f_{\tau _{k}}^{k}\).

Remark 2

We may assume that \(\alpha \lesssim R^{C_{0}}\beta \). This will be discussed in Proposition 6.4and Corollary 6.5, which involve pigeonholing the wave packets of \(f\).

In the following lemma, we assume that the \(f_{\theta}\) satisfy (12) and \(f=\sum _{\theta }f_{\theta}\) has been pruned according to the above definition.

Lemma 2.1

Properties of \(f^{k}\)

  1. (1)

    \(| f_{\tau _{k}}^{k} (x) | \le |f_{ \tau _{k}}^{k+1}(x)|\lesssim \# \theta \subset \tau _{k}\).

  2. (2)

    \(\| f_{\tau _{k}}^{k} \|_{L^{\infty}(\mathbb{R}^{3})} \le K^{3}A^{N-k+1} \frac{\beta }{\alpha }\).

  3. (3)

    For \(R\) sufficiently large depending on \(\varepsilon \), \(\textit{supp} \widehat{f_{\tau _{k}}^{k}}\subset 3\tau _{k} \).

Proof

For the first property, recall that \(\sum _{T_{\tau _{k}} \in \mathbb{T}_{\tau _{k}}}\psi _{T_{\tau _{k}}}\) is a partition of unity so we may iterate the inequalities

$$\begin{aligned} |f_{\tau _{k}}^{k}|\le |f_{\tau _{k}}^{k+1}|&\le \sum _{\tau _{k+1} \subset \tau _{k}}|f_{\tau _{k+1}}^{k+1}|\le \cdots \le \sum _{\tau _{N} \subset \tau _{k}}|f_{\tau _{N}}^{N}|= \sum _{\theta \subset \tau _{k}}|f_{ \theta}|. \end{aligned}$$

The first property follows from our assumption (12) that each \(\|f_{\theta}\|_{L^{\infty}(\mathbb{R}^{3})}\lesssim 1\). For the \(L^{\infty}\) bound in the second property, write

$$\begin{aligned} |f_{ \tau _{k}}^{k}(x)|& = |\sum _{ \substack{T_{\tau _{k}} \in \mathbb{T}_{\tau _{k}^{g}}}} \psi _{T_{\tau _{k}}}(x) f_{ \tau _{k}}^{k+1}(x)|\le \sum _{ \substack{T_{\tau _{k}} \in \mathbb{T}_{\tau _{k}^{h}}}} \psi _{T_{\tau _{k}}}^{1/2}(x) \|\psi _{T_{\tau _{k}}}^{1/2}f_{ \tau _{k}}^{k+1}\|_{\infty}\\ &\lesssim \| \psi _{T_{\tau _{k}}}^{1/2}f_{ \tau _{k}}^{k+1}\|_{\infty}. \end{aligned}$$

By the definition of \(\mathbb{T}_{\tau _{k}}^{g}\), \(\|\psi _{T_{\tau _{k}}}^{1/2}f_{\tau _{k}}^{k+1}\|_{\infty}\le K^{3} A^{N-k+1} \frac{\beta }{\alpha }\).

The third property depends on the Fourier support of \(\psi _{T_{\tau _{k}}}\), which is contained in \(\tau _{k}\) shifted to the origin. Note if each \(f_{\tau _{k}}^{k+1}\) has Fourier support in \(\cup _{\tau _{k+1}\subset \tau _{k}}3\tau _{k+1}\), then \(\mathrm {supp}\widehat{f_{\gamma _{k}}^{k}}\) is contained in \(3\tau _{k}\). □

Definition 5

Let \(\phi :\mathbb{R}^{3}\to \mathbb{R}\) be a smooth, radial function supported in \([-\frac{1}{4},\frac{1}{4}]^{3}\) and satisfying when \(|x|\le 1\). Then define \(w:\mathbb{R}^{3}\to [0,\infty )\) by

Let \(B\subset \mathbb{R}^{3}\) denote the unit ball centered at the origin. For any set \(U=T(B)\) where \(T\) is an affine transformation \(T:\mathbb{R}^{3}\to \mathbb{R}^{3}\), define

$$ w_{U}(x)=|U|^{-1}w(T^{-1}(x)). $$

For each \(\tau _{k}\), let \(A_{\tau _{k}}\) be a linear transformation mapping \(\tau _{k}^{*}\) to the unit cube and define \(\omega _{\tau _{k}}\) by

$$ \omega _{\tau _{k}}(x)=|\tau _{k}^{*}|^{-1}w(A_{\tau _{k}}(x)). $$

Let the capital-W version of weight functions denote the \(L^{\infty}\)-normalized (as opposed to \(L^{1}\)-normalized) versions, so for example, for any ball \(B_{s}\), \(W_{B_{s}}(x)=|B_{s}|w_{B_{s}}(x)\). If a weight function has subscript which is only a scale, say \(s\), then the functions \(w_{s},W_{s}\) are weight functions localized to the \(s\)-ball centered at the origin.

Remark 3

Note the additional property that \(\widehat{w}\) is supported in \([-\frac{1}{2},\frac{1}{2}]^{3}\), so each \(w_{B_{s}}\) is Fourier supported in an \(s^{-1}\)-neighborhood of the origin. Finally, note the property that if \(A_{1},A_{2}\) are affine transformations of the unit ball and \(A_{1}\subset A_{2}\), then \(w_{A_{1}}*w_{A_{2}}\lesssim w_{A_{2}}\).

Next, we record the locally constant property. By locally constant property, we mean that if a function \(f\) has Fourier transform supported in a convex set \(A\), then \(|f|\) is bounded above by an averaged version of \(|f|\) over a dual set \(A^{*}\).

Lemma 2.2

Locally constant property

For each \(\tau _{k}\) and \(T_{\tau _{k}}\in \mathbb{T}_{\tau _{k}}\),

$$\begin{aligned} \|f_{\tau _{k}}\|_{L^{\infty}(T_{\tau _{k}})}^{2}\lesssim |f_{\tau _{k}}|^{2}* \omega _{\tau _{k}}(x)\qquad \textit{for any}\quad x\in T_{\tau _{k}} . \end{aligned}$$

Also, for any \(R_{k}^{1/3}\)-ball \(B_{R_{k}^{1/3}}\),

$$\begin{aligned} \|\sum _{\tau _{k}}|f_{\tau _{k}}|^{2}\|_{L^{\infty}(B_{R_{k}^{1/3}})} \lesssim |f_{\tau _{k}}|^{2}*w_{B_{R_{k}^{1/3}}}(x)\qquad \textit{for any}\quad x\in B_{R_{k}^{1/3}} . \end{aligned}$$

Because the pruned versions of \(f\) and \(f_{\tau _{k}}\) have essentially the same Fourier supports as the unpruned versions, the locally constant lemma applies to the pruned versions as well.

Proof of Lemma 2.2

For the first claim, we write the argument for \(f_{\tau _{k}}\) in detail. Let \(\rho _{\tau _{k}}\) be a bump function equal to 1 on \(\tau _{k}\) and supported in \(2\tau _{k}\). Then using Fourier inversion and Hölder’s inequality,

Since \(\rho _{\tau _{k}}\) may be taken to be an affine transformation of a standard bump function adapted to the unit ball, is a constant. The function decays rapidly off of \(\tau _{k}^{*}\), so . Since for any \(T_{\tau _{k}}\in \mathbb{T}_{\tau _{k}}\), \(\omega _{\tau _{k}}(y)\sim \omega _{\tau _{k}}(y')\) for all \(y,y'\in T_{\tau _{k}}\), we have

$$\begin{aligned} \sup _{x\in T_{\tau _{k}}}|f_{\tau _{k}}|^{2}*\omega _{\tau _{k}}(x)&\le \int |f_{\tau _{k}}|^{2}(y)\sup _{x\in T_{\tau _{k}}}\omega _{\tau _{k}}(x-y)dy \\ &\sim \int |f_{\tau _{k}}|^{2}(y)\omega _{\tau _{k}}(x-y)dy\qquad \text{for all}\quad x\in T_{\tau _{k}}. \end{aligned}$$

For the second part of the lemma, repeat analogous steps as above, except begin with \(\rho _{\tau _{k}}\) which is identically 1 on a ball of radius \(2R_{k}^{-1/3}\) containing \(\tau _{k}\). Then

where we used that each \(\rho _{\tau _{k}}\) is a translate of a single function \(\rho _{R^{-1/3}}\). The rest of the argument is analogous to the first part. □

The following local \(L^{2}\)-orthogonality lemma which is Lemma 3 in [GM22b].

Lemma 2.3

Local \(L^{2}\) orthogonality

Let \(U=T(B)\) where \(B\) is the unit ball centered at the origin and \(T:\mathbb{R}^{3}\to \mathbb{R}^{3}\) is an affine transformation. Let \(h:\mathbb{R}^{3}\to \mathbb{C}\) be a Schwartz function with Fourier transform supported in a disjoint union \(X=\sqcup _{k} X_{k}\), where \(X_{k}\subset B\) are Lebesgue measurable. If the maximum overlap of the sets \(X_{k}+U^{*}\) is \(L\), then

$$ \int |h_{X}|^{2}w_{U}\lesssim L\sum _{X_{k}}\int |h_{X_{k}}|^{2}w_{U}, $$

where \(h_{X_{k}}=\int _{X_{k}}\widehat{h}(\xi )e^{2\pi i x\cdot \xi}d\xi \).

Here, we may take \(\{x:|x\cdot \xi |\le 1\quad \forall \xi \in U-U\}\) as the definition of \(U^{*}\). We will include a sketch of the proof for future reference.

Proof

By Plancherel’s theorem, we have

$$\begin{aligned} \int |h_{X}|^{2}w_{U}&=\int h_{X} \overline{h_{X}w_{U}}=\int \widehat{h_{X}}\overline{\widehat{h_{X}}*\widehat{w_{U}}}. \end{aligned}$$

Since \(\widehat{h_{X}}=\sum _{k}\widehat{h_{X_{k}}}\), \(\int \widehat{h_{X}}\overline{\widehat{h_{X}}*\widehat{w_{U}}}=\sum _{X_{k}} \sum _{X_{k}'}\int \widehat{h_{X_{k}}} \overline{\widehat{h_{X_{k}'}}*\widehat{w_{U}}}\). For each \(X_{k}\), the integral on the right hand side vanishes except for \(\lesssim L\) many choices of \(X_{k}'\). □

2.2 High-low frequency decomposition of square functions

Definition 6

Auxiliary functions

Let \(\eta :\mathbb{R}^{3}\to [0,\infty )\) be a radial, smooth bump function satisfying \(\eta (x)=1\) on \(B_{1/2}\) and \(\mathrm {supp}\eta \subset B_{1}\). Then for each \(s>0\), let

$$ \eta _{\le s}(\xi ) =\eta (s^{-1}\xi ) . $$

We will sometimes abuse notation by denoting , where \(h\) is some Schwartz function. Also define \(\eta _{s}(x)=\eta _{\le s}-\eta _{\le s/2}\).

Fix \(N_{0}< N-1\) which will be specified in §6.2.

Definition 7

For \(N_{0}\le k\le N-1\), let

In the following definition, \(A\gg 1\) is the same constant that goes into the pruning definition of \(f^{k}\).

Definition 8

Define the high set by

$$ H=\{x\in U_{\alpha ,\beta }: A \beta \le g_{N-1}(x)\}. $$

For each \(k=N_{0},\ldots ,N-2\), let \(H=\Omega _{N-1}\) and let

$$ \Omega _{k}=\{x\in U_{\alpha ,\beta }\setminus \cup _{l=k+1}^{N-1}\Omega _{l}: A^{N-k}\beta \le g_{k}(x) \}. $$

Define the low set to be

$$ L=U_{\alpha ,\beta }\setminus [\cup _{k=N_{0}}^{N-1}\Omega _{k}]. $$

Lemma 2.4

Low lemma

There is an absolute constant \(D>0\) so that for each \(x\), \(|g_{k}^{\ell}(x)|\le D g_{k+1}(x)\).

Proof

We perform a pointwise version of the argument in the proof of local/global \(L^{2}\)-orthogonality (Lemma 2.3). For each \(f_{\tau _{k}}^{k+1}\), by Plancherel’s theorem,

The integrand is supported in \((2\tau _{k+1}-2\tau _{k+1}')\cap B_{R_{k+1}^{-1/3}}\). This means that the integral vanishes unless \(\tau _{k+1}\) is within \(\sim R_{k+1}^{-1/3}\) of \(\tau _{k+1}'\), in which case we write \(\tau _{k+1}\sim \tau _{k+1}'\). Then

$$\begin{aligned} &\sum _{\tau _{k+1},\tau _{k+1}'\subset \tau _{k}}\int _{\mathbb{R}^{2}}e^{-2 \pi i x\cdot \xi}\widehat{f}_{\tau _{k+1}}^{k+1}* \widehat{\overline{f}_{\tau _{k+1}'}^{k+1}}(\xi )\eta _{< R_{k+1}^{-1/3}}( \xi )d\xi\\ &\quad =\sum _{ \substack{\tau _{k+1},\tau _{k+1}'\subset \tau _{k}\\ \tau _{k+1}\sim \tau _{k+1}'}}\int _{\mathbb{R}^{2}}e^{-2\pi i x\cdot \xi} \widehat{f}_{\tau _{k+1}}^{k+1}* \widehat{\overline{f}_{\tau _{k+1}'}^{k+1}}(\xi )\eta _{< R_{k+1}^{-1/3}}( \xi )d\xi . \end{aligned}$$

Use Plancherel’s theorem again to get back to a convolution in \(x\) and conclude that

By the locally constant property (Lemma 2.2) and (1) of Lemma 2.1,

It remains to note that

since \(\tau _{k}^{*}\subset \tau _{k+1}^{*}\) and is an \(L^{1}\)-normalized function that is rapidly decaying away from \(B_{R_{k+1}^{1/3}}(0)\). □

Corollary 2.5

High-dominance on \(\Omega _{k}\)

For \(R\) large enough depending on \(\varepsilon \), \(g_{k}(x)\le 2|g_{k}^{h}(x)|\) for all \(x\in \Omega _{k}\).

Proof

This follows directly from Lemma 2.4. Indeed, since \(g_{k}(x)=g_{k}^{\ell}(x)+g_{k}^{h}(x)\), the inequality \(g_{k}(x)>2|g_{k}^{h}(x)|\) implies that \(g_{k}(x)<2|g_{k}^{\ell}(x)|\). Then by Lemma 2.4, \(|g_{k}(x)|<2D g_{k+1}(x)\). Since \(x\in \Omega _{k}\), \(g_{k+1}(x)\le A^{N-k-1}\beta \), which altogether gives the upper bound

$$ g_{k}(x)\le 2D A^{N-k-1}\beta . $$

The contradicts the property that on \(\Omega _{k}\), \(A^{N-k}\beta \le g_{k}(x)\), for \(A\) sufficiently larger than \(D\), which finishes the proof. □

Lemma 2.6

Pruning lemma

For any \(s\ge R^{-\varepsilon /3}\) and \(\tau \in{\mathbf{{S}}}(s)\),

$$\begin{aligned} |\sum _{\tau _{k}\subset \tau}f_{\tau _{k}}-\sum _{\tau _{k}\subset \tau}f_{\tau _{k}}^{k+1}(x)|&\le \frac{\alpha }{A^{1/2}K^{3}}\\ & \qquad \textit{for all $x\in \Omega _{k}$}, \qquad N_{0}\le k\le N-1, \\ \textit{and}\qquad |\sum _{\tau _{N_{0}}\subset \tau}f_{\tau _{N_{0}}}- \sum _{\tau _{N_{0}}\subset \tau}f_{\tau _{N_{0}}}^{N_{0}}(x)|&\le \frac{\alpha }{A^{1/2}K^{3}}\qquad \textit{ for all $x\in L$}. \end{aligned}$$

Proof

Begin by proving the first claim about \(\Omega _{k}\). By the definition of the pruning process, we have

$$ f_{\tau}=f^{N-1}_{\tau}+(f_{\tau}^{N}-f^{N-1}_{\tau})=\cdots =f^{k+1}_{ \tau}(x)+\sum _{m=k+1}^{N-1}(f^{m+1}_{\tau}-f^{m}_{\tau})$$
(13)

where formally, the subscript \(\tau \) means \(f_{\tau}=\sum _{\theta \subset \tau}f_{\theta}\) and \(f_{\tau}^{m}=\sum _{\tau _{m}\subset \tau}f_{\tau _{m}}^{m}\). We will show that each difference in the sum is much smaller than \(\alpha \). For each \(N-1\ge m\ge k+1\) and \(\tau _{m}\),

$$\begin{aligned} &|f_{\tau _{m}}^{m}(x)-f_{\tau _{m}}^{m+1}(x)|\\ &\quad=|\sum _{T_{\tau _{m}} \in \mathbb{T}_{\tau _{m}}^{b}}\psi _{T_{\tau _{m}}}(x)f_{\tau _{m}}^{m+1}(x)| = \sum _{T_{\tau _{m}}\in T_{\tau _{m}}^{b}} |\psi _{T_{\tau _{m}}}^{1/2}(x)f_{ \tau _{m}}^{m+1}(x)|\psi _{T_{\tau _{m}}}^{1/2}(x) \\ &\quad \le \sum _{T_{\tau _{m}}\in \mathbb{T}_{\tau _{m}}^{b}} K^{-3}A^{-(N-m+1)} \frac{\alpha }{\beta } \| \psi _{T_{\tau _{m}}}^{1/2}f_{{\tau _{m}}}^{m+1} \|_{L^{ \infty}(\mathbb{R}^{3})}^{2} \psi _{T_{\tau _{m}}}^{1/2}(x) \\ &\quad \lesssim K^{-3}A^{-(N-m+1)}\frac{\alpha }{\beta }\sum _{T_{\tau _{m}}\in \mathbb{T}_{ \tau _{m}}^{b}} \sum _{\tilde{T}_{{\tau _{m}}}\in \mathbb{T}_{\tau _{m}}} \| \psi _{T_{\tau _{m}}}|f_{{\tau _{m}}}^{m+1}|^{2} \|_{L^{\infty}( \tilde{T}_{{\tau _{m}}})} \psi _{T_{\tau _{m}}}^{1/2}(x) \\ &\quad \lesssim K^{-3}A^{-(N-m+1)}\frac{\alpha }{\beta } \sum _{T_{\tau _{m}}, \tilde{T}_{\tau _{m}}\in \mathbb{T}_{\tau _{m}}} \| \psi _{T_{\tau _{m}}}\|_{L^{ \infty}(\tilde{T}_{\tau _{m}})}\||f_{{\tau _{m}}}^{m+1} |^{2}\|_{{L}^{ \infty}(\tilde{T}_{{\tau _{m}}})} \psi _{T_{\tau _{m}}}^{1/2}(x) . \end{aligned}$$

Let \(c_{\tilde{T}_{\tau _{m}}}\) denote the center of \(\tilde{T}_{\tau _{m}}\) and note the pointwise inequality

$$ \sum _{{T}_{\tau _{m}}}\|\psi _{T_{\tau _{m}}}\|_{L^{\infty}(\tilde{T}_{ \tau _{m}})}\psi _{T_{\tau _{m}}}^{1/2}(x)\lesssim |\tau _{m}^{*}|\omega _{ \tau _{m}}(x-c_{\tilde{T}_{\tau _{m}}}) , $$

which means that

$$\begin{aligned} &|f_{\tau _{m}}^{m}(x)-f_{\tau _{m}}^{m+1}(x)| \\ &\quad \lesssim K^{-3}A^{-(N-m+1)} \frac{\alpha }{\beta } |\tau _{m}^{*}|\sum _{\tilde{T}_{\tau _{m}}\in \mathbb{T}_{ \tau _{m}}} \omega _{\tau _{m}}(x-c_{\tilde{T}_{\tau _{m}}})\||f_{{\tau _{m}}}^{m+1} |^{2}\|_{{L}^{\infty}(\tilde{T}_{{\tau _{m}}})} \\ &\quad\lesssim K^{-3}A^{-(N-m+1)}\frac{\alpha }{\beta }|\tau _{m}^{*}| \sum _{ \tilde{T}_{\tau _{m}}\in \mathbb{T}_{\tau _{m}}} \omega _{\tau _{m}}(x-c_{ \tilde{T}_{\tau _{m}}})|f_{{\tau _{m}}}^{m+1} |^{2}*\omega _{\tau _{m}}(c_{ \tilde{T}_{\tau _{m}}}) \\ &\quad\lesssim K^{-3}A^{-(N-m+1)}\frac{\alpha }{\beta } |f_{{\tau _{m}}}^{m+1} |^{2}* \omega _{\tau _{m}}(x) \end{aligned}$$

where we used the locally constant property in the second to last inequality. The last inequality is justified by the fact that \(\omega _{\tau _{m}}(x-c_{\tilde{T}_{\tau _{m}}})\sim \omega _{\tau _{m}}(x-y)\) for any \(y\in \tilde{T}_{\tau _{m}}\), and we have the pointwise relation \(\omega _{\tau _{m}}*\omega _{\tau _{m}}\lesssim \omega _{\tau _{m}}\). Then

$$\begin{aligned} &|\sum _{\tau _{m}\subset \tau}(f_{\tau _{m}}^{m}(x)-f_{\tau _{m}}^{m+1}(x))| \\ &\quad\lesssim K^{-3}A^{-(N-m+1)}\frac{\alpha }{\beta }\sum _{\tau _{m}\subset \tau}|f_{ \tau _{m}}^{m+1}|^{2}*\omega _{\tau _{m}}(x)\sim K^{-3}A^{-(N-m+1)} \frac{\alpha }{\beta }g_{m}(x). \end{aligned}$$

At this point, choose \(A\) sufficiently large determined by the proof of Corollary 2.5 and so that if \(g_{m}(x)\le A^{N-m}\beta \), then the above inequality implies that

$$ |\sum _{\tau _{m}\subset \tau}(f_{\tau _{m}}^{m}(x)-f_{\tau _{m}}^{m+1}(x))| \le \varepsilon K^{-3}A^{-1/2}\alpha . $$

This finishes the proof since the number of terms in (13) is bounded by \(N\le \varepsilon ^{-1}\). The argument for the pruning on \(L\) is analogous. □

3 Geometry for the cone and the moment curve

We have seen in Corollary 2.5 that on \(\Omega _{k}\), \(g_{k}\) is high-dominated. We will now describe how to use the wave envelope estimate for the cone (Theorem 1.3 from [GWZ20]) to control the high part of \(g_{k}\).

Begin by describing the cone set-up using the rotated coordinate system from §5 of [GWZ20]. Define the truncated cone by \(\Gamma = \{r(1,\omega ,\frac{1}{2}\omega ^{2}):\frac{1}{2}\le r\le 1,\quad | \omega |\le 1\}\). We consider the neighborhood

$$ \Gamma (S^{2})=\{(\nu _{1},\nu _{2},\nu _{3}): \frac{1}{2}\le \nu _{1} \le 1,\,\, |\nu _{3}-\frac{1}{2}\frac{\nu _{2}^{2}}{\nu _{1}}|\le S^{-2} \} $$

where \(S\ge 1\) is dyadic. For \(|\omega |\le 1\), we will use the (not unit-normalized) frame

$$ \textstyle\begin{cases} {\mathbf{{c}}}(\omega )&=(1,\omega ,\frac{1}{2}\omega ^{2}) \\ {\mathbf{{b}}}(\omega )&=(-\omega ,1-\frac{1}{2}\omega ^{2},\omega ) \\ {\mathbf{{t}}}(\omega )&=(\frac{1}{2}\omega ^{2},-\omega ,1) \end{cases}\displaystyle . $$

Define a cone plank \(\tau \) of dimension \(1\times S^{-1}\times S^{-2}\) and centered at \((1,\omega ,\frac{1}{2}\omega ^{2})\in \Gamma \) by

$$ \tau =\{A{\mathbf{{c}}}(\omega )+B{\mathbf{{b}}}(\omega )+C{\mathbf{{t}}}(\omega ):\quad \frac{1}{2}\le A\le 1,\quad |B|\le S^{-1},\quad |C|\le S^{-2} \}. $$
(14)

Let \({\mathbf{{S}}}_{S^{-1}}\) denote a collection of \(1\times S^{-1}\times S^{-2}\) conical blocks \(\tau \) which approximately partition \(\Gamma (S^{2})\). The definitions and notation we use are compatible with those in §3 of [GWZ20].

Recall that our goal in this section is to bound the high part of \(g_{k}\). The summands of \(g_{k}\) have Fourier support in \(2\tau _{k}-2\tau _{k}\), where \(\tau _{k}\in{\mathbf{{S}}}(R_{k}^{-1/3})\) (noting that \({\mathbf{{S}}}(R_{k}^{-1/3})\) refers to moment curve blocks defined at the beginning of §2). Removing the low part cuts away a ball of radius \(R_{k+1}^{-1/3}\). In Proposition 3.1, we will essentially show that \((2\tau _{k}-2\tau _{k})\setminus B_{R_{k+1}^{-1/3}}\) may be identified with a conical plank \(\tau \in{\mathbf{{S}}}_{R_{k}^{-1/3}}\).

Let \(S\ge 1\) be a dyadic parameter that will be chosen to be sufficiently large in Proposition 3.1. Suppose that \(\tau \in{\mathbf{{S}}}(S^{-1})\) is the \(l\)th piece, meaning that

$$ \tau =\{(\xi _{1},\xi _{2},\xi _{3}): lS^{-1}\le \xi _{1}< (l+1)S^{-1}, \,|\xi _{2}-\xi _{1}^{2}|\le S^{-2},\,|\xi _{3}-3\xi _{1}\xi _{2}+2 \xi _{1}^{3}|\le S^{-3} \} $$

where \(l\in \{0,\ldots ,S-1\}\). Using the Frenet frame description from (10), the set \((10\tau -10\tau )\setminus B_{(4S)^{-1}}(0)\) is contained in

$$\begin{aligned} \begin{aligned} &\tilde{\tau}:=\{A{\mathbf{{T}}}(lS^{-1})+B{\mathbf{{N}}}(lS^{-1})+C{\mathbf{{B}}}(lS^{-1}):A \sim S^{-1},\\ & |B|\lesssim S^{-2},\quad |C|\lesssim S^{-3}\}. \end{aligned} \end{aligned}$$
(15)

Define the linear transformation \(T:\mathbb{R}^{3}\to \mathbb{R}^{3}\) by

$$ T(x,y,z):=(x,\frac{y}{2},\frac{z}{6}). $$
(16)

Proposition 3.1

Let \(S\in 2^{\mathbb{N}}\) be larger than some absolute constant. After dilating by \(S\), the sets \(T[(10\tau -10\tau )\setminus B_{(4S)^{-1}}(0)]\) with \(\tau \in{\mathbf{{S}}}(S^{-1})\), are comparable to the cone planks from \({\mathbf{{S}}}_{S^{-1}}\).

Proof

The image of the set (15) under \(T\) is

$$\begin{aligned} &T(\tilde{\tau})=\{A(1,lS^{-1},\frac{1}{2}(lS^{-1})^{2})+BT({\mathbf{{N}}}(lS^{-1}))+CT({ \mathbf{{B}}}(lS^{-1}): A\sim S^{-1},\\ &\quad |B|\lesssim S^{-2},\quad |C| \lesssim S^{-3}\}. \end{aligned}$$

Define \(\omega =lS^{-1}\). Since \(T({\mathbf{{T}}}(lS^{-1}))={\mathbf{{c}}}(lS^{-1})\) and \(T({\mathbf{{N}}}(lS^{-1}))\cdot{\mathbf{{t}}}(lS^{-1})=0\), it is easy to see that the set \(T(\tilde{\tau})\) is comparable to the \(S^{-1}\) dilation of (14). □

Next, we define moment curve wave envelopes, which are roughly the smallest convex sets containing wave packets from neighboring moment curve blocks.

Definition 9

Let \(1\le S^{3}\le R \). For \(\tau \in{\mathbf{{S}}}(S^{-1})\) which is the \(\ell \)th block, define the moment curve wave envelope \(V_{\tau ,R}\) to be

$$\begin{aligned} V_{\tau ,R}={}&\{A{\mathbf{{T}}}(lS^{-1})+B{\mathbf{{N}}}(lS^{-1})+C{\mathbf{{B}}}(lS^{-1}): |A|\le S^{-2}R,\\ &\quad |B|\le S^{-1}R,\quad |C|\le R \} . \end{aligned}$$

We will compare these with wave envelopes for the cone defined in §3 from [GWZ20]. Again let \(1\le S^{3}\le R\). Using our notation, if \(\tau '\in{\mathbf{{S}}}_{S^{-1}}\) is given by (14), then the cone wave envelope \(U_{\tau ',R^{2/3}}\) is defined by

$$\begin{aligned} \begin{aligned} U_{\tau ',R^{2/3}}={}&\{A{\mathbf{{c}}}(\omega )+B{\mathbf{{b}}}(\omega )+C{\mathbf{{t}}}(\omega ): |A| \le S^{-2}R^{2/3},\\ & |B|\le S^{-1}R^{2/3},\quad |C|\le R^{2/3} \}. \end{aligned} \end{aligned}$$

After applying a linear transformation, the \(R^{-1/3}\)-dilation of the moment curve wave envelope \(V_{\tau ,R}\) is comparable to a cone wave envelope \(U_{\tau ',R^{2/3}}\).

Proposition 3.2

Let \(T\) be the linear transformation (16) and let \(S\) be a sufficiently large dyadic number with \(1\le S^{3}\le R\). Then for each \(\tau \in{\mathbf{{S}}}(S^{-1})\), there is a \(\tau '\in{\mathbf{{S}}}_{S^{-1}}\) for which \(R^{-1/3}\cdot [(T^{t})^{-1}V_{\tau ,R}]\) is comparable to \(U_{\tau ',R^{2/3}}\).

Proof

Note that \((T^{t})^{-1}(x,y,z)=(x,2y,6z)\). The \(R^{-1/3}\)-dilation of the image of the wave envelope \(V_{\tau ,R}\) under \((T^{t})^{-1}\) is

$$\begin{aligned} &R^{-1/3}\cdot [(T^{t})^{-1}V_{\tau ,R}]\\ &\quad=\{A(1,4lS^{-1},18(lS^{-1})^{2})+B(-2lS^{-1}-9(lS^{-1})^{3},2-18(lS^{-1})^{4},\\ &\qquad 18lS^{-1}+36(lS^{-1})^{3}) \\ &\qquad +C(3(lS^{-1})^{2},-6lS^{-1}, 6): |A|\lesssim S^{-2}R^{2/3},\quad |B| \lesssim S^{-1}R^{2/3},\quad |C|\lesssim R^{2/3}\}. \end{aligned}$$

Applying a Gram-Schmidt process to the vectors, we see that the above set is comparable to

$$\begin{aligned} \{A(1,lS^{-1},&\frac{1}{2}(lS^{-1})^{2})+B(-lS^{-1},1-\frac{1}{2}(lS^{-1})^{2},lS^{-1}) \\ &+C(\frac{1}{2}(lS^{-1})^{2},-lS^{-1}, 1): |A|\lesssim S^{-2}R^{2/3},\\ &\quad |B|\lesssim S^{-1}R^{2/3},\quad |C|\lesssim R^{2/3}\}. \end{aligned}$$

Define \(\omega =lS^{-1}\). Then it is clear that the above set is comparable to \(U_{\tau ',R^{2/3}}\) where \(\tau '\in{\mathbf{{S}}}_{S^{-1}}\) is centered at \((1,\omega ,\frac{1}{2}\omega ^{2})\), as require by the proposition. □

3.1 High-frequency analysis

Now that we have identified moment curve blocks with cone planks and moment curve wave envelopes with cone wave envelopes, we are prepared to use Theorem 1.3 from [GWZ20] to control the high part of square functions. Recall the theorem statement.

Theorem 3.3

Theorem 1.3 from [GWZ20]

For each \(\delta >0\), there exists \(B_{\delta }\in (0,\infty )\) so that

$$ \int _{\mathbb{R}^{3}}|f|^{4}\le B_{\delta }R^{\delta }\sum _{R^{-1/2}\le \sigma \le 1} \sum _{\tau \in{\mathbf{{S}}}_{\sigma ^{-1}R^{-1/2}}}\sum _{U\|U_{\tau ,R}}|U|^{-1} \Big(\int _{U}\sum _{ \substack{\theta \subset \tau \\\theta \in{\mathbf{{S}}}_{R^{-1/2}}}}|f_{ \theta}|^{2}\Big)^{2} $$

for any Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\Gamma (R)\).

The initial sum on the right hand side is over dyadic \(\sigma \), \(R^{-1/2}\le \sigma \le 1\). Whenever we sum over an interval \((a,b)\), we always mean the numbers in \(2^{\mathbb{Z}}\cap (a,b)\).

Lemma 3.4

High lemma

For each \(\delta >0\), there is \(B_{\delta }\in (0,\infty )\) so that the following holds. For any \(k\), there is some dyadic scale \(R_{k+1}^{-1/3}\le s\le 10 R_{k}^{-1/3}\) for which

$$ \int |g_{k}^{h}|^{4}\le B_{\delta }R^{\delta }(\log R)\sum _{s\le \sigma \le 1} \sum _{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s)}\sum _{V\|V_{\tau ,s^{-3}}}|V|^{-1} \Big(\int _{V}\sum _{ \substack{\tau _{s}\subset \tau \\\tau _{s}\in{\mathbf{{S}}}(s)}}|f_{\tau _{s}}^{k+1}|^{4} \Big)^{2} . $$

Proof

First describe the Fourier support of \(g_{k}^{h}\). By (3) of Lemma 2.1, the support of \(\widehat{|f_{\tau _{k}}^{k+1}|^{2}}\) is \(2(\tau _{k}-\tau _{k})\). The high-frequency cutoff removes a ball of radius \(R_{k+1}^{-1/3}\), so \(g_{k}^{h}\) is Fourier supported within the annulus \(R_{k+1}^{-1/3}\le |\xi |\le 10 R_{k}^{-1/3}\). By dyadic pigeonholing, there is some dyadic \(s\in [R_{k+1}^{-1/3},R_{k}^{-1/3}]\) for which

where \(\eta _{s}:\mathbb{R}^{3}\to [0,\infty )\) is a smooth function supported in the annulus \(s/4\le |\xi |\le s\) (in the case that \(s=R_{k}^{-1/3}\), let \(\eta _{s}\) be supported on \(s\le |\xi |\le 20R_{k}^{-1/3}\)). In the proof of Lemma 2.4, we showed the pointwise equality

(17)

where \(\tau _{s}\in{\mathbf{{S}}}(s)\) and \(\tau _{s}'\sim \tau _{s}\) means that \(\tau _{s}'\in{\mathbf{{S}}}(s)\) and \(\text{dist}(2\tau _{s},2\tau _{s}')\le 2s\). For each \(\tau _{s}\), the sub-sum on the right hand side has Fourier transform supported in \(10(\tau _{2s}-\tau _{2s})\setminus B_{s}(0)\) where \(\tau _{2s}\in{\mathbf{{S}}}(2s)\) contains \(\tau _{s}\). Now write

(18)

Perform the change of variables \(x\mapsto T^{t}x\) to get

By Proposition 3.1, we may view the sub-sums corresponding to each \(\tau _{k}\) on the right hand side of (17) as having Fourier support which is part of a tiling of the cone, after applying \(T\) and dilating by a factor of \(s^{-1}\). Therefore, we may apply the wave envelope estimate Theorem 3.3, dilated by a factor of \(s^{-1}\), to obtain

where \(\tau _{2s}``\subset "\tau '\) means that \((2s)^{-1}\cdot [ T(10\tau _{2s}-10\tau _{2s})\setminus B_{s}(0)] \subset \tau '\). It remains to undo the initial steps which allowed us to apply the wave envelope estimate for the cone. First multiply both sides of the above inequality by \(|\det T|^{-3}\). Then do the change of variables \(x\mapsto (T^{-1})^{t}x\) to obtain

where by Proposition 3.2, we identify \(\tau \in{\mathbf{{S}}}(\sigma ^{-1}s)\) with \(\tau '\in{\mathbf{{S}}}_{\sigma ^{-1} s}\) via \(s^{-1}T[(10\tau -10\tau )\setminus B_{\sigma ^{-1}s/4}(0)]\sim \tau '\) and identify \(s^{-1} T^{t}U\) with a \(V\|V_{\tau ,s^{-3}}\). Next, by Cauchy-Schwartz, since the number of \(\tau _{s}\subset \tau _{2s}\) and the number of \(\tau _{s}'\) satisfying \(\tau _{s}'\sim \tau _{s}\) is \(O(1)\), it suffices to replace the above integrals by

$$ |V|^{-1}\Big(\int _{V}\sum _{\tau _{s}\subset \tau}||{f_{\tau _{s}}^{k+1}}|^{2}* \rho (x)|^{2}dx\Big)^{2} $$

where . Note that \(\|\rho \|_{1}\sim 1\). Again, by Cauchy-Schwarz, the above integral is bounded by

$$ |V|^{-1}\Big(\int _{V}\sum _{\tau _{s}\subset \tau}|{f_{\tau _{s}}^{k+1}}|^{4}* \rho (x)dx\Big)^{2}. $$

Write \(\chi _{V_{0}}\) for the characteristic function of \(2V_{\tau ,s^{-3}}\). After summing the above integral over \(V\|V_{\tau ,s^{-3}}\), we have the bound

$$ \sum _{V\|V_{\tau ,s^{-3}}}|V|^{-1}\Big(\int _{V}\sum _{\tau _{s} \subset \tau}|{f_{\tau _{s}}^{k+1}}|^{4}*\rho (x)dx\Big)^{2}\lesssim |V|^{-2} \int \Big(\sum _{\tau _{s}\subset \tau}|{f_{\tau _{s}}^{k+1}}|^{4}* \rho *\chi _{V_{0}}\Big)^{2}. $$

By Cauchy-Schwarz and Young’s convolution inequality, the right hand side is bounded by

$$ |V|^{-2}\int \Big(\sum _{\tau _{s}\subset \tau}|{f_{\tau _{s}}^{k+1}}|^{4}* \chi _{V_{0}}\Big)^{2}\lesssim \sum _{V\|V_{\tau ,s^{-3}}}|V|^{-1} \Big(\int _{V}\sum _{\tau _{s}\subset \tau}|{f_{\tau _{s}}^{k+1}}|^{4} \Big)^{2}. $$

 □

4 Key iterations that unwind the pruning process

4.1 An \(L^{7/2}\) square function estimate for the parabola

Let \(\mathbb{P}^{1}=\{(t,t^{2}):0\le t\le 1\}\) and for \(r\ge 1\), let \(\mathcal {N}_{r^{-1}}(\mathbb{P}^{1})\) denote the \(r^{-1}\)-neighborhood of \(\mathbb{P}^{1}\) in \(\mathbb{R}^{2}\). Define the collection of canonical \(\sim r^{-1/2}\times r^{-1}\) parabola blocks as follows. Let \(s\in 2^{\mathbb{Z}}\) be the smallest number satisfying \(r^{-1}\le s^{2}\). Then write \(\mathcal {N}_{s^{2}}(\mathbb{P}^{1})\) as

$$ \bigsqcup _{1\le l\le s^{-1}-2} \{(\xi _{1},\xi _{2})\in \mathcal {N}_{s^{2}}( \mathbb{P}^{1}):ls\le \xi _{1}< (l+1)s \} $$

and the two end pieces

$$ \{(\xi _{1},\xi _{2})\in \mathcal {N}_{s^{2}}(\mathbb{P}^{1}):\xi _{1}< s\} \sqcup \{(\xi _{1},\xi _{2})\in \mathcal {N}_{s^{2}}(\mathbb{P}^{1}):1-s\le \xi _{1}\} $$

We use the notation \(\ell (\tau )=r^{-1/2}\) in two ways: (1) to describe \(\tau \) as one of the blocks from the above partition and (2) to index the set of \(\tau \) from the above partition.

Theorem 4.1

Cylindrical \(L^{7/2}\) square function estimate over \(\mathbb{P}^{1}\)

Let \(\mathbb{P}^{1}=\{(t,t^{2}):0\le t\le 1\}\) and for \(r\ge 1\), let \(\mathcal {N}_{r^{-1}}(\mathbb{P}^{1})\) denote the \(r^{-1}\)-neighborhood of \(\mathbb{P}^{1}\) in \(\mathbb{R}^{2}\). If \(h:\mathbb{R}^{3}\to \mathbb{C}\) is a Schwartz function with Fourier transform supported in \(\mathcal {N}_{r^{-1}}(\mathbb{P}^{1})\times \mathbb{R}\), then

$$ \int _{\mathbb{R}^{3}}|h|^{7/2}\lesssim _{\varepsilon }r^{\varepsilon }\int _{\mathbb{R}^{3}}(\sum _{ \zeta }|h_{\zeta }|^{2})^{7/4} $$

where the \(\zeta \) are products of approximate rectangles \(\theta \), \(\ell (\theta )={r^{-1/2}}\), with ℝ.

The local version of Theorem 4.1 is

Corollary 4.2

Let \(B_{r}\) be an \(r\)-ball in \(\mathbb{R}^{3}\). If \(h:\mathbb{R}^{3}\to \mathbb{C}\) is a Schwartz function with Fourier transform supported in \(\mathcal {N}_{r^{-1}}(\mathbb{P}^{1})\times \mathbb{R}\), then

$$ \int _{\mathbb{R}^{3}}|h|^{7/2}W_{B_{r}}\lesssim _{\varepsilon }r^{\varepsilon }\int _{\mathbb{R}^{3}}( \sum _{\zeta }|h_{\zeta }|^{2})^{7/4}W_{B_{r}} $$

where the \(\zeta \) are products of approximate rectangles \(\theta \), \(\ell (\theta )={r^{-1/2}}\), with ℝ.

We delay the proofs of Theorem 4.1 and Corollary 4.2 to §A.1.1 in Appendix A.

Lemma 4.3

Let \(R_{k-1}\le r\le R_{k}\). For each \(r^{-\frac{1}{3}}\le \sigma \le 1\), \(\tau \in{\mathbf{{S}}}({\sigma ^{-1}r^{-\frac{1}{3}}})\), and \(V\|V_{\tau ,r}\), we have

V τ τ τ S ( r 1 3 ) | f τ k | 7 / 2 E ε 7 R ε 7 | V | 1 τ τ τ S ( r 1 3 ) | τ τ τ S ( σ 1 2 r 1 3 ) | f τ m | 2 | 7 / 4 W V

where \(m\ge k\) satisfies \(R_{m}^{-\frac{1}{3}}\le \sigma ^{\frac{1}{2}}r^{-\frac{1}{3}}\le R_{m-1}^{- \frac{1}{3}}\).

The weight function \(W_{V}\) is defined by \(w(T^{-1}x)\), where \(T\) is an affine transformation mapping the unit cube to \(V\) and \(w(\cdot )\) is the function from Definition 5.

Proof of Lemma 4.3

This would be a straightforward consequence of Corollary 4.2 after a moment curve rescaling, except for the fact that the pruning process alters the Fourier support of \(f\). Let \(s=\min (r^{-\frac{1}{3}},R_{m-1}^{-\frac{1}{3}})\). First we will show that

V τ τ τ S ( r 1 3 ) | f τ k | 7 / 2 ε R ε 8 | V | 1 τ τ τ S ( r 1 3 ) | τ τ τ S ( s ) | f τ m | 2 | 7 / 4 W V ,
(19)

for \(R_{m}^{-\frac{1}{3}}\le \sigma ^{\frac{1}{2}}r^{-\frac{1}{3}}\le R_{m-1}^{- \frac{1}{3}}\). If \(r^{-\frac{1}{3}}\le R_{m-1}^{-\frac{1}{3}}\), then \(|f_{\tau '}^{k}|\le |f_{\tau '}^{m}|\) and (19) is trivially true, so assume that \(R_{m-1}^{-\frac{1}{3}}< r^{-\frac{1}{3}}\). This assumption also implies that \(R_{m-1}^{-\frac{1}{3}}\le R_{k}^{-\frac{1}{3}}\), so \(k< m\). We will perform an “unwinding the pruning” process using successive applications of Corollary 4.2.

Begin by performing a moment curve rescaling. Suppose that \(\tau \in{\mathbf{{S}}}(\sigma ^{-1}r^{-\frac{1}{3}})\) is the \(l\)th piece, meaning

$$\begin{aligned} &\{(\xi _{1},\xi _{2},\xi _{3}): l(\sigma ^{-1}r^{-\frac{1}{3}})\le \xi _{1}< (l+1)(\sigma ^{-1}r^{-\frac{1}{3}}),\,|\xi _{2}-\xi _{1}^{2}| \le (\sigma ^{-1}r^{-\frac{1}{3}})^{2},\,\\ &\quad |\xi _{3}-3\xi _{1}\xi _{2}+2 \xi _{1}^{3}|\le (\sigma ^{-1}r^{-\frac{1}{3}})^{3} \} \end{aligned}$$

where \(0\le l\le \sigma r^{\frac{1}{3}}-1\). Define the affine transformation \(T:\mathbb{R}^{3}\to \mathbb{R}^{3}\) by \(T(\xi _{1},\xi _{2}, \xi _{3})=(\xi _{1}',\xi _{2}',\xi _{3}')\) where

$$ \textstyle\begin{cases} \xi _{1}'&= \frac{\xi _{1}-l(\sigma ^{-1}r^{-\frac{1}{3}})}{(\sigma ^{-1}r^{-\frac{1}{3}})} \\ \xi _{2}'&= \frac{\xi _{2}-2(l\sigma ^{-1}r^{-\frac{1}{3}})\xi _{1}+(l\sigma ^{-1}r^{-\frac{1}{3}})^{2}}{(\sigma ^{-1}r^{-\frac{1}{3}})^{2}} \\ \xi _{3}'&= \frac{\xi _{3}-3(l\sigma ^{-1}r^{-\frac{1}{3}})\xi _{2}+3(l\sigma ^{-1}r^{-\frac{1}{3}})^{2}\xi _{1}-(l\sigma ^{-1}r^{-\frac{1}{3}})^{3}}{(\sigma ^{-1}r^{-\frac{1}{3}})^{3}}. \end{cases} $$

It is not difficult to verify that for each \(k\le l\le m-1\) and \(\tau _{l}\in{\mathbf{{S}}}(R_{l}^{-\frac{1}{3}})\), \(T(\tau _{l})\) is a canonical moment curve block in \({\mathbf{{S}}}(\sigma r^{\frac{1}{3}}R_{l}^{-\frac{1}{3}})\). For each \(\tau ''\in{\mathbf{{S}}}(\sigma ^{\frac{1}{2}}r^{-\frac{1}{3}})\), \(T(\tau '')\) is a canonical moment curve block in \({\mathbf{{S}}}(\sigma ^{\frac{3}{2}})\).

Let \(T(\xi )=A\xi +b\) where \(A:\mathbb{R}^{3}\to \mathbb{R}^{3}\) is a linear transformation and \(\xi ,b\in \mathbb{R}^{3}\). Perform the change of variables \(x\mapsto A^{T}x\), obtaining

V τ τ τ S ( r 1 3 ) | f τ k | 7 / 2 = | ( A T ) 1 ( V ) | 1 τ τ τ S ( r 1 3 ) | f τ k A T | 7 / 2 W V A T .

Write \(f_{\tau '}^{k}\circ A^{T}=g_{T(\tau ')}^{k}\) where \(g_{T(\tau ')}^{k}\) is the Fourier projection of a Schwartz function onto \(2T(\tau ')\) (the dilation by 2 reflects the fact that \(f_{\tau '}^{k}\) is supported in \(2\tau '\)). In fact, since \(f_{\tau '}^{k}\) is Fourier supported in \(\cup _{\tau _{k}\subset \tau '}2\tau _{k}\), \(g_{T(\tau ')}^{k}\) is Fourier supported in \(\cup _{\tau _{k}\subset \tau '}2T(\tau _{k})\).

Next, consider what the set \((A^{T})^{-1}(V)\) is. The matrix representations of \(A\) and \((A^{T})^{-1}\) are

$$\begin{aligned} &A= \begin{bmatrix} \frac{1}{\sigma ^{-1}r^{-\frac{1}{3}}} & 0 & 0 \\ -\frac{2l}{\sigma ^{-1}r^{-\frac{1}{3}}} & \frac{1}{\sigma ^{-2}r^{-\frac{2}{3}}} & 0 \\ \frac{3l^{2}}{\sigma ^{-1}r^{-\frac{1}{3}}} & - \frac{3l}{\sigma ^{-2}r^{-\frac{2}{3}}} & \frac{1}{\sigma ^{-3}r^{-1}} \end{bmatrix} \qquad \text{and}\\ & (A^{T})^{-1}= \begin{bmatrix} \sigma ^{-1}r^{-\frac{1}{3}} & 2l(\sigma ^{-1}r^{-\frac{1}{3}})^{2} & 3l^{2}( \sigma ^{-1}r^{-\frac{1}{3}})^{3} \\ 0 & (\sigma ^{-1}r^{-\frac{1}{3}})^{2} & 3l(\sigma ^{-1}r^{- \frac{1}{3}})^{3} \\ 0 & 0 & (\sigma ^{-1}r^{-\frac{1}{3}})^{3} \end{bmatrix} . \end{aligned}$$

The set \(V\) is a translation of \(V_{\tau ,r}\) which is comparable to the set

$$\begin{aligned} &\{A{\mathbf{{T}}}(l\sigma ^{-1}r^{-\frac{1}{3}})+B{\mathbf{{N}}}(l\sigma ^{-1}r^{- \frac{1}{3}})+C{\mathbf{{B}}}(l\sigma ^{-1}r^{-\frac{1}{3}}):|A|\le \sigma ^{-2}r^{ \frac{1}{3}} ,\\ &\quad |B|\le \sigma ^{-1}r^{\frac{2}{3}},\quad |C|\le r \} \end{aligned}$$

where the vectors \({\mathbf{{T}}},{\mathbf{{N}}},{\mathbf{{B}}}\) are defined in (10). Using these explicit expressions, we see that \((A^{T})^{-1}(V)\) is comparable to a \(\sigma ^{-3}\)-ball in \(\mathbb{R}^{3}\).

For our first step in showing (19), we would like to apply Corollary 4.2 to estimate \(g_{T(\tau ')}^{k}\) by a square function in \(T(\tau _{k})\subset T(\tau ')\). Since \(T(\tau _{k})\) are moment curve blocks in \({\mathbf{{S}}}(\sigma r^{\frac{1}{3}}R_{k}^{-\frac{1}{3}})\), Corollary 4.2 may be employed on balls of radius \(\sigma ^{-2}r^{-\frac{2}{3}}R_{k}^{\frac{2}{3}}\). The assumptions that \(\sigma ^{\frac{1}{2}}r^{-\frac{1}{3}}\le R_{m-1}^{-\frac{1}{3}}\le R_{k}^{- \frac{1}{3}}\) mean that \(r^{-\frac{2}{3}}R_{k}^{\frac{2}{3}}\le \sigma ^{-1}\). Since \((A^{T})^{-1}(V)\) is comparable to a \(\sigma ^{-3}\)-ball which contains \(\sigma ^{-2}r^{-\frac{2}{3}}R_{k}^{\frac{2}{3}}\)-balls, Corollary 4.2 applies, yielding

$$\begin{aligned} &\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}|f_{ \tau '}^{k}\circ A^{T}|^{7/2} W_{(A^{T})^{-1}(V)}\\ &\quad \le C_{\varepsilon }R^{\varepsilon ^{9}} \int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\tau _{k}\subset \tau '}|f_{\tau _{k}}^{k}\circ A^{T}|^{2}|^{7/4}W_{(A^{T})^{-1}(V)}. \end{aligned}$$

Then by (1) of Lemma 2.1, \(|f_{\tau _{k}}^{k}|\le |f_{\tau _{k}}^{k+1}|\), so

$$\begin{aligned} &\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}|f_{ \tau '}^{k}\circ A^{T}|^{7/2}W_{(A^{T})^{-1}(V)}\\ &\quad \le C_{\varepsilon }R^{\varepsilon ^{9}} \int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\tau _{k}\subset \tau '}|f_{\tau _{k}}^{k+1}\circ A^{T}|^{2}|^{7/4}W_{(A^{T})^{-1}(V)}. \end{aligned}$$

If \(k+1=m\), then we are done showing (19). If \(k+1< m\), then by Khintchine’s inequality, we may select signs \(c_{\tau _{k}}\in \{\pm 1\}\) so that

$$\begin{aligned} &\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\tau _{k}\subset \tau '}|f_{\tau _{k}}^{k+1}\circ A^{T}|^{2}|^{7/4}W_{(A^{T})^{-1}(V)} \\ &\quad \sim \int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\tau _{k}\subset \tau '}c_{\tau _{k}}f_{\tau _{k}}^{k+1}\circ A^{T}|^{7/2}W_{(A^{T})^{-1}(V)}. \end{aligned}$$

The function \(\sum _{\tau _{k}\subset \tau '}c_{\tau _{k}}f_{\tau _{k}}^{k+1} \circ A^{T}=g_{T(\tau ')}^{k+1}\) where \(g_{T(\tau ')}^{k+1}\) has Fourier support in \(\cup _{\tau _{k+1}\subset \tau '}2T(\tau _{k+1})\). Then we may apply Corollary 4.2 again to obtain

$$\begin{aligned} &\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\tau _{k}\subset \tau '}c_{\tau _{k}}f_{\tau _{k}}^{k+1}\circ A^{T}|^{7/2}W_{(A^{T})^{-1}(V)} \\ &\quad\le C_{\varepsilon }R^{\varepsilon ^{9}}\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\tau _{k+1}\subset \tau '}|f_{\tau _{k+1}}^{k+1}\circ A^{T}|^{2}|^{7/4}W_{(A^{T})^{-1}(V)}. \end{aligned}$$

Again, use \(|f_{\tau _{k+1}}^{k+1}|\le |f_{\tau _{k+2}}^{k+2}|\) and halt if \(m=k+2\) or find signs \(c_{\tau _{k+1}}\in \{\pm 1\}\) for which

$$\begin{aligned} &\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\tau _{k+1}\subset \tau '}|f_{\tau _{k+1}}^{k+2}\circ A^{T}|^{2}|^{7/4}W_{(A^{T})^{-1}(V)} \\ &\quad\sim \int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\tau _{k+1}\subset \tau '}c_{\tau _{k+1}}f_{\tau _{k+1}}^{k+1} \circ A^{T}|^{7/2}W_{(A^{T})^{-1}(V)}. \end{aligned}$$

Iterating this process and undoing the change of variables results in (19). Note that since \(m-k\le \varepsilon ^{-1}\), the accumulated constant satisfies \((C_{\varepsilon }R^{\varepsilon ^{9}})^{\varepsilon ^{-1}}\le C_{\varepsilon }^{\varepsilon ^{-1}}R^{\varepsilon ^{8}}\). For the final step, we perform the same argument as above, using a change of variables and Khintchine’s inequality to write

$$\begin{aligned} &|V|^{-1}\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\substack{\tau ''\subset \tau '\\\tau ''\in{\mathbf{{S}}}(s)}}|f_{ \tau ''}^{m}|^{2}|^{7/4} W_{V}\\ &\quad \sim |{(A^{T})^{-1}(V)}|^{-1}\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\substack{\tau ''\subset \tau '\\\tau ''\in{\mathbf{{S}}}(s)}}c_{ \tau ''}f_{\tau ''}^{m}\circ A^{T}|^{7/2}W_{(A^{T})^{-1}(V)}. \end{aligned}$$

Since for \(\tau ''\in{\mathbf{{S}}}(\sigma ^{\frac{1}{2}}r^{-\frac{1}{3}})\) with \(\tau ''\subset \tau \) we have \(T(\tau '')\in{\mathbf{{S}}}(\sigma ^{\frac{3}{2}})\), we may apply Corollary 4.2 one last time to get

$$\begin{aligned} &\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{\substack{\tau ''\subset \tau '\\\tau ''\in{\mathbf{{S}}}(s)}}c_{ \tau ''}f_{\tau ''}^{m}\circ A^{T}|^{7/2}W_{(A^{T})^{-1}(V)}\\ &\quad \le C_{ \varepsilon }R^{\varepsilon ^{8}} \int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{ \substack{\tau ''\subset \tau '\\\tau ''\in{\mathbf{{S}}}(\sigma ^{\frac{1}{2}}r^{-\frac{1}{3}})}}|f_{ \tau ''}^{m}\circ A^{T}|^{2}|^{7/4}W_{(A^{T})^{-1}(V)}. \end{aligned}$$

Finally, undo the change of variables to get

$$\begin{aligned} &|{(A^{T})^{-1}(V)}|^{-1}\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{ \substack{\tau ''\subset \tau '\\\tau ''\in{\mathbf{{S}}}(\sigma ^{\frac{1}{2}}r^{-\frac{1}{3}})}}|f_{ \tau ''}^{m}\circ A^{T}|^{2}|^{7/4}W_{(A^{T})^{-1}(V)}\\ &\quad =|V|^{-1}\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(r^{-\frac{1}{3}})}}| \sum _{ \substack{\tau ''\subset \tau '\\\tau ''\in{\mathbf{{S}}}(\sigma ^{\frac{1}{2}}r^{-\frac{1}{3}})}}|f_{ \tau ''}^{m} |^{2}|^{7/4}W_{V}. \end{aligned}$$

 □

4.2 An \(\ell ^{7/4}\)-estimate for the cone

Let \(\Gamma =\{(\xi _{1},\xi _{2},\xi _{3})\in \mathbb{R}^{3}:\xi _{1}^{2}+\xi _{2}^{2}= \xi _{3}^{2},\quad \frac{1}{2}\le \xi _{3}\le 1\}\) be the truncated cone. In this section, let \({\mathbf{{S}}}_{r^{-1/2}}\) denote the collection of \(1\times r^{-1/2}\times r^{-1}\) blocks which tile \(\mathcal {N}_{r^{-1}}(\Gamma )\), as defined in §5 of [GM22a].

Proposition 4.4

For any Schwartz function \(h:\mathbb{R}^{3}\to \mathbb{C}\) with \(\widehat{h}\) supported in \(\mathcal {N}_{r^{-1}}(\Gamma )\), we have

$$ \int _{\mathbb{R}^{3}}|h|^{7/2}\lesssim _{\varepsilon }r^{\varepsilon }\sum _{r^{-1/2}\le \sigma \le 1}\sum _{\tau \in{\mathbf{{S}}}_{\sigma ^{-1}r^{-1/2}}}\sum _{U \|U_{\tau _{k},r}}|U|\Big(|U|^{-1}\int \sum _{ \substack{\theta \subset \tau \\\theta \in{\mathbf{{S}}}_{r^{-1/2}}}}|h_{ \theta}|^{7/4}W_{U}\Big)^{2} . $$

We delay the proof of Proposition 4.4 to §A.2.

Lemma 4.5

For \(10\le r_{1}\le r_{2}\) and \(R_{k-1}\le r_{2}\le R_{k}\), there exists some dyadic \(s\), \(R^{-\frac{1}{3}}\le s\le r_{2}^{-\frac{1}{3}}\) such that

τ 0 S ( r 1 1 3 ) R 3 ( τ τ 0 τ S ( r 2 1 3 ) | f τ k | 2 ) 7 / 2 ( C log R ) 2 ε 1 R 3 ( θ S ( R 1 / 3 ) | f θ | 2 ) 7 / 2 + ( C log R ) ε 1 B δ R δ τ 0 S ( r 1 1 3 ) s σ 1 τ S ( σ 1 s ) τ τ 0 V V τ , s 3 | V | ( V τ τ τ S ( s ) | f τ k m | 7 / 2 ) 2
(20)

where \(k_{m}\ge k\) satisfies \(R_{k_{m}}^{-\frac{1}{3}}\le s\le R_{k_{m}-1}^{-\frac{1}{3}}\).

Proof of Lemma 4.5

Fix a \(\tau _{0}\) and analyze each integral on the left hand side of (20). Suppose that there is some dyadic value \(s\) in the range \(R_{k}^{-1/3}\le s\le r_{2}^{-1/3}\) which satisfies

(21)

The next step repeats the argument from Lemma 3.4, which we provide a sketch for now. By the proof of Lemma 3.2, the integral on the right hand side above is equivalent to

where \(\tau _{s}'\sim \tau _{s}\) means that \(\text{dist}(\tau _{s}',\tau _{s})\lesssim s\). By Propositions 3.1 and 3.2, the Fourier support of the integrand may now be viewed as a canonical tiling of (an \(s\)-dilation of) the cone. Therefore, we may apply Proposition 4.4 to bound the previous displayed integral by

$$ C_{\delta }R^{\delta }\sum _{s\le \sigma \le 1}\sum _{ \substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s)\\\tau \subset \tau _{0}}} \sum _{V\|V_{\tau ,s^{-3}}}|V|^{-1}\Big(\int \sum _{\tau _{s}\subset \tau}|f_{\tau _{s}}^{k}|^{7/2}W_{V}\Big)^{2} $$

where we are free to choose \(\delta >0\). Here, the iteration ends since we have proven the lemma.

In the case that (21) does not hold, we may assume that

(22)

Applying the argument from Lemma 2.4 to the integrand, we have

Since is \(L^{1}\)-normalized, we may ignore the by Young’s convolution inequality. Then by Lemma 2.1, we have the pointwise inequality \(|f_{\tau _{k}}^{k}|\le |f_{\tau _{k}}^{k+1}|\). The conclusion in this case is that

$$ \int _{\mathbb{R}^{3}}(\sum _{ \substack{\tau '\subset \tau _{0}\\\tau '\in{\mathbf{{S}}}(r_{2}^{-\frac{1}{3}})}}|f_{ \tau '}^{k}|^{2})^{7/2}\lesssim (\log R) \int _{\mathbb{R}^{3}}|\sum _{ \substack{\tau _{k}\subset \tau _{0}}}|f_{\tau _{k}}^{k}|^{2}|^{7/2}. $$

Next, we iterate this procedure, considering the two cases (21) and (22) applied to the integral on the right hand side above. The number of steps in the iteration is the same as the number of times (the appropriate version of) (22) holds. Each time (22) holds, we refine the scale of our square function by a factor of \(R^{\varepsilon /3}\) (moving from \(\tau _{k}\) to \(\tau _{k+1}\), say). Therefore, the total number of steps is bounded by \(\varepsilon ^{-1}\). In the case that (21) holds at one step, the iteration terminates with an accumulated constant of at most \((\log R)^{\varepsilon ^{-1}}B_{\delta }R^{\delta \varepsilon ^{-1}}\). Since we are free to choose \(\delta \), this proves the proposition. The final case is that (22) holds at all steps in the iteration. This leads to the first term from the upper bound in (20). □

4.3 Algorithm to fully unwind the pruning process

Recall some notation which was defined in §3.

  1. (1)

    For each dyadic \(\sigma \), \(R_{k}^{-\frac{1}{3}}\sigma \le 1\) and each \(\tau \in{\mathbf{{S}}}(\sigma ^{-1}R_{k}^{-\frac{1}{3}})\), the dual set \(\tau ^{*}\) is a \(\sigma R_{k}^{\frac{1}{3}}\times \sigma ^{2}R_{k}^{\frac{2}{3}} \times \sigma ^{3}R_{k}\) plank centered at the origin and comparable to \(\{x\in \mathbb{R}^{3}:|x\cdot \xi |\le 1\qquad \forall \xi \in \tau -\tau \}\). The wave envelope \(V_{\tau ,R_{k}}\) is an anisotropically dilated version of \(\tau ^{*}\) with dimensions \(\sigma ^{-2}R_{k}^{-\frac{1}{3}}\times \sigma ^{-1}R_{k}^{- \frac{2}{3}}\times R_{k}^{-1}\).

  2. (2)

    Let \(V\|V_{\tau ,R_{k}}\) denote an indexing set for \(V\) which are translates of \(V_{\tau ,R_{k}}\) and which tile \(\mathbb{R}^{3}\).

Proposition 4.6

For each \(k\), we have

$$\begin{aligned} &\sum _{R_{k}^{-\frac{1}{3}}\le \sigma \le 1}\sum _{\tau \in{\mathbf{{S}}}( \sigma ^{-1}R_{k}^{-\frac{1}{3}})}\sum _{V\|V_{\tau ,R_{k}}}|V|^{-1} \left (\int \sum _{\tau _{k}\subset \tau}|f_{\tau _{k}}^{k+1}|^{7/2}W_{V} \right )^{2} \\ &\quad \le C_{\varepsilon }R^{3\varepsilon ^{2}}\textit{S}_{1}(R^{\varepsilon })^{N-k+1}\int (\sum _{ \theta}|f_{\theta}|^{2})^{7/2}. \end{aligned}$$
(23)

Propositon 4.6 will follow from an algorithm which uses the Lemmas 4.3 and 4.5 as building blocks.

Proof of Proposition 4.6

Let \(C>0\) be an absolute constant that we specify later in the proof. We will define an algorithm which at intermediate step \(m\), produces an inequality

( L.H.S. of (23) ) ( C log R ) 4 ε 1 m ( B ε 7 R ε 7 ) m ( E ε 7 R ε 7 ) a ( R ε 3 S 1 ( R ε ) ) b × s m σ 1 τ S ( σ 1 s m ) V V τ , s m 3 | V | ( V τ τ τ S ( s m ) | f τ k m | 7 / 2 ) 2
(24)

in which \(a+b=m\) and \(R^{-\frac{1}{3}}\le s_{m}\le R_{k}^{-\frac{1}{3}}R^{- \frac{\varepsilon ^{3}}{2}a}R^{-\frac{\varepsilon }{3}b}\) and \(k_{m}\ge k+1\) satisfies \(R_{k_{m}}^{-\frac{1}{3}}\le s_{m}\le R_{k_{m}-1}^{-\frac{1}{3}}\). Notice that (24) clearly holds with \(m=0\), taking \(k_{m}=k+1\) and \(s_{m}=R_{k}^{-\frac{1}{3}}\). Assuming (24) holds for \(m-1\), we will show that either the algorithm terminates and the proposition is proved or (24) holds for \(m\ge 1\). We further suppose that \(R^{-\frac{1}{3}}R^{\frac{\varepsilon }{3}}\le s_{m-1}\), otherwise proceed to the final case in which the algorithm terminates below.

Step m: Let \(\sigma \in [s_{m-1},1]\) be a dyadic value satisfying

$$\begin{aligned} (\text{L.H.S. of (23)})\le ( \text{R.H.S. of (24) with $m-1$ and $\sigma $}). \end{aligned}$$
(25)

The analysis splits into two cases depending on whether \(\sigma \ge R^{-\varepsilon ^{3}}\) or \(\sigma < R^{-\varepsilon ^{3}}\).

Step m: \(\sigma \ge R^{-\varepsilon ^{3}}\). Using Cauchy-Schwarz and then Hölder’s inequality, for each \(\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})\),

V V τ , s m 1 3 | V | ( V τ τ τ S ( s m 1 ) | f τ k m 1 | 7 / 2 ) 2 V V τ , s m 1 3 | V | ( # τ τ ) τ τ τ S ( s m 1 ) ( V | f τ k m 1 | 7 / 2 ) 2 V V τ , s m 1 3 | V | ( R ε 3 ) τ τ τ S ( s m 1 ) V | f τ k m 1 | 7 R ε 3 τ τ τ S ( s m 1 ) R 3 | f τ k m 1 | 7 .

Define \(k_{m-1}^{*}\) to be \(k_{m-1}^{*}=k_{m-1}+1\) if \(s_{m-1}=R_{k_{m-1}}^{-\frac{1}{3}}\) and \(k_{m-1}^{*}=k_{m-1}\) if \(R_{k_{m-1}}^{-\frac{1}{3}}< s_{m-1}\). Note that by (1) of Lemma 2.1, for each \(\tau '\in{\mathbf{{S}}}(s_{m-1})\), \(|f_{\tau '}^{k_{m-1}}|\le |f_{\tau '}^{k_{m-1}^{*}}|\). By rescaling for the moment curve, for each \(\tau '\in{\mathbf{{S}}}(s_{m-1})\),

$$\begin{aligned} \int _{\mathbb{R}^{3}}|f_{\tau '}^{k_{m-1}^{*}}|^{7}&\le \text{S}_{2}(R^{\varepsilon }) \int _{\mathbb{R}^{3}}(\sum _{\tau _{k_{m-1}^{*}}\subset \tau '}|f_{\tau _{k_{m-1}^{*}}}^{k_{m-1}^{*}}|^{2})^{7/2} \\ &\lesssim \text{S}_{1}(R^{\varepsilon })\int _{\mathbb{R}^{3}}(\sum _{\tau _{k_{m-1}^{*}} \subset \tau '}|f_{\tau _{k_{m-1}^{*}}}^{k_{m-1}^{*}}|^{2})^{7/2} \end{aligned}$$

where we used the definitions of \(\text{S}_{1}(\cdot )\) and \(\text{S}_{2}(\cdot )\) and the relationship \(\text{S}_{2}(\cdot )\lesssim \text{S}_{1}(\cdot )\), which follows from Lemma 2.2. If \(k_{m-1}^{*}=N\), then the algorithm terminates with the inequality

$$\begin{aligned} &(\text{L.H.S. of (23)}) \\ &\quad \le (C\log R)^{4\varepsilon ^{-1}(m-1)+1}(B_{ \varepsilon ^{7}}R^{\varepsilon ^{7}})^{m-1}(E_{\varepsilon ^{7}}R^{\varepsilon ^{7}})^{a}(R^{\varepsilon ^{3}}\text{S}_{1}(R^{ \varepsilon }))^{b+1} \\ &\qquad{}\times \int _{\mathbb{R}^{3}}(\sum _{\theta}|f_{\theta}|^{2})^{7/2} \end{aligned}$$
(26)

in which \(a+b=m-1\) and \(R^{-\frac{1}{3}}\le R_{k}^{-\frac{1}{3}}R^{-\frac{\varepsilon ^{3}}{2}a}R^{- \frac{\varepsilon }{3}b}\). If \(k_{m-1}^{*}< N\), by Lemma 4.5, there are two further cases. One case is that

$$ \sum _{\tau '\in{\mathbf{{S}}}(s_{m-1})}\int _{\mathbb{R}^{3}}(\sum _{\tau _{k_{m-1}^{*}} \subset \tau '}|f_{\tau _{k_{m-1}^{*}}}^{k_{m-1}^{*}}|^{2})^{7/2}\le (C \log R)^{2\varepsilon ^{-1}}\int _{\mathbb{R}^{3}}(\sum _{\theta}|f_{\theta}|^{2})^{7/2} $$

and the algorithm terminates with the inequality

$$\begin{aligned} &(\text{L.H.S. of (23)}) \\ &\quad \le (C\log R)^{4\varepsilon ^{-1}(m-1)+2\varepsilon ^{-1}+1}(B_{ \varepsilon ^{7}}R^{\varepsilon ^{7}})^{m-1}(E_{\varepsilon ^{7}}R^{\varepsilon ^{7}})^{a}(R^{\varepsilon ^{3}}\text{S}_{1}(R^{ \varepsilon }))^{b+1} \\ &\qquad{}\times \int _{\mathbb{R}^{3}}(\sum _{\theta}|f_{\theta}|^{2})^{7/2} \end{aligned}$$
(27)

in which \(a+b=m-1\) and \(R^{-\frac{1}{3}}\le R_{k}^{-\frac{1}{3}}R^{-\frac{\varepsilon ^{3}}{2}a}R^{- \frac{\varepsilon }{3}b}\). We will verify further below that these termination conditions prove the proposition.

The other case from Lemma 4.5 is that there is some dyadic value \(s_{m}\), \(R^{-\frac{1}{3}}\le s_{m}\le R_{k_{m-1}^{*}}^{-\frac{1}{3}}\), which satisfies

τ S ( s m 1 ) R 3 ( τ k m 1 τ | f τ k m 1 k m 1 | 2 ) 7 / 2 ( C log R ) ε 1 B ε 7 R ε 7 s m σ 1 τ S ( σ 1 s m ) V V τ , s m 3 | V | ( V τ τ τ S ( s m ) | f τ k m | 7 / 2 ) 2

where \(k_{m}\ge k_{m-1}^{*}\) satisfies \(R_{k_{m}}^{-\frac{1}{3}}\le s_{m}\le R_{k_{m}-1}^{-\frac{1}{3}}\). By definition of \(k_{m-1}^{*}\), \(R_{k_{m-1}^{*}}^{-\frac{1}{3}}< s_{m-1}\). Then since \(s_{m}\le R_{k_{m-1}^{*}}^{-\frac{1}{3}}< s_{m-1}\le R_{k}^{- \frac{1}{3}}R^{-\frac{\varepsilon }{3}b}\), \(R_{k_{m-1}}=R^{k_{m-1}\varepsilon }\), and \(R_{k}R^{\varepsilon b}=R^{(k+b)\varepsilon }\), it follows that \(s_{m}\le R_{k}^{-\frac{1}{3}}R^{-\frac{\varepsilon }{3}(b+1)}\). We have shown in this case that

( L.H.S. of (23) ) ( C log R ) 4 ε 1 ( m + 1 ) ( B ε 7 R ε 7 ) m ( E ε 7 R ε 7 ) a ( R ε 3 S 1 ( R ε ) ) b + 1 × s m σ 0 1 τ S ( σ 0 1 s m ) V V τ , s m 3 | V | ( V τ τ τ S ( s m ) | f τ k m | 7 / 2 ) 2

in which \(a+(b+1)=m+1\) and \(R^{-\frac{1}{3}}\le s_{m}\le R_{k}^{-\frac{1}{3}}R^{- \frac{\varepsilon ^{3}}{2}a}R^{-\frac{\varepsilon }{3}(b+1)}\) and \(k_{m}\ge k+1\) satisfies \(R_{k_{m}}^{-\frac{1}{3}}\le s_{m}\le R_{k_{m}-1}^{-\frac{1}{3}}\), which verifies (24).

Step m: \(s_{m-1}\le \sigma \le R^{-\varepsilon ^{3}}\). Let \(\tilde{s}_{m-1}=\max (R^{-\frac{1}{3}},\sigma ^{\frac{1}{2}}s_{m-1})\). By Lemma 4.3, for each \(\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})\) and \(V\|V_{\tau ,s_{m-1}^{-3}}\),

V τ τ τ S ( s m 1 ) | f τ k m 1 | 7 / 2 E ε 7 R ε 7 | V | 1 τ τ τ S ( s m 1 ) | τ τ τ S ( s ˜ m 1 ) | f τ k m | 2 | 7 / 4 W V

where \(k_{m}'\ge k+1\) satisfies \(R_{k_{m}'}^{-\frac{1}{3}}\le \tilde{s}_{m-1}\le R_{k_{m}'-1}^{- \frac{1}{3}}\). Then using \(\|\cdot \|_{\ell ^{7/4}}\le \|\cdot \|_{\ell ^{1}}\) and Cauchy-Schwarz, we have

$$\begin{aligned} &\sum _{\substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})}} \sum _{V\|V_{ \tau ,s_{m-1}^{-3}}}|V|^{-1}\Big(\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(s_{m-1})}}(\sum _{ \substack{\tau ''\subset \tau '\\\tau ''\in{\mathbf{{S}}}(\tilde{s}_{m-1})}}|f_{ \tau ''}^{k_{m}'}|^{2})^{7/4} W_{V}\Big)^{2} \\ &\quad\le \sum _{\substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})}} \sum _{V \|V_{\tau ,s_{m-1}^{-3}}}|V|^{-1}\Big(\int (\sum _{ \substack{\tau ''\subset \tau \\\tau ''\in{\mathbf{{S}}}(\tilde{s}_{m-1})}}|f_{ \tau ''}^{k_{m}'}|^{2})^{7/4} W_{V}\Big)^{2} \\ &\quad\le \sum _{\substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})}} \int _{ \mathbb{R}^{3}}(\sum _{ \substack{\tau ''\subset \tau \\\tau ''\in{\mathbf{{S}}}(\tilde{s}_{m-1})}}|f_{ \tau ''}^{k_{m}'}|^{2})^{7/2} . \end{aligned}$$

If \(\tilde{s}_{m-1}=R^{-\frac{1}{3}}\), then the algorithm terminates with the inequality

$$\begin{aligned} (\text{L.H.S. of (23)})\le{}& (C\log R)^{4\varepsilon ^{-1}(m-1)+2\varepsilon ^{-1}+1}(B_{ \varepsilon ^{7}}R^{\varepsilon ^{7}})^{m-1} \\ &{}\times (E_{\varepsilon ^{7}}R^{\varepsilon ^{7}})^{a+1}(R^{\varepsilon ^{3}} \text{S}_{1}(R^{\varepsilon }))^{b} \int _{\mathbb{R}^{3}}(\sum _{\theta}|f_{\theta}|^{2})^{7/2} \end{aligned}$$
(28)

in which \(a+b=m-1\) and \(R^{-\frac{1}{3}} \le R_{k}^{-\frac{1}{3}}R^{-\frac{\varepsilon ^{3}}{2}a}R^{- \frac{\varepsilon }{3}b}\). Otherwise, suppose that \(\tilde{s}_{m-1}>R^{-\frac{1}{3}}\) and apply Lemma 4.5, again leading to two cases. In the case that

$$ \sum _{\substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})}} \int _{\mathbb{R}^{3}}( \sum _{ \substack{\tau ''\subset \tau \\\tau ''\in{\mathbf{{S}}}(\sigma ^{\frac{1}{2}}s_{m-1})}}|f_{ \tau ''}^{k_{m}'}|^{2})^{7/2}\le (C\log R)^{2\varepsilon ^{-1}}\int _{\mathbb{R}^{3}}( \sum _{\theta}|f_{\theta}|^{2})^{7/2} $$

then the algorithm terminates with the same inequality as is recorded in (28).

The other case from Lemma 4.5 is that there is some dyadic value \(s_{m}\), \(R^{-\frac{1}{3}}\le s_{m}\le \sigma ^{\frac{1}{2}}s_{m-1}\), which satisfies

τ S ( σ 1 s m 1 ) R 3 ( τ τ τ S ( σ 1 2 s m 1 ) | f τ k m | 2 ) 7 / 2 ( C log R ) ε 1 B ε 7 R ε 7 × s m σ 0 1 τ S ( σ 0 1 s m ) V V τ , s m 3 | V | ( V τ τ τ S ( s m ) | f τ k m | 7 / 2 ) 2

where \(k_{m}\ge k_{m-1}^{*}\) satisfies \(R_{k_{m}}^{-\frac{1}{3}}\le s_{m}\le R_{k_{m}-1}^{-\frac{1}{3}}\). Since \(s_{m-1}\le R_{k}^{-\frac{1}{3}}R^{-\frac{\varepsilon ^{3}}{2}a}R^{- \frac{\varepsilon }{3}b}\) and \(s_{m}\le \sigma ^{\frac{1}{2}}s_{m-1}\), we have \(s_{m}\le R_{k}^{-\frac{1}{3}}R^{-\frac{\varepsilon ^{3}}{2}(a+1)}R^{- \frac{\varepsilon }{3}b}\). We have shown in this case that

( L.H.S. of (23) ) ( C log R ) 4 ε 1 ( m + 1 ) ( B ε 7 R ε 7 ) m ( E ε 7 R ε 7 ) a + 1 ( R ε 3 S 1 ( R ε ) ) b × s m σ 0 1 τ S ( σ 0 1 s m ) V V τ , s m 3 | V | ( V τ τ τ S ( s m ) | f τ k m | 7 / 2 ) 2

in which \((a+1)+b=m+1\) and \(R^{-\frac{1}{3}}\le s_{m}\le R_{k}^{-\frac{1}{3}}R^{- \frac{\varepsilon ^{3}}{2}(a+1)}R^{-\frac{\varepsilon }{3}b}\) and \(k_{m}\ge k+1\) satisfies \(R_{k_{m}}^{-\frac{1}{3}}\le s_{m}\le R_{k_{m}-1}^{-\frac{1}{3}}\), which verifies (24).

Termination criteria. It remains to check that the criteria for termination from both of the above cases implies Proposition 4.6. The first case in which the algorithm terminates is (27). Then \(R^{-\frac{1}{3}}\le R_{k}^{-\frac{1}{3}}R^{-\frac{\varepsilon ^{3}}{2}a}R^{- \frac{\varepsilon }{3}b})\) implies that \(a\le \frac{2}{3}\varepsilon ^{-3}(N-k)\) and \(b\le N-k\), which implies that \(m-1=a+b\le \varepsilon ^{-3}(N-k)\). Then the constants in the upper bound from (27) are bounded by

$$\begin{aligned} (C\log R)^{4\varepsilon ^{-1}(m-1)+2\varepsilon ^{-1}+1}&(B_{\varepsilon ^{7}}R^{\varepsilon ^{7}})^{m-1}(E_{ \varepsilon ^{7}}R^{\varepsilon ^{7}})^{a}(R^{\varepsilon ^{3}}\text{S}_{1}(R^{\varepsilon }))^{b+1} \\ &\le (C\varepsilon ^{-7}R^{\varepsilon ^{7}})^{\varepsilon ^{-5}}B_{\varepsilon ^{7}}^{\varepsilon ^{-4}}R^{\varepsilon ^{3}}E_{ \varepsilon ^{7}}^{\varepsilon ^{-4}}R^{\varepsilon ^{3}}R^{\varepsilon ^{2}}\text{S}_{1}(R^{\varepsilon })^{N-k+1}\\ &\le C_{ \varepsilon }R^{3\varepsilon ^{2}}\text{S}_{1}(R^{\varepsilon })^{N-k+1} . \end{aligned}$$

The second case in which the algorithm terminates with (28) proves the proposition by an analogous argument. □

5 Bounding the broad part of \(U_{\alpha ,\beta }\)

For three canonical blocks \(\tau ^{1},\tau ^{2},\tau ^{3}\) (with dimensions \(\sim R^{-\varepsilon /3}\times R^{-2\varepsilon /3}\times R^{-\varepsilon }\)) which are pairwise \(\ge R^{-\varepsilon /3}\)-separated, define the broad part of \(U_{\alpha ,\beta }\) to be

$$ \text{Br}_{\alpha ,\beta }^{K}=\{x\in U_{\alpha ,\beta }: \alpha \le K|f_{\tau ^{1}}(x)f_{ \tau ^{2}}(x)f_{\tau ^{3}}(x)|^{\frac{1}{3}},\quad \max _{\tau ^{i}}|f_{ \tau ^{i}}(x)|\le \alpha \}. $$

We bound the broad part of \(U_{\alpha ,\beta }\) in the following proposition. Recall that the parameter \(N_{0}\) was used in the definition of the sets \(\Omega _{k}\) and \(L\).

Proposition 5.1

Let \(R,K\ge 1\). Suppose that \(\|f_{\theta}\|_{L^{\infty}(\mathbb{R}^{3})}\le 2\) for all \(\theta \in{\mathbf{{S}}}(R^{-\frac{1}{3}})\). Then

$$ \alpha ^{7}|\mathrm{Br}_{\alpha ,\beta }^{K}|\le \big[CR^{10 \varepsilon N_{0}}A^{\varepsilon ^{-1}} + K^{50}R^{4\varepsilon ^{2}+10\varepsilon }A^{\varepsilon ^{-1}} \mathrm{S}_{1}(R^{\varepsilon })^{\varepsilon ^{-1}-N_{0}} \big]\int \Big|\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}\Big|^{ \frac{7}{2}} . $$

We will use the following version of a local trilinear restriction inequality for the moment curve, which was proved in Proposition 6 of [GM22b]. The weight function \(W_{B_{r}}\) in the following theorem decays by a factor of 10 off of the ball \(B_{r}\). It is defined right after Definition 5.

Theorem 5.2

Let \(s\ge 10r\ge 10\). Suppose that \(\tau ^{1},\tau ^{2},\tau ^{3}\in{\mathbf{{S}}}(R^{-\varepsilon /3})\) satisfy \(\textit{dist}(\tau ^{i},\tau ^{j})\ge s^{-1}\) for \(i\neq j\). Then

$$ \int _{B_{r}}|f_{\tau ^{1}}f_{\tau ^{2}}f_{\tau ^{3}}|^{2}\lesssim R^{ \varepsilon }s^{3}|B_{r}|^{-2}\big(\int |f_{\tau ^{1}}|^{2}W_{B_{r}}\big)\big( \int |f_{\tau ^{2}}|^{2}W_{B_{r}}\big)\big(\int |f_{\tau ^{3}}|^{2}W_{B_{r}} \big) $$

for any Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\mathcal {M}(r^{3})\).

Proof of Proposition 5.1

Note that

$$ \text{Br}_{\alpha ,\beta }^{K}=(L\cap \text{Br}_{\alpha ,\beta }^{K})\cup ( \sqcup _{k=N_{0}}^{N-1} \Omega _{k}\cap \text{Br}_{\alpha ,\beta }^{K}) $$

We bound each of the sets \(\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\) and \(\text{Br}_{\alpha ,\beta }^{K}\cap L\) in separate cases. It suffices to consider the case that \(R\) is at least some constant depending on \(\varepsilon \) since if \(R\le C_{\varepsilon }\), we may prove the proposition using trivial inequalities.

Case 1: bounding \(| \text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}|\). By Lemma 2.6,

$$\begin{aligned} |\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}|\le {}&|\{x\in U_{\alpha ,\beta }\cap \Omega _{k}:\alpha \lesssim K|f_{\tau ^{1}}^{k+1}(x)f_{\tau ^{2}}^{k+1}(x)f_{ \tau ^{3}}^{k+1}(x)|^{\frac{1}{3}},\\ & \max _{\tau ^{i}}|f_{\tau ^{i}}(x)| \le \alpha \}. \end{aligned}$$

By Lemma 2.1, the Fourier supports of \(f_{\tau ^{1}}^{k+1},f_{\tau ^{2}}^{k+1},f_{\tau ^{3}}^{k+1}\) are contained in \(2\tau ^{1},2 \tau ^{2}, 2\tau ^{3}\) respectively, which are \(\ge 5 R^{-\frac{\varepsilon }{3}}\)-separated blocks of the moment curve. Let \(\{B_{R_{k}^{\frac{1}{3}}}\}\) be a finitely overlapping cover of \(\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\) by \(R_{k}^{\frac{1}{3}}\)-balls. For \(R\) large enough depending on \(\varepsilon \), apply Theorem 5.2 to get

$$\begin{aligned} &\int _{B_{R_{k}^{\frac{1}{3}}}}|f_{\tau ^{1}}^{k+1}f_{\tau ^{2}}^{k+1}f_{ \tau ^{3}}^{k+1}|^{2}\\ &\quad\lesssim _{\varepsilon }R^{\varepsilon } |B_{R_{k}^{\frac{1}{3}}}|^{-2} \Big(\int |f_{\tau ^{1}}^{k+1}|^{2}W_{B_{R_{k}^{\frac{1}{3}}}}\Big) \Big(\int |f_{\tau ^{2}}^{k+1}|^{2}W_{B_{R_{k}^{\frac{1}{3}}}}\Big) \Big(\int |f_{\tau ^{3}}^{k+1}|^{2}W_{B_{R_{k}^{\frac{1}{3}}}}\Big). \end{aligned}$$

Using local \(L^{2}\)-orthogonality (Lemma 2.3), each integral on the right hand side above is bounded by

$$ \lesssim \int \sum _{\tau _{k}}|f_{\tau _{k}}^{k+1}|^{2}W_{B_{R_{k}^{ \frac{1}{3}}}}. $$

If \(x\in \text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\cap B_{R_{k}^{\frac{1}{3}}}\), then the above integral is bounded by

$$ \lesssim \int \sum _{\tau _{k}}|f_{\tau _{k}}^{k+1}|^{2}*\omega _{\tau _{k}}W_{B_{R_{k}^{ \frac{1}{3}}}}\lesssim C |B_{R_{k}^{\frac{1}{3}}}| \sum _{\tau _{k}}|f_{ \tau _{k}}^{k+1}|^{2}*\omega _{\tau _{k}}(x) $$

by the locally constant property (Lemma 2.2) and properties of the weight functions. The summary of the inequalities so far is that

$$ \alpha ^{6}|\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\cap B_{R_{k}^{ \frac{1}{3}}}|\lesssim _{\varepsilon }K^{6}\int _{B_{R_{k}^{\frac{1}{3}}}}|f_{ \tau ^{1}}^{k+1}f_{\tau ^{2}}^{k+1}f_{\tau ^{3}}^{k+1}|^{2}\lesssim _{ \varepsilon }R^{\varepsilon }K^{6} |B_{R_{k}^{\frac{1}{3}}}|g_{k}(x)^{3} $$

where \(x\in \text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\cap B_{R_{k}^{\frac{1}{3}}}\).

Recall that since \(x\in \Omega _{k}\), we have the lower bound \(A^{M-k}\beta \le g_{k}(x)\) (where \(A\) is from Definition 8), which leads to the inequality

$$ \alpha ^{6}|\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\cap B_{R_{k}^{ \frac{1}{3}}}|\lesssim _{\varepsilon }K^{6} R^{\varepsilon } \frac{1}{A^{M-k}\beta }|B_{R_{k}^{ \frac{1}{3}}}|g_{k}(x)^{3+1} . $$

By Corollary 2.5, we also have the upper bound \(|g_{k}(x)|\le 2|g_{k}^{h}(x)|\), so that

$$ \alpha ^{6}|\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\cap B_{R_{k}^{ \frac{1}{3}}}|\lesssim _{\varepsilon }K^{6} R^{\varepsilon } \frac{1}{A^{M-k}\beta }|B_{R_{k}^{ \frac{1}{3}}}||g_{k}^{h}(x)|^{4} . $$

By the locally constant property applied to \(g_{k}^{h}\), \(|g_{k}^{h}|^{4}\lesssim _{\varepsilon }|g_{k}^{h}*w_{ B_{R_{k}^{\frac{1}{3}}}}|^{4}\) and by Cauchy-Schwarz, \(|g_{k}^{h}*w_{ B_{R_{k}^{\frac{1}{3}}}}|^{4}\lesssim |g_{k}^{h}|^{4}*w_{B_{R_{k}^{ \frac{1}{3}}}}\). Combine this with the previous displayed inequality to get

$$ \alpha ^{6}|\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\cap B_{R_{k}^{ \frac{1}{3}}}|\lesssim _{\varepsilon }K^{6} R^{\varepsilon } \frac{1}{A^{M-k}\beta }\int |g_{k}^{h}|^{4}W_{ B_{R_{k}^{\frac{1}{3}}}} . $$

Summing over the balls \(B_{R_{k}^{\frac{1}{3}}}\) in our finitely-overlapping cover of \(\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\), we conclude that

$$ \alpha ^{6}|\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}|\lesssim _{\varepsilon }K^{6} R^{ \varepsilon } \frac{1}{A^{M-k}\beta }\int _{\mathbb{R}^{3}}|g_{k}^{h}|^{4} .$$
(29)

We are done using the properties of the set \(\text{Br}_{\alpha ,\beta }^{K}\cap \Omega _{k}\), which is why we now integrate over all of \(\mathbb{R}^{3}\) on the right hand side. We will now use Lemma 3.4 to analyze the high part \(g_{k}^{h}\). In particular, Lemma 3.4 gives

| g k h | 4 ε R ε R k 1 / 3 σ 1 τ S ( σ 1 R k 1 3 ) V V τ , R k |V| ( V τ k τ | f τ k k + 1 | 4 ) 2 .
(30)

Next use (2) from Lemma 2.1 to note that \(\|f_{\tau _{k}}^{k+1}\|_{\infty}\le \sum _{\tau _{k+1}\subset \tau _{k}} \|f_{\tau _{k+1}}^{k+1}\|_{\infty}\lesssim R^{\varepsilon }A^{\varepsilon ^{-1}}K^{3} \frac{\beta }{\alpha }\): for each \(R_{k}^{-1/3}\le \sigma \le 1\), \(\tau \in{\mathbf{{S}}}(\sigma ^{-1}R_{k}^{-\frac{1}{3}})\) and \(V\|V_{\tau ,R_{k}}\),

|V| ( V τ k τ | f τ k k + 1 | 4 ) 2 ( A ε 1 R ε K 3 β α )|V| ( V τ k τ | f τ k k + 1 | 7 / 2 ) 2 .

Using this and applying Proposition 4.6 gives the upper bound

R k 1 / 3 σ 1 τ S ( σ 1 R k 1 3 ) V V τ , R k | V | ( V τ k τ | f τ k k + 1 | 4 ) 2 ε ( A ε 1 R ε K 3 β α ) R 3 ε 2 S 1 ( R ε ) N k + 1 ( θ S ( R 1 / 3 ) | f θ | 2 ) 7 / 2 .

Combining this with (29) and (30), the summary of the argument from this case is

$$ \alpha ^{7}|U_{\alpha ,\beta }|\lesssim _{\varepsilon }K^{6} R^{2\varepsilon }(A^{\varepsilon ^{-1}}R^{\varepsilon }K^{3}) \text{S}_{1}(R^{\varepsilon })^{N-k+1}\int (\sum _{\theta \in{\mathbf{{S}}}(R^{-1/3})}|f_{ \theta}|^{2})^{7/2}. $$

Since \(k>N_{0}\), this upper bound has the desired form.

Case 2: bounding \(|U_{\alpha ,\beta }\cap L|\). Begin by using Lemma 2.6 to bound

$$ \alpha ^{7}|\text{Br}_{\alpha ,\beta }^{K}\cap L|\lesssim K^{7} \int _{U_{\alpha ,\beta } \cap L}|f^{N_{0}+1}|^{7}. $$

Then use Cauchy-Schwarz and the locally constant property for \(g_{N_{0}}\):

$$\begin{aligned} \int _{U_{\alpha ,\beta }\cap L}|f^{N_{0}+1}|^{7}&\lesssim (R^{ N_{0}\varepsilon /3})^{7/2} \int _{U_{\alpha ,\beta }\cap L}(\sum _{\tau _{N_{0}}}|f_{\tau _{N_{0}}}^{N_{0}+1}|^{2})^{7/2} \\ &\lesssim (R^{ N_{0}\varepsilon /3})^{7/2} \int _{U_{\alpha ,\beta }\cap L}(g_{N_{0}})^{7/2}. \end{aligned}$$

Using the definition the definition of \(L\), we bound the factors of \(g_{N_{0}}\) by

$$ \int _{U_{\alpha ,\beta }\cap L} (A^{\varepsilon ^{-1}}\beta )^{7/2}. $$

Finally, by the definition of \(U_{\alpha ,\beta }\), conclude that

$$ \alpha ^{7}|\text{Br}_{\alpha ,\beta }^{K}\cap L|\lesssim _{\varepsilon }K^{7}R^{2N_{0}\varepsilon }A^{ \varepsilon ^{-1}}\int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2}. $$

 □

6 Proof of Theorem 1 from Proposition 5.1

First, we prove Proposition 6.1 below, which is that \(\text{S}_{1}(R)\lesssim _{\varepsilon }R^{\varepsilon }\). This follows using various reductions from pigeonholing, a broad-narrow argument, and the broad estimate Proposition 5.1. Then, in §6.3, we use induction to show that Proposition (6.1) implies that \(\text{S}_{2}(R)\lesssim _{\varepsilon }R^{\varepsilon }\), which is equivalent to Theorem 1.

Proposition 6.1

For any \(\varepsilon >0\) and \(R\ge 1\),

$$ \mathrm{S}_{1}(R)\lesssim _{\varepsilon }R^{\varepsilon }. $$

In order to make use of Proposition 5.1, we need to reduce to the case that our function \(f\) is localized to a ball, its wave packets have been pigeonholed so that \(\|f_{\theta}\|_{\infty}\lesssim 1\) for all \(\theta \in{\mathbf{{S}}}(R^{-1/3})\), and we have approximated \(\|f\|_{7}\) by an expression involving a superlevel set. This is the content of the following subsection.

6.1 Wave packet decomposition and pigeonholing

Begin with the spatial localization.

Lemma 6.2

For any \(R\)-ball \(B_{R}\subset \mathbb{R}^{3}\), suppose that

$$ \|f\|_{L^{7}(B_{R})}^{7}\lesssim _{\varepsilon }R^{\varepsilon }\int \big|\sum _{\theta \in{\mathbf{{S}}}(R^{-1/3})}|f_{\theta}|^{2}*\omega _{\theta}\big|^{\frac{7}{2}} $$

for any Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\mathcal {M}^{3}(R)\). Then Proposition 6.1is true.

Proof

If \(\phi _{B_{R}}\) is a Schwartz function that rapidly decays away from \(B_{R}\), has Fourier transform supported in \(B_{R^{-1}}(0)\), and satisfies \(\phi _{B_{R}}\gtrsim 1\) on \(B_{R}\), then we may apply the hypothesis to the function \(\phi _{B_{R}}f\) to obtain

$$\begin{aligned} \sum _{B_{R}}\int _{B_{R}}|f|^{7}\lesssim _{\varepsilon }R^{\varepsilon }\sum _{B_{R}} \int |\sum _{\theta \in{\mathbf{{S}}}_{R^{-1/3}}}|\phi _{B_{R}}f_{\theta}|^{2}* \omega _{\theta}|^{\frac{7}{2}} \end{aligned}$$

where the sum is over a finitely overlapping cover of \(\mathbb{R}^{3}\) by \(R\)-balls. Then, since \(\|\cdot \|_{\ell ^{7/2}}\le \|\cdot \|_{\ell ^{1}}\),

$$ \sum _{B_{r}}\int |\sum _{\theta \in{\mathbf{{S}}}(R^{-1/3})}|\phi _{B_{r}}f_{ \theta}|^{2}*\omega _{\theta}|^{\frac{7}{2}}\le \int |\sum _{\theta \in{ \mathbf{{S}}}(R^{-1/3})}\sum _{B_{r}}|\phi _{B_{r}}f_{\theta}|^{2}*\omega _{ \theta}|^{\frac{7}{2}}. $$

It remains to note that \(\sum _{B_{r}}|\phi _{B_{r}}|^{2}\lesssim 1\). □

It further suffices to prove a weak, level-set version of Proposition 6.1.

Lemma 6.3

For each \(B_{R}\) and Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\mathcal {M}^{3}(R)\), there exists \(\alpha >0\) such that

$$ \|f\|_{L^{7}(B_{R})}^{7}\lesssim (\log R)\alpha ^{7}|\{x\in B_{R}:\alpha \le |f(x)| \}|+R^{-500}\int |\sum _{\theta \in{\mathbf{{S}}}(R^{-1/3})}|f_{\theta}|^{2}* \omega _{\theta}|^{7/2}. $$

Proof

Split the integral as follows:

$$\begin{aligned} \int _{B_{R}}|f|^{7}={}&\sum _{R^{-1000}\le \lambda \le 1}\int _{\{x\in B_{R}: \lambda \|f\|_{L^{\infty}(B_{R})}\le |f(x)|\le 2\lambda \|f\|_{L^{ \infty}(B_{R})}\}}|f|^{7}\\ &{}+\int _{\{x\in B_{R}:|f(x)|\le R^{-1000}\|f \|_{L^{\infty}(B_{R})}\}}|f|^{7} \end{aligned}$$

in which \(\lambda \) varies over dyadic values in the range \([R^{-1000},1]\). If one of the \(\lesssim \log R\) many terms in the first sum dominates, then we are done. Suppose instead that the second expression dominates:

$$ \int _{B_{R}}|f|^{7}\le 2\int _{\{x\in B_{R}:|f(x)|\le R^{-1000}\|f\|_{L^{ \infty}(B_{R})}\}}|f|^{7}. $$

We have

$$\begin{aligned} \int _{\{x\in B_{R}:|f(x)|\le R^{-1000}\|f\|_{L^{\infty}(B_{R})}\}}|f|^{7}& \le R^{-1000}|B_{R}|\|f\|_{L^{\infty}(B_{R})}^{7} \\ (\text{Cauchy-Schwarz})\qquad &\lesssim R^{-1000}|B_{R}|R^{2}\|\sum _{ \theta}|f_{\theta}|^{2}\|_{L^{\infty}(B_{R})}^{7/2} \\ (\text{loc. const. for $\sum _{\theta}|f_{\theta}|^{2}$})\qquad & \lesssim R^{-1000}|B_{R}|R^{2}\|\sum _{\theta}|f_{\theta}|^{2}*w_{R^{1/3}} \|_{L^{\infty}(B_{R})}^{7/2} \\ (\text{loc. const. for each $|f_{\theta}|^{2}$})\qquad &\lesssim R^{-995} \|\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}*w_{R^{1/3}}\|_{L^{ \infty}(B_{R})}^{7/2} \\ (\text{H\"{o}lder's inequality})\qquad &\lesssim R^{-995}\||\sum _{ \theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2}*w_{R^{1/3}}\|_{L^{\infty}(B_{R})} \\ &\lesssim R^{-995}\int |\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2}. \end{aligned}$$

 □

Continue to use the notation

$$ U_{\alpha }=\{x\in B_{R}:\alpha \le |f(x)|\}. $$

We will show that to estimate the size of \(U_{\alpha }\), it suffices to replace \(f\) with a version whose wave packets at scale \(\theta \) have been pigeonholed. Write

$$\begin{aligned} f=\sum _{\theta}\sum _{T\in \mathbb{T}_{\theta}} \psi _{T}f_{\theta } \end{aligned}$$
(31)

where for each \(\theta \in{\mathbf{{S}}}(R^{-1/3})\), \(\{\psi _{T}\}_{T\in \mathbb{T}_{\theta}}\) is the partition of unity from §2.1. If \(\alpha \le C_{\varepsilon }R^{-100}\max _{\theta}\|f_{\theta}\|_{\infty}\), then using a similar argument that bounds the second expression in the proof of Lemma 6.3, the inequality

$$ \alpha ^{7}|U_{\alpha }|\lesssim _{\varepsilon }R^{\varepsilon }\int |\sum _{\theta}|f_{\theta}|^{2}* \omega _{\theta}|^{7/2} $$

is trivial.

Proposition 6.4

Wave packet decomposition

Let \({\alpha }>C_{\varepsilon }R^{-100}\max _{\theta}\|f_{\theta}\|_{L^{\infty}(\mathbb{R}^{3})}\). There exist subsets \(\tilde{\mathbb{T}}_{\theta}\subset \mathbb{T}_{\theta}\), as well as a constant \(A>0\) with the following properties:

$$\begin{aligned} |U_{\alpha }|\lesssim (\log R)|\{x\in U_{\alpha }:\,\,{{\alpha }}&\lesssim |\sum _{ \theta \in{\mathbf{{S}}}(R^{-1/3})}\sum _{T\in \tilde{\mathbb{T}}_{\theta}} \psi _{T}(x)f_{ \theta }(x)|\,\,\}|, \end{aligned}$$
(32)
$$\begin{aligned} R^{\varepsilon }T\cap U_{\alpha }\neq\emptyset \qquad &\textit{for all}\quad \theta \in{\mathbf{{S}}}(R^{-1/3}),\quad T\in \tilde{\mathbb{T}}_{\theta} \end{aligned}$$
(33)
$$\begin{aligned} A\lesssim \|\sum _{T\in \tilde{\mathbb{T}}_{\theta}} \psi _{T}f_{\theta}\|_{L^{ \infty}(\mathbb{R}^{3})}&\lesssim R^{3\varepsilon } A\qquad \textit{for all}\quad \theta \in{\mathbf{{S}}}(R^{-1/3}) \end{aligned}$$
(34)
$$\begin{aligned} \|\psi _{T}f_{\theta}\|_{L^{\infty}(\mathbb{R}^{3})}&\sim A\qquad \textit{for all}\quad \theta \in{\mathbf{{S}}}(R^{-1/3}),\quad T\in \tilde{\mathbb{T}}_{ \theta}. \end{aligned}$$
(35)

Proof

Split the sum (31) into

$$ f=\sum _{\theta}\sum _{T\in \mathbb{T}_{\theta}^{c}}\psi _{T}f_{\theta}+\sum _{ \theta}\sum _{T\in \mathbb{T}_{\theta}^{f}}\psi _{T}f_{\theta}$$
(36)

where the close set is

$$ \mathbb{T}_{\theta}^{c}:=\{T\in \mathbb{T}_{\theta}:R^{\varepsilon }T\cap U_{ {\alpha }}\neq \emptyset \} $$

and the far set is

$$ \mathbb{T}_{\theta}^{f}:=\{T\in \mathbb{T}_{\theta}:R^{\varepsilon }T\cap U_{ {\alpha }}= \emptyset \} . $$

Using the rapid decay of the partition of unity, for each \(x\in U_{ {\alpha }}\),

$$ |\sum _{\theta}\sum _{T\in \mathbb{T}_{\theta}^{f}}\psi _{T}(x)f_{\theta}(x)| \lesssim _{\varepsilon }R^{-1000}\max _{\theta}\|f_{\theta}\|_{L^{\infty}(B_{R})}. $$

Therefore, using the assumption that \({{\alpha }}\) is at least \(C_{\varepsilon }R^{-100}\max _{\theta}\|f_{\theta}\|_{L^{\infty}(\mathbb{R}^{3})}\),

$$ |U_{ {\alpha }}|\le 2|\{x\in U_{ {\alpha }}:\,\,{ {\alpha }}\le 2 |\sum _{\theta} \sum _{T\in \tilde{\mathbb{T}}_{\theta}^{c}}\psi _{T}(x)f_{\theta}(x)|\,\,\}|. $$

Now we sort the close wave packets according to amplitude. Let

$$ M=\max _{\theta}\max _{T\in \mathbb{T}_{\theta}^{c}}\|\psi _{T}f_{\theta}\|_{L^{ \infty}(\mathbb{R}^{3})}. $$
(37)

Split the remaining wave packets into

$$ \sum _{\theta}\sum _{T\in \mathbb{T}_{\theta}^{c}}\psi _{T}f_{\theta}=\sum _{ \theta}\sum _{R^{-10^{3}}\le \lambda \le 1}\sum _{T\in \mathbb{T}_{\theta , \lambda}^{c}}\psi _{T}f_{\theta}+\sum _{\theta}\sum _{T\in \mathbb{T}_{\theta ,s}^{c}} \psi _{T}f_{\theta }$$
(38)

where \(\lambda \) is a dyadic number in the range \([R^{-10^{3}},1]\),

$$ \mathbb{T}_{\theta ,\lambda}^{c}:=\{T\in \mathbb{T}_{\theta}^{c}:\|\psi _{T}f_{\theta} \|_{L^{\infty}(\mathbb{R}^{3})}\sim \lambda M \}, $$

and

$$ \mathbb{T}_{\theta ,s}^{c}:= \{T\in \mathbb{T}_{\theta}^{c}:\|\psi _{T}f_{\theta}\|_{L^{ \infty}(\mathbb{R}^{3})}\le R^{-1000}M \} . $$

Again using the lower bound for \({\alpha }\) (and the fact that the number of \(T\in \mathbb{T}_{\theta}^{c}\) is bounded by \(R^{4}\)), the small wave packets cannot dominate and we have

$$ |U_{ {\alpha }}|\le 4|\{x\in U_{ {\alpha }}:\,\,{ {\alpha }}\le 4|\sum _{\theta } \sum _{R^{-10^{3}}\le \lambda \le 1}\sum _{T\in \mathbb{T}_{\theta ,\lambda}^{c}} \psi _{T}(x)f_{\theta}(x)|\,\,\}|. $$

By dyadic pigeonholing, for some \(\lambda \in [R^{-1000},1]\),

$$ |U_{ {\alpha }}|\lesssim (\log R)|\{x\in U_{ {\alpha }}:\,\,{ {\alpha }}\lesssim ( \log R)|\sum _{\theta }\sum _{T\in \mathbb{T}_{\theta ,\lambda}^{c}}\psi _{T}(x)f_{ \theta}(x)|\,\,\}|. $$

Note that we have the pointwise inequality

$$\begin{aligned} |\sum _{T\in \mathbb{T}_{\theta ,\lambda}^{c}}\psi _{T}(x)f_{\theta}(x)|&= | \sum _{\substack{T\in \mathbb{T}_{\theta ,\lambda}^{c}\\ x\in R^{\varepsilon }T}}\psi _{T}(x)f_{ \theta}(x)|+|\sum _{ \substack{T\in \mathbb{T}_{\theta ,\lambda}^{c}\\ x\notin R^{\varepsilon }T}}\psi _{T}(x)f_{ \theta}(x)| \\ &\le |\sum _{ \substack{T\in \mathbb{T}_{\theta ,\lambda}^{c}\\ x\in R^{\varepsilon }T}}\psi _{T}(x)f_{ \theta}(x)|+C_{\varepsilon }R^{-1000}|f_{\theta}(x)| . \end{aligned}$$

We know that \(\lambda M\ge C_{\varepsilon }R^{-1000}\max _{\theta}\|f_{\theta}\|_{L^{\infty}( \mathbb{R}^{3})}\) since if this did not hold, we would violate the lower bound for \(\alpha \). It follows that

$$ \lambda M\le \|\sum _{T\in \mathbb{T}_{\theta ,\lambda}^{c}}\psi _{T}f_{\theta} \|_{L^{\infty}(\mathbb{R}^{3})}\le 3R^{3\varepsilon }\lambda M. $$

The statement of the lemma is now satisfied with \(A=\lambda M\) and \(\tilde{\mathbb{T}}_{\theta}=\mathbb{T}_{\theta ,\lambda}^{c}\). □

Corollary 6.5

Let \(f\), \(\alpha \), \(\tilde{\mathbb{T}}_{\theta}\) and \(A>0\) be as in Proposition 6.4. Then for each \(x\in U_{\alpha }\),

$$ \alpha \le R^{103\varepsilon }\frac{1}{A} \sum _{\theta \in \mathcal {S}}|\sum _{T\in \tilde{\mathbb{T}}_{\theta}} \psi _{T} f_{\theta}|^{2}*\omega _{\theta}(x). $$

Proof

Let \(x\in U_{\alpha }\). Then using the rapid decay of \(\psi _{T}\) off of \(T\),

$$ \alpha \lesssim \big|\sum _{\theta}\sum _{ \substack{T\in \tilde{\mathbb{T}}_{\theta}\\ x\in R^{\varepsilon }T}}\psi _{T}(x)f_{ \theta}(x)\big|+C_{\varepsilon }R^{-1000}\max _{\theta}\|f_{\theta}\|_{L^{ \infty}(\mathbb{R}^{3})} . $$

Then the lower bound for \(\alpha \) and (35) imply that

$$ \alpha \lesssim \#\{\theta :\exists T\in \tilde{\mathbb{T}}_{\theta} \text{ satisfying } x\in R^{\varepsilon }T\}A. $$

By the locally constant property, if \(\eta _{\theta}\) is a standard bump function equal to 1 on \(\theta \), then

for some \(z_{\theta}\in \mathbb{R}^{3}\). Note that if either \(y\notin \frac{1}{2}R^{\varepsilon }T\) or \(z_{\theta}-y\notin \frac{1}{2}R^{\varepsilon }\theta ^{*}\), then . It follows from the lower bound on \(\alpha \) that

It remains to note that for each \(\theta \) counted on the right hand side above, and for each \(y\in R^{\varepsilon }T\), . □

Lemma 6.6

For each \(\alpha >0\), \(B_{R}\), and Schwartz function \(f:\mathbb{R}^{3}\to \mathbb{C}\) with Fourier transform supported in \(\mathcal {M}^{3}(R)\), there exists \(\beta >0\) such that \(\alpha ^{7}|\{x\in B_{R}:\alpha \le |f(x)|\}|\) is bounded by

$$\begin{aligned} &C(\log R)\alpha ^{7}|\{x\in B_{R}:\alpha \le |f(x)|,\quad \beta /2\le \sum _{ \theta}|f_{\theta}|^{2}*\omega _{\theta}(x)\le \beta \}|\\ &\quad {}+R^{-500}\int |\sum _{ \theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2} . \end{aligned}$$

Proof

First fix the notation

$$ U_{\alpha }=\{x\in B_{R}:\alpha \le |f(x)|\} $$

and \(g=\sum _{\theta \in{\mathbf{{S}}}(R^{-1/3})}|f_{\theta}|^{2}*\omega _{\theta}\). Then write

$$\begin{aligned} |U_{\alpha }|&=\sum _{R^{-1000}\le \lambda \le 1}|\{x\in U_{\alpha }: g(x)\sim \lambda \|g\|_{L^{\infty}(B_{R})}\}|+|\{x\in U_{\alpha }: g(x)\\ &\le R^{-1000} \|g\|_{L^{\infty}(B_{R})}\}| \end{aligned}$$

where \(\lambda \) takes dyadic values and \(g(x)\sim \lambda \|g\|_{L^{\infty}(B_{R})}\) means \(\lambda \|g\|_{L^{\infty}(B_{R})}/2\le g(x)\le \lambda \|g\|_{L^{ \infty}(B_{R})}\). If one of the first \(\le C(\log R)\) terms dominates the sum, then the lemma is proved. Suppose instead that the last term dominates, so that

$$ |U_{\alpha }|\lesssim |\{x\in U_{\alpha }: g(x)\le R^{-1000}\|g\|_{L^{\infty}(B_{R})} \}|. $$

Then

$$\begin{aligned} \alpha ^{7}|U_{\alpha }|&\lesssim \int _{\{x\in U_{\alpha }: g(x)\le R^{-1000}\|g\|_{L^{ \infty}(B_{R})}\}}|f|^{7} \\ (\text{Cauchy-Schwarz})\qquad &\lesssim R^{2}\int _{\{x\in U_{\alpha }: g(x) \le R^{-1000}\|g\|_{L^{\infty}(B_{R})}\}}|\sum _{\theta}|f_{\theta}|^{2}|^{7/2} \\ (\text{loc. const. for each $|f_{\theta}|^{2}$})\qquad &\lesssim R^{2} \int _{\{x\in U_{\alpha }: g(x)\le R^{-1000}\|g\|_{L^{\infty}(B_{R})}\}}| \sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2} \\ &\lesssim R^{2}|B_{R}|R^{-1000}\|\sum _{\theta}|f_{\theta}|^{2}*\omega _{ \theta}\|_{L^{\infty}(B_{R})}^{7/2} \\ (\text{loc. const. for $\sum _{\theta}|f_{\theta}|^{2}$})\qquad & \lesssim R^{-995}\|\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}*w_{R^{1/3}} \|_{L^{\infty}(B_{R})}^{7/2} \\ (\text{H\"{o}lder's inequality})\qquad &\lesssim R^{-995}\||\sum _{ \theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2}*w_{R^{1/3}}\|_{L^{\infty}(B_{R})} \\ &\lesssim R^{-995}\int |\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2}. \end{aligned}$$

 □

6.2 A multi-scale inequality for \(\text{S}_{1}(R)\) implying Proposition 6.1

First we use a broad/narrow analysis to prove a multi-scale inequality for \(\text{S}_{1}(R)\).

Lemma 6.7

For any \(1\le K^{3}\le R\) and \(1\le N_{0}\le \varepsilon ^{-1}\),

$$\begin{aligned} &\mathrm{S}_{1}(R)\\ &\quad \lesssim (\log R)^{2}\left ( K^{53}\big[R^{10 \varepsilon N_{0}}A^{ \varepsilon ^{-1}} + R^{4\varepsilon ^{2}+200\varepsilon }A^{\varepsilon ^{-1}} \mathrm{S}_{1}(R^{\varepsilon })^{\varepsilon ^{-1}-N_{0}} \big]+\mathrm{S}_{1}(R/K^{3})\right ) . \end{aligned}$$

Proposition 5.1 implies Lemma 6.7

Let \(f:\mathbb{R}^{3}\to \mathbb{C}\) be a Schwartz function with Fourier transform supported in \(\mathcal {M}^{3}(R)\). By Lemma 6.2, it suffices to bound \(\|f\|_{L^{7}(B_{R})}^{7}\) instead of \(\|f\|_{L^{7}(\mathbb{R}^{3})}^{7}\). By Lemma 6.3, we may fix \(\alpha >0\) so that \(\|f\|_{L^{7}(B_{R})}^{7}\lesssim (\log R)^{2}\alpha ^{7}|U_{\alpha }|\). By Proposition 6.4, we may replace \(\alpha \) by \(\alpha /A\) and replace \(f\) by \(\tilde{f}=\frac{1}{A}\sum _{\theta \in{\mathbf{{S}}}(R^{-1/3})}\sum _{T \in \tilde{\mathbb{T}}_{\theta}} \psi _{T}f_{\theta}\) where \(\tilde{\mathbb{T}}_{\theta}\) satisfies the properties in that proposition. From here, we will take \(f\) to mean \(\tilde{f}\). By Lemma 6.6, we may fix \(\beta >0\) so that \(\alpha ^{7}|U_{\alpha }|\lesssim (\log R)\alpha ^{7}|U_{\alpha ,\beta }|\). Finally, by Corollary 6.5, we have \(\alpha \lesssim R^{103}\).

Write \(f=\sum _{\tau \in{\mathbf{{S}}}(K^{-1})}f_{\tau}\). The broad-narrow inequality is

$$\begin{aligned} |f(x)|&\le 6\max _{\tau \in{\mathbf{{S}}}(K^{-1})}|f_{\tau}(x)|+K^{3}\max _{ \substack{d(\tau ^{i},\tau ^{j})\ge K^{-1}\\\tau ^{i}\in{\mathbf{{S}}}(K^{-1})}}|f_{ \tau ^{1}}(x)f_{\tau ^{2}}(x)f_{\tau ^{3}}(x)|^{\frac{1}{3}} . \end{aligned}$$
(39)

Indeed, suppose that the set \(\{\tau \in{\mathbf{{S}}}(K^{-1}):|f_{\tau}(x)|\ge K^{-1}\max _{\tau '\in{ \mathbf{{S}}}(K^{-1})}|f_{\tau '}(x)|\}\) has at least 5 elements. Then we can find three \(\tau ^{1},\tau ^{2},\tau ^{3}\) which are pairwise \(\ge K^{-1}\)-separated and satisfy \(|f(x)|\le K^{3}|f_{\tau ^{1}}(x)f_{\tau ^{2}}(x)f_{\tau ^{3}}(x)|^{ \frac{1}{3}}\). If there are fewer than 5 elements, then \(|f(x)|\le 6 \max _{\tau \in{\mathbf{{S}}}(K^{-1})}|f_{\tau}(x)|\).

The broad-narrow inequality leads to two possibilities. In one case, we have

$$ |U_{\alpha ,\beta }|\lesssim |\{x\in U_{\alpha ,\beta }:|f(x)|\le 6\max _{\tau \in{ \mathbf{{S}}}(K^{-1})}|f_{\tau}(x)|\}| . $$
(40)

Then the summary of inequalities from this case is

$$\begin{aligned} \int _{B_{R}}|f|^{7}\lesssim (\log R)^{2}\alpha ^{7}|U_{\alpha ,\beta }|\lesssim ( \log R)^{2}\sum _{\tau \in{\mathbf{{S}}}(K^{-1})}\int _{\mathbb{R}^{3}}|f_{\tau \in{\mathbf{{S}}}(K^{-1})}|^{7} . \end{aligned}$$

By rescaling for the moment curve and the definition of \(\text{S}_{1}(\cdot )\), we may bound each integral in the final upper bound by

$$ \int _{\mathbb{R}^{3}}|f_{\tau}|^{7}\le \text{S}_{1}(R/K^{3})\int _{\mathbb{R}^{3}}| \sum _{\theta \subset \tau}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2}. $$

Noting that \(\sum _{\tau}\int _{\mathbb{R}^{3}}|\sum _{\theta \subset \tau}|f_{\theta}|^{2}* \omega _{\theta}|^{7/2}\le \int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{2}* \omega _{\theta}|^{7/2} \) finishes this case.

The remaining case from the broad-narrow inequality is that

$$ |U_{\alpha ,\beta }|\lesssim |\{x\in U_{\alpha ,\beta }:|f(x)|\le K^{3} \max _{ \substack{d(\tau ^{i},\tau ^{j})\ge K^{-1}\\\tau ^{i}\in{\mathbf{{S}}}(K^{-1})}}|f_{ \tau ^{1}}(x)f_{\tau ^{2}}(x)f_{\tau ^{3}}(x)|^{1/3}\}| . $$

We may further assume that

$$\begin{aligned} |U_{\alpha ,\beta }|&\lesssim |\{x\in U_{\alpha ,\beta }:|f(x)|\\ &\le K^{3} \max _{ \substack{d(\tau ^{i},\tau ^{j})\ge K^{-1}\\\tau ^{i}\in{\mathbf{{S}}}(K^{-1})}}|f_{ \tau ^{1}}(x)f_{\tau ^{2}}(x)f_{\tau ^{3}}(x)|^{1/3},\quad \max _{ \tau \in{\mathbf{{S}}}(K^{-1})}|f_{\tau}(x)|\le \alpha \}| \end{aligned}$$

since otherwise, we would be in the first case (40). The size of the set above is now bounded by a sum over pairwise \(K^{-1}\)-separated 3-tuples \((\tau ^{1},\tau ^{2},\tau ^{3})\) of \(|\text{Br}_{\alpha ,\beta }^{K}|\) from §5. Using Proposition 5.1 to bound \(|\text{Br}_{\alpha ,\beta }^{K}|\), the summary of the inequalities from this case is

$$\begin{aligned} &\int _{B_{R}}|f|^{7}\\ &\quad \lesssim (\log R)^{2} K^{53}\big[R^{10 \varepsilon N_{0}}A^{ \varepsilon ^{-1}} + R^{4\varepsilon ^{2}}A^{\varepsilon ^{-1}} \mathrm{S}_{1}(R^{\varepsilon })^{\varepsilon ^{-1}-N_{0}} \big]\int |\sum _{\theta}|f_{\theta}|^{2}*\omega _{\theta}|^{7/2}, \end{aligned}$$

which finishes the proof. □

With Lemma 6.7 in hand, we may now prove Proposition 6.1.

Proof of Proposition 6.1

Let \(\eta \) be the infimum of the set

$$ \mathcal {S}=\{\delta \ge 0:\sup _{R\ge 1}\frac{\text{S}_{1}(R)}{R^{\delta }}< \infty \}. $$

Suppose that \(\eta >0\). Let \(\varepsilon _{1}\), \(\eta >\varepsilon _{1}>0\), be a parameter we will specify later. By Lemma 6.7, we have

$$\begin{aligned} \sup _{R\ge 1}\frac{\text{S}_{1}(R)}{R^{\eta -\varepsilon _{1}}}\lesssim _{\varepsilon }{}& \sup _{R\ge 1}\frac{1}{R^{\eta -\varepsilon _{1}}}\Big[(\log R)^{2}\Big( K^{53} \big[R^{10 \varepsilon N_{0}} + R^{4\varepsilon ^{2}+10\varepsilon } \text{S}_{1}(R^{\varepsilon })^{\varepsilon ^{-1}-N_{0}} \big]\\ &{}+\text{S}_{1}(R/K^{3})\Big) \Big] \end{aligned}$$

where we are free to choose \(\varepsilon >0\), \(1\le N_{0}\le \varepsilon ^{-1}\), and \(1\le K^{3}\le R\). Continue to bound the expression on the right hand side by

$$\begin{aligned} &\sup _{R\ge 1}(\log R)^{2}\Big( K^{53} \frac{R^{10 \varepsilon N_{0}}}{R^{\eta -\varepsilon _{1}}} + K^{53} \frac{R^{4\varepsilon ^{2}+10\varepsilon }}{R^{N_{0}\varepsilon (\eta +\varepsilon _{1})-2\varepsilon _{1}}} \big[ \frac{\text{S}_{1}(R^{\varepsilon })}{R^{\varepsilon (\eta +\varepsilon _{1})}}\big]^{\varepsilon ^{-1}-N_{0}}\\ &\quad {}+ \frac{1}{K^{3(\eta +\varepsilon _{1})}R^{-2\varepsilon _{1}}} \frac{{\text{S}_{1}(R/K^{3})}}{(R/K^{3})^{\eta +\varepsilon _{1}}} \Big). \end{aligned}$$

By definition of \(\eta \),

$$ \sup _{R\ge 1}\frac{\text{S}_{1}(R^{\varepsilon })}{R^{\varepsilon (\eta +\varepsilon _{1})}}+\sup _{R \ge 1}\frac{{\text{S}_{1}(R/K^{3})}}{(R/K^{3})^{\eta +\varepsilon _{1}}} < \infty , $$

so it suffices to check that

$$\begin{aligned} \sup _{R\ge 1}(\log R)^{2}\Big( K^{53} \frac{R^{10 \varepsilon N_{0}}}{R^{\eta -\varepsilon _{1}}} +& K^{53} \frac{R^{4\varepsilon ^{2}+10\varepsilon }}{R^{N_{0}\varepsilon (\eta +\varepsilon _{1})-2\varepsilon _{1}}} + \frac{1}{K^{3(\eta +\varepsilon _{1})}R^{-2\varepsilon _{1}}} \Big)< \infty \end{aligned}$$

to obtain a contradiction. From here, choose \(N_{0}=\varepsilon ^{-1/2}\) and \(K=R^{\varepsilon _{1}}\) so that it suffices to check

$$\begin{aligned} \sup _{R\ge 1}(\log R)^{2}\Big( \frac{1}{R^{\eta -10\varepsilon ^{1/2}-54\varepsilon _{1}}} +& \frac{1}{R^{\varepsilon ^{1/2}\eta -4\varepsilon ^{2}-10\varepsilon -55\varepsilon _{1}}} + \frac{1}{R^{\varepsilon _{1}}} \Big)< \infty . \end{aligned}$$

This is clearly true if we choose \(\varepsilon >0\) to satisfy \(\min (\eta -10\varepsilon ^{1/2},\eta -4\varepsilon ^{3/2}-10\varepsilon ^{1/2})>\eta /2\) and then choose \(\varepsilon _{1}\) to be smaller than \(\frac{1}{55}\varepsilon ^{1/2}\eta /4\). Our reasoning has shown that \(\eta -\varepsilon _{1}\in \mathcal {S}\), which is a contradiction. Conclude that \(\eta =0\), as desired. □

6.3 Proof of Theorem 1

We will show that \(\text{S}_{1}(R)\lesssim _{\varepsilon }R^{\varepsilon }\) implies \(\text{S}_{2}(R)\lesssim _{\varepsilon }R^{\varepsilon }\), which proves Theorem 1. See Definitions 1 and 2 in §1.2 for the definitions of \(\text{S}_{1}(R)\) and \(\text{S}_{2}(R)\).

The following is a multi-scale inequality relating \(\text{S}_{1}(R)\) to \(\text{S}_{2}(\cdot )\) and \(\text{S}_{1}(\cdot )\) evaluated at parameters smaller than \(R\).

Proposition 6.8

For \(R\ge 10\),

$$ \mathrm{S}_{2}(R)\lesssim R^{\varepsilon }\mathrm{S}_{1}(R^{1/3})\max _{1\le \lambda \le R^{2/3}}\mathrm{S}_{2}(\lambda ). $$

Granting this proposition, we now prove Theorem 1.

Proof of Theorem 1

Let \(\eta \) be the infimum of the set

$$ \mathcal {S}=\{\delta \ge 0:\sup _{R\ge 1}\frac{\text{S}_{2}(R)}{R^{\delta }}< \infty \}. $$

Suppose that \(\eta >0\). Let \(\varepsilon _{1}>0\) satisfy \((1/3)\eta -\varepsilon _{1}>0\). By Proposition 6.8, we have

$$\begin{aligned} \sup _{R\ge 1}\frac{\text{S}_{2}(R)}{R^{\eta -\varepsilon _{1}}}&\le C_{\varepsilon }\sup _{R \ge 1}\Big[\frac{\text{S}_{1}(R)R^{\varepsilon }}{R^{\eta -\varepsilon _{1}}}\max _{1\le \lambda \le R^{2/3}}\text{S}_{2}(\lambda ) \Big] \\ &=C_{\varepsilon }\sup _{R\ge 1} \frac{\text{S}_{1}(R)R^{\varepsilon }}{R^{(1/3)\eta -\varepsilon _{1}-(2/3)\delta }}\max _{1\le \lambda \le R^{2/3}}\frac{\text{S}_{2}(\lambda )}{\lambda ^{\eta +\delta }} \frac{\lambda ^{\eta +\delta }}{R^{(2/3)(\eta +\delta )}} \end{aligned}$$

where \(\delta \) is any positive number in the last line. By definition of \(\eta \),

$$ \max _{1\le \lambda}\frac{S_{2}(\lambda )}{\lambda ^{\eta +\delta }}\le C_{ \delta }< \infty . $$

Therefore,

$$\begin{aligned} \sup _{R\ge 1}\frac{\text{S}_{1}(R)R^{\varepsilon }}{R^{(1/3)\eta -\varepsilon _{1}-(2/3)\delta }} \max _{1\le \lambda \le R^{2/3}} \frac{\text{S}_{2}(\lambda )}{\lambda ^{\eta +\delta }} \frac{\lambda ^{\eta +\delta }}{R^{(2/3)(\eta +\delta )}} &\lesssim _{\varepsilon ,\delta } \sup _{R\ge 1}\frac{\text{S}_{1}(R)R^{\varepsilon }}{R^{(1/3)\eta -\varepsilon _{1}-(2/3)\delta }}. \end{aligned}$$

Choose \(\delta >0\) so that \((1/3)\eta -\varepsilon _{1}-(2/3)\delta >0\). Then choose \(\varepsilon >0\) so that \((1/3)\eta -\varepsilon _{1}-(2/3)\delta -\varepsilon >0\). Since by Proposition 6.1, we know that \(\text{S}_{1}(R)\) grows slower than any power in \(R\), conclude that

$$ \sup _{R\ge 1}\frac{\text{S}_{1}(R)}{R^{(1/3)\eta -\varepsilon _{1}-(2/3)\delta -\varepsilon }}< \infty . $$

The sequence of inequalities shows that \(\eta -\varepsilon _{1}\in{ \mathcal {S}}\), which contradicts the fact that \(\eta \) is the infimum. Conclude that \(\eta =0\). □

It remains to prove Proposition 6.8, which we do presently.

Proof of Proposition 6.8

Let \(f:\mathbb{R}^{3}\to \mathbb{C}\) be a Schwartz function with Fourier transform supported in \(\mathcal {M}^{3}(R)\). Begin with the defining inequality for \(\text{S}_{1}(R^{1/3})\):

$$ \int _{\mathbb{R}^{3}}|f|^{7}\le S_{1}(R^{1/3})\int _{\mathbb{R}^{3}}|\sum _{\tau \in{\mathbf{{S}}}(R^{-1/9})}|f_{\tau}|^{2}*\omega _{\tau}|^{7/2}. $$
(41)

We choose the scale \(R^{1/3}\) because each \(\omega _{\tau}\) is localized to an \(R^{1/9}\times R^{2/9}\times R^{1/3}\) plank, which is contained in an \(R^{1/3}\) ball. The square function we are aiming for, \(\sum _{\theta \in{\mathbf{{S}}}(R^{-1/3})}|f_{\theta}|^{2}\), is locally constant on \(R^{1/3}\) balls, so we will be able to eliminate the weights and therefore obtain a bound for \(\text{S}_{2}(R)\). The idea for going from the right hand side of (41) to our desired right hand side is to perform an algorithm similar to Proposition 4.6. We may vastly simplify the process since here we have not performed any pruning steps which give \(k\)th versions of \(f\) with altered Fourier support.

Begin with the assumption that

(42)

Indeed, if this does not hold, then we may assume

Then we have reached our termination criterion since by local \(L^{2}\)-orthogonality, for each \(\tau \in{\mathbf{{S}}}(R^{-1/9})\),

Finally, simply note that and by Young’s convolution inequality,

$$ \int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{2}*w_{R^{1/3}}|^{7/2} \lesssim \int _{\mathbb{R}^{3}}|\sum _{\theta}|f_{\theta}|^{2}|^{7/2}. $$

From here on, assume that (42) holds.

Now we describe the simplified algorithm. Let \(\delta >0\) be a constant that we specify later in the proof. At intermediate step \(m\), we have the inequality

$$\begin{aligned} \text{(R.H.S. of (42))} \le{}& (C_{\delta }R^{2\delta }(\log R)^{10})^{m} \sum _{s_{m}\le \sigma \le 1}\sum _{ \substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m})}} \sum _{V\|V_{\tau ,s_{m}^{-3}}}|V| \\ &{}\times \Big(|V|^{-1}\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(s_{m})}}|f_{\tau '}|^{7/2} W_{V} \Big)^{2} \end{aligned}$$
(43)

in which \(R^{-\frac{1}{3}}\le s_{m}\le R^{-\frac{1}{9}}R^{-\frac{\varepsilon ^{2}}{2}m}\). We begin by showing that (43) holds with \(m=1\).

Step 1: The Fourier support of is in the annulus \(\{R^{-1/3}\le |\xi |\le 2R^{-1/9}\}\). Since \(\sum _{\substack{R^{-1/3}\le s\le 3R^{-1/9}}}\eta _{s}\equiv 1\) (where \(s\) takes dyadic values) on this annulus, there is some dyadic \(s_{1}\), \(R^{-1/3}\le s_{1}\le 3R^{-1/9}\) such that

By pointwise local \(L^{2}\)-orthogonality (or the proof of Lemma 2.4),

where \(\tau '\sim \tau ''\) means \(d(\tau ',\tau '')\le 3 s_{1}\). For each \(\tau \in{\mathbf{{S}}}(R^{-1/9})\) and \(\tau '\subset \tau \), \(\tau '\in{\mathbf{{S}}}(s_{1})\), the Fourier support of

is contained in \((10\tau '-10\tau ')\setminus B_{s_{1}}\), which, after dilating by a factor of \(s_{1}^{-1}\), we may identify with a conical cap as we did in §3. Therefore, we may apply Proposition 4.4 (with space rescaled by a factor of \(s_{1}^{-1}\)) to obtain

For each \(s_{1}\le \sigma \le 1\), \(\underline {\tau}\in{\mathbf{{S}}}(\sigma ^{-1}s_{1})\), and \(V\|V_{\underline {\tau},s_{1}^{-3}}\), by Cauchy-Schwarz and Hölder’s inequality,

Note that since \(\tau '\subset \tau \), we have \(\tau ^{*}\subset 2(\tau ')^{*}\) and each \((\tau ')^{*}\subset 2V_{\tau ,s_{1}^{-3}}\). Also note that \(B_{s_{1}}(0)\subset V_{\tau ,s_{1}^{-3}}\). By properties of weight functions, it follows that

This concludes Step 1.

From here, we assume that for \(m>1\), Step \(m-1\) holds. We will show that either the algorithm terminates with the desired inequality or Step \(m\) (i.e. (43)) holds.

Step m: Let \(\sigma \in [s_{m-1},1]\) be a dyadic value satisfying

$$\begin{aligned} \text{(R.H.S. of (42))}\le ( \text{R.H.S. of (43) with $m-1$ and $\sigma $}). \end{aligned}$$
(44)

The analysis splits into two cases depending on whether \(\sigma \ge R^{-\varepsilon ^{2}}\) or \(\sigma < R^{-\varepsilon ^{2}}\).

Step m: \(\sigma \ge R^{-\varepsilon ^{2}}\). Using Cauchy-Schwarz and then Hölder’s inequality, for each \(\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})\),

V V τ , s m 1 3 | V | ( V τ τ τ S ( s m 1 ) | f τ | 7 / 2 ) 2 V V τ , s m 1 3 | V | ( # τ τ ) τ τ τ S ( s m 1 ) ( V | f τ | 7 / 2 ) 2 V V τ , s m 1 3 | V | ( R ε 2 ) τ τ τ S ( s m 1 ) V | f τ | 7 R ε 2 τ τ τ S ( s m 1 ) R 3 | f τ | 7 .

By rescaling for the moment curve, for each \(\tau '\in{\mathbf{{S}}}(s_{m-1})\),

$$ \int _{\mathbb{R}^{3}}|f_{\tau '}|^{7}\le \text{S}_{2}(s_{m-1}^{3}R)\int _{\mathbb{R}^{3}}( \sum _{\theta \subset \tau '}|f_{\theta}|^{2})^{7/2}. $$

Since \(\sum _{\tau '\in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})}\int _{\mathbb{R}^{3}}(\sum _{ \theta \subset \tau '}|f_{\theta}|^{2})^{7/2}\le \int _{\mathbb{R}^{3}}( \sum _{\theta}|f_{\theta}|^{2})^{7/2}\), the algorithm terminates in this case with the inequality

$$\begin{aligned} (\text{L.H.S. of (23)})\le (C_{\delta }R^{2\delta }\log &R)^{10m}R^{ \varepsilon ^{2}}\text{S}_{1}(s_{m-1}^{3}R)\int _{\mathbb{R}^{3}}(\sum _{\theta \in{\mathbf{{S}}}(R^{-1/3})}|f_{ \theta}|^{2})^{7/2}. \end{aligned}$$
(45)

Step m: \(s_{m-1}\le \sigma \le R^{-\varepsilon ^{2}}\). Let \(\tilde{s}_{m-1}=\max (R^{-\frac{1}{3}},\sigma ^{\frac{1}{2}}s_{m-1})\). By Lemma 4.3 (which applies equally well to functions which have not been pruned), for each \(\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})\) and \(V\|V_{\tau ,s_{m-1}^{-3}}\),

V τ τ τ S ( s m 1 ) | f τ | 7 / 2 C δ R δ | V | 1 τ τ τ S ( s m 1 ) | τ τ τ S ( s ˜ m 1 ) | f τ | 2 | 7 / 4 W V .

Then using \(\|\cdot \|_{\ell ^{7/4}}\le \|\cdot \|_{\ell ^{1}}\) and Cauchy-Schwarz, we have

$$\begin{aligned} &\sum _{\substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})}} \sum _{V\|V_{ \tau ,s_{m-1}^{-3}}}|V|\Big(|V|^{-1}\int \sum _{ \substack{\tau '\subset \tau \\\tau '\in{\mathbf{{S}}}(s_{m-1})}}(\sum _{ \substack{\tau ''\subset \tau '\\\tau ''\in{\mathbf{{S}}}(\tilde{s}_{m-1})}}|f_{ \tau ''}|^{2})^{7/4}W_{V} \Big)^{2} \\ &\quad\le \sum _{\substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})}} \sum _{V \|V_{\tau ,s_{m-1}^{-3}}}|V|^{-1}\Big(\int (\sum _{ \substack{\tau ''\subset \tau \\\tau ''\in{\mathbf{{S}}}(\tilde{s}_{m-1})}}|f_{ \tau ''}|^{2})^{7/4}W_{V} \Big)^{2} \\ &\quad\le \sum _{\substack{\tau \in{\mathbf{{S}}}(\sigma ^{-1}s_{m-1})}} \int _{ \mathbb{R}^{3}}(\sum _{ \substack{\tau ''\subset \tau \\\tau ''\in{\mathbf{{S}}}(\tilde{s}_{m-1})}}|f_{ \tau ''}|^{2})^{7/2} . \end{aligned}$$

If \(\tilde{s}_{m-1}=R^{-\frac{1}{3}}\), then the algorithm terminates with the inequality

$$\begin{aligned} (\text{R.H.S. of (42)})\le (C_{\delta }R^{2\delta }(\log &R)^{10})^{ m} \int _{\mathbb{R}^{3}}(\sum _{\theta}|f_{\theta}|^{2})^{7/2} . \end{aligned}$$
(46)

Otherwise, suppose that \(\tilde{s}_{m-1}>R^{-\frac{1}{3}}\). Then the inequality so far reads

$$\begin{aligned} (\text{L.H.S. of (23)})\le (C_{\delta }R^{2\delta }(\log &R)^{10})^{(m-1)}(C_{ \delta }R^{\delta }\log R)\int _{\mathbb{R}^{3}}(\sum _{\tau ''\in{\mathbf{{S}}}(\sigma ^{ \frac{1}{2}}s_{m-1})}|f_{\tau ''}|^{2})^{7/4}. \end{aligned}$$

By the same reasoning as in Step 1, either we are in a termination case analogous to when (42) does not hold, or there is some \(s_{m}\le \sigma ^{\frac{1}{2}}s_{m-1}\) for which

R 3 ( τ S ( σ 1 2 s m 1 ) | f τ | 2 ) 7 / 4 ( C δ R δ log R ) s m σ 1 τ S ( σ 1 s m ) V V τ , s m 3 | V | ( V τ τ τ S ( s m ) | f τ | 7 / 2 ) 2 .

By the hypothesis that \(s_{m-1}\le R^{-\frac{1}{9}}R^{\frac{-\varepsilon ^{2}}{2}(m-1)}\) and \(\sigma \le R^{-\varepsilon ^{2}}\), we have \(s_{m}\le R^{-\frac{1}{9}}R^{\frac{-\varepsilon ^{2}}{2}m}\). This completes the justification of Step \(m\).

Termination criteria. It remains to summarize the termination criteria. In each of the termination scenarios, if \(\text{S}_{2}(\cdot )\) appears, then it is multiplied by one factor of \(R^{\varepsilon ^{2}}\) and is evaluated at some parameter \(\lambda \) between 1 and \(R^{2/3}\). Finally, consider the accumulated constant

$$ (C_{\delta }R^{\delta }(\log R)^{10})^{M} $$

after \(M\) steps of the algorithm. Since \(R^{-1/3}\le R^{-1/9}R^{-\varepsilon ^{2} M/2}\), we have \(M\le \varepsilon ^{-2}/2\). Therefore, taking \(\delta =\varepsilon ^{4}\) gives

$$ (C_{\delta }R^{\delta }(\log R)^{10})^{M}\le \tilde{C}_{\varepsilon }R^{\varepsilon ^{3/2}}, $$

which suffices to prove the proposition. □