On Superorthogonality

Pierce, Lillian B.

doi:10.1007/s12220-021-00606-3

On Superorthogonality

Published: 17 March 2021

Volume 31, pages 7096–7183, (2021)
Cite this article

The Journal of Geometric Analysis Aims and scope Submit manuscript

Lillian B. Pierce ORCID: orcid.org/0000-0002-0194-0083¹

295 Accesses
7 Citations
Explore all metrics

Abstract

In this survey, we explore how superorthogonality amongst functions in a sequence $f_1,f_2,f_3,\ldots $ results in direct or converse inequalities for an associated square function. We distinguish between three main types of superorthogonality, which we demonstrate arise in a wide array of settings in harmonic analysis and number theory. This perspective gives clean proofs of central results, and unifies topics including Khintchine’s inequality, Walsh–Paley series, discrete operators, decoupling, counting solutions to systems of Diophantine equations, multicorrelation of trace functions, and the Burgess bound for short character sums.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On polynomials in primes, ergodic averages and monothetic groups

Article 17 February 2024

On generalized powers of operators

Article 29 September 2022

Analytic Number Theory in the Last Decade

Article 01 July 2022

Notes

ETH Zürich, Rämistrasse 101, 8092 Zürich, Switzerland. Email: kowalski@math.ethz.ch.
To be more precise, this applies exactly in this way only when all $x\in {\mathbb {Z}}/q{\mathbb {Z}}$ are “unramified” for $\varrho $; since exceptions to this are rare for the cases that interest us, and since there is in any case a similar (but slightly more complicated) description even when x is ramified, we do not dwell on this issue.

References

Apostol, T.M.: Introduction to Analytic Number Theory, vol. I. Springer, New York (1976)
Book MATH Google Scholar
Bednorz, W.: A note on the Menchov–Rademacher inequality. Bull. Pol. Acad. 54(1), 26–30 (2006)
MathSciNet Google Scholar
Billard, P.: Sur la convergence presque partout des séries de Fourier–Walsh des fonctions de l’espace $L^{2}\, (0,\,1)$. Studia Math. 28, 363–388 (1967)
Article MATH Google Scholar
Bourgain, J., Chang, M.-C.: On a multilinear character sum of Burgess. C. R. Acad. Sci. Paris I(348), 115–120 (2010)
Article MathSciNet MATH Google Scholar
Bourgain, J., Demeter, C.: A study guide for the $l^2$ decoupling theorem. Chin. Ann. Math. B 38(1), 173–200 (2017)
Article MathSciNet MATH Google Scholar
Bourgain, J., Demeter, C., Guth, L.: Proof of the main conjecture in Vinogradov’s mean value theorem for degrees higher than three. Ann. Math. (2) 184(2), 633–682 (2016)
Article MathSciNet MATH Google Scholar
Bourgain, J.: An approach to pointwise ergodic theorems. In: Geometric Aspects of Functional Analysis (1986/87), Lecture Notes in Mathematics, vol. 1317, pp. 204–223. Springer, Berlin (1988)
Bourgain, J.: On the maximal ergodic theorem for certain subsets of the integers. Isr. J. Math. 61, 39–72 (1988)
Article MathSciNet MATH Google Scholar
Bourgain, J.: On the pointwise ergodic theorem on ${L}^p$ for arithmetic sets. Isr. J. Math 61, 73–84 (1988)
Article MATH Google Scholar
Bourgain, J.: Pointwise ergodic theorems for arithmetic sets. Inst. Hautes Études Sci. Publ. Math. (69):5–45 (with an Appendix by the author, Harry Furstenberg. Yitzhak Katznelson and Donald S, Ornstein) (1989)
Burgess, D.A.: The distribution of quadratic residues and non-residues. Mathematika 4, 106–112 (1957)
Article MathSciNet MATH Google Scholar
Burgess, D.A.: On character sums and ${L}$-series. J. Reine Angew. Math. 3(12), 193–206 (1962)
MathSciNet MATH Google Scholar
Burgess, D.A.: On character sums and ${L}$-series II. Proc. Lond. Math. Soc. 3(13), 524–536 (1963)
Article MathSciNet MATH Google Scholar
Burgess, D.A.: The character sum estimate with $r =3$. J. Lond. Math. Soc. 2(33), 219–226 (1986)
Article MATH Google Scholar
Chang, M.-C.: On a question of Davenport and Lewis and new character sum bounds in finite fields. Duke Math. J. 145(3), 409–442 (2008)
Article MathSciNet MATH Google Scholar
Chang, M.-C.: Burgess inequality in ${\mathbb{F}}_{p^2}$. Geom. Funct. Anal. 19, 1001–1016 (2009)
Article MathSciNet Google Scholar
Chang, M.-C.: An estimate of incomplete mixed character sums. In: An Irregular Mind, Bolyai Society Mathematics Studies, vol. 21, pp. 243–250. János Bolyai Mathematics Society, Budapest (2010)
Córdoba, A.: A note on Bochner–Riesz operators. Duke Math. J. 46(3), 505–511 (1979)
Article MathSciNet MATH Google Scholar
Davenport, H., Erdős, P.: The distribution of quadratic and higher residues. Publ. Math. Debr. 2, 252–265 (1952)
Article MathSciNet MATH Google Scholar
Davenport, H., Lewis, D.J.: Character sums and primitive roots in finite fields. Rend. Circ. Mat. Palermo Ser. II Tomo XII Anno(1963) XII, 129–136 (1963)
Article MathSciNet MATH Google Scholar
Deligne, P.: La conjecture de Weil II. Inst. Hautes Études Sc. Publ. Math. No. 52, 137–252 (1980)
Article MathSciNet MATH Google Scholar
Doob, J.L.: Stochastic Processes. Wiley, New York (1953)
MATH Google Scholar
Fine, N.J.: On the Walsh functions. Trans. Am. Math. Soc. 65, 372–414 (1949)
Article MathSciNet MATH Google Scholar
Fouvry, É., Ganguly, S., Kowalski, E., Michel, P.: Gaussian distribution for the divisor function and Hecke eigenvalues in arithmetic progressions. Comment. Math. Helv. 89(4), 979–1014 (2014)
Article MathSciNet MATH Google Scholar
Fouvry, É., Kowalski, E., Michel, P.: A study in sums of products. Philos. Trans. R. Soc. A 373(2040), 1–26 (2015)
Article MathSciNet MATH Google Scholar
Fouvry, É., Kowalski, E., Michel, P., Raju, C.S., Rivat, J., Soundararajan, K.: On short sums of trace functions. Ann. Inst. Fourier (Grenoble) 167(1), 423–449 (2017)
Article MathSciNet MATH Google Scholar
Fouvry, É., Kowalski, E., Michel, P., Sawin, W.: Lectures on applied $\ell $-adic cohomology. Contemp. Math. 740, 113–195 (2019)
Article MathSciNet MATH Google Scholar
Friedlander, J.B., Iwaniec, H., Mazur, B., Rubin, K.: The spin of prime ideals. Invent. Math. 193(3), 697–749 (2013)
Article MathSciNet MATH Google Scholar
Gallagher, P.X., Montgomery, H.L.: A note on Burgess’s estimate. Math. Notes 88, 321–329 (2010)
Article MathSciNet MATH Google Scholar
Gressman, P.T., Guo, S., Pierce, L.B., Roos, J., Yung, P.-L.: Reversing a philosophy: from counting to square functions and decoupling. J. Geom. Analysis (2020). arXiv:1906.05877
Gressman, P.T.: Geometric averaging operators and nonconcentration inequalities (2019). arXiv:1906.04599
Haagerup, U.: The best constants in the Khintchine inequality. Studia Math. 70(3), 231–283 (1981)
Article MathSciNet MATH Google Scholar
Harper, A., Nikeghbali, A., Radziwiłł, M.: A note on Helson’s conjecture on moments of random multiplicative functions. In: Analytic Number Theory. Springer, Cham (2015)
Heap, W., Lindqvist, S.: Moments of random multiplicative functions and truncated characteristic polynomials. Q. J. Math. 67(4), 683–714 (2015)
MathSciNet MATH Google Scholar
Heath-Brown, D.R.: Burgess’s bounds for character sums. Proc. Math. Stat. 43, 199–213 (2012)
MathSciNet MATH Google Scholar
Heath-Brown, D.R.: Small solutions of quadratic congruences, and character sums with binary quadratic forms. Mathematika 62, 551–571 (2016)
Article MathSciNet MATH Google Scholar
Heath-Brown, D.R., Pierce, L.B.: Burgess bounds for short mixed character sums. J. Lond. Math. Soc. (2) 91(3), 693–708 (2015)
Article MathSciNet MATH Google Scholar
Ionescu, A.D., Wainger, S.: ${L}^p$ boundedness of discrete singular Radon transforms. J. Am. Math. Soc. 19(2), 357–383 (2005)
Article MATH Google Scholar
Iwaniec, H., Kowalski, E.: Analytic Number Theory, vol. 53. Amer. Math. Soc. Colloquium Publications, Providence RI (2004)
MATH Google Scholar
Kac, M.: Statistical Independence in Probability, Analysis, and Number Theory. The Mathematical Association of America, Washington, DC (1964)
Google Scholar
Kaczmarz, S.: Über ein Orthogonalsystem. In: Comptes rendus du premier congrès des math. des pays slaves (Varsovie), pp. 189–192 (1929)
Kaczmarz, S., Steinhaus, H.: Le systéme orthogonal de M. Rademacher. Studia Math. 2(1), 231–247 (1930)
Article MATH Google Scholar
Kaczmarz, S., Steinhaus, H.: Theorie der Orthogonalreihen. Instytut Matematyczny Polskiej Akademi Nauk, Warszawa-Lwów (1936)
MATH Google Scholar
Katz, N.M.: Exponential Sums and Differential Equations. Annals of Mathematics Studies, vol. 124. Princeton University Press, Princeton (1990)
Book MATH Google Scholar
Kowalski, E., Ricotta, G.: Fourier coefficients of $GL(N)$ automorphic forms in arithmetic progressions. Geom. Funct. Anal. 24(4), 1229–1297 (2014)
Article MathSciNet MATH Google Scholar
Magyar, A., Stein, E.M., Wainger, S.: Discrete analogues in harmonic analysis: spherical averages. Ann. Math. 155, 189–208 (2002)
Article MathSciNet MATH Google Scholar
Menchov, D.: Sur les séries de fonctions orthogonales. Fundam. Math. 1, 82–105 (1923)
Article Google Scholar
Mirek, M., Stein, E.M., Zorin-Kranich, P.: Jump inequalities for translation-invariant operators of Radon type on ${\mathbb{Z}}^d$ (2018). arXiv:1809.03803
Paley, R.E.A.C.: A remarkable series of orthogonal functions (I). Proc. Lond. Math. Soc. (2) 34(4), 241–264 (1932)
Article MathSciNet MATH Google Scholar
Paley, R.E.A.C., Zygmund, A.: On some series of functions. Math. Proc. Camb. Philos. Soc. 26(3), 337–357 (1930)
Article MATH Google Scholar
Petrow, I., Young, M.P.: The fourth moment of Dirichlet $L$-functions along a coset and the Weyl bound (2019). arXiv:1908.10346
Pierce, L.B.: Burgess bounds for multi-dimensional short mixed character sums. J. Number Theory 163, 172–210 (2016)
Article MathSciNet MATH Google Scholar
Pierce, L.B.: The Vinogradov mean value theorem [after Wooley, and Bourgain, Demeter and Guth]. Number 407, Exp. No. 1134. In: Séminaire Bourbaki, vol. 2016/2017. Exposés 1120–1135, pp. 479–564 (2019)
Pierce, L.B.: Burgess bounds for short character sums evaluated at forms II: the mixed case. Riv. Mat. Univ. di Parma (2020). arXiv:2002.03435
Pierce, L.B., Xu, J.: Burgess bounds for short character sums evaluated at forms. Algebra Number Theory 14, 1911–1951 (2020)
Article MathSciNet MATH Google Scholar
Pólya, G.: Über die Verteilung der quadratischen Reste und Nichtreste, pp. 21–29. Göttinger Nachrichten (1918)
Rademacher, H.: Einige Sätze über Reihen von allgemeinen Orthogonal-Funktionen. Math. Ann. 87, 112–138 (1922)
Article MathSciNet MATH Google Scholar
Rubio de Francia, J.L.: A Littlewood–Paley inequality for arbitrary intervals. Rev. Mat. Iberoam. 1(2), 1–14 (1985)
Article MathSciNet MATH Google Scholar
Sjölin, P.: An inequality of Paley and convergence ae of Walsh–Fourier series. Ark. Mat. 7, 551–570 (1969)
Article MathSciNet MATH Google Scholar
Stein, E.M.: On limits of seqences of operators. Ann. Math. 2(74), 140–170 (1961)
Article MathSciNet Google Scholar
Stein, E.M.: Singular Integrals and Differentiability Properties of Functions. Princeton University Press, Princeton (1970)
MATH Google Scholar
Stein, E.M.: Harmonic Analysis: Real-Variable Methods, Orthogonality, and Oscillatory Integrals. Princeton University Press, Princeton (1993)
MATH Google Scholar
Thiele, C.M.: Time–frequency analysis in the discrete phase plane. ProQuest LLC, Ann Arbor, MI, 1995. Thesis (PhD), Yale University
Vinogradov, I.M.: Sur la distribution des résidus et des nonrésidus des puissances. J. Soc. Phys. Math. Soc. Univ. Permi 1, 94–96 (1918)
Google Scholar
Vinogradov, I.M.: On a general theorem concerning the distribution of the residues and non-residues of powers. Trans. Am. Math. Soc. 29, 209–217 (1927)
Article MathSciNet MATH Google Scholar
Walsh, J.L.: A closed set of normal orthogonal functions. Am. J. Math. 45, 5–24 (1923)
Article MathSciNet MATH Google Scholar
Weil, A.: On the Riemann hypothesis in function fields. Proc. Nat. Acad. Sci. USA 27, 345–347 (1941)
Article MATH Google Scholar
Weil, A.: Sur les courbes algébriques et les variétés qui s’en déduisent. Actualités Math. et Sci., 1041(Deuxième partie,):§${\rm IV}$, (1945)
Wintner, A.: Random factorizations and Riemann’s hypothesis. Duke Math. J. 11, 267–275 (1944)
Article MathSciNet MATH Google Scholar
Wolff, T.H.: In: Łaba, I., Shubin, C. (eds) Lectures on Harmonic Analysis, University Lecture Series, vol. 29. AMS, Providence (2003) (with a foreword by C. Fefferman and preface by I. Łaba)
Zygmund, A.: Trigonometric Series, Volumes I and II, 3rd edn. Cambridge University Press, Cambridge (2002)

Download references

Acknowledgements

Pierce is partially supported by NSF CAREER Grant DMS-1652173, a Sloan Research Fellowship, and the AMS Joan and Joseph Birman Fellowship.

Author information

Authors and Affiliations

Department of Mathematics, Duke University, 120 Science Drive, Durham, NC, 27708, USA
Lillian B. Pierce

Authors

Lillian B. Pierce
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lillian B. Pierce.

Additional information

Dedicated to the memory of Elias M. Stein.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

With an “Appendix” by Emmanuel Kowalski.

Appendices

Appendix A: Further Remarks on Walsh–Paley Series

We deferred a few details on the direct and converse inequalities in the setting of Walsh–Paley series in Sect. 4. Here, we first remark on the limiting argument to obtain (4.4) for $p=2r$ from the truncated version (4.21). Second, we remark on deducing the cases for $1<p<\infty $ from the cases with p an even integer; this illustrates a further application of Khintchine’s inequality. Third, we show how to deduce the operator bound (4.5) from the dyadic direct and converse inequalities (4.4).

1.1 A.1. Limiting Arguments for Direct and Converse Inequalities

Fix $p=2r$. In the main text we showed that uniformly in N,

$$\begin{aligned} \left\| \sum _{n=0}^N f_n \right\| _{L^p} \le c_p \left\| \left( \sum _{n=0}^N f_n^2\right) ^{1/2} \right\| _{L^p} \le c_p \left\| \left( \sum _{n=0}^\infty f_n^2\right) ^{1/2} \right\| _{L^p}. \end{aligned}$$

The same method of proof used to obtain this shows that for any $N_1< N_2$,

$$\begin{aligned} \Vert S_{2^{N_2}}f - S_{2^{N_1}} f \Vert _{L^p}\le c_p \left\| \left( \sum _{n=N_1+1}^{N_2} f_n^2\right) ^{1/2} \right\| _{L^p}. \end{aligned}$$

If f is such that the right-hand side of the direct inequality converges, then this tail must vanish as $N_1, N_2 \rightarrow \infty $, so that as $N \rightarrow \infty $, $S_{2^N}f$ converges in $L^p$ norm to some function, say F, which satisfies $ \Vert F\Vert _{L^p} \le c_p\left\| \left( \sum _{n=0}^\infty f_n^2\right) ^{1/2}\right\| _{L^p} .$ By the Dominated Convergence Theorem, for each m

$$\begin{aligned} c_m(F) = \int _0^1 F(\theta ) w_m(\theta ) \mathrm{d}\theta = \int _0^1 f(\theta )w_m(\theta ) \mathrm{d}\theta = c_m(f), \end{aligned}$$

and since $\{w_m\}$ is a complete orthonormal system on [0, 1], we conclude $F=f$, verifying the direct inequality. For the converse inequality, we apply the maximal bound (4.20) to see that $\left\| \left( \sum _{n=0}^N f_n^2\right) ^{1/2} \right\| _{L^p} \le c_p' \left\| \sum _{n=0}^N f_n \right\| _{L^p} \ll _p \Vert f\Vert _{L^p}$ uniformly in N, which suffices.

1.2 A.2 Linearization

We have verified the direct and converse inequalities (4.4) in $L^p$ for each even integer $p \ge 2$. To conclude the results for all $1< p <\infty $, we recall Paley’s arguments (now standard), in which the Rademacher functions again make an appearance, via Khintchine’s inequality.

One would like to interpolate either the direct inequality (or the converse inequality, respectively), but one must first linearize. For any fixed $1<p<\infty $, the truth for all $f \in L^p$ of the direct and converse inequalities

$$\begin{aligned} \left\| \left( \sum _{n=0}^\infty f_n^2\right) ^{1/2} \right\| _{L^p} \ll _p \Vert f\Vert _{L^p} \ll _p \left\| \left( \sum _{n=0}^\infty f_n^2\right) ^{1/2} \right\| _{L^p} \end{aligned}$$

(A.1)

is equivalent to the truth of the statement that

$$\begin{aligned} \Vert f^*\Vert _{L^p}\ll _p \Vert f\Vert _{L^p} \ll _p \Vert f^*\Vert _{L^p} \end{aligned}$$

(A.2)

holds for all $f \in L^p$, uniformly for all choices of $\varepsilon _n \in \{ \pm 1\}$, where

$$\begin{aligned} f^*(t) = \sum _{n=0}^\infty \varepsilon _n f_n(t). \end{aligned}$$

The advantage of (A.2) is that the expressions in this inequality are linear, and thus well-suited to interpolation.

Let us verify the equivalence. If (A.2) holds, to deduce (A.1), we use the Rademacher functions. Given f and its associated sequence $\{f_n\}$ we define an auxiliary function $F(t,\theta ) = \sum _{n=0}^\infty r_n(\theta ) f_n(t)$ for each $\theta \in [0,1]$. By assumption of (A.2), for each fixed $\theta $,

$$\begin{aligned} \int _0^1 | F(t,\theta )|^p \mathrm{d}t \ll _p \int _0^1 |f(t)|^p \mathrm{d}t \ll _p \int _0^1 | F(t,\theta )|^p \mathrm{d}t. \end{aligned}$$

We integrate this over $\theta \in [0,1]$ to conclude by Fubini’s theorem that

$$\begin{aligned} \int _0^1 \int _0^1 \left| \sum _{n=0}^\infty r_n(\theta ) f_n(t)\right| ^p \mathrm{d}\theta \mathrm{d}t \ll _p \int _0^1 |f(t)|^p \mathrm{d}t \ll _p \int _0^1 \int _0^1 \left| \sum _{n=0}^\infty r_n(\theta ) f_n(t)\right| ^p\mathrm{d}\theta \mathrm{d}t. \end{aligned}$$

Now for each fixed t we apply Khintchine’s inequality (2.5), and this proves that (A.1) holds, as desired.

The converse is more elementary. Given $f \in L^p$, and any choice of $\{\varepsilon _n\}$, $f^*$ is the function with associated expansion $\sum _{n=0}^\infty g_n$ with $g_n=\varepsilon _n f_n$, so that applying the direct inequality followed by the converse inequality assumed in (A.1) shows that

$$\begin{aligned} \Vert f^* \Vert _{L^p} \ll _p \left\| \left( \sum _n g_n^2\right) ^{1/2} \right\| _{L^p} =\left\| \left( \sum _n f_n^2\right) ^{1/2} \right\| _{L^p} \ll _p \Vert f\Vert _{L^p}. \end{aligned}$$

One obtains $\Vert f\Vert _{L^p} \ll _p \Vert f^*\Vert _{L^p}$ in an analogous fashion.

1.3 A.3 Remarks for $2 \le p < \infty $

We know that (A.1) and hence (A.2) holds for each $p=2r$ with $r \ge 1$ an integer. We fix a sequence $\{ \varepsilon _n\}_n$ with $\varepsilon _n \in \{\pm 1\}$ and consider a truncation $(S_{2^N}f)^* (t) = \sum _{0 \le n \le N} \varepsilon _n f_n(t)$. Then applying the left-hand side of (A.2), for every even integer $p \ge 2$,

$$\begin{aligned} \Vert (S_{2^N}f)^* \Vert _{L^p} \ll _p \Vert S_{2^N}f\Vert _{L^p} \ll _p\Vert f\Vert _{L^p}, \end{aligned}$$

in which the last inequality holds uniformly in N, by the maximal theorem in (4.20). By Riesz–Thorin interpolation between $p=2$ and any even integer, we conclude that this inequality holds for all $2 \le p < \infty $. For a fixed $p \ge 2$, we can then deduce that $(S_{2^N}f)^*$ converges in $L^p$ norm to a limit function, say $F^*$. By the Dominated Convergence Theorem, the coefficients $c_m(F^*)$ agree with those of $f^*$, and since the Walsh functions form a complete system, we learn that $F^*=f^*$. We conclude that $\Vert f^*\Vert _{L^p} \ll _p \Vert f\Vert _{L^p}$, obtaining the left-hand inequality of (A.2) for each $2\le p < \infty $. For the other inequality, we simply observe that given f and a fixed sequence $\{\varepsilon _n\}$, then $(f^*)^*=f$, so the right-hand inequality of (A.2) follows.

1.4 A.4 Remarks for $1< p \le 2$

One again uses the linearized inequalities (A.2) in order to apply duality. Fix $1<p \le 2$, and fix a sequence of $\varepsilon _n \in \{\pm 1\}$, and accordingly define $f_N^* = \sum _{0 \le n \le N} \varepsilon _n f_n$. By duality, to show that $\Vert f_N^*\Vert _{L^p} \ll _p \Vert f\Vert _{L^p}$ it suffices to show that for all $g \in L^{p'}$ with $1/p + 1/p'=1$, $\Vert f_N^* g\Vert _{L^1} \ll _p \Vert g\Vert _{L^{p'}} \Vert f\Vert _{L^p}.$ Precisely,

$$\begin{aligned} \left\| \left( \sum _{n=0}^{N} \varepsilon _n f_n\right) g \right\| _{L^1} = \left\| \left( \sum _{n=0}^{N} \varepsilon _n g_n\right) f \right\| _{L^1} \le \left\| \sum _{n=0}^{N} \varepsilon _n g_n\right\| _{L^{p'}} \left\| {f}\right\| _{L^p}, \end{aligned}$$

with the last inequality due to Hölder’s inequality. We apply the known case for $p' \ge 2$, so that $\left\| \sum _{n=0}^{N} \varepsilon _n g_n\right\| _{L^{p'}}\ll _p \Vert g\Vert _{L^{p'}}$, uniformly in the choice of signs $\{ \varepsilon _n\}$. We conclude that $\Vert f_N^*\Vert _{L^p} \ll _p \Vert f\Vert _{L^p}$ uniformly in N, and uniformly in the choice of $\{\varepsilon _n\}$. Thus we may argue as before that $f_N^*$ converges in $L^p$ norm to a function, which we may check is indeed $f^* = \sum \varepsilon _n f_n$, and this verifies that $\Vert f^*\Vert _{L^p} \ll _p \Vert f\Vert _{L^p}$ holds. For the other inequality, we again note that for each fixed choice of signs, $(f^*)^*=f$, and thus we obtain $\Vert f\Vert _{L^p} \ll _p \Vert f^*\Vert _{L^p}$, concluding the proof.

1.5 A.5 Combining the Direct and Converse Inequalities

Fix $1<p<\infty $ and $n \ge 1$. To combine the direct and converse inequalities for the dyadic differences $f_n = S_{2^{n}}f - S_{2^{n-1}}f$ in order to bound $S_n f$ on $L^p$, we must be able to express the partial sum $S_nf$ in terms of dyadic differences. Paley employs an identity of the following flavor. Write the binary expansion $n=2^{n_1} + \cdots + 2^{n_s}$ with $n_1> \cdots > n_s$. We claim

$$\begin{aligned} w_n(t) w_n(\theta ) \sum _{m=0}^{n-1} w_m(t)w_m(\theta )= & {} \sum _{m \in [2^{n_1}, 2^{n_1+1})} w_m(t)w_m(\theta ) \nonumber \\+ & {} \cdots +\sum _{m \in [2^{n_s}, 2^{n_s+1})} w_m(t)w_m(\theta ). \end{aligned}$$

(A.3)

Once we have verified this, the deduction is simple. Recall

$$\begin{aligned} S_n f (t) = \sum _{m=0}^{n-1} c_m(f) w_m(t) = \int _0^1 f(\theta ) \sum _{m=0}^{n-1} w_m(\theta ) w_m(t) \mathrm{d}\theta . \end{aligned}$$

To introduce the extraneous factor $w_n$ which is critical to the identity (A.3), given any $f \in L^p[0,1]$ we define the function $g(\theta ) = f(\theta ) w_n(\theta )$ with identical $L^p$ norm; we will also use the notation $g_m = S_{2^{m}} g - S_{2^{m-1}} g.$ Then using $w_n(\theta )^2 \equiv 1$ followed by (A.3),

$$\begin{aligned} w_n(t) (S_n f)(t) = \int _0^1 g(\theta ) w_n(\theta ) w_n(t) \sum _{m=0}^{n-1} w_m(\theta ) w_m(t) \mathrm{d}\theta = g_{n_1+1}(t) + \cdots + g_{n_s+1}(t). \end{aligned}$$

Now applying first the direct inequality and then the converse inequality for the functions $\{g_n\}$ we obtain the desired result:

$$\begin{aligned} \Vert S_nf \Vert _{L^p}= & {} \Vert \sum _{j=1}^s g_{n_j+1} \Vert _{L^p} \ll _p \left\| \left( \sum _{j=1}^s g_{n_j+1}^2\right) ^{1/2} \right\| _{L^p}\\\le & {} \left\| \left( \sum _{n=0}^\infty g_n^2\right) ^{1/2} \right\| _{L^p} \ll _p \Vert g \Vert _{L^p} = \Vert f \Vert _{L^p}. \end{aligned}$$

To verify (A.3), it suffices to observe an equivalent identity about sets of numbers written in binary (also expressible in terms of properties of the Walsh group or “dyadic group,” see [23, §2] or [3]). Precisely, fix n and $m \le n$ and suppose $n=2^{n_1} + \cdots + 2^{n_s}$ (with $n_1> \cdots > n_s$) and $m=2^{m_1} + \cdots + 2^{m_r}$ (with $m_1> \cdots > m_r$), and let the $(n_1+1)$-digit representation of n and m in binary be ${\underline{n}}, {\underline{m}}$, respectively. Then $w_n w_m = w_u$ where ${\underline{u}} = {\underline{n}} \oplus {\underline{m}}$; here $\oplus $ denotes exclusive-or summation. (Since the square of any Rademacher function is identically one, if any exponent occurs in both the binary expansion of n and of m, then it does not appear as an exponent in the binary expansion of u for the function $w_u$ such that $w_u = w_nw_m$.)

Consequently, (A.3) is equivalent to the following identity on sets of distinct binary numbers:

$$\begin{aligned} \{ {\underline{n}} \oplus {\underline{m}} : 0 \le m< n \} = \bigsqcup _{j=1}^s \{ {\underline{m}} : 2^{n_j} \le m < 2^{n_j+1}\}. \end{aligned}$$

We can first verify that for $j=1$, $ \{ {\underline{n}} \oplus {\underline{m}} : 0 \le m< 2^{n_1} \} = \{ {\underline{m}} : 2^{n_1} \le m < 2^{n_1+1}\}.$ This is because the map acting on $ 0 \le m < 2^{n_1}$ by $m \mapsto {\underline{n}} \oplus {\underline{m}} $ is injective and maps into $\{ {\underline{m}} : 2^{n_1} \le m < 2^{n_1+1}\}$; since the cardinalities match, it is a bijection. Similarly, one can see that for each $2 \le j \le s$,

$$\begin{aligned} \{ {\underline{n}} \oplus {\underline{m}} : 2^{n_1} + \cdots + 2^{n_{j-1}} \le m< 2^{n_1} + \cdots + 2^{n_{j-1}} + 2^{n_j} \} = \{ {\underline{m}} : 2^{n_j} \le m < 2^{n_j+1}\}, \end{aligned}$$

and the claim holds.

In Remark 4.3, we claimed that while the functions $\{w_n\}$ are orthogonal, they do not themselves possess superorthogonality properties for 2r-tuples with $r\ge 2$. This referred to the fact that for any $r \ge 2$, we can pick 2r functions $w_n$ with 2r distinct values of n (so the tuple $(n_1,\ldots , n_{2r})$ satisfies the hypothesis of Type I or Type II or Type III) such that $\int w_{n_1} \cdots w_{n_{2r}} = 1$. Using the notation introduced above, this follows from the fact that we can choose 2r pairwise distinct integers $n_1, \ldots , n_{2r}$ such that when written in binary, ${\underline{n}}_1 \oplus \cdots \oplus {\underline{n}}_{2r}=0$.

Appendix B: The Source of Quasi-superorthogonality for Trace Functions

Appendix by Emmanuel Kowalski ^{Footnote 1}

This short note will attempt to explain the source of the quasi-superorthogonality of trace functions that appears in Sect. 7, and in particular it will highlight that it arises from “exact” superorthogonality (of the corresponding type) for other functions, combined with Deligne’s very deep work on the Riemann Hypothesis over finite fields. We then explain briefly the source of the exact superorthogonality in the type of examples considered in the survey [25] of Fouvry, Kowalski and Michel.

Remark

The presentation is not fully rigorous, since we did not want to obscure the key conceptual point with technical aspects, such as the need to work with continuous $\ell $-adic representations, etc.

Let q be a prime number. The key data is a certain compact topological group $\varPi _q$ associated to q, with a normal subgroup $\varPi _q^g$ (both are algebraic variants of the classical fundamental group of topology, but mainly viewed as classifying coverings of the space, instead of groups of homotopy classes of loops). Moreover, for every $x\in {\mathbb {Z}}/q{\mathbb {Z}}$, there exists a conjugacy class $\theta _{q}(x)$ in $\varPi _q$ (called the Frobenius conjugacy class at x), and $\varPi _q^g$ is big enough that it and a single Frobenius conjugacy class generate $\varPi _q$ topologically.

A trace function F modulo q always has the following form: there exists a finite-dimensional vector space V on which $\varPi _q$ acts linearly (i.e., a finite-dimensional representation of the group) in such a way that

$$\begin{aligned} F(x)={{\,\mathrm{tr}\,}}(\theta _{q}(x)\mid V), \end{aligned}$$

(B.1)

the trace of the endomorphism of V associated to the Frobenius conjugacy class at x. This is well-defined, since the trace is invariant under conjugation.

We view the action as a homomorphism $\varrho :\varPi _q\rightarrow \mathrm {GL}(V)$. Then the formula (B.1) shows that a trace function is the restriction of the character of a representation to a certain subset of conjugacy classes of that group.^{Footnote 2}

The Grothendieck–Lefschetz trace formula combined with Deligne’s Riemann Hypothesis can then be shown to imply (for suitable trace functions) the statement that

$$\begin{aligned} \sum _{x\in {\mathbb {Z}}/q{\mathbb {Z}}} F(x)=\Bigl (\int _{\varPi _q^g}{{\,\mathrm{tr}\,}}(\varrho (y))\mathrm{d}y\Bigr )cq+O(\sqrt{q}), \end{aligned}$$

(B.2)

for some complex number c with $|c|\le 1$, where the integral is with respect to the probability Haar measure on the compact group $\varPi _q^g$ and the implied constant in the $O(\cdot )$ symbol depends only on “local” invariants of $\varrho $ which are usually easy to bound.

Remark

In many cases of interest, one deals with an action of $\varPi _q$ which has the property that $\varrho (\varPi _q^g)=\varrho (\varPi _q)$. Then (B.2) holds with $c=1$, and thus it indicates that the discrete sum of the trace of $\varrho $ over the finitely many Frobenius classes $\theta _q(x)$ is close to the integral over the whole group [note that $\varrho (\theta _q(x))\in \varPi ^g_q$ because of the assumption on $\varrho $]. However, the formula (B.2) holds in general in the stated form.

We can now explain how this, together with algebraic properties of certain compact Lie groups, leads to quasi-superorthogonality.

Suppose we have finitely many trace functions $F_1$, ..., $F_{2r}$, each associated to a representation $\varrho _i$ (on the space $V_i$), satisfying suitable conditions. We want to understand the sum

$$\begin{aligned} \sum _{x\in {\mathbb {Z}}/q{\mathbb {Z}}} F_1(x)\overline{F_2(x)}\cdots F_{2r-1}(x)\overline{F_{2r}(x)}. \end{aligned}$$

Part of the unspecified properties required of $\varrho _i$ imply that the contragradient or dual representation $D(\varrho _i)$ of $\varrho _i$ satisfies

$$\begin{aligned} {{\,\mathrm{tr}\,}}(D(\varrho _i)(y))=\overline{{{\,\mathrm{tr}\,}}(\varrho _i(y))}. \end{aligned}$$

So, according to (B.2), applied to the representation

$$\begin{aligned} \varrho =\varrho _1\otimes D(\varrho _2)\otimes \cdots \otimes \varrho _{2r-1}\otimes D(\varrho _{2r}), \end{aligned}$$

we get

$$\begin{aligned}&\sum _{x\in {\mathbb {Z}}/q{\mathbb {Z}}} F_1(x)\overline{F_2(x)}\cdots F_{2r-1}(x)\overline{F_{2r}(x)} \\&\quad = \Bigl (\int _{\varPi _q^g}{{\,\mathrm{tr}\,}}(\varrho _1(y)) \overline{{{\,\mathrm{tr}\,}}(\varrho _2(y))}\cdots {{\,\mathrm{tr}\,}}(\varrho _{2r-1}(y)) \overline{{{\,\mathrm{tr}\,}}(\varrho _{2r}(y))} dy\Bigr )c'q+O(\sqrt{q}), \end{aligned}$$

for some complex number $c'$ with $|c'|\le 1$.

Thus, we will obtain quasi-superorthogonality, of any type, for the trace functions, provided the characters ${{\,\mathrm{tr}\,}}(\varrho _i)$ of the $\varrho _i$ (restricted to the subgroup $\varPi _q^g$) satisfy exact superorthogonality of the same type.

We present now one source of such superorthogonality that lies behind many examples (but not all—for Dirichlet characters, such as in the inequality (7.9), the mechanism is a bit different).

In fact, at this point, we can replace $\varPi ^g_q$ by any fixed compact group G, with the $\varrho _i$ being unitary (continuous) finite-dimensional representations of G.

According to the character theory of compact groups the integral

$$\begin{aligned} \int _{G}{{\,\mathrm{tr}\,}}(\varrho _1(y)) \overline{{{\,\mathrm{tr}\,}}(\varrho _2(y))}\cdots {{\,\mathrm{tr}\,}}(\varrho _{2r-1}(y)) \overline{{{\,\mathrm{tr}\,}}(\varrho _{2r}(y))} \mathrm{d}y \end{aligned}$$

(B.3)

is equal to the dimension of the space of invariant vectors in the tensor product representation $\varrho $. Now suppose that each $V_i$ has dimension at least 2 and that the image of each $\varrho _i$, which is a subgroup of the unitary group of the space $V_i$, happens to be the special unitary group ${{\,\mathrm{SU}\,}}(V_i)$. Consider the map

$$\begin{aligned} y\mapsto (\varrho _1(y),\ldots ,\varrho _{2r}(y)) \end{aligned}$$

from G to

$$\begin{aligned} {{\,\mathrm{SU}\,}}(V_1)\times \cdots \times {{\,\mathrm{SU}\,}}(V_{2r}). \end{aligned}$$

Let H be its image. It is again a compact group, and it has the property that the projection of H to each factor ${{\,\mathrm{SU}\,}}(V_i)$ is surjective. Now a special case of what Katz [44, §1.8, Prop. 1.8.2] has called the Goursat–Kolchin–Ribet property is that such a subgroup H is equal to the product

$$\begin{aligned} {{\,\mathrm{SU}\,}}(V_1)\times \cdots \times {{\,\mathrm{SU}\,}}(V_{2r}), \end{aligned}$$

unless at least two of the representations are equivalent, in which case at least two of the characters ${{\,\mathrm{tr}\,}}(\varrho _i)$ are the same functions. (To see that this may be the case, consider the special case where all $V_i$ have different dimensions; then the groups ${{\,\mathrm{SU}\,}}(V_i)$ are pairwise non-isomorphic “almost” simple groups, and the projection assumption implies that the group H has to contain all of them as “Jordan–Hölder factors”, which is only possible if H is the full product.)

Thus, if no two of the characters are equal, then we have a splitting of the integral

$$\begin{aligned}&\int _{G}{{\,\mathrm{tr}\,}}(\varrho _1(y)) \overline{{{\,\mathrm{tr}\,}}(\varrho _2(y))}\cdots {{\,\mathrm{tr}\,}}(\varrho _{2r-1}(y)) \overline{{{\,\mathrm{tr}\,}}(\varrho _{2r}(y))} \mathrm{d}y \\&\quad = \int _H {{\,\mathrm{tr}\,}}(y_1,y_2^*,\ldots ,y_{2r-1},y_{2r}^*)\mathrm{d}y_1\cdots \mathrm{d}y_{2r} \\&\quad = \Bigl (\int _{{{\,\mathrm{SU}\,}}(V_1)}{{\,\mathrm{tr}\,}}(y_1)\mathrm{d}y_1\Bigr )\cdots \Bigl (\int _{{{\,\mathrm{SU}\,}}(V_{2r})}\overline{{{\,\mathrm{tr}\,}}(y_{2r})}\mathrm{d}y_{2r}\Bigr ), \end{aligned}$$

which vanishes. In other words, in these conditions, we obtain superorthogonality of Type II, and in fact really in the same way suggested at the beginning of the paper, i.e., from independent random variables, these being the different characters $y\mapsto {{\,\mathrm{tr}\,}}(\varrho _i(y))$.

One can be more precise about conditions on the representations $\varrho _i$ that lead to vanishing of the integral (B.3), but we hope that this sketch has given some idea of how this may arise.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pierce, L.B. On Superorthogonality. J Geom Anal 31, 7096–7183 (2021). https://doi.org/10.1007/s12220-021-00606-3

Download citation

Received: 21 July 2020
Accepted: 06 January 2021
Published: 17 March 2021
Issue Date: July 2021
DOI: https://doi.org/10.1007/s12220-021-00606-3

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On Superorthogonality

Abstract

Access this article

Similar content being viewed by others

On polynomials in primes, ergodic averages and monothetic groups

On generalized powers of operators

Analytic Number Theory in the Last Decade

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix A: Further Remarks on Walsh–Paley Series

1.1 A.1. Limiting Arguments for Direct and Converse Inequalities

1.2 A.2 Linearization

1.3 A.3 Remarks for \(2 \le p < \infty \)

1.4 A.4 Remarks for \(1< p \le 2\)

1.5 A.5 Combining the Direct and Converse Inequalities

Appendix B: The Source of Quasi-superorthogonality for Trace Functions

Remark

Remark

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

On Superorthogonality

Abstract

Access this article

Similar content being viewed by others

On polynomials in primes, ergodic averages and monothetic groups

On generalized powers of operators

Analytic Number Theory in the Last Decade

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix A: Further Remarks on Walsh–Paley Series

1.1 A.1. Limiting Arguments for Direct and Converse Inequalities

1.2 A.2 Linearization

1.3 A.3 Remarks for \(2 \le p < \infty \)

1.4 A.4 Remarks for \(1< p \le 2\)

1.5 A.5 Combining the Direct and Converse Inequalities

Appendix B: The Source of Quasi-superorthogonality for Trace Functions

Remark

Remark

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation