Conditioned Galton–Watson Trees: The Shape Functional, and More on the Sum of Powers of Subtree Sizes and Its Mean

Fill, James Allen; Janson, Svante; Wagner, Stephan

doi:10.1007/s44007-024-00087-0

Conditioned Galton–Watson Trees: The Shape Functional, and More on the Sum of Powers of Subtree Sizes and Its Mean

Original Research Article
Open access
Published: 29 March 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

La Matematica Aims and scope Submit manuscript

Conditioned Galton–Watson Trees: The Shape Functional, and More on the Sum of Powers of Subtree Sizes and Its Mean

Download PDF

385 Accesses
Explore all metrics

Abstract

For a complex number $\alpha $, we consider the sum of the $\alpha $th powers of subtree sizes in Galton–Watson trees conditioned to be of size n. Limiting distributions of this functional $X_n(\alpha )$ have been determined for ${\text {Re}}\alpha \ne 0$, revealing a transition between a complex normal limiting distribution for ${\text {Re}}\alpha < 0$ and a non-normal limiting distribution for ${\text {Re}}\alpha > 0$. In this paper, we complete the picture by proving a normal limiting distribution, along with moment convergence, in the missing case ${\text {Re}}\alpha = 0$. The same results are also established in the case of the so-called shape functional $X_n'(0)$, which is the sum of the logarithms of all subtree sizes; these results were obtained earlier in special cases. In addition, we prove convergence of all moments in the case ${\text {Re}}\alpha < 0$, where this result was previously missing, and establish new results about the asymptotic mean for real $\alpha < 1/2$.

A novel feature for ${\text {Re}}\alpha =0$ is that we find joint convergence for several $\alpha $ to independent limits, in contrast to the cases ${\text {Re}}\alpha \ne 0$, where the limit is known to be a continuous function of $\alpha $. Another difference from the case ${\text {Re}}\alpha \ne 0$ is that there is a logarithmic factor in the asymptotic variance when ${\text {Re}}\alpha =0$; this holds also for the shape functional.

The proofs are largely based on singularity analysis of generating functions.

On polynomials in primes, ergodic averages and monothetic groups

Article 17 February 2024

Strong asymptotic freeness for independent uniform variables on compact groups associated to nontrivial representations

Article 07 May 2024

On the rate of convergence in Wasserstein distance of the empirical measure

Article 18 October 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction and Main Results

This paper is a sequel to [5]. As there, we consider a conditioned Galton–Watson tree $\mathcal {T}_n$ of size n, and the random variables

$$\begin{aligned} X_n(\alpha )&:=F_\alpha (\mathcal {T}_n):=\sum _{v\in \mathcal {T}_n}|\mathcal {T}_{n,v}|^\alpha , \end{aligned}$$

(1.1)

where $\mathcal {T}_{n,v}$ is the fringe subtree of $\mathcal {T}_n$ rooted at a vertex $v\in \mathcal {T}_n$, i.e., the subtree consisting of v and all its descendants. This is a special case of what is known as an additive functional: a functional associated with a rooted tree T that can be expressed in the form

$$\begin{aligned} F(T) = \sum _{v\in T} f(T_v) \end{aligned}$$

(1.2)

for a certain toll function f. Thus, $F_\alpha $ is the additive functional on rooted trees defined by the toll function $f_\alpha (T):=|T|^\alpha $. As in [5], we allow the parameter $\alpha $ to be any complex number; this is advantageous, even for the study of real $\alpha $, since it allows us to use powerful results from the theory of analytic functions in the proofs, and it also yields new phenomena for non-real $\alpha $, for example, Theorem 1.4 below for purely imaginary $\alpha $. (See further Sect. 2 for the notation used here and below.)

In [5], it is assumed that the conditioned Galton–Watson tree $\mathcal {T}_n$ is defined by some offspring distribution $\xi $ with ${\mathbb E{}}\xi =1$ and $0< \sigma ^2:= {\text {Var}}\xi < \infty $. The main results are limit theorems showing that then the random variables $X_n(\alpha )$ converge in distribution after suitable normalization. The results differ for the two cases ${\text {Re}}\alpha <0$ and ${\text {Re}}\alpha >0$: Typical results are the following (here somewhat simplified), where

$$\begin{aligned} {\widetilde{X}}_n(\alpha ):=X_n(\alpha )-{\mathbb E{}}X_n(\alpha ). \end{aligned}$$

(1.3)

For further related results, and references to previous work, see [5].

Theorem 1.1

([5, Theorem 1.1]) If ${\text {Re}}\alpha <0$, then

$$\begin{aligned} n^{-1/2}{\widetilde{X}}_n(\alpha )\overset{\textrm{d}}{\longrightarrow }\widehat{X}(\alpha ), \end{aligned}$$

(1.4)

where $\widehat{X}(\alpha )$ is a centered complex normal random variable with distribution depending on the offspring distribution $\xi $.

Theorem 1.2

([5, Theorem 1.2]) If ${\text {Re}}\alpha >0$, then

$$\begin{aligned} n^{-\alpha -\frac{1}{2}}{\widetilde{X}}_n(\alpha )\overset{\textrm{d}}{\longrightarrow }\sigma ^{-1}{\widetilde{Y}}(\alpha ), \end{aligned}$$

(1.5)

where ${\widetilde{Y}}(\alpha )$ is a centered random variable with a (non-normal) distribution that depends on $\alpha $ but does not depend on the offspring distribution $\xi $.

Note the three differences between the two cases:

(i)
the normalization is by different powers of n, with the power constant for ${\text {Re}}\alpha <0$ but not for ${\text {Re}}\alpha >0$;
(ii)
the limit is normal for ${\text {Re}}\alpha <0$ but not for ${\text {Re}}\alpha >0$;
(iii)
the limit distribution is universal for ${\text {Re}}\alpha >0$ in the sense that it depends on $\xi $ only by the scale factor $\sigma ^{-1}$, but for ${\text {Re}}\alpha <0$, the distribution seems to depend on the offspring distribution $\xi $ in a more complicated way. (In the latter case, the distribution is complex normal, so it is determined by the covariance matrix of $\bigl ({\text {Re}}\widehat{X}(\alpha ),{\text {Im}}\widehat{X}(\alpha )\bigr )$; a complicated formula for covariances is given in [5, Remark 5.1], but we do not know how to evaluate it for concrete examples, not even when $\alpha <0$ is real and, thus, $\widehat{X}(\alpha )$ is a real random variable.)

The results above leave a gap: the case ${\text {Re}}\alpha =0$, and the main purpose of the present paper is to fill this gap, and to compare the results with the cases above. The case $\alpha =0$ is trivial, since $X_n(\alpha )=n$ is non-random. (If $\alpha =0$, then each vertex v of the tree contributes $|\mathcal {T}_{n,v}|^\alpha = 1$ to (1.1).) However, in this case we instead study the derivative

$$\begin{aligned} X_n'(0) = \sum _{v\in \mathcal {T}_n}\log |\mathcal {T}_{n,v}| =\log \prod _{v\in \mathcal {T}_n}|\mathcal {T}_{n,v}|, \end{aligned}$$

(1.6)

which is known as the shape functional. This functional was introduced by Fill [3] in the (different) context of binary search trees under the random permutation model, for which he argued that the functional serves as a crude measure of the “shape” of a random tree, and then studied in some special cases of simply generated trees in e.g. [1, 4, 7, 16, 19], see Sect. 3.

Another gap in [5] is that moment convergence was proved for ${\text {Re}}\alpha >0$ (Theorem 1.2) but not for ${\text {Re}}\alpha <0$ (Theorem 1.1). We fill that gap too.

For technical convenience, we assume throughout the paper the weak extra moment condition

$$\begin{aligned} {\mathbb E{}}\xi ^{2+\delta }<\infty , \end{aligned}$$

(1.7)

for some $\delta >0$; we also continue to assume ${\mathbb E{}}\xi =1$. We let $\mathcal {T}$ be an unconditioned Galton–Watson tree with offspring distribution $\xi $, and define, for complex $\alpha $ with ${\text {Re}}\alpha <\frac{1}{2}$,

$$\begin{aligned} \mu (\alpha )&:= {\mathbb E{}}|\mathcal {T}|^\alpha = \sum _{n=1}^\infty n^\alpha {\mathbb P{}}(|\mathcal {T}|=n), \end{aligned}$$

(1.8)

$$\begin{aligned} \mu '&:=\mu '(0) ={\mathbb E{}}\log |\mathcal {T}| = \sum _{n=1}^\infty {\mathbb P{}}(|\mathcal {T}|=n)\log n .\end{aligned}$$

(1.9)

(The sum (1.8) converges for ${\text {Re}}\alpha <\frac{1}{2}$, since ${\mathbb P{}}(|\mathcal {T}|=n)=O(n^{-3/2})$; see (2.25).)

Our main results are the following. Note that $X_n'(0)$ is a real random variable, while $X_n(\textrm{i}t)$ and $X_n(\alpha )$ for $\alpha \notin \mathbb R$ are non-real except in trivial cases. As said above, special cases of Theorem 1.3 have been proved by Pittel [19], Fill and Kapur [7], and Caracciolo et al. [1].

Theorem 1.3

Assume (1.7) with $\delta >0$. Then,

$$\begin{aligned} \frac{ X_n'(0)-\mu 'n}{\sqrt{n\log n}}\ \overset{\textrm{d}}{\longrightarrow }N\bigl (0,4(1-\log 2)\sigma ^{-2}\bigr ) \end{aligned}$$

(1.10)

together with convergence of all moments.

Theorem 1.4

Assume (1.7) with $\delta >0$. Then, for any real $t\ne 0$,

$$\begin{aligned} \frac{X_n(\textrm{i}t)-\mu (\textrm{i}t)n}{\sqrt{n\log n}}\ \overset{\textrm{d}}{\longrightarrow }\zeta _{\textrm{i}t}\end{aligned}$$

(1.11)

together with convergence of all moments, where $\zeta _{\textrm{i}t}$ is a symmetric complex normal variable with variance

$$\begin{aligned} {\mathbb E{}}|\zeta _{\textrm{i}t}|^2= \frac{1}{\sqrt{\pi }}{\text {Re}}\frac{\Gamma (\textrm{i}t-\tfrac{1}{2})}{\Gamma (\textrm{i}t)}\sigma ^{-2}>0 .\end{aligned}$$

(1.12)

Theorem 1.5

Assume (1.7) with $\delta >0$. Then, for any complex $\alpha $ with ${\text {Re}}\alpha <0$,

$$\begin{aligned} \frac{X_n(\alpha )-\mu (\alpha )n}{\sqrt{n}} \overset{\textrm{d}}{\longrightarrow }\widehat{X}(\alpha ) \end{aligned}$$

(1.13)

together with convergence of all moments, where $\widehat{X}(\alpha )$ is a centered complex normal random variable with positive variance and distribution depending on the offspring distribution $\xi $. Hence, (1.4) holds with convergence of all moments.

Remark 1.6

By “convergence of all moments,” we mean in the case of complex variables, $Z_n$ say, convergence of all mixed moments of $Z_n$ and $\overline{Z_n}$, which is equivalent to convergence of all mixed moments of ${\text {Re}}Z_n$ and ${\text {Im}}Z_n$. Since we have convergence in distribution, this is by a standard argument using uniform integrability also equivalent to convergence of all absolute moments.

Note that, conversely, by the method of moments applied to $({\text {Re}}Z_n,{\text {Im}}Z_n)$, this implies convergence in distribution of $Z_n$, provided, as is the case here, the limit distribution is determined by its moments. Thus, our proof of moment convergence provides a new proof of Theorem 1.1, very different from the proof in [5]. $\square $

Remark 1.7

Since the statements include convergence of the first moments (to 0), we may in Theorems 1.3–1.5 replace $\mu 'n$, $\mu (\textrm{i}t)n$, and $\mu (\alpha )n$ by the expectations ${\mathbb E{}}X_n'(0)$, ${\mathbb E{}}X_n(\textrm{i}t)$, and ${\mathbb E{}}X_n(\alpha )$, respectively; in particular, this gives the last sentence in Theorem 1.5. More precise estimates of the expectations are given in (3.11), (3.20), (4.14), and (5.10). $\square $

Theorems 1.3 and 1.4 combine some of the features found for ${\text {Re}}\alpha <0$ and ${\text {Re}}\alpha >0$ in (i)-(iii) above. First, the variances in Theorems 1.3 and 1.4 are of order $n\log n$. This might be a surprise since it is not what a naive extrapolation from either ${\text {Re}}\alpha <0$ in Theorem 1.1 or ${\text {Re}}\alpha >0$ in Theorem 1.2 would yield, where the variances are of order n (${\text {Re}}\alpha <0$) and $n^{1+2{\text {Re}}\alpha }$ (${\text {Re}}\alpha >0$); however, it is not surprising that a logarithmic factor appears when the two different expressions meet. (Compare for instance the result on the mean of $X_n(\alpha )$ in [5, Theorem 1.7], or the discussion of the binary search tree recurrence in [9, Example VI.15, pp. 428–429], where the emergence of similar logarithmic factors is observed.) Second, the limits are normal, as heuristically would be expected by “continuity” from the left, see (ii). Third, the limits are universal and depend only on $\sigma $ as a scale factor, as heuristically would be expected by “continuity” from the right, see (iii).

The proofs in [5] use two different methods, which are combined to yield the full results: (1) methods using complex analysis and the fact that $X_n(\alpha )$ is an analytic function of $\alpha $, and (2) analysis of moments for a fixed $\alpha $ using singularity analysis of generating functions based on results of Fill et al. [4], also presented in [9, Section VI.10]. In the present paper, we will use only the second method. We follow the proofs in [5] with some variations (see also [6] and [7]). However, some new leading terms will appear in the singular expansions of the generating functions, which will dominate the terms that are leading in [5]; this explains both the logarithmic factors in the variance (and in higher moments) in Theorems 1.3 and 1.4, and the fact that these theorems yield normal limits while Theorem 1.2 does not.

After some preliminaries in Sect. 2, we first study the shape functional and prove Theorem 1.3 in Sect. 3; we then study the case of imaginary exponents and prove Theorem 1.4 in Sect. 4; after that, we consider the case ${\text {Re}}\alpha <0$ and prove Theorem 1.5 in Sect. 5. These three sections use the same method (from [7] and [5]) and are, thus, quite similar, but some details differ. The differences arise partly because $X_n'(0)$ is real, while $X_n(\textrm{i}t)$ and (in general) $X_n(\alpha )$ are not; we will also see that the logarithmic factors in the first two cases appear in the moments in somewhat different ways, and that there is a cancellation of some leading terms in our induction for the first and third cases, but not for $X_n(\textrm{i}t)$. For this reason, we give complete arguments for all three cases, and we encourage the reader to compare them and see both similarities and differences.

In Sect. 6, we show how the centering functions (1.8) and (1.9) can be compared across variation in the offspring distribution when (real) $\alpha $ satisfies $\alpha < \frac{1}{2}$.

Remark 1.8

The results in [5] show also joint convergence for different $\alpha $ in Theorems 1.1 and 1.2, with limits $\widehat{X}(\alpha )$ and ${\widetilde{Y}}(\alpha )$ that are analytic, and in particular continuous, random functions of the parameter $\alpha $ in the half-planes ${\text {Re}}\alpha <0$ and ${\text {Re}}\alpha >0$, respectively. This does not extend to the imaginary axis ${\text {Re}}\alpha =0$; we will see in Theorem 4.2 that $X_n(\alpha )$ for different imaginary $\alpha $ are asymptotically independent (for ${\text {Im}}\alpha >0$), and thus, it is not possible to have joint convergence to a continuous random function. $\square $

Remark 1.9

Let $\alpha =s+\textrm{i}t$, where t is real and fixed, and let $s\searrow 0$. (Thus, $s>0$ is real.) It is shown in [5, Appendix D] that if $t\ne 0$, then the limit ${\widetilde{Y}}(s+\textrm{i}t)$ diverges (in probability, say) as $s\searrow 0$, and that $s^{1/2}{\widetilde{Y}}(s+\textrm{i}t)\overset{\textrm{d}}{\longrightarrow }\zeta $, where $\zeta $ is a symmetric complex normal variable with

$$\begin{aligned} {\mathbb E{}}|\zeta |^2 =\frac{1}{2\sqrt{\pi }} {\text {Re}}\frac{\Gamma (\textrm{i}t-\frac{1}{2})}{\Gamma (\textrm{i}t)}>0. \end{aligned}$$

(1.14)

(However, unfortunately there is a typographical error in [5, (D.2)], see the corrigendum to [5] for a correction.) Similarly, it is shown in [5, Appendix C] that $s^{-1/2}{\widetilde{Y}}(s)\overset{\textrm{d}}{\longrightarrow }N(0,2(1-\log 2))$ as $s\searrow 0$; in particular $s^{-1}{\widetilde{Y}}(s)$ diverges.

These results may be compared to Theorems 1.3–1.4; note that the limits are the same, except that the variances in both cases differ by a factor 1/2 (which of course depends on the chosen normalizations). Both sets of results can be regarded as iterated limits of ${\widetilde{X}}_n(s+\textrm{i}t)$, taking ${n\rightarrow \infty }$ and $s\searrow 0$ in different orders. The divergence of ${\widetilde{Y}}(s+\textrm{i}t)$ as $s\searrow 0$ (for fixed $t\ne 0$) thus seems to be related to the fact that the asymptotic variance in Theorem 1.3 is of greater order than n, and similarly the divergence as $s\searrow 0$ of $s^{-1}{\widetilde{Y}}(s)$ (which loosely might be regarded as an approximation of $n^{-1/2}{\widetilde{X}}_n'(0)$) seems related to Theorem 1.4. However, we do not see why the factors $s^{\pm \frac{1}{2}}$ in these limits should correspond to the factor $(\log n)^{1/2}$ in Theorems 1.3 and 1.4 [or more precisely to the factor $(2 \log n)^{1/2}$, to get exactly the same limit distributions]. $\square $

We end with some problems suggested by the results and comments above.

Problem 1.10

Is there a simple explanation of the equality discussed in Remark 1.9 of iterated limits in different orders, but with different normalizations? Is this an instance of some general phenomenon? What happens if $s\searrow 0$ and ${n\rightarrow \infty }$ simultaneously, i.e., for ${\widetilde{X}}_n(s_n+\textrm{i}t)$ where $s_n\searrow 0$ at some appropriate rate?

The asymptotic independence of $X(\textrm{i}t)$ for $t>0$ mentioned in Remark 1.8 suggests informally that the stochastic process $({\widetilde{X}}_n(\textrm{i}t): t\geqslant 0)$ asymptotically looks something like white noise. This might be investigated further, for example as follows.

Problem 1.11

Consider the integrated process $\int _0^t {\widetilde{X}}_n(\textrm{i}u)\,\textrm{d}u$. What is the order of its variance? Does this process after normalization converge to a process with paths that are continuous in t?

The moment assumption (1.7) is used repeatedly to control error terms, but it seems convenient rather than necessary.

Problem 1.12

We conjecture that Theorems 1.3–1.5 hold also without the assumption (1.7). Prove (or disprove) this!

2 Notation and Preliminaries

2.1 General Notation

As said above, $\mathcal {T}$ is a Galton–Watson tree defined by an offspring distribution $\xi $ with mean ${\mathbb E{}}\xi =1$ and finite non-zero variance $\sigma ^2:={\text {Var}}\xi <\infty $, and we assume $ {\mathbb E{}}\xi ^{2+\delta }<\infty $ for some $\delta >0$. Furthermore, the conditioned Galton–Watson tree $\mathcal {T}_n$ is defined as $\mathcal {T}$ conditioned on $|\mathcal {T}|=n$. We assume for simplicity that $\xi $ has span 1; the general case follows by standard (and minor) modifications. (Recall that the span of an integer-valued random variable $\xi $, denoted ${\text {span}}(\xi )$, is the largest integer h such that $\xi \in a+h\mathbb Z$ a.s. for some $a\in \mathbb Z$; we consider only $\xi $ with ${\mathbb P{}}(\xi =0)>0$ and then the span is the largest integer h such that $\xi /h\in \mathbb Z$ a.s., i.e., the greatest common divisor of $\{n:{\mathbb P{}}(\xi =n)>0\}$.)

In the sequel, $\Gamma (z)$ denotes the Gamma function, $\psi (z):=\Gamma '(z)/\Gamma (z)$ is its logarithmic derivative, and $\gamma =-\psi (1)$ is Euler’s constant.

A random variable $\zeta $ has a complex normal distribution if it takes values in $\mathbb C$ and $({\text {Re}}\zeta ,{\text {Im}}\zeta )$ is a 2-dimensional normal distribution (with arbitrary covariance matrix). In particular, $\zeta $ is symmetric complex normal if further ${\mathbb E{}}\zeta =0$ and $({\text {Re}}\zeta ,{\text {Im}}\zeta )$ has covariance matrix $\left( {\begin{matrix}\varsigma ^2/2&{}0\\ 0&{}\varsigma ^2/2\end{matrix}}\right) $ for some $\varsigma ^2={\mathbb E{}}|\zeta |^2$, which is called the variance; equivalently, ${\mathbb E{}}\zeta =0$, ${\mathbb E{}}\zeta ^2=0$, and ${\mathbb E{}}|\zeta |^2=\varsigma ^2$. (See e.g. [11, Proposition 1.31].) A symmetric complex normal distribution with variance $\varsigma ^2$ is determined by the mixed moments of $\zeta $ and $\overline{\zeta }$, which are given by (see [11, p. 14])

$$\begin{aligned} {\mathbb E{}}\bigl [\zeta ^\ell \,\overline{\zeta }^r\bigr ]= {\left\{ \begin{array}{ll} \varsigma ^{2\ell }\ell !, &{} \ell =r, \\ 0,&{}\ell \ne r. \end{array}\right. } \end{aligned}$$

(2.1)

Unspecified limits are as ${n\rightarrow \infty }$. We let $\overset{\textrm{d}}{\longrightarrow }$ denote convergence in distribution.

For real x and y, we denote $\min (x,y)$ by $x\wedge y$.

The semifactorial $\ell !!$ is defined for odd integers $\ell $ (the only case that we use) by

$$\begin{aligned} \ell !! := 1\times 3 \times \, \cdots \, \times \ell =2^{(\ell +1)/2} \Gamma \left( \tfrac{\ell }{2}+1\right) /\sqrt{\pi }. \end{aligned}$$

(2.2)

Note that $(-1)!!=1!!=1$.

Throughout, $\varepsilon $ denotes an arbitrarily small fixed number with $\varepsilon >0$. (We will tacitly assume that $\varepsilon $ is sufficiently small when necessary.)

We let C and c denote unimportant positive constants, possibly different each time; these may depend on the parameter $\alpha $ (or $\alpha _1,\alpha _2$ below). We sometimes use c with subscripts; these keep the same value within the same section.

2.2 $\Delta $-domains and Singularity Analysis

A $\Delta $-domain is a complex domain of the type

$$\begin{aligned} \{z:|z|<R,\, z\ne 1,\,|\arg (z-1)|>\theta \} \end{aligned}$$

(2.3)

where $R>1$ and $0<\theta <\pi /2$, see [9, Section VI.3]. A function is $\Delta $-analytic if it is analytic in some $\Delta $-domain (or can be analytically continued to such a domain).

Our proofs are based on singularity analysis of various generating functions (see [9, Chapter VI]), using estimates as $z\rightarrow 1$ in a suitable $\Delta $-domain; the domain may be different each time. In particular, we use repeatedly [9, Theorem VI.3, p. 390] to estimate error terms when we identify coefficients. All estimates below of analytic functions tacitly are valid in some $\Delta $-domains (possibly different ones for different functions), even when that is not said explicitly.

2.3 Polylogarithms

${\text {Li}}_\alpha (z)$ and ${\text {Li}}_{\alpha ,r}(z)$ denote polylogarithms and generalized polylogarithms, respectively; they are defined for $\alpha \in \mathbb C$ and $r=0,1,\dots $ by the power series

$$\begin{aligned} {\text {Li}}_\alpha (z)&:=\sum _{n=1}^\infty n^{-\alpha }z^n, \end{aligned}$$

(2.4)

$$\begin{aligned} {\text {Li}}_{\alpha ,r}(z)&:=\sum _{n=1}^\infty (\log n)^r\frac{z^n}{n^\alpha } \end{aligned}$$

(2.5)

for $|z|<1$, and then extended analytically to $\mathbb C\setminus [0,\infty )$ (in particular they are $\Delta $-analytic); see e.g. [9, Section VI.8]. Note that ${\text {Li}}_{\alpha ,0}(z)={\text {Li}}_\alpha (z)$. We will also use the notation

$$\begin{aligned} L(z):=-\log (1-z) =\sum _{n=1}^\infty \frac{z^n}{n} ={\text {Li}}_{1}(z) .\end{aligned}$$

(2.6)

We will use singular expansions of polylogarithms and generalized polylogarithms into powers of $1-z$, possibly including powers of L(z). Infinite singular expansions of polylogarithms and generalized polylogarithms are given by Flajolet [8, Theorem 1] (also [9, Theorem VI.7]); we will mainly use only the following simple versions keeping only the main terms.

For any real a, let $\mathcal {P}_a$ be the set of all polynomials in z of degree $<a$. In particular, if $a\leqslant 0$, then $\mathcal {P}_a=\{0\}$. If $0\leqslant a\leqslant 1$, then every polynomial in $\mathcal {P}_a$ is constant. These simple cases are the ones of most interest to us.

We then have, for each $\alpha \notin \{1,2,\dots \}$,

$$\begin{aligned} {\text {Li}}_{\alpha }(z) = \Gamma (1-\alpha )(1-z)^{\alpha -1} +P(z)+ O\bigl (|1-z|^{{{\text {Re}}\alpha }}\bigr ), \end{aligned}$$

(2.7)

for some $ P(z)\in \mathcal {P}_{{{\text {Re}}\alpha }}$.

Moreover, in our proofs, we will often go back and forth between expansions in powers of $1-z$ (including powers of L(z)) and expansions in (generalized) polylogarithms, using the following simple consequence of the singular expansions of generalized polylogarithms, proved in [7]. (Here slightly simplified.)

Lemma 2.1

([7, Lemmas 2.5 and 2.6]) Suppose that ${\text {Re}}\alpha <1$. Then, for each $r\geqslant 0$, in any fixed $\Delta $-domain and for any $\varepsilon >0$,

$$\begin{aligned} {\text {Li}}_{\alpha ,r}(z)&= \sum _{j=0}^r \rho _{r,j}(\alpha ) (1-z)^{\alpha -1} L(z)^j + c_r(\alpha ) +O\bigl (|1-z|^{{\text {Re}}\alpha -\varepsilon }\bigr ), \end{aligned}$$

(2.8)

for some coefficients $\rho _{r,j}(\alpha )$ and $c_r(\alpha )$, with leading coefficient

$$\begin{aligned} \rho _{r,r}(\alpha ) =\Gamma (1-\alpha ). \end{aligned}$$

(2.9)

Conversely,

$$\begin{aligned} (1-z)^{\alpha -1} L(z)^r&= \sum _{j=0}^r \hat{\rho }_{r,j}(\alpha ) {\text {Li}}_{\alpha ,j}(z) +\hat{c}_r(\alpha ) +O\bigl (|1-z|^{{\text {Re}}\alpha -\varepsilon }\bigr ), \end{aligned}$$

(2.10)

for some coefficients $\hat{\rho }_{r,j}(\alpha )$ and $\hat{c}_r(\alpha )$, with

$$\begin{aligned} \hat{\rho }_{r,r}(\alpha ) =\rho _{r,r}(\alpha )^{-1}=\Gamma (1-\alpha )^{-1}. \end{aligned}$$

(2.11)

Remark 2.2

The lemmas in [7] are stated for real $\alpha $, but the proofs hold also for complex $\alpha $. Moreover, the results extend to $\alpha $ with ${\text {Re}}\alpha \geqslant 1$, assuming $\alpha \notin \{1,2,\dots \}$, provided the error terms $O\bigl (|1-z|^{{\text {Re}}\alpha -\varepsilon }\bigr )$ are replaced by $O(|1-z|)$ when ${\text {Re}}\alpha >1$. $\square $

2.4 Hadamard Products

Recall that the Hadamard product $A(z)\odot B(z)$ of two power series $A(z)=\sum _{n=0}^\infty a_n z^n$ and $B(z)=\sum _{n=0}^\infty b_n z^n$ is defined by

$$\begin{aligned} A(z)\odot B(z) := \sum _{n=0}^\infty a_n b_n z^n. \end{aligned}$$

(2.12)

(See e.g. [4] or [9].) As a simple example, for any complex $\alpha $ and $\beta $,

$$\begin{aligned} {\text {Li}}_\alpha (z)\odot {\text {Li}}_\beta (z)={\text {Li}}_{\alpha +\beta }(z), \end{aligned}$$

(2.13)

and, more generally, by (2.5),

$$\begin{aligned} {\text {Li}}_{\alpha ,r}(z)\odot {\text {Li}}_{\beta ,s}(z)={\text {Li}}_{\alpha +\beta ,r+s}(z). \end{aligned}$$

(2.14)

We note also, for any constant c and power series $A(z)=\sum _{n=0}^\infty a_nz^n$, the trivial result

$$\begin{aligned} c\odot A(z) = ca_0. \end{aligned}$$

(2.15)

For our error terms, we will use the following lemma; it is part of [5, Lemma 12.2] and taken from [4, Propositions 9 and 10(i)] and [9, Theorem VI.11, p. 423]. (Further related results are given in [4, 9, Section VI.10.2], and [5].)

Lemma 2.3

([4, 9]) If g(z) and h(z) are $\Delta $-analytic, then $g(z)\odot h(z)$ is $\Delta $-analytic. Moreover, suppose that $g(z)=O(|1-z|^a)$ and $h(z)=O(|1-z|^b)$, where a and b are real with $a+b+1\notin \{0,1,2,\dots \}$; then, as $z\rightarrow 1$ in a suitable $\Delta $-domain,

$$\begin{aligned} g(z)\odot h(z)=P(z)+O\bigl (|1-z|^{a+b+1}\bigr ), \end{aligned}$$

(2.16)

for some $ P(z)\in \mathcal {P}_{a+b+1}$.

2.5 Generating Functions for Galton–Watson trees

Let $p_k:={\mathbb P{}}(\xi =k)$ denote the values of the probability mass function for the offspring distribution $\xi $, and let $\Phi $ be its probability generating function:

$$\begin{aligned} \Phi (z):={\mathbb E{}}z^\xi =\sum _{k=0}^\infty p_k z^k. \end{aligned}$$

(2.17)

Similarly, let $q_n:={\mathbb P{}}(|\mathcal {T}|=n)$, and let y denote the corresponding probability generating function:

$$\begin{aligned} y(z):={\mathbb E{}}z^{|\mathcal {T}|} = \sum _{n=1}^\infty {\mathbb P{}}\bigl (|\mathcal {T}|=n\bigr )z^n =\sum _{n=1}^\infty q_n z^n. \end{aligned}$$

(2.18)

As is well known, then

$$\begin{aligned} y(z)&=z\Phi \bigl (y(z)\bigr ).\end{aligned}$$

(2.19)

Under our assumptions ${\mathbb E{}}\xi =1$ and $0<{\text {Var}}\xi <\infty $, the generating function y(z) extends analytically to a $\Delta $-domain and is, thus, $\Delta $-analytic; see [12, Lemma A.2] and [5, §12.1] (and under stronger assumptions [9, Theorem VI.6, p. 404]). Furthermore, see again [12, Lemma A.2], there exists a $\Delta $-domain where $|y(z)|<1$, and thus, $\Phi (y(z))$ is $\Delta $-analytic, as well as $\Phi ^{(m)}\bigl (y(z)\bigr )$ for every $m\geqslant 1$.

We note some useful consequence of our extra moment assumption (1.7); we may without loss of generality assume $\delta \leqslant 1$. (Compare [5, (12.5), (12.30), and (12.31)] without the assumption (1.7) but with weaker error terms, and [9, Theorem VI.6] with stronger results under stronger assumptions.)

Lemma 2.4

If (1.7) holds with $0<\delta \leqslant 1$, then, for z in some $\Delta $-domain,

$$\begin{aligned} y(z)&=1-\sqrt{2} \sigma ^{-1}(1-z)^{1/2}+O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}}\bigr ), \end{aligned}$$

(2.20)

$$\begin{aligned} y'(z)&=2^{-1/2}\sigma ^{-1}(1-z)^{-1/2}+O\bigl (|1-z|^{-\frac{1}{2}+\frac{\delta }{2}}\bigr ), \end{aligned}$$

(2.21)

$$\begin{aligned} \frac{zy'(z)}{y(z)}&=2^{-1/2}\sigma ^{-1}(1-z)^{-1/2}+O\bigl (|1-z|^{-\frac{1}{2}+\frac{\delta }{2}}\bigr ). \end{aligned}$$

(2.22)

In particular, all three functions are $\Delta $-analytic.

Proof

That y(z) is $\Delta $-analytic was noted above, and the estimate (2.20) was shown in [5, Lemma 12.15]. A differentiation then yields (2.21) in a smaller $\Delta $-domain, using Cauchy’s estimates for a disk with radius $c|1-z|$ centered at z (see [9, Theorem VI.8, p. 419]).

Note that $zy'(z)/y(z)$ is analytic in any domain where y is defined and analytic with $|y(z)|<1$, since then (2.19) holds in the domain and implies that $y(z)\ne 0$ for $z\ne 0$, and also that z/y(z) is analytic at $z=0$. Hence, also $zy'(z)/y(z)$ is $\Delta $-analytic. Finally, (2.22) follows from (2.20) and (2.21). $\square $

By (2.7), and using $\Gamma (-1/2)=-2\sqrt{\pi }$, we can rewrite (2.20) as

$$\begin{aligned} y(z)&=-\frac{\sqrt{2}}{\Gamma (-\frac{1}{2}) \sigma }{\text {Li}}_{3/2}(z)+c+O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}}\bigr ) \nonumber \\&=\frac{1}{\sqrt{2\pi }\sigma }{\text {Li}}_{3/2}(z)+c+O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}}\bigr ), \end{aligned}$$

(2.23)

where (although we do not need it) $c=1-\zeta (3/2)/\sqrt{2\pi \sigma ^2}$. Furthermore, by (2.4) and singularity analysis [9, Theorem VI.3, p. 390], (2.23) implies

$$\begin{aligned} q_n={\mathbb P{}}(|\mathcal {T}|=n)= \frac{1}{\sqrt{2\pi }\sigma }n^{-3/2}+O\bigl (n^{-\frac{3}{2}-\frac{\delta }{2}}\bigr ) = \frac{1+O\bigl (n^{-\delta / 2}\bigr )}{\sqrt{2\pi }\sigma }n^{-3/2} .\end{aligned}$$

(2.24)

Remark 2.5

It is well known that the asymptotic formula

$$\begin{aligned} q_n={\mathbb P{}}(|\mathcal {T}|=n)= \frac{1+o(1)}{\sqrt{2\pi }\sigma }n^{-3/2} \qquad \text {as}~{n\rightarrow \infty }\end{aligned}$$

(2.25)

holds with a weaker error bound than (2.24), assuming only ${\text {Var}}\xi <\infty $ (and ${\mathbb E{}}\xi =1$); see e.g. [18] (assuming an exponential moment), [15, Lemma 2.1.4], or [13, Theorem 18.11] (with $\tau =\Phi (\tau )=1$) and the further references given there. $\square $

Lemma 2.6

Assume (1.7) with $0<\delta \leqslant 1$. Then, for z in some $\Delta $-domain,

$$\begin{aligned} \Phi \bigl (y(z)\bigr )&=1 + O\bigl (|1-z|^{\frac{1}{2}}\bigr ), \end{aligned}$$

(2.26)

$$\begin{aligned} \Phi '\bigl (y(z)\bigr )&=1 + O\bigl (|1-z|^{\frac{1}{2}}\bigr ), \end{aligned}$$

(2.27)

$$\begin{aligned} \Phi ''\bigl (y(z)\bigr )&= \sigma ^2+ O\bigl (|1-z|^{\frac{\delta }{2}}\bigr ), \end{aligned}$$

(2.28)

and, for each fixed $m\geqslant 3$,

$$\begin{aligned} \Phi ^{(m)}\bigl (y(z)\bigr )&= O\bigl (|1-z|^{\frac{\delta }{2}+1-\frac{m}{2}}\bigr ) . \end{aligned}$$

(2.29)

Proof

The assumption (1.7) implies the estimate, see e.g. [5, Lemma 12.14],

$$\begin{aligned} \Phi (z)=z+\tfrac{1}{2}\sigma ^2(1-z)^2 + O\bigl (|1-z|^{2+\delta }\bigr ), \qquad |z|\leqslant 1. \end{aligned}$$

(2.30)

By differentiation of (2.30), for the remainder term using Cauchy’s estimates for a disk with radius $(1-|z|)/2$ centered at z, we obtain for all z with $|z|<1$, and each fixed $m\geqslant 3$,

$$\begin{aligned} \Phi '(z)&=1 - \sigma ^2(1-z) + O\bigl (|1-z|^{2+\delta }/(1-|z|)\bigr ), \end{aligned}$$

(2.31)

$$\begin{aligned} \Phi ''(z)&= \sigma ^2+ O\bigl (|1-z|^{2+\delta }/(1-|z|)^2\bigr ), \end{aligned}$$

(2.32)

$$\begin{aligned} \Phi ^{(m)}(z)&= O\bigl (|1-z|^{2+\delta }/(1-|z|)^m\bigr ). \end{aligned}$$

(2.33)

For z in a suitable $\Delta $-domain we have (2.20), and as a consequence, if $|1-z|$ is small enough,

$$\begin{aligned} c|1-z|^{1/2}\leqslant 1-|y(z)| \leqslant |1-y(z)| \leqslant C|1-z|^{1/2}. \end{aligned}$$

(2.34)

The result follows by (2.30)–(2.34). $\square $

Remark 2.7

In fact, (2.27) holds without the extra assumption (1.7), assuming only ${\mathbb E{}}\xi ^2<\infty $, because then $\Phi $ is twice continuously differentiable in the closed unit disk with $\Phi '(1)=1$, and $y(z)=1-\sqrt{2}\sigma ^{-1}(1-z)^{1/2}+o(|1-z|^{1/2})$ as is shown in [12, Lemma A.2], see also [5, (12.5)]. $\square $

3 The Shape Functional

We consider here the shape functional $X_n'(0)$. Asymptotics for the mean and variance of this functional were found by Fill [3] in the case of binary search trees under the random permutation model; these are not simply generated trees. Asymptotics for the mean and variance were found by Meir and Moon [16] for simply generated trees under a condition equivalent to our conditioned Galton–Watson trees with $\xi $ having a finite exponential moment ${\mathbb E{}}e^{r\xi }<\infty $ for some $r>0$. Pittel [19] showed asymptotic normality in the case of uniform labelled trees [the case $\xi \sim {\text {Po}}(1)$] by estimating cumulants. Fill and Kapur [7] considered uniform binary trees [$\xi \sim {\text {Bi}}(2,\frac{1}{2})$] and showed asymptotic normality by estimating moments by singularity analysis, see also Fill et al. [4]. Asymptotic normality has recently been shown, by similar methods, also for uniformly random ordered trees [the case $\xi \sim {\text {Ge}}(\tfrac{1}{2})$] by Caracciolo et al. [1], who further [personal communication] have extended the results to arbitrary offspring distributions $\xi $ (with ${\mathbb E{}}\xi =1$ as here), at least provided that $\xi $ has a finite exponential moment ${\mathbb E{}}e^{r\xi }<\infty $ for some $r>0$.

We will here extend these results to any offspring distribution $\xi $ satisfying the standard condition ${\mathbb E{}}\xi =1$ and the weak moment condition (1.7) for some $\delta >0$. We assume without loss of generality that $0<\delta \leqslant 1$. We will use singularity analysis to estimate moments, in the same way as [1, 4, 7].

In this section, we define (corresponding to [5, (12.46)])

$$\begin{aligned} b_n:=\log n -\mu ', \qquad n\geqslant 1, \end{aligned}$$

(3.1)

where $\mu '={\mathbb E{}}\log |\mathcal {T}| = \sum _{n=1}^\infty q_n\log n $ as in (1.9), and we let F be the additive functional defined by the toll function $f(T):=b_{|T|}$. Thus, by (1.6),

$$\begin{aligned} F(\mathcal {T}_n)=X_n'(0)-\mu 'n .\end{aligned}$$

(3.2)

The generating function of $b_n$ is, by (2.4)–(2.5) and noting ${\text {Li}}_0(z)=z/(1-z)$,

$$\begin{aligned} B(z) = \sum _{n=1}^\infty (\log n-\mu ')z^n = {\text {Li}}_{0,1}(z)-\mu '{\text {Li}}_0(z) .\end{aligned}$$

(3.3)

Hence, by Lemma 2.1 (or [9, Figure VI.11, p. 410] with more terms),

$$\begin{aligned} B(z)&= (1-z)^{-1}L(z) -c(1-z)^{-1}+O\bigl (|1-z|^{-\varepsilon }\bigr ) \end{aligned}$$

(3.4)

$$\begin{aligned}&=O\bigl (|1-z|^{-1-\varepsilon }\bigr ). \end{aligned}$$

(3.5)

We define the generating functions, for $\ell \geqslant 1$,

$$\begin{aligned} M_\ell (z):={\mathbb E{}}\bigl [F(\mathcal {T})^\ell z^{|\mathcal {T}|}\bigr ] =\sum _{n=1}^\infty q_n {\mathbb E{}}[F(\mathcal {T}_n)^\ell ] z^n .\end{aligned}$$

(3.6)

These generating functions can be calculated recursively by the following formula (valid for any sequence $b_n$) from [5], where $A(z)^{\odot \ell }$ denotes the $\ell ^\textrm{th}$ Hadamard power of a power series A(z).

Lemma 3.1

([5, Lemma 12.4]) For every $\ell \geqslant 1$,

$$\begin{aligned} M_\ell (z) = \frac{z y'(z)}{y(z)} \sum _{m=0}^\ell \frac{1}{m!}\mathop {\mathrm {\sum \nolimits ^{**}}}\limits \left( {\begin{array}{c}\ell \\ \ell _0,\dots ,\ell _m\end{array}}\right) B(z)^{\odot \ell _0} \odot \bigl [zM_{\ell _1}(z)\cdots M_{\ell _m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )\bigr ], \end{aligned}$$

(3.7)

where $\mathop {\mathrm {\sum \nolimits ^{**}}}\limits $ is the sum over all $(m+1)$-tuples $(\ell _0,\dots ,\ell _m)$ of non-negative integers summing to $\ell $ such that $1\leqslant \ell _1,\dots ,\ell _m<\ell $.

Note that B(z) is $\Delta $-analytic by (3.3); furthermore, $zy'(z)/y(z)$ and $\Phi ^{(m)}\bigl (y(z)\bigr )$ are also $\Delta $-analytic, see Sect. 2.5. Hence, (3.7) and induction using Lemma 2.3 show that every $M_\ell (z)$ is $\Delta $-analytic.

It will be convenient to denote the sum in (3.7) by $R_\ell (z)$. Thus,

$$\begin{aligned} M_\ell (z)=\frac{zy'(z)}{y(z)}R_\ell (z). \end{aligned}$$

(3.8)

3.1 The Mean

We begin with the mean ${\mathbb E{}}X'_n(0)$ and the corresponding generating function $M_1(z)$. The following result includes earlier results for special cases in [1, 3, 4, 7, 16, 19], but our error term is weaker [since we have the weaker moment assumption (1.7)]. Recall that $\psi (z):=\Gamma '(z)/\Gamma (z)$, and note that

$$\begin{aligned} \psi (-\tfrac{1}{2})=\psi (\tfrac{1}{2})+2=-\gamma -2\log 2+2, \end{aligned}$$

(3.9)

see [17, 5.5.2 and 5.4.13].

Lemma 3.2

Assume (1.7) with $0<\delta \leqslant 1$. Then, for any $\varepsilon >0$,

$$\begin{aligned} M_1(z) = -\sigma ^{-2}L(z) +\frac{{\mu '-\psi (-\frac{1}{2})}}{\sigma ^2} + O\bigl (|1-z|^{\frac{\delta }{2}-\varepsilon }\bigr ) \end{aligned}$$

(3.10)

and

$$\begin{aligned} {\mathbb E{}}X'_n(0)= \mu 'n - \frac{\sqrt{2\pi }}{\sigma } n^{1/2}+ O\bigl (n^{\frac{1}{2}-\frac{\delta }{2}+\varepsilon }\bigr ). \end{aligned}$$

(3.11)

Proof

For $\ell =1$, the sums in (3.7) reduce to a single term with $m=0$ and $\ell _0=1$, and thus, as in [5, (12.29)], using (2.19),

$$\begin{aligned} M_1(z) =\frac{zy'(z)}{y(z)}\cdot \bigl ( B(z)\odot z\Phi \bigl (y(z)\bigr )\bigr ) =\frac{zy'(z)}{y(z)}\cdot \bigl ( B(z)\odot y(z)\bigr ) .\end{aligned}$$

(3.12)

By (2.14), (2.15), (3.3), and (2.23), we obtain, using Lemma 2.3 and (3.5) for the error term,

$$\begin{aligned} B(z)\odot y(z) = \frac{1}{\sqrt{2\pi }\sigma }{\text {Li}}_{3/2,1}(z) -\frac{\mu '}{\sqrt{2\pi }\sigma }{\text {Li}}_{3/2}(z) +{}c_{1}+O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}-\varepsilon }\bigr ). \end{aligned}$$

(3.13)

Further, by our choice (1.9) of $\mu '$,

$$\begin{aligned} (B\odot y)(1) = \sum _{n=1}^\infty b_n q_n =\sum _{n=1}^\infty q_n(\log n -\mu ') = \mu '-\mu '=0. \end{aligned}$$

(3.14)

By (2.7), we have

$$\begin{aligned} {\text {Li}}_{3/2}(z) = \Gamma (-\tfrac{1}{2})(1-z)^{1/2}+ {}c_{2}+ O\bigl (|1-z|^{}\bigr ). \end{aligned}$$

(3.15)

Moreover, by [9, Theorem VI.7, p. 408] (or [8, Theorem 1]),

$$\begin{aligned} {\text {Li}}_{3/2,1}(z) = \Gamma (-\tfrac{1}{2})(1-z)^{1/2}L(z) + \Gamma '(-\tfrac{1}{2})(1-z)^{1/2}+ {}c_{3}+ O\bigl (|1-z|^{}\bigr ) .\end{aligned}$$

(3.16)

Hence, (3.13) and (3.15)–(3.16) yield, using (3.14) to see that the constant terms cancel,

$$\begin{aligned} B(z)\odot y(z)&= \frac{\Gamma (-\frac{1}{2})}{\sqrt{2\pi }\sigma }(1-z)^{1/2}L(z) + \frac{\Gamma '(-\frac{1}{2})-\mu '\Gamma (-\frac{1}{2})}{\sqrt{2\pi }\sigma }(1-z)^{1/2}+ O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}-\varepsilon }\bigr ) \nonumber \\ {}&=-\frac{\sqrt{2}}{\sigma }(1-z)^{1/2}L(z) +\frac{\sqrt{2}\bigl (\mu '-\psi (-\frac{1}{2})\bigr )}{\sigma } (1-z)^{1/2}+ O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}-\varepsilon }\bigr ) .\end{aligned}$$

(3.17)

Finally, (3.12), (2.22), and (3.17) yield (3.10).

Since $L(z)=\sum _{n=1}^\infty z^n/n$, (3.10) yields by standard singularity analysis, recalling the definition (3.6),

$$\begin{aligned} q_n {\mathbb E{}}F(\mathcal {T}_n) =-\sigma ^{-2}n^{-1}+O\bigl (n^{-1-\frac{\delta }{2}+\varepsilon }\bigr ). \end{aligned}$$

(3.18)

Hence, using also (2.24),

$$\begin{aligned} {\mathbb E{}}F(\mathcal {T}_n)=-\frac{\sqrt{2\pi }}{\sigma }n^{1/2}+O\bigl (n^{\frac{1}{2}-\frac{\delta }{2}+\varepsilon }\bigr ) \end{aligned}$$

(3.19)

and (3.11) follows by (3.2). $\square $

Remark 3.3

Under stronger moment conditions on the offspring distribution $\xi $, we may in the same way obtain an expansion of the mean ${\mathbb E{}}X_n'(0)$ with further terms. For example, if ${\mathbb E{}}\xi ^{3+\delta }<\infty $, then the same argument yields

$$\begin{aligned} {\mathbb E{}}X'_n(0)= \mu 'n - \frac{\sqrt{2\pi }}{\sigma } n^{1/2}+ \frac{{\mathbb E{}}[\xi (\xi -1)(\xi -2)]}{3\sigma ^4} \log n +O(1). \end{aligned}$$

(3.20)

In the special case of binary trees, this was given in [7, (4.2)]. Note that the coefficient of $\log n$ in (3.20) vanishes for binary trees, but not in general. $\square $

3.2 The Second Moment

Lemma 3.4

Assume (1.7) with $0<\delta \leqslant 1$. Then, for any $\varepsilon >0$,

$$\begin{aligned} M_2(z)&= 2^{3/2}(1-\log 2)\sigma ^{-3} (1-z)^{-1/2}L(z) +{}c_{4}(1-z)^{-1/2}+ O\bigl (|1-z|^{-\frac{1}{2}+\frac{\delta }{2}-\varepsilon }\bigr ) .\end{aligned}$$

(3.21)

Proof

We use Lemma 3.1 with the notation $R_\ell (z)$ as in (3.8). For $\ell =2$, Lemma 3.1 shows using (2.19) that

$$\begin{aligned} R_2(z)&= B(z)^{\odot 2}\odot y(z) +2B(z)\odot \bigl [zM_1(z)\Phi '(y(z))\bigr ] +zM_1(z)^2\Phi ''\bigl (y(z)\bigr ). \end{aligned}$$

(3.22)

We consider the three terms separately.

First, by (3.5), (2.20), and Lemma 2.3 (twice), we have

$$\begin{aligned} B(z)^{\odot 2}\odot y(z) = B(z)^{\odot 2}\odot \bigl (y(z)-1\bigr ) ={}c_{5}+ O\bigl (|1-z|^{\frac{1}{2}-2\varepsilon }\bigr ). \end{aligned}$$

(3.23)

For the remaining two terms, we have to be more careful, since it will turn out that their main terms cancel.

For the second term, we note first that (3.10) implies $M_1(z)=O\bigl (|1-z|^{-\varepsilon }\bigr )$, and thus, (2.27) yields

$$\begin{aligned} zM_1(z)\Phi '(y(z))=M_1(z)+O\bigl (|1-z|^{\frac{1}{2}-\varepsilon }\bigr ). \end{aligned}$$

(3.24)

Hence, (3.5) and Lemma 2.3 yield

$$\begin{aligned} B(z)\odot \bigl [ zM_1(z)\Phi '(y(z))\bigr ]&=B(z)\odot M_1(z) +{}c_{6}+ O\bigl (|1-z|^{\frac{1}{2}-2\varepsilon }\bigr ) .\end{aligned}$$

(3.25)

This implies, using (3.5), (3.10), and Lemma 2.3 again, followed by (3.3), and recalling ${\text {Li}}_{0,1}\odot L(z)={\text {Li}}_{0,1}\odot {\text {Li}}_{1,0}(z)={\text {Li}}_{1,1}(z)$,

$$\begin{aligned} B(z)\odot \bigl [zM_1(z)\Phi '(y(z))\bigr ]&=-\sigma ^{-2}B(z)\odot L(z)+{}c_{7}+O\bigl (|1-z|^{\frac{\delta }{2}-2\varepsilon }\bigr ) \nonumber \\ {}&=-\sigma ^{-2}\bigl ({\text {Li}}_{0,1}(z)\odot L(z) -\mu 'L(z)\bigr )+c_{7}+O\bigl (|1-z|^{\frac{\delta }{2}-2\varepsilon }\bigr ) \nonumber \\ {}&=-\sigma ^{-2}{\text {Li}}_{1,1}(z)+\sigma ^{-2}\mu 'L(z)+c_{7}+O\bigl (|1-z|^{\frac{\delta }{2}-2\varepsilon }\bigr ) .\end{aligned}$$

(3.26)

We use the singular expansion of ${\text {Li}}_{1,1}(z)$:

$$\begin{aligned} {\text {Li}}_{1,1}(z) = \tfrac{1}{2} L^2(z)-\gamma L(z)+{}c_{8}+O\bigl (|1-z|^{1-\varepsilon }\bigr ), \end{aligned}$$

(3.27)

which follows from [8, p. 380] and is given in [7, p. 96], except that the error term there should be $O(|(1 - z) L(z)|)$, not $O(|1 - z|)$. Consequently, (3.26) yields

$$\begin{aligned} B(z)\odot \bigl [zM_1(z)\Phi '(y(z))\bigr ] =-\tfrac{1}{2}\sigma ^{-2}L^2(z)+\sigma ^{-2}(\gamma +\mu ') L(z)+{}c_{9}+O\bigl (|1-z|^{\frac{\delta }{2}-2\varepsilon }\bigr ) .\end{aligned}$$

(3.28)

For the third term in (3.22), we have by (2.28) and (3.10), again using $M_1(z)=O\bigl (|1-z|^{-\varepsilon }\bigr )$,

$$\begin{aligned} zM_1(z)^2\Phi ''\bigl (y(z)\bigr )&= \sigma ^2M_1(z)^2 + O\bigl (|1-z|^{\frac{\delta }{2}-2\varepsilon }\bigr ) \nonumber \\ {}&= \sigma ^{-2}L^2(z) -2\frac{{\mu '-\psi (-\frac{1}{2})}}{\sigma ^2} L(z) +{}c_{10}+ O\bigl (|1-z|^{\frac{\delta }{2}-2\varepsilon }\bigr ). \end{aligned}$$

(3.29)

Finally, (3.22) yields, by summing (3.23), (3.28) (twice), and (3.29), recalling (3.9),

$$\begin{aligned} R_2(z)&= 2\frac{{\gamma +\psi (-\frac{1}{2})}}{\sigma ^2} L(z) +{}c_{11}+ O\bigl (|1-z|^{\frac{\delta }{2}-2\varepsilon }\bigr ) \nonumber \\ {}&= 4(1-\log 2)\sigma ^{-2}L(z) +c_{11}+ O\bigl (|1-z|^{\frac{\delta }{2}-2\varepsilon }\bigr ) .\end{aligned}$$

(3.30)

The result (3.21) now follows by (3.30), (3.8), and (2.22), and replacing $\varepsilon $ by $\varepsilon /2$ (as we may because $\varepsilon $ is arbitrary). $\square $

This gives the asymptotics for the second moment of the shape functional. Again, the result includes earlier results for special cases in [1, 3, 7, 16, 19]. Recall from (3.2) that $F(\mathcal {T}_n)=X_n'(0)-\mu 'n$.

Lemma 3.5

Assume (1.7) with $\delta >0$. Then

$$\begin{aligned} {\mathbb E{}}\bigl [\bigl (X'_n(0)-\mu 'n\bigr )^2 \bigr ]&= {\mathbb E{}}\bigl [F(\mathcal {T}_n)^2\bigr ] = 4(1-\log 2)\sigma ^{-2} n\log n + O\bigl (n^{}\bigr ), \end{aligned}$$

(3.31)

and thus,

$$\begin{aligned} {\text {Var}}X_n'(0)&= {\text {Var}}{F(\mathcal {T}_n)} = 4(1-\log 2)\sigma ^{-2} n\log n + O\bigl (n^{}\bigr ) .\end{aligned}$$

(3.32)

Proof

We may assume $\delta \leqslant 1$. The definition (3.6) and the singular expansion (3.21) yield by standard singularity analysis (using (2.10)–(2.11) or [9, Figure VI.5, p. 388])

$$\begin{aligned} q_n{\mathbb E{}}\bigl [F(\mathcal {T}_n)^2\bigr ] = \frac{2^{3/2}(1-\log 2)}{\sqrt{\pi }}\sigma ^{-3} n^{-1/2}\log n + O\bigl (n^{-\frac{1}{2}}\bigr ) .\end{aligned}$$

(3.33)

Hence, (3.31) follows by (2.24). Finally, (3.32) follows by (3.31) and (3.19). $\square $

3.3 Higher Moments

We extend the results above to higher moments, using the method used earlier for special cases in [1, 7]; see also [19] for a different method (in another special case).

We prove the following analogue of [5, Lemma 12.8]. Note that (3.34) is not true for $\ell =1$, since the leading power of L(z) in that case is $L(z)^1$ by (3.10). (Also (3.35) fails for $\ell =1$ in general.)

Lemma 3.6

Assume (1.7) with $0<\delta \leqslant 1$. Then, for every $\ell \geqslant 2$, $M_\ell (z)$ is $\Delta $-analytic, and, for any $\varepsilon >0$,

$$\begin{aligned} M_\ell (z)&= \sigma ^{-\ell -1}(1-z)^{(1-\ell )/2}\sum _{j=0}^{\lfloor \ell /2\rfloor }\kappa _{\ell ,j}L(z)^j +O\bigl (|1-z|^{-\frac{1}{2}\ell +\frac{1}{2}+\frac{\delta }{2}-\varepsilon }\bigr ) \end{aligned}$$

(3.34)

$$\begin{aligned}&=\sigma ^{-\ell -1}\sum _{j=0}^{\lfloor \ell /2\rfloor }\widehat{\kappa }_{\ell ,j}{\text {Li}}_{(3-\ell )/2,j}(z) +O\bigl (|1-z|^{-\frac{1}{2}\ell +\frac{1}{2}+\frac{\delta }{2}-\varepsilon }\bigr ) , \end{aligned}$$

(3.35)

for some coefficients $\kappa _{\ell ,j}$ and $\widehat{\kappa }_{\ell ,j}$. The leading coefficients $\kappa ^*_{2k}:=\kappa _{2k,k}$ in the case that $\ell = 2 k$ is even are given by the recursion

$$\begin{aligned} \kappa ^*_2&=2^{3/2}(1-\log 2), \end{aligned}$$

(3.36)

$$\begin{aligned} \kappa ^*_{2k}&=2^{-3/2}\sum _{i=1}^{k-1}\left( {\begin{array}{c}2k\\ 2i\end{array}}\right) \kappa ^*_{2i}\kappa ^*_{2(k-i)}, \qquad k\geqslant 2 .\end{aligned}$$

(3.37)

Furthermore,

$$\begin{aligned} \widehat{\kappa }_{2k,k}=\Gamma \bigl (k-\tfrac{1}{2}\bigr )^{-1}\kappa _{2k,k} =\Gamma \bigl (k-\tfrac{1}{2}\bigr )^{-1}\kappa ^*_{2k} .\end{aligned}$$

(3.38)

Proof

Note first that (3.34) and (3.35) are equivalent by Lemma 2.1, and that (3.38) follows using (2.11).

We use induction on $\ell $. The base case $\ell =2$ (including (3.36)) is Lemma 3.4, so we assume $\ell \geqslant 3$. We follow the proof of [5, Lemma 12.8], mutatis mutandis.

We first note that $L(z)=O\bigl (|1-z|^{-\varepsilon }\bigr )$. Hence, for every $\ell '<\ell $, the induction hypothesis and (for the case $\ell '=1$) Lemma 3.2 show that

$$\begin{aligned} M_{\ell '}(z) = O\bigl (|1-z|^{-\frac{1}{2}\ell '+\frac{1}{2}-\varepsilon }\bigr ). \end{aligned}$$

(3.39)

(Here and in the sequel we replace without further comment, as we may, $c\varepsilon $ by $\varepsilon $, for any constant c possibly depending on $\ell $.) Hence, using Lemma 2.6, for a typical term in (3.7) (with $m\geqslant 0$),

$$\begin{aligned} zM_{\ell _1}(z)\cdots M_{\ell _m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )&= O\bigl (|1-z|^{-\frac{1}{2}\sum _{i=1}^m\ell _i+\frac{1}{2}m-\varepsilon }\Phi ^{(m)}\bigl (y(z)\bigr )\bigr ) \nonumber \\&= {\left\{ \begin{array}{ll} O\bigl (|1-z|^{-\frac{1}{2}(\ell -\ell _0)+\frac{1}{2}m-\varepsilon }\bigr ), &{} m\leqslant 2, \\ O\bigl (|1-z|^{-\frac{1}{2}(\ell -\ell _0)+1+\frac{\delta }{2}-\varepsilon }\bigr ), &{} m\geqslant 3. \end{array}\right. } \end{aligned}$$

(3.40)

Since $\ell - \ell _0 \geqslant m$, the exponent here is $<0$. Hence, (3.5) and Lemma 2.3 applied $\ell _0$ times yield

$$\begin{aligned}&B(z)^{\odot \ell _0} \odot \bigl [zM_{\ell _1}(z)\cdots M_{\ell _m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )\bigr ] \nonumber \\&\quad = {\left\{ \begin{array}{ll} O\bigl (|1-z|^{-\frac{1}{2}\ell +\frac{1}{2}\ell _0+\frac{1}{2}m-\varepsilon }\bigr ), &{} m\leqslant 2, \\ O\bigl (|1-z|^{-\frac{1}{2}\ell +\frac{1}{2}\ell _0+1+\frac{\delta }{2}-\varepsilon }\bigr ), &{} m\geqslant 3. \end{array}\right. } \end{aligned}$$

(3.41)

If $m=0$, then $\ell _0=\ell \geqslant 3$, and if $m=1$, then $\ell _1<\ell $ and thus, $\ell _0=\ell -\ell _1\geqslant 1$. Hence, except in the two cases (1) $m=1$ and $\ell _0=1$ and (2) $m=2$ and $\ell _0=0$, we have $m+\ell _0\geqslant 3$, and then the exponent in (3.41) is $\geqslant -\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon $. Consequently, by (3.7)–(3.8),

$$\begin{aligned} R_\ell (z) = \ell B(z) \odot \bigl [z M_{\ell -1}(z) \Phi '\bigl (y(z)\bigr )\bigr ]&+ \frac{1}{2}\sum _{j=1}^{\ell -1}\left( {\begin{array}{c}\ell \\ j\end{array}}\right) z M_j(z)M_{\ell -j}(z) \Phi ''\bigl (y(z)\bigr )\nonumber \\&+O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) .\end{aligned}$$

(3.42)

By (2.27), (3.39), (3.5), and Lemma 2.3, we have, similarly to (3.25),

$$\begin{aligned} B(z)\odot \bigl [ zM_{\ell -1}(z)\Phi '(y(z))\bigr ]&=B(z)\odot M_{\ell -1}(z) +O\bigl (|1-z|^{-\frac{1}{2}\ell +\frac{3}{2}-\varepsilon }\bigr ) .\end{aligned}$$

(3.43)

Hence, using also (2.28) and (again) (3.39), we can simplify (3.42) to

$$\begin{aligned} R_\ell (z) = \ell B(z) \odot M_{\ell -1}(z)&+ \frac{\sigma ^2}{2}\sum _{j=1}^{\ell -1}\left( {\begin{array}{c}\ell \\ j\end{array}}\right) M_j(z)M_{\ell -j}(z) +O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) .\end{aligned}$$

(3.44)

In the remaining estimates, we have to be more careful, in particular since there will be important cancellations. (This is as in the case $\ell =2$ treated earlier, but somewhat different.)

Consider first the Hadamard product in (3.44) (the case $m=1$ and $\ell _0=1$ above). We now use the induction hypothesis in the form (3.35) and obtain by (2.14) and (3.3), using again (3.5) and Lemma 2.3 for the error term, and finally rewriting by (2.8),

$$\begin{aligned}&B(z)\odot M_{\ell -1}(z) \nonumber \\&\quad =\sigma ^{-\ell } \sum _{j=0}^{\lfloor (\ell -1)/2\rfloor }\widehat{\kappa }_{\ell -1,j} \bigl ({\text {Li}}_{(4-\ell )/2,j+1}(z)-\mu '{\text {Li}}_{(4-\ell )/2,j}(z)\bigr ) +O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) \nonumber \\&\quad =\sigma ^{-\ell } \sum _{k=0}^{\lfloor (\ell +1)/2\rfloor }c^{(1)}_{\ell ,k}{\text {Li}}_{(4-\ell )/2,k}(z) +O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) \nonumber \\&\quad = \sigma ^{-\ell } (1-z)^{-\frac{1}{2}\ell +1} \sum _{k=0}^{\lfloor (\ell +1)/2\rfloor } c^{(2)}_{\ell ,k} L(z)^k +O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) ,\end{aligned}$$

(3.45)

where the leading coefficient in the sum is, using (2.9) and (2.11),

$$\begin{aligned} c^{(2)}_{\ell ,\lfloor (\ell +1)/2\rfloor } = \Gamma (\ell /2-1) c^{(1)}_{\ell ,\lfloor (\ell +1)/2\rfloor } = \Gamma (\ell /2-1) \widehat{\kappa }_{\ell -1,\lfloor (\ell -1)/2\rfloor } =\kappa _{\ell -1,\lfloor (\ell -1)/2\rfloor } .\end{aligned}$$

(3.46)

The leading term in (3.45) is, thus,

$$\begin{aligned} \sigma ^{-\ell }\kappa _{\ell -1,\lfloor (\ell -1)/2\rfloor } (1-z)^{-\frac{1}{2}\ell +1} L(z)^{\lfloor (\ell +1)/2\rfloor } .\end{aligned}$$

(3.47)

Consider now the terms with $j=1$ and $j=\ell -1$ in the sum in (3.44). By Lemma 3.2 and the induction hypothesis, we have

$$\begin{aligned} \sigma ^2M_1(z)M_{\ell -1}(z)&=\sigma ^{-\ell } (1-z)^{-\frac{1}{2}\ell +1} \sum _{j=0}^{\lfloor (\ell -1)/2\rfloor }\kappa _{\ell -1,j} \bigl [-L(z)^{j+1}+cL(z)^j\bigr ] \nonumber \\&\quad +O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) .\end{aligned}$$

(3.48)

Note that the leading term in (3.48) cancels (3.47). Consequently, (3.45)–(3.48) yield

$$\begin{aligned}&\ell B(z) \odot M_{\ell -1}(z) + \frac{\sigma ^2}{2}\cdot 2\cdot \left( {\begin{array}{c}\ell \\ 1\end{array}}\right) M_1(z)M_{\ell -1}(z) \nonumber \\ {}&\quad = (1-z)^{-\frac{1}{2}\ell +1} \sum _{k=0}^{\lfloor (\ell -1)/2\rfloor } c^{(3)}_{\ell ,k}L(z)^k +O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) .\end{aligned}$$

(3.49)

The remaining terms in (3.44) yield immediately, by the induction hypothesis,

$$\begin{aligned}&\frac{\sigma ^2}{2}\sum _{j=2}^{\ell -2}\left( {\begin{array}{c}\ell \\ j\end{array}}\right) M_j(z)M_{\ell -j}(z) = (1-z)^{-\frac{1}{2}\ell +1} \sum _{k=0}^{\lfloor \ell /2\rfloor } c^{(4)}_{\ell ,k}L(z)^k +O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) .\end{aligned}$$

(3.50)

Finally, (3.44) and (3.49)–(3.50) yield

$$\begin{aligned} R_\ell (z) = (1-z)^{-\frac{1}{2}\ell +1} \sum _{j=0}^{\lfloor \ell /2\rfloor }c^{(5)}_{\ell ,j}L(z)^j +O\bigl (|1-z|^{-\frac{1}{2}\ell +1+\frac{\delta }{2}-\varepsilon }\bigr ) , \end{aligned}$$

(3.51)

and (3.34) follows by (3.8) and (2.22), which completes the induction step.

It remains only to show the recursion (3.37) for the leading coefficients. If $\ell =2k$ is even, with $\ell \geqslant 4$, then (3.49) does not contribute to $c^{(5)}_{2k,k}$ nor, thus, to $\kappa _{2k,k}$, and neither do the terms in (3.50) with j odd. Hence, the argument above yields

$$\begin{aligned} c^{(5)}_{2k,k}=\frac{1}{2}\sum _{i=1}^{k-1}\left( {\begin{array}{c}2k\\ 2i\end{array}}\right) \sigma ^{-2k}\kappa _{2i,i}\kappa _{2k-2i,k-i} \end{aligned}$$

(3.52)

and, thus, recalling again (2.22),

$$\begin{aligned} \kappa _{2k,k}=2^{-3/2}\sum _{i=1}^{k-1}\left( {\begin{array}{c}2k\\ 2i\end{array}}\right) \kappa _{2i,i}\kappa _{2k-2i,k-i}, \end{aligned}$$

(3.53)

which is (3.37). $\square $

The recursion (3.37) is the same as [5, (C.35)] and, thus, has the same solution [5, (C.40)], i.e.

$$\begin{aligned} \kappa ^*_{2k}=2^{3/2}\frac{(2k)!\,(2k-2)!}{(k-1)!\,k!} d_1^k, \qquad k\geqslant 1, \end{aligned}$$

(3.54)

with, see [5, (C.36)] and (3.36),

$$\begin{aligned} d_1:=2^{-3/2}\kappa ^*_2/2 =\tfrac{1}{2}(1-\log 2) .\end{aligned}$$

(3.55)

This is what we need to complete the proof of the asymptotic normality of $F(\mathcal {T}_n)$.

Proof of Theorem 1.3

If $\ell \geqslant 2$, then (3.6), the expansion (3.35), (2.5), and standard singularity analysis yield

$$\begin{aligned} q_n{\mathbb E{}}\bigl [F(\mathcal {T}_n)^\ell \bigr ] =\sigma ^{-\ell -1} \widehat{\kappa }_{\ell ,\lfloor \ell /2\rfloor } n^{(\ell -3)/2}(\log n)^{\lfloor \ell /2\rfloor } +O\bigl (n^{(\ell -3)/2}(\log n)^{\lfloor \ell /2\rfloor -1}\bigr ) .\end{aligned}$$

(3.56)

Hence, using (2.24),

$$\begin{aligned} {\mathbb E{}}\bigl [F(\mathcal {T}_n)^\ell \bigr ] =\sigma ^{-\ell } \sqrt{2\pi }\widehat{\kappa }_{\ell ,\lfloor \ell /2\rfloor } n^{\ell /2}(\log n)^{\lfloor \ell /2\rfloor } +O\bigl (n^{\ell /2}(\log n)^{\lfloor \ell /2\rfloor -1}\bigr ) .\end{aligned}$$

(3.57)

Consequently,

$$\begin{aligned} \frac{{\mathbb E{}}\bigl [F(\mathcal {T}_n)^\ell \bigr ] }{(n\log n)^{\ell /2}} \rightarrow {\left\{ \begin{array}{ll} 0,&{} \ell =2k+1\geqslant 3, \\ \sigma ^{-2k} \sqrt{2\pi }\widehat{\kappa }_{2k,k}, &{}\ell =2k\geqslant 2. \end{array}\right. } \end{aligned}$$

(3.58)

Furthermore, (3.58) holds also for $\ell =1$ (with limit 0) by (3.19).

For even $\ell =2k$, the limit in (3.58) is by (3.38), (3.54), and (3.55), cf. [5, (C.41)],

$$\begin{aligned} \sigma ^{-2k}\frac{\sqrt{2\pi }}{\Gamma (k-\frac{1}{2})}\kappa ^*_{2k}&= \sigma ^{-2k}\frac{4\sqrt{\pi }}{\Gamma (k-\frac{1}{2})} \frac{(2k)!\,(2k-2)!}{(k-1)!\,k!} d_1^k =\sigma ^{-2k}2^{2k}\frac{(2k)!}{k!}d_1^k \nonumber \\ {}&=\bigl (8d_1\sigma ^{-2}\bigr )^k\cdot (2k-1)!! = \bigl (4(1-\log 2)\sigma ^{-2}\bigr )^k \cdot (2k-1)!! .\end{aligned}$$

(3.59)

Consequently, the limits appearing in (3.58) are the moments of a normal distribution $N\bigl (0,4(1-\log 2)\sigma ^{-2}\bigr )$, and thus, (1.10) follows by the method of moments. (Recall that $F(\mathcal {T}_n)=X_n'(0)-\mu 'n$ by (3.2).) $\square $

4 Imaginary Powers

In this section, we consider $X_n(\alpha )$ in (1.1) when the exponent $\alpha $ is purely imaginary, i.e., ${\text {Re}}\alpha =0$. We exclude the trivial case $\alpha =0$, when $X_n(\alpha )=n$ is non-random. We assume throughout the section that $0<\delta <1$ and that (1.7) holds. As above, $\varepsilon $ is an arbitrarily small positive number, and we replace $c\varepsilon $ by $\varepsilon $ without comment.

We follow rather closely the argument for the case $0<{\text {Re}}\alpha <1/2$ in [5, §§12.4–6], but we will see new terms appearing that will lead to the dominating terms with logarithmic factors for the moments; this is very similar to the argument in Sect. 3, but we will see some differences. (Notably, there are no cancellations of leading terms like those in Sect. 3.)

As in [5, §12.4], we define

$$\begin{aligned} b_n:=n^\alpha -\mu (\alpha ), \end{aligned}$$

(4.1)

with the following generating function (cf. [5, (12.44)] and (2.7), and note ${\text {Li}}_0(z)=z(1-z)^{-1}$):

$$\begin{aligned} B(z)&= B_\alpha (z):=\sum _{n=1}^\infty b_nz^n ={\text {Li}}_{-\alpha }(z)-\mu (\alpha ){\text {Li}}_0(z) \end{aligned}$$

(4.2)

$$\begin{aligned}&= \Gamma (1+\alpha )(1-z)^{-\alpha -1}-\mu (\alpha )(1-z)^{-1}+O(1) \end{aligned}$$

(4.3)

$$\begin{aligned}&= O\bigl (|1-z|^{-1}\bigr ) .\end{aligned}$$

(4.4)

Let now $F(T)=F_\alpha (T)$ denote the additive functional defined by the toll function $f_\alpha (T):=b_{|T|}$. Thus,

$$\begin{aligned} F_\alpha (\mathcal {T}_n)=X_n(\alpha )-n\mu (\alpha ). \end{aligned}$$

(4.5)

4.1 The Mean

For the mean, we define the generating function

$$\begin{aligned} M_\alpha (z):={\mathbb E{}}\bigl [F_\alpha (\mathcal {T}) z^{|\mathcal {T}|}\bigr ] =\sum _{n=1}^\infty q_n {\mathbb E{}}[F_\alpha (\mathcal {T}_n)] z^n .\end{aligned}$$

(4.6)

We then have, as in (3.12) and [5, (12.29)],

$$\begin{aligned} M_\alpha (z) =\frac{zy'(z)}{y(z)}\cdot \bigl ( B_\alpha (z)\odot y(z)\bigr ) .\end{aligned}$$

(4.7)

Thus, $M_\alpha (z)$ is $\Delta $-analytic. Further, we have by (2.13), (4.2), and (2.23), using (4.4) and Lemma 2.3 for the error term in (2.23), and then using for the second line (2.7) and $\Gamma (-\frac{1}{2})=-2\sqrt{\pi }$,

$$\begin{aligned} B_\alpha (z)&\odot y(z ) = \frac{1}{\sqrt{2\pi }\sigma }{\text {Li}}_{3/2-\alpha }(z) -\frac{\mu (\alpha )}{\sqrt{2\pi }\sigma }{\text {Li}}_{3/2}(z) +c_1+O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}}\bigr ) \nonumber \\ {}&= \frac{\Gamma (\alpha -\tfrac{1}{2})}{\sqrt{2\pi }\sigma } (1-z)^{\tfrac{1}{2}-\alpha } +2^{1/2}\sigma ^{-1}\mu (\alpha )(1-z)^{1/2}+c_2+O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}}\bigr ). \end{aligned}$$

(4.8)

Further, similarly to (3.14),

$$\begin{aligned} (B_\alpha \odot y)(1) = \sum _{n=1}^\infty b_n q_n =\sum _{n=1}^\infty q_n [n^\alpha -\mu (\alpha )] ={\mathbb E{}}|\mathcal {T}|^\alpha - \mu (\alpha )=0. \end{aligned}$$

(4.9)

Thus, letting $z\rightarrow 1$ in (4.8) shows that $c_2=(B_\alpha \odot y)(1)=0$.

Finally, (4.7), (2.22), and (4.8) yield, using (2.7) again,

$$\begin{aligned} M_\alpha (z)&= \frac{\Gamma (\alpha -\tfrac{1}{2})}{2\sqrt{\pi }\sigma ^2}(1-z)^{-\alpha } +\sigma ^{-2}\mu (\alpha ) +O\bigl (|1-z|^{\frac{\delta }{2}}\bigr ) \end{aligned}$$

(4.10)

$$\begin{aligned}&=\frac{\Gamma (\alpha -\tfrac{1}{2})}{2\sqrt{\pi }\sigma ^2\Gamma (\alpha )}{\text {Li}}_{1-\alpha }(z) +c_3 +O\bigl (|1-z|^{\frac{\delta }{2}}\bigr ) .\end{aligned}$$

(4.11)

Singularity analysis now yields, from (4.6) and (4.11),

$$\begin{aligned} q_n {\mathbb E{}}[F_\alpha (\mathcal {T}_n)] = \frac{\Gamma (\alpha -\tfrac{1}{2})}{2\sqrt{\pi }\sigma ^2\Gamma (\alpha )}n^{\alpha -1} +O\bigl (n^{-1-\frac{\delta }{2}}\bigr ) \end{aligned}$$

(4.12)

and, thus, by (2.24),

$$\begin{aligned} {\mathbb E{}}[F_\alpha (\mathcal {T}_n)] = \frac{\Gamma (\alpha -\tfrac{1}{2})}{\sqrt{2}\sigma \Gamma (\alpha )}n^{\frac{1}{2}+\alpha } +O\bigl (n^{\frac{1}{2}-\frac{\delta }{2}}\bigr ). \end{aligned}$$

(4.13)

Hence, recalling (4.5),

$$\begin{aligned} {\mathbb E{}}X_n(\alpha )=\mu (\alpha )n +\frac{\Gamma (\alpha -\tfrac{1}{2})}{\sqrt{2}\sigma \Gamma (\alpha )}n^{\frac{1}{2}+\alpha } +O\bigl (n^{\frac{1}{2}-\frac{\delta }{2}}\bigr ) .\end{aligned}$$

(4.14)

This agrees with [5, Theorem 1.7(ii)] (proved without (1.7), and by different methods), except that the error estimate here is smaller.

4.2 Higher Moments

For higher moments, we need mixed moments for $\alpha $ and $\overline{\alpha }=-\alpha $. Thus, somewhat more generally, fix $\alpha _1$ and $\alpha _2$ with ${\text {Re}}\alpha _1={\text {Re}}\alpha _2=0$ but $\alpha _1\ne 0\ne \alpha _2$. We define, for integers $\ell _1,\ell _2\geqslant 0$, the generating function

$$\begin{aligned} M_{\ell _1,\ell _2}(z)&:={\mathbb E{}}\bigl [F_{\alpha _1}(\mathcal {T})^{\ell _1}F_{\alpha _2}(\mathcal {T})^{\ell _2} z^{|\mathcal {T}|}\bigr ] =\sum _{n=1}^\infty q_n {\mathbb E{}}\bigl [F_{\alpha _1}(\mathcal {T}_n)^{\ell _1}F_{\alpha _2}(\mathcal {T}_n)^{\ell _2}\bigr ] z^n .\end{aligned}$$

(4.15)

Thus, $M_{1,0}=M_{\alpha _1}$ and $M_{0,1}=M_{\alpha _2}$ are given by (4.7). The functions $M_{\ell ,r}$ can then be found by the following recursion, given in [5, (12.75)], for every $\ell ,r\geqslant 0$ with $\ell +r\geqslant 1$:

$$\begin{aligned}{} & {} M_{\ell ,r}(z) = \frac{z y'(z)}{y(z)} \sum _{m=0}^{\ell +r} \frac{1}{m!}\mathop {\mathrm {\sum \nolimits ^{**}}}\limits \left( {\begin{array}{c}\ell \\ \ell _0,\dots ,\ell _m\end{array}}\right) \left( {\begin{array}{c}r\\ r_0,\dots ,r_m\end{array}}\right) B_{\alpha _1}(z)^{\odot \ell _0} \nonumber \\{} & {} \quad \odot B_{\alpha _2}(z)^{\odot r_0} \odot \bigl [zM_{\ell _1,r_1}(z)\cdots M_{\ell _m,r_m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )\bigr ], \end{aligned}$$

(4.16)

where $\mathop {\mathrm {\sum \nolimits ^{**}}}\limits $ is the sum over all pairs of $(m+1)$-tuples $(\ell _0,\dots ,\ell _m)$ and $(r_0,\dots ,r_m)$ of non-negative integers that sum to $\ell $ and r, respectively, such that $1\leqslant \ell _i+r_i<\ell + r$ for every $i\geqslant 1$. (Note that there are two typographical errors in [5]: the lower summation limit should be $m=0$, and the final qualification “$i\geqslant 1$” is missing there.) It follows by induction that every $M_{\ell ,r}$ is $\Delta $-analytic.

We define for convenience $R_{\ell ,r}(z)$ as the sum in (4.16); thus,

$$\begin{aligned} M_{\ell ,r}(z) = \frac{z y'(z)}{y(z)}R_{\ell ,r}(z). \end{aligned}$$

(4.17)

Let us first consider second moments. Taking $\ell =r=1$ in (4.16) yields, recalling (2.19),

$$\begin{aligned} R_{1,1}(z)&= B_{\alpha _1}(z)\odot B_{\alpha _2}(z)\odot y(z) + B_{\alpha _1}(z)\odot [zM_{0,1}(z)\Phi '(y(z))] \nonumber \\ {}&\qquad + B_{\alpha _2}(z)\odot [zM_{1,0}(z)\Phi '(y(z))] + zM_{1,0}(z)M_{0,1}(z)\Phi ''(y(z)). \end{aligned}$$

(4.18)

The first term is, by (4.4) and (4.8) (where $c_2=0$ by (4.9)) together with Lemma 2.3,

$$\begin{aligned} O\bigl (|1-z|^{-1}\bigr )\odot O\bigl (|1-z|^{1/2}\bigr )=c_4+O\bigl (|1-z|^{1/2}\bigr ). \end{aligned}$$

(4.19)

For the other terms in (4.18), we first note from (4.10) that $M_{1,0}(z)=M_{\alpha _1}(z)=O(1)$ and $M_{0,1}(z)=M_{\alpha _2}(z)=O(1)$. Thus, using also (2.27)–(2.28), (4.4), and Lemma 2.3, we may simplify to

$$\begin{aligned} R_{1,1}(z)&=c_5 + B_{\alpha _1}(z)\odot M_{0,1}(z) + B_{\alpha _2}(z)\odot M_{1,0}(z) + M_{1,0}(z)M_{0,1}(z)\sigma ^2\nonumber \\ {}&\qquad +O\bigl (|1-z|^{\delta /2}\bigr ). \end{aligned}$$

(4.20)

Furthermore, (4.10) yields

$$\begin{aligned} M_{1,0}(z)M_{0,1}(z)&=c_6(1-z)^{-\alpha _1}+c_7(1-z)^{-\alpha _2}+c_8(1-z)^{-\alpha _1-\alpha _2}\nonumber \\&\quad +c_9+O\bigl (|1-z|^{\frac{\delta }{2}}\bigr ) .\end{aligned}$$

(4.21)

We compute the Hadamard products in (4.20) by (2.13), (4.2), and (4.11), using again (4.4) and Lemma 2.3 for the error term. Together with (4.21), this yields from (4.20) a result that we write, using (2.7), as

$$\begin{aligned} R_{1,1}(z)&= \Bigl (\frac{\Gamma (\alpha _2-\tfrac{1}{2})}{2\sqrt{\pi }\sigma ^2\Gamma (\alpha _2)} +\frac{\Gamma (\alpha _1-\tfrac{1}{2})}{2\sqrt{\pi }\sigma ^2\Gamma (\alpha _1)}\Bigr ) {\text {Li}}_{1-\alpha _1-\alpha _2}(z) \nonumber \\ {}&\quad +c_{10}(1-z)^{-\alpha _1} +c_{11}(1-z)^{-\alpha _2} +c_8(1-z)^{-\alpha _1-\alpha _2} +c_{12} \nonumber \\ {}&\quad +O\bigl (|1-z|^{\delta /2}\bigr ). \end{aligned}$$

(4.22)

If $\alpha _1+\alpha _2\ne 0$, we use (2.7) also on the first term and obtain

$$\begin{aligned} R_{1,1}(z)&= c_{13}(1-z)^{-\alpha _1-\alpha _2} +c_{10}(1-z)^{-\alpha _1} +c_{11}(1-z)^{-\alpha _2} +c_{14} \nonumber \\ {}&\quad +O\bigl (|1-z|^{\delta /2}\bigr ). \end{aligned}$$

(4.23)

On the other hand, if $\alpha _1+\alpha _2=0$, we recall that ${\text {Li}}_1(z)=L(z)$, and thus, (4.22) yields

$$\begin{aligned} R_{1,1}(z)&= \frac{1}{\sqrt{\pi }\sigma ^2}{\text {Re}}\frac{\Gamma (\alpha _1-\tfrac{1}{2})}{\Gamma (\alpha _1)}\cdot L(z) +c_{10}(1-z)^{-\alpha _1} +c_{11}(1-z)^{-\alpha _2} +c_{15} \nonumber \\ {}&\quad +O\bigl (|1-z|^{\delta /2}\bigr ). \end{aligned}$$

(4.24)

We can now obtain $M_{1,1}(z)$ from (4.23)–(4.24) by (4.17) and (2.22). We do not state the result separately, but proceed immediately to a general formula.

Lemma 4.1

Let $\alpha \ne 0$ with ${\text {Re}}\alpha =0$, and take $\alpha _1=\alpha $ and $\alpha _2=\overline{\alpha }=-\alpha $. Then, for each pair of integers $\ell ,r\geqslant 0$ with $\ell +r\geqslant 2$, $M_{\ell ,r}(z)$ is $\Delta $-analytic and we have, for some coefficients $\varkappa _{\ell ,r;j,k}$ and $\widehat{\varkappa }_{\ell ,r;j,k}$, and every $\varepsilon >0$,

$$\begin{aligned} M_{\ell ,r}(z)&= \sum _{j,k}\varkappa _{\ell ,r;j,k} (1-z)^{(1-\ell -r)/2+j\alpha } L(z)^k +O\bigl (|1-z|^{\frac{1}{2}(1-\ell -r)+\frac{\delta }{2}-\varepsilon }\bigr ) \end{aligned}$$

(4.25)

$$\begin{aligned}&= \sum _{j,k}\widehat{\varkappa }_{\ell ,r;j,k}{\text {Li}}_{(3-\ell -r)/2+j\alpha ,k}(z) +O\bigl (|1-z|^{\frac{1}{2}(1-\ell -r)+\frac{\delta }{2}-\varepsilon }\bigr ) ,\end{aligned}$$

(4.26)

where the sums are over integers j and k with $-\ell \leqslant j\leqslant r$ and $0\leqslant k\leqslant \ell \wedge r$.

Furthermore, if $\ell +r=1$, then (4.25) holds (but not (4.26)).

If $\ell =r$, then the only non-zero coefficients with $k=\ell =r$ are

$$\begin{aligned} \varkappa _{\ell ,\ell ;0,\ell }&=\sigma ^{-2\ell -1}\varkappa ^*_\ell , \end{aligned}$$

(4.27)

$$\begin{aligned} \widehat{\varkappa }_{\ell ,\ell ;0,\ell }&= \Gamma \bigl (\ell -\tfrac{1}{2}\bigr )^{-1}\varkappa _{\ell ,\ell ;0,\ell } =\frac{\sigma ^{-2\ell -1}}{\Gamma \bigl (\ell -\frac{1}{2}\bigr )}\varkappa ^*_\ell , \end{aligned}$$

(4.28)

where $\varkappa ^*_\ell $ is given by the recursion

$$\begin{aligned} \varkappa ^*_1&= \frac{1}{\sqrt{2\pi }}{\text {Re}}\frac{\Gamma (\alpha -\tfrac{1}{2})}{\Gamma (\alpha )}, \end{aligned}$$

(4.29)

$$\begin{aligned} \varkappa ^*_\ell&=2^{-3/2}\sum _{i=1}^{\ell -1}\left( {\begin{array}{c}\ell \\ i\end{array}}\right) ^2\varkappa ^*_{i}\varkappa ^*_{\ell -i}, \qquad \ell \geqslant 2. \end{aligned}$$

(4.30)

Proof

Note first that for $\ell +r=1$, (4.25) follows from (4.10). (We see also from (4.11) that (4.26) would hold if we add a constant term; the problem is that ${\text {Li}}_1(z)$ is L(z) and not a constant.)

Assume in the rest of the proof that $\ell +r\geqslant 2$. Then the expansions (4.25) and (4.26) are equivalent by Lemma 2.1; furthermore, for the leading terms, (4.27) and (4.28) are equivalent by (2.11).

Consider next the case $\ell +r=2$. If $(\ell ,r)=(2,0)$ or (0, 2), we can obtain the functions $M_{2,0}(z)$ and $M_{0,2}(z)$ as special cases of $M_{1,1}(z)$ where $\alpha _1 = \alpha _2 = \pm \alpha $ and, thus, use (4.23) with $\alpha _1=\alpha _2=\pm \alpha $ and obtain (4.25) by (4.17) and (2.22). (Now only terms with $k=0$ appear.)

If $\ell =r=1$, we similarly use (4.24), (4.17), and (2.22) and obtain (4.25) including a single term with $k=1$, viz. $\varkappa _{1,1;0,1}L(z)(1-z)^{-1/2}$ with $\varkappa _{1,1;0,1}$ given by (4.27) and (4.29).

For $\ell +r\geqslant 3$, we use induction on $\ell +r$. By the induction hypothesis (4.25) (including the case $\ell +r=1$ just proved by (4.10)), we have for every $(\ell ',r')$ with $1\leqslant \ell '+r'<\ell +r$,

$$\begin{aligned} M_{\ell ',r'}(z) = O\bigl (|1-z|^{-\frac{1}{2}(\ell '+r')+\frac{1}{2}-\varepsilon }\bigr ). \end{aligned}$$

(4.31)

Consequently, for a typical term in (4.16), as in (3.40) and using again Lemma 2.6,

$$\begin{aligned}&zM_{\ell _1,r_1}(z)\cdots M_{\ell _m,r_m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )= O\bigl (|1-z|^{-\frac{1}{2}\sum _{i=1}^m(\ell _i+r_i)+\frac{1}{2}m-\varepsilon }\Phi ^{(m)}\bigl (y(z)\bigr )\bigr ) \nonumber \\&\quad = {\left\{ \begin{array}{ll} O\bigl (|1-z|^{-\frac{1}{2}(\ell +r-\ell _0-r_0)+\frac{1}{2}m-\varepsilon }\bigr ), &{} m\leqslant 2, \\ O\bigl (|1-z|^{-\frac{1}{2}(\ell +r-\ell _0-r_0)+1+\frac{\delta }{2}-\varepsilon }\bigr ), &{} m\geqslant 3. \end{array}\right. } \end{aligned}$$

(4.32)

Again the exponent here is $<0$, and it follows by (4.4) and Lemma 2.3 that

$$\begin{aligned}&B_{\alpha _1}(z)^{\odot \ell _0} \odot B_{\alpha _2}(z)^{\odot r_0} \odot \bigl [zM_{\ell _1,r_1}(z)\cdots M_{\ell _m,r_m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )\bigr ] \nonumber \\&\quad = {\left\{ \begin{array}{ll} O\bigl (|1-z|^{-\frac{1}{2}(\ell +r)+\frac{1}{2}(\ell _0+r_0)+\frac{1}{2}m-\varepsilon }\bigr ), &{} m\leqslant 2, \\ O\bigl (|1-z|^{-\frac{1}{2}(\ell +r)+\frac{1}{2}(\ell _0+r_0)+1+\frac{\delta }{2}-\varepsilon }\bigr ), &{} m\geqslant 3. \end{array}\right. } \end{aligned}$$

(4.33)

As in the proof of Lemma 3.6, except in the two cases (1) $m=1$ and $\ell _0+r_0=1$ and (2) $m=2$ and $\ell _0=r_0=0$ we have $m+\ell _0+r_0\geqslant 3$, and then the exponent in (4.33) is $\geqslant -\frac{1}{2}(\ell +r)+1+\frac{\delta }{2}-\varepsilon $. Consequently, by (4.16)–(4.17),

$$\begin{aligned} R_{\ell ,r}(z)&= \ell B_{\alpha _1}(z) \odot \bigl [z M_{\ell -1,r}(z) \Phi '\bigl (y(z)\bigr )\bigr ] +r B_{\alpha _2}(z) \odot \bigl [z M_{\ell ,r-1}(z) \Phi '\bigl (y(z)\bigr )\bigr ] \nonumber \\ {}&\qquad + \frac{1}{2}\mathop {\mathrm {\sum \sum }}\limits _{0<i+j<\ell +r} \left( {\begin{array}{c}\ell \\ i\end{array}}\right) \left( {\begin{array}{c}r\\ j\end{array}}\right) z M_{i,j}(z)M_{\ell -i,r-j}(z) \Phi ''\bigl (y(z)\bigr )\nonumber \\&\qquad +O\bigl (|1-z|^{-\frac{1}{2}(\ell +r)+1+\frac{\delta }{2}-\varepsilon }\bigr ) .\end{aligned}$$

(4.34)

As in (3.42)–(3.44) and (4.18)–(4.20), this can be simplified, using (2.27)–(2.28), (4.31), (4.4), and Lemma 2.3, and we obtain

$$\begin{aligned} R_{\ell ,r}(z)&= \ell B_{\alpha _1}(z) \odot M_{\ell -1,r}(z) +r B_{\alpha _2}(z) \odot M_{\ell ,r-1}(z) \nonumber \\ {}&\quad + \frac{\sigma ^2}{2}\mathop {\mathrm {\sum \sum }}\limits _{0<i+j<\ell +r} \left( {\begin{array}{c}\ell \\ i\end{array}}\right) \left( {\begin{array}{c}r\\ j\end{array}}\right) M_{i,j}(z)M_{\ell -i,r-j}(z) +O\bigl (|1-z|^{-\frac{1}{2}(\ell +r)+1+\frac{\delta }{2}-\varepsilon }\bigr ). \end{aligned}$$

(4.35)

By the induction hypothesis in the form (4.26) and (4.2), using as always Lemma 2.3 for the error term, we have

$$\begin{aligned} B_{\alpha _1}(z) \odot M_{\ell -1,r}(z)&= \sum _{j,k}\widehat{\varkappa }_{\ell -1,r;j,k} {\text {Li}}_{(4-\ell -r)/2+j\alpha ,k}(z) \odot \bigl ({\text {Li}}_{-\alpha }(z)-\mu (\alpha ){\text {Li}}_0(z)\bigr ) \nonumber \\ {}&\quad +O\bigl (|1-z|^{-\frac{1}{2}(\ell +r)+1+\frac{\delta }{2}-\varepsilon }\bigr ) \end{aligned}$$

(4.36)

summing over $-(\ell -1)\leqslant j\leqslant r$ and $0\leqslant k\leqslant (\ell -1)\wedge r$. By (2.14), this can be rearranged as follows:

$$\begin{aligned} \sum _{j,k}c^{(1)}_{\ell ,r;j,k} {\text {Li}}_{(4-\ell -r)/2+j\alpha ,k}(z) +O\bigl (|1-z|^{-\frac{1}{2}(\ell +r)+1+\frac{\delta }{2}-\varepsilon }\bigr ), \end{aligned}$$

(4.37)

now summing over $-\ell \leqslant j\leqslant r$ and $0\leqslant k\leqslant (\ell -1)\wedge r$. By Lemma 2.1, this can also be written

$$\begin{aligned} \sum _{j,k}c^{(2)}_{\ell ,r;j,k} (1-z)^{(2-\ell -r)/2+j\alpha } L(z)^k +O\bigl (|1-z|^{-\frac{1}{2}(\ell +r)+1+\frac{\delta }{2}-\varepsilon }\bigr ), \end{aligned}$$

(4.38)

still summing over $-\ell \leqslant j\leqslant r$ and $0\leqslant k\leqslant (\ell -1)\wedge r$.

By symmetry, $B_{\alpha _2}(z) \odot M_{\ell ,r-1}(z) $ can also be written as (4.38) (with different coefficients $c^{(2)}_{\ell ,r;j,k}$), now summing over $-\ell \leqslant j\leqslant r$ and $0\leqslant k\leqslant \ell \wedge (r-1)$.

Finally, the double sum in (4.35) can by the induction hypothesis (4.25) also be written as (4.38), summing over $-\ell \leqslant j\leqslant r$ and $0\leqslant k\leqslant \ell \wedge r$.

Consequently, (4.35) yields

$$\begin{aligned} R_{\ell ,r}(z) = \sum _{j,k}c^{(3)}_{\ell ,r;j,k} (1-z)^{(2-\ell -r)/2+j\alpha } L(z)^k +O\bigl (|1-z|^{-\frac{1}{2}(\ell +r)+1+\frac{\delta }{2}-\varepsilon }\bigr ), \end{aligned}$$

(4.39)

summing over $-\ell \leqslant j\leqslant r$ and $0\leqslant k\leqslant \ell \wedge r$. By (4.17) and (2.22), this implies (4.25), which completes the induction proof of (4.25)–(4.26).

Now consider the case $\ell =r\geqslant 2$. We see that then the only terms above with $k=\ell =r$ come from the double sum in (4.35); moreover, they appear only for terms there with $i=j$, and we obtain by induction that the only non-zero coefficient in (4.39) with $k=\ell $ is, using (4.27),

$$\begin{aligned} c^{(3)}_{\ell ,\ell ;0,\ell } =\frac{\sigma ^2}{2}\sum _{i=1}^{\ell -1}\left( {\begin{array}{c}\ell \\ i\end{array}}\right) ^2\varkappa _{i,i;0,i}\varkappa _{\ell -i,\ell -i;0,\ell -i} =\frac{1}{2}\sigma ^{-2\ell }\sum _{i=1}^{\ell -1}\left( {\begin{array}{c}\ell \\ i\end{array}}\right) ^2\varkappa ^*_{i}\varkappa ^*_{\ell -i}. \end{aligned}$$

(4.40)

Hence, when deriving (4.25) from (4.39) by (4.17) and (2.22), we also find that the only non-zero coefficient with $k=\ell $ is

$$\begin{aligned} \varkappa _{\ell ,\ell ;0,\ell } =2^{-1/2}\sigma ^{-1}c^{(3)}_{\ell ,\ell ;0,\ell } =2^{-3/2}\sigma ^{-2\ell -1}\sum _{i=1}^{\ell -1}\left( {\begin{array}{c}\ell \\ i\end{array}}\right) ^2\varkappa ^*_{i}\varkappa ^*_{\ell -i}.\end{aligned}$$

(4.41)

This proves (4.27) and (4.30). $\square $

The recursion (4.30) is the same as [5, (D.6)], and thus has the same solution [5, (D.10)]

$$\begin{aligned} \varkappa ^*_\ell = 2^{3/2} \frac{\ell !\,(2\ell -2)!}{(\ell -1)!}d_1^\ell , \end{aligned}$$

(4.42)

with, by [5, (D.9)] and (4.29),

$$\begin{aligned} d_1&:=2^{-3/2}\varkappa ^*_{1} =\frac{1}{4\sqrt{\pi }}{\text {Re}}\frac{\Gamma (\alpha -\tfrac{1}{2})}{\Gamma (\alpha )}. \end{aligned}$$

(4.43)

Proof of Theorem 1.4

We have $\alpha =\textrm{i}t$. If $\ell +r\geqslant 2$, then (4.15), (4.26), (2.5), and singularity analysis yield

$$\begin{aligned} q_n{\mathbb E{}}\bigl [F_{\alpha }(\mathcal {T}_n)^\ell \,\overline{F_{\alpha }(\mathcal {T}_n)}^r\bigr ] = q_n{\mathbb E{}}\bigl [F_{\alpha }(\mathcal {T}_n)^\ell F_{\overline{\alpha }}(\mathcal {T}_n)^r\bigr ] = O\bigl ( n^{(\ell +r-3)/2}(\log n)^{\ell \wedge r}\bigr ) .\end{aligned}$$

(4.44)

When $\ell =r$, we find more precisely

$$\begin{aligned} q_n{\mathbb E{}}\bigl [F_{\alpha }(\mathcal {T}_n)^\ell \,\overline{F_{\alpha }(\mathcal {T}_n)}^\ell \bigr ] = \widehat{\varkappa }_{\ell ,\ell ;0,\ell } n^{(2\ell -3)/2}(\log n)^{\ell } +O\bigl (n^{(2 \ell -3)/2}(\log n)^{\ell -1}\bigr ) .\end{aligned}$$

(4.45)

Hence, using (2.24) and (4.28),

$$\begin{aligned} {\mathbb E{}}\bigl [F_{\alpha }(\mathcal {T}_n)^\ell \,\overline{F_{\alpha }(\mathcal {T}_n)}^r\bigr ] = {\left\{ \begin{array}{ll} O\bigl (n^{(\ell +r)/2}(\log n)^{\ell \wedge r}\bigr ), &{} \ell \ne r, \\ \sigma ^{-2\ell }\frac{\sqrt{2\pi }}{\Gamma (l-\frac{1}{2})}\varkappa ^*_\ell n^{\ell }(\log n)^{\ell } + O\bigl (n^{\ell }(\log n)^{\ell -1}\bigr ), &{} \ell =r. \end{array}\right. } \end{aligned}$$

(4.46)

Consequently,

$$\begin{aligned} \frac{{\mathbb E{}}\bigl [F_{\alpha }(\mathcal {T}_n)^\ell \,\overline{F_{\alpha }(\mathcal {T}_n)}^r\bigr ]}{(n\log n)^{(\ell +r)/2}} \rightarrow {\left\{ \begin{array}{ll} 0,&{} \ell \ne r, \\ \sigma ^{-2\ell }\frac{\sqrt{2\pi }}{\Gamma (l-\frac{1}{2})}\varkappa ^*_\ell , &{}\ell =r\geqslant 1. \end{array}\right. } \end{aligned}$$

(4.47)

Furthermore, (4.47) holds also for $\ell +r=1$ by (4.13).

For $\ell =r$, the limit in (4.47) is by (4.42) and (4.43), cf. [5, (D.11)],

$$\begin{aligned} \sigma ^{-2\ell }\frac{\sqrt{2\pi }}{\Gamma (\ell -\frac{1}{2})}\varkappa ^*_{\ell }&= \sigma ^{-2\ell }\frac{4\sqrt{\pi }}{\Gamma (\ell -\frac{1}{2})} \frac{\ell !\,(2\ell -2)!}{(\ell -1)!} d_1^\ell =\sigma ^{-2\ell }2^{2\ell }\ell !\,d_1^\ell \nonumber \\ {}&=\bigl (4d_1\sigma ^{-2}\bigr )^\ell \cdot \ell ! = \Bigl ( \frac{1}{\sqrt{\pi }\sigma ^2}{\text {Re}}\frac{\Gamma (\alpha -\tfrac{1}{2})}{\Gamma (\alpha )}\Bigr )^\ell \cdot \ell ! .\end{aligned}$$

(4.48)

Consequently, by (2.1), the limits in (4.47) are the moments of a symmetric complex normal distribution with variance (1.12), and thus, (1.11) follows by the method of moments. (Recall that $F_\alpha (\mathcal {T}_n)=X_n(\alpha )-\mu (\alpha )n$ by (4.5).)

Finally, the claim in (1.12) that the variance is non-zero follows from the same claim in (1.14) (where the variance is the same up to a factor $\sigma ^2 / 2$), which is shown in [5, Theorem D.1, as corrected in the corrigendum]. $\square $

4.3 Joint Distributions

We can extend the arguments above to joint distributions of several $X_n(\alpha )$ with different imaginary $\alpha $. Since we have $X_n(\overline{\alpha })=\overline{X_n(\alpha )}$, it suffices to consider the case ${\text {Im}}\alpha >0$. In this case, different $X_n(\alpha )$ are asymptotically independent, as is stated more precisely in the following theorem.

Theorem 4.2

For any finite set $t_1,\dots ,t_r$ of distinct positive numbers, the complex random variables $\bigl (X_n(\textrm{i}t_k)-\mu (\textrm{i}t_k)n\bigr )/\sqrt{n\log n}$ converge, as ${n\rightarrow \infty }$, jointly in distribution to independent symmetric complex normal variables $\zeta _{\textrm{i}t_k}$ with variances given by (1.12).

This can be interpreted as joint convergence (in the product topology) of the entire family $\{X_n(\textrm{i}t):t>0\}$ of random variables, after normalization, to an (uncountable) family of independent symmetric complex normal variables $\zeta _{\textrm{i}t}$. As said in Remark 1.8, this behavior is strikingly different from the cases ${\text {Re}}\alpha <0$ and ${\text {Re}}\alpha >0$, where we have joint convergence to analytic random functions of $\alpha $.

Proof

We argue as above, using the method of moments and singularity analysis of generating functions, with mainly notational differences. We give only a sketch, leaving further details to the reader.

For a sequence of arbitrary non-zero imaginary numbers $\alpha _1,\dots ,\alpha _\ell $ (allowing repetitions), define the generating function

$$\begin{aligned} M_{\alpha _1,\dots ,\alpha _\ell }(z)&:={\mathbb E{}}\bigl [F_{\alpha _1}(\mathcal {T})\cdots F_{\alpha _\ell }(\mathcal {T}) z^{|\mathcal {T}|}\bigr ] =\sum _{n=1}^\infty q_n {\mathbb E{}}\bigl [F_{\alpha _1}(\mathcal {T}_n)\cdots F_{\alpha _\ell }(\mathcal {T}_n)\bigr ] z^n .\end{aligned}$$

(4.49)

When $\ell =1$ and 2, these are the same as $M_{\alpha _1}(z)$ or $M_{1,1}(z)$ in the notation used above. The recursion (4.16) extends as follows. We write again

$$\begin{aligned} M_{\alpha _1,\dots ,\alpha _\ell }(z) =\frac{zy'(z)}{y(z)} R_{\alpha _1,\dots ,\alpha _\ell }(z). \end{aligned}$$

(4.50)

Then, by a straightforward extension of the proof of [5, Lemma 12.4], cf. (4.16),

$$\begin{aligned} R_{\alpha _1,\dots ,\alpha _\ell }(z) =\sum _{m=0}^\ell \frac{1}{m!} \sum B_{\alpha _{i_1}}(z)\odot \cdots \odot B_{\alpha _{i_{q}}}(z)\odot \bigl [zM_{A_1}(z)\cdots M_{A_m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )\bigr ] \end{aligned}$$

(4.51)

where we sum over all partitions of $[\ell ]:=\{1,\dots ,\ell \}$ into an ordered sequence of $m+1$ sets $I_0,\dots ,I_m$ with $I_1,\dots ,I_m$ neither empty nor equal to the full set $[\ell ]$ (while $I_0$ may be empty or equal to $[\ell ]$), and $i_j$ are defined by $I_0=\{i_1,\dots ,i_q\}$ and, for $1\leqslant j\leqslant m$, $A_j$ is the sequence $(\alpha _i:i\in I_j)$.

As in Lemma 4.1, it follows by induction that for any sequence $A=(\alpha _1,\dots ,\alpha _\ell )$ of length $|A|=\ell \geqslant 2$,

$$\begin{aligned} M_{A}(z)&= \sum _{\beta ,k}\varkappa _{A;\beta ,k} (1-z)^{(1-\ell )/2+\beta } L(z)^k +O\bigl (|1-z|^{\frac{1}{2}(1-\ell )+\frac{\delta }{2}-\varepsilon }\bigr ) \end{aligned}$$

(4.52)

$$\begin{aligned}&= \sum _{\beta ,k}\widehat{\varkappa }_{A;\beta ,k}{\text {Li}}_{(3-\ell )/2+\beta ,k}(z) +O\bigl (|1-z|^{\frac{1}{2}(1-\ell )+\frac{\delta }{2}-\varepsilon }\bigr ) ,\end{aligned}$$

(4.53)

where we sum over $0\leqslant k\leqslant \ell /2$ and all $\beta $ such that $-\beta $ equals the sum of some subsequence of A. (The two expansions are equivalent by Lemma 2.1.) The base case $\ell =2$ follows from (4.23)–(4.24) by (4.17) and (2.22); note also that (4.52) (but not (4.53)) holds for $\ell =1$ by (4.10). The induction then proceeds for $\ell \geqslant 3$ as in the proof of Lemma 4.1; note that the notation has changed slightly: $\ell $ here corresponds to $\ell +r$ there (so r should now be ignored), and $q=\ell _0 $. With these and other notational changes, (4.31)–(4.33) still hold, and as there the only significant contributions in (4.51) come from the cases (1) $m=1$ and $q=1$ and (2) $m=2$ and $q=0$; it follows as in (4.36)–(4.39) that (4.52) and (4.53) hold for all $\ell $. Moreover, we see from (4.36) that for the terms with $m=1$ and $q=1$, and thus, $|A_1|=\ell -1$, the index (exponent) k is not increased, and thus, by induction, these terms only contribute to $k\leqslant (\ell -1)/2<\ell /2$. Hence, terms with $k=\ell /2$ come only from the case $m=2$ and $q=0$ in (4.51), when the sequence A (regarded as a multiset) is partitioned into two nonempty parts $A_1$ and $A_2$; each such partition yields the contribution $\frac{1}{2}M_{A_1}(z)M_{A_2}(z)$ + lower order terms to (4.51). Furthermore, it follows that contributions to $\varkappa _{A;\beta ,k}$ with $k=\ell /2$ come only from $\varkappa _{A_j;\beta _j,k_j}$ (with $j=1,2$) where $k_j=|A_j|/2$. This is obviously possible only when both $\ell _j:=|A_j|$ are even, and an induction shows, again using (4.23)–(4.24) for the base case $\ell =2$, that the contribution is non-zero only if A is balanced in the sense that it can be partitioned into $\ell /2$ pairs $\{\alpha _i,-\alpha _i\}$; moreover, we must have $\beta =0$. We now write $\varkappa ^*_A:=\varkappa _{A;0,k}$ if A is balanced with $|A|=2k$. (We let $\varkappa ^*_A:=0$ if A is not balanced.) For $|A|\geqslant 4$, we, thus, obtain the recurrence, from the case $m=2$ and $q=0$ in (4.51), and recalling (4.50) and (2.22),

$$\begin{aligned} \varkappa ^*_A = 2^{-3/2}\sigma \sum \varkappa ^*_{A_1}\varkappa ^*_{A_2}, \end{aligned}$$

(4.54)

summing over all partitions of A into two nonempty sets $A_1$ and $A_2$ that both are balanced.

It follows by induction from (4.54) that if $|A|=2k\geqslant 2$, then $\varkappa ^*_A$ can be written as a sum

$$\begin{aligned} \varkappa ^*_A= \bigl (2^{-3/2}\sigma \bigr )^{k-1} \sum \prod _{j=1}^k\varkappa ^*_{A_j}, \end{aligned}$$

(4.55)

where we sum over full binary trees with k leaves, where each leaf is labelled by a pair $I_j$ of indices such that $I_1,\dots ,I_k$ form a partition of [2k], and furthermore the corresponding sets $A_j$ are balanced, i.e., $\alpha _i+\alpha _{i'}=0$ if $I_j=\{i,i'\}$.

Let $A=(\alpha _1,\dots ,\alpha _{2k})$ consist of the numbers $\textrm{i}t_j$ and $-\textrm{i}t_j$ repeated $k_j$ times each, for $j=1,\dots ,r$, where $t_1,\dots ,t_r$ are distinct and positive; thus, $|A|=2k$ with $k=\sum _j k_j$. Then there are $\prod _j k_j!$ ways to partition A into balanced pairs, and for each binary tree with k leaves, these pairs can be assigned to the k leaves in k! ways. Each tree and each assignment of balanced pairs $A_i$ gives the same contribution to the sum (4.55), and we obtain, since there are $C_{k-1}=(2k-2)!/(k!(k-1)!)$ full binary trees with k leaves,

$$\begin{aligned} \varkappa ^*_A= \bigl (2^{-3/2}\sigma \bigr )^{k-1} \frac{(2k-2)!}{(k-1)!} \prod _{j=1}^r{\left[ (\varkappa ^*_{\{\pm \textrm{i}t_j\}})^{k_j}k_j! \right] }. \end{aligned}$$

(4.56)

Let $\sigma ^2_{\textrm{i}t}$ be the variance of $\zeta _{\textrm{i}t}$ in (1.12). For the case $A=\{\textrm{i}t,-\textrm{i}t\}$, Lemma 4.1 applies and we have by (4.27) and (4.29), in the present notation,

$$\begin{aligned} \varkappa ^*_{\{\pm \textrm{i}t\}}=2^{-1/2}\sigma ^{-1}\sigma ^2_{\textrm{i}t}. \end{aligned}$$

(4.57)

Hence, (4.56) yields

$$\begin{aligned} \varkappa ^*_A= 2^{-2k+\frac{3}{2}}\sigma ^{-1} \frac{(2k-2)!}{(k-1)!} \prod _{j=1}^r \left( \sigma _{\textrm{i}t_j}^{2k_j} k_j! \right) . \end{aligned}$$

(4.58)

Since $\widehat{\varkappa }_{A;0,k}=\Gamma (k-\frac{1}{2})^{-1}\varkappa ^*_A$, we finally obtain from (4.53), using (2.24), that

$$\begin{aligned} n^{-k} {\mathbb E{}}\bigl [F_{\alpha _1}(\mathcal {T}_n)\cdots F_{\alpha _{2k}}(\mathcal {T}_n)\bigr ]&\rightarrow 2^{-2k+2}\sqrt{\pi }\frac{(2k-2)!}{\Gamma (k-\frac{1}{2})(k-1)!} \prod _{j=1}^r \left( \sigma _{\textrm{i}t_j}^{2k_j} k_j! \right) \nonumber \\ {}&=\prod _{j=1}^r \left( \sigma _{\textrm{i}t_j}^{2k_j}k_j! \right) , \end{aligned}$$

(4.59)

which equals the corresponding mixed moment ${\mathbb E{}}\bigl (\zeta _{\alpha _1}\cdots \zeta _{\alpha _{2k}}\bigr ) =\prod _j{\mathbb E{}}|\zeta _{\textrm{i}t_j}|^{2k_j}$, see (2.1). Similarly, all mixed moments with unbalanced indices converge after normalization to 0. Hence, the result follows by the method of moments. $\square $

Note that the combinatorial argument in the final part of the proof (restricted to the case $r=1$) yields an alternative proof that the recursion (4.41) is solved by (4.42)–(4.43). Conversely, the argument above without detailed counting of possibilities shows that the left-hand side of (4.59) converges to $c_k$ times the right-hand side, for some combinatorial constant $c_k$ not depending on $k_1,\dots k_r$. Since (4.47) shows that the formula is correct for $r=1$, we must have $c_k=1$, and thus, (4.59) holds.

5 Negative Real Part

In this section, we consider the case that $\alpha $ in (1.1) has negative real part. Applying the same approach as in previous sections, we prove convergence of all moments for the normalized random variable. As before, we assume throughout the section that (1.7) holds with $0<\delta <1$. Again, we set

$$\begin{aligned} b_n:=n^\alpha -\mu (\alpha ), \end{aligned}$$

(5.1)

with the generating function

$$\begin{aligned} B(z)&= B_\alpha (z):=\sum _{n=1}^\infty b_nz^n ={\text {Li}}_{-\alpha }(z)-\mu (\alpha ){\text {Li}}_0(z). \end{aligned}$$

(5.2)

In contrast to Sect. 4, the term $\mu (\alpha ){\text {Li}}_0(z) = \mu (\alpha )z(1-z)^{-1}$ now dominates. For later convenience, we let $\eta := \min (-{\text {Re}}\alpha ,\,\delta /2)$, and note that $0<\eta <\frac{1}{2}$ (assuming again $\delta <1$ as we may). Then (2.7) implies

$$\begin{aligned} B(z) = -\mu (\alpha ) (1-z)^{-1} + O\bigl (|1-z|^{-1+\eta }\bigr ). \end{aligned}$$

(5.3)

This is even true for $\alpha \in \{-1,-2,\dots \}$, where logarithmic terms occur in the asymptotic expansion of ${\text {Li}}_{-\alpha }$, due to the aforementioned fact that $\eta < \frac{1}{2}$.

Once again, we let $F(T)=F_\alpha (T)$ denote the additive functional defined by the toll function $f_\alpha (T):=b_{|T|}$, so that

$$\begin{aligned} F_\alpha (\mathcal {T}_n)=X_n(\alpha )-n\mu (\alpha ). \end{aligned}$$

(5.4)

5.1 The Mean

We use the same notation for the generating function of the mean as in Sect. 4, i.e.,

$$\begin{aligned} M_\alpha (z):={\mathbb E{}}\bigl [F_\alpha (\mathcal {T}) z^{|\mathcal {T}|}\bigr ] =\sum _{n=1}^\infty q_n {\mathbb E{}}[F_\alpha (\mathcal {T}_n)] z^n, \end{aligned}$$

(5.5)

and note that (4.7) still holds:

$$\begin{aligned} M_\alpha (z) =\frac{zy'(z)}{y(z)}\cdot \bigl ( B_\alpha (z)\odot y(z)\bigr ). \end{aligned}$$

(5.6)

Thus, $M_\alpha (z)$ is still $\Delta $-analytic. In analogy with (4.8), we now have

$$\begin{aligned} B_\alpha (z) \odot y(z )&= \frac{1}{\sqrt{2\pi }\sigma }{\text {Li}}_{3/2-\alpha }(z) -\frac{\mu (\alpha )}{\sqrt{2\pi }\sigma }{\text {Li}}_{3/2}(z) +c_1+O\bigl (|1-z|^{\frac{1}{2}+\frac{\delta }{2}}\bigr ) \nonumber \\ {}&= 2^{1/2}\sigma ^{-1}\mu (\alpha )(1-z)^{1/2}+c_{2}+O\bigl (|1-z|^{\frac{1}{2}+\eta }\bigr ). \end{aligned}$$

(5.7)

Moreover, (4.9) still holds, so $c_{2} = 0$. Combining this with (2.22) now yields

$$\begin{aligned} M_\alpha (z) = \sigma ^{-2}\mu (\alpha ) + O\bigl (|1-z|^{\eta }\bigr ). \end{aligned}$$

(5.8)

Applying singularity analysis and (2.24), we find that

$$\begin{aligned} {\mathbb E{}}[F_\alpha (\mathcal {T}_n)] = O\bigl (n^{\frac{1}{2}-\eta }\bigr ) \end{aligned}$$

(5.9)

or equivalently

$$\begin{aligned} {\mathbb E{}}X_n(\alpha )=\mu (\alpha )n + O\bigl (n^{\frac{1}{2}-\eta }\bigr ). \end{aligned}$$

(5.10)

5.2 Higher Moments

As in Sect. 4.2, we consider the mixed moments of $F_{\alpha _1}(\mathcal {T}_n)$ and $F_{\alpha _2}(\mathcal {T}_n)$ for two complex numbers $\alpha _1$ and $\alpha _2$ that are now both assumed to have negative real part. In particular, this includes the special case that $\alpha _2 = \overline{\alpha }_1$. We are, thus, interested in the generating function

$$\begin{aligned} M_{\ell _1,\ell _2}(z)&:={\mathbb E{}}\bigl [F_{\alpha _1}(\mathcal {T})^{\ell _1}F_{\alpha _2}(\mathcal {T})^{\ell _2} z^{|\mathcal {T}|}\bigr ] \end{aligned}$$

(5.11)

for integers $\ell _1,\ell _2 \geqslant 0$, cf. (4.15). In particular, we have $M_{1,0}=M_{\alpha _1}$ and $M_{0,1}=M_{\alpha _2}$. Set $\eta := \min (-{\text {Re}}\alpha _1,-{\text {Re}}\alpha _2,\,\delta /2)$ (again noting that $\eta < \frac{1}{2}$). Then by (5.8) we have

$$\begin{aligned} M_{1,0}(z) = \sigma ^{-2}\mu (\alpha _1) + O\bigl (|1-z|^{\eta }\bigr ) \text { and } M_{0,1}(z) = \sigma ^{-2}\mu (\alpha _2) + O\bigl (|1-z|^{\eta }\bigr ). \end{aligned}$$

(5.12)

In order to deal with higher moments, we make use of the recursion (4.16). Let us start with second-order moments: here, we obtain

$$\begin{aligned} M_{1,1}(z)&= \frac{z y'(z)}{y(z)} \left[ B_{\alpha _1}(z)\odot B_{\alpha _2}(z)\odot y(z) + B_{\alpha _1}(z)\odot (zM_{0,1}(z)\Phi '(y(z))) \right. \nonumber \\ {}&\qquad \left. + B_{\alpha _2}(z)\odot (zM_{1,0}(z)\Phi '(y(z))) + zM_{1,0}(z)M_{0,1}(z)\Phi ''(y(z)) \right] . \end{aligned}$$

(5.13)

In view of (5.8), (2.20), (2.27), and (2.28), the functions y(z), $zM_{0,1}(z)\Phi '\bigl (y(z)\bigr )$, $zM_{1,0}(z)\Phi '\bigl (y(z)\bigr )$, and $z M_{1,0}(z)M_{0,1}(z) \Phi ''\bigl (y(z)\bigr )$ are all of the form $c + O\bigl (|1-z|^{\eta }\bigr )$, and taking the Hadamard product with $B_{\alpha _1}(z)$ or $B_{\alpha _2}(z)$ does not change this property. Combining this with (2.22) we conclude that there is a constant $\varkappa _{1,1}$ such that

$$\begin{aligned} M_{1,1}(z) = 2^{-1/2}\sigma ^{-1}\varkappa _{1,1} (1-z)^{-1/2} + O\bigl (|1-z|^{-\frac{1}{2}+\eta }\bigr ), \end{aligned}$$

(5.14)

which implies by virtue of singularity analysis and (2.24) that

$$\begin{aligned} {\mathbb E{}}[F_{\alpha _1}(\mathcal {T}_n)F_{\alpha _2}(\mathcal {T}_n)] = \varkappa _{1,1}\,n + O\bigl (n^{1 - \eta }\bigr ). \end{aligned}$$

(5.15)

We can obtain the functions $M_{2,0}(z)$ and $M_{0,2}(z)$ as special cases of $M_{1,1}(z)$ where $\alpha _1 = \alpha _2$. Hence there are also constants $\varkappa _{2,0}$ and $\varkappa _{0,2}$ such that

$$\begin{aligned} M_{2,0}(z) = 2^{-1/2}\sigma ^{-1}\varkappa _{2,0} (1-z)^{-1/2} + O\bigl (|1-z|^{-\frac{1}{2}+\eta }\bigr ) \end{aligned}$$

(5.16)

and

$$\begin{aligned} M_{0,2}(z) = 2^{-1/2}\sigma ^{-1}\varkappa _{0,2} (1-z)^{-1/2} + O\bigl (|1-z|^{-\frac{1}{2}+\eta }\bigr ), \end{aligned}$$

(5.17)

and, thus,

$$\begin{aligned} {\mathbb E{}}[F_{\alpha _1}(\mathcal {T}_n)^2] = \varkappa _{2,0}\,n + O\bigl (n^{1 - \eta }\bigr ) \text { and } {\mathbb E{}}[F_{\alpha _2}(\mathcal {T}_n)^2] = \varkappa _{0,2}\,n + O\bigl (n^{1 - \eta }\bigr ). \end{aligned}$$

(5.18)

We will use these as the base case of an inductive proof of the following lemma.

Lemma 5.1

Suppose that ${\text {Re}}\alpha _1 < 0$ and ${\text {Re}}\alpha _2 < 0$, and let

$$\begin{aligned} \eta = \min (-{\text {Re}}\alpha _1,-{\text {Re}}\alpha _2,\,\delta /2) \end{aligned}$$

(5.19)

be as above. Then, for all non-negative integers $\ell $ and r with $s = \ell +r \geqslant 1$, the function $M_{\ell ,r}(z)$ is $\Delta $-analytic and we have

$$\begin{aligned} M_{\ell ,r}(z) = \widehat{\varkappa }_{\ell ,r} (1-z)^{(1-s)/2} + O\bigl (|1-z|^{(1-s)/2+\eta }\bigr ), \end{aligned}$$

(5.20)

where $\widehat{\varkappa }_{1,0}=\sigma ^{-2}\mu (\alpha _1)$, $\widehat{\varkappa }_{0,1}=\sigma ^{-2}\mu (\alpha _2)$, and, for $s\geqslant 2$,

$$\begin{aligned} \widehat{\varkappa }_{\ell ,r} = \frac{(s-3)!!}{\sigma 2^{(s-1)/2}} \sum _{\begin{array}{c} j=0 \\ j \equiv \ell \bmod 2 \end{array}}^{\ell \wedge r} \left( {\begin{array}{c}\ell \\ j\end{array}}\right) \left( {\begin{array}{c}r\\ j\end{array}}\right) j!\, (\ell -j-1)!!\,(r-j-1)!!\,\varkappa _{1,1}^{j} \varkappa _{2,0}^{(\ell -j)/2}\varkappa _{0,2}^{(r-j)/2} \end{aligned}$$

(5.21)

if s is even, and $\widehat{\varkappa }_{\ell ,r} = 0$ otherwise.

Proof

We prove the statement by induction on $s = \ell + r$. Note that (5.12) as well as (5.14), (5.16), and (5.17) are precisely the cases $s=1$ and $s = 2$, respectively.

For the induction step, we take $s \geqslant 3$ and use recursion (4.16). It follows immediately from this recursion that all $M_{\ell ,r}$ are $\Delta $-analytic, so we focus on the asymptotic behavior at 1. Let us first consider the product

$$\begin{aligned} zM_{\ell _1,r_1}(z)\cdots M_{\ell _m,r_m}(z)\Phi ^{(m)}\bigl (y(z)\bigr ), \end{aligned}$$

(5.22)

where all $\ell _i$ and $r_i$ are non-negative integers, $1 \leqslant \ell _i + r_i < s$ for every $i \geqslant 1$, $\ell _0 + \ell _1 + \cdots + \ell _m = \ell $, and $r_0 + r_1 + \cdots + r_m = r$. By the induction hypothesis, $M_{\ell _i,r_i}(z) = O\bigl (|1-z|^{(1-\ell _i-r_i)/2}\bigr )$ for all $i \geqslant 1$, which can be improved to $M_{\ell _i,r_i}(z) = O\bigl (|1-z|^{(1-\ell _i-r_i)/2 + \eta }\bigr )$ if $\ell _i + r_i$ is odd and greater than 1. Combining with (2.29), we obtain

$$\begin{aligned} zM_{\ell _1,r_1}(z)\cdots M_{\ell _m,r_m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )&= O\bigl (|1-z|^{(m-\ell _1 - \cdots - \ell _m-r_1-\cdots -r_m)/2 + \frac{\delta }{2}+ 1 - m/2}\bigr )\nonumber \\&= O\bigl (|1-z|^{(\ell _0 + r_0 - \ell - r)/2 + 1 + \eta }\bigr ) \end{aligned}$$

(5.23)

for $m \geqslant 3$. By (5.3) and repeated use of Lemma 2.3, this estimate continues to hold after taking the Hadamard product with $B_{\alpha _1}(z)^{\odot \ell _0} \odot B_{\alpha _2}(z)^{\odot r_0}$, and the factor $\frac{z y'(z)}{y(z)}$ in (4.16) contributes $-\frac{1}{2}$ to the exponent by (2.22). Since $\ell _0$ and $r_0$ are non-negative, it follows that the total contribution of all terms with $m \geqslant 3$ is $O\bigl (|1-z|^{(1- s)/2 + \eta }\bigr )$ and, thus, negligible. We can, therefore, focus on the cases $m = 0$, $m=1$, and $m=2$. Here, $\Phi ^{(m)}\bigl (y(z)\bigr )$ is O(1) in all cases by (2.26)–(2.28), and we obtain

$$\begin{aligned} zM_{\ell _1,r_1}(z)\cdots M_{\ell _m,r_m}(z)\Phi ^{(m)}\bigl (y(z)\bigr )&= O\bigl (|1-z|^{(m-\ell _1 - \cdots - \ell _m-r_1-\cdots -r_m)/2}\bigr ) \nonumber \\&= O\bigl (|1-z|^{(m + \ell _0 + r_0 - \ell - r)/2}\bigr ). \end{aligned}$$

(5.24)

Terms with $m + \ell _0 + r_0 \geqslant 3$ are negligible for the same reason as before. Likewise, terms with $m + \ell _0 + r_0 = 2$ are negligible if at least one of the sums $\ell _i + r_i$ with $i \geqslant 1$ is odd and greater than 1, as we can then improve the bound on $M_{\ell _i,r_i}(z)$. Let us determine all remaining possibilities:

$m = 0$ implies $m + \ell _0 + r_0 = \ell + r = s \geqslant 3$, so we have already accounted for this negligible case.
$m = 1$ gives us $\ell _0 + \ell _1 = \ell $ and $r_0 + r_1 = r$ with $1 \leqslant \ell _1 + r_1 < \ell + r$, thus, $\ell _0 + r_0 \geqslant 1$. So we have $(\ell _0,\ell _1,r_0,r_1) = (1,\ell -1,0,r)$ and $(\ell _0,\ell _1,r_0,r_1) = (0,\ell ,1,r-1)$ as the only two relevant possibilities in this case.
Finally, if $m = 2$, we must have $\ell _0 = r_0 = 0$ and $\ell _1 + \ell _2 = \ell $ and $r_1 + r_2 = r$.

Now we divide the argument into two subcases, according as $s = \ell + r$ is even or odd.

Odd $s \geqslant 3$. If $m = 2$, $\ell _0 = r_0 = 0$, and $\ell _1 + \ell _2 + r_1 + r_2 = \ell + r = s$, then either $\ell _1 + r_1$ or $\ell _2 + r_2$ is odd. Thus, the corresponding term is asymptotically negligible unless $\ell _1 + r_1 = 1$ or $\ell _2 + r_2 = 1$. So in this case, there are only four terms that might be asymptotically relevant:

$$\begin{aligned} (\ell _1,\ell _2,r_1,r_2) \in \{(1,\ell -1,0,r), (\ell -1,1,r,0), (0,\ell ,1,r-1), (\ell ,0,r-1,1)\}. \end{aligned}$$

(5.25)

In addition, $m = 1$ contributes with two terms as mentioned above. Thus, we obtain

$$\begin{aligned} M_{\ell ,r}(z)&= \frac{z y'(z)}{y(z)} \left[ \ell B_{\alpha _1}(z) \odot \left( zM_{\ell -1,r}(z) \Phi '\bigl (y(z)\bigr )\right) + r B_{\alpha _2}(z) \odot \left( zM_{\ell ,r-1}(z) \Phi '\bigl (y(z)\bigr )\right) \right. \nonumber \\&\qquad \left. + \ell z M_{1,0}(z) M_{\ell -1,r}(z) \Phi ''\bigl (y(z)\bigr ) + r z M_{0,1}(z) M_{\ell ,r-1}(z) \Phi ''\bigl (y(z)\bigr ) \right] \nonumber \\&\quad + O\bigl (|1-z|^{(1-s)/2+\eta }\bigr ). \end{aligned}$$

(5.26)

By the induction hypothesis, $M_{\ell -1,r}(z) = \widehat{\varkappa }_{\ell -1,r} (1-z)^{1-\frac{s}{2}} + O\bigl (|1-z|^{1-\frac{s}{2}+\eta }\bigr )$. Consequently, using (5.8), (2.27), and (2.28), we get

$$\begin{aligned} zM_{\ell -1,r}(z) \Phi '\bigl (y(z)\bigr )&= \widehat{\varkappa }_{\ell -1,r} (1-z)^{1-\frac{s}{2}} + O\bigl (|1-z|^{1-\frac{s}{2}+\eta }\bigr ), \end{aligned}$$

(5.27)

$$\begin{aligned} z M_{1,0}(z) M_{\ell -1,r}(z) \Phi ''\bigl (y(z)\bigr )&= \mu (\alpha _1) \widehat{\varkappa }_{\ell -1,r} (1-z)^{1-\frac{s}{2}} + O\bigl (|1-z|^{1-\frac{s}{2}+\eta }\bigr ). \end{aligned}$$

(5.28)

Recall from (5.2) that $B_{\alpha _1}(z) = {\text {Li}}_{-\alpha _1}(z)-\mu (\alpha _1){\text {Li}}_0(z)$. Applying the Hadamard product gives us, using (2.7), (2.13), and Lemma 2.3,

$$\begin{aligned} B_{\alpha _1}(z) \odot \left( zM_{\ell -1,r}(z) \Phi '\bigl (y(z)\bigr )\right) = -\mu (\alpha _1) \widehat{\varkappa }_{\ell -1,r} (1-z)^{1-\frac{s}{2}} + O\bigl (|1-z|^{1-\frac{s}{2}+\eta }\bigr ), \end{aligned}$$

(5.29)

so the first and third terms in (5.26) effectively cancel, and the same argument applies to the second and fourth terms. Hence we have proven the desired statement in the case that s is odd.

Even $s \geqslant 4$. In this case, we can neglect the terms with $m = 1$ and $\ell _1 + r_1 = \ell + r -1 = s-1$, since $s-1$ is odd and greater than 1. Thus, only terms with $m = 2$ and $\ell _0 = r_0 = 0$ matter. For the same reason, we can ignore all terms where $\ell _1+r_1$ and $\ell _2+r_2$ are odd: at least one of them has to be greater then 1, making all such terms asymptotically negligible. Hence we obtain

$$\begin{aligned} M_{\ell ,r}(z)&= \frac{z y'(z)}{y(z)} \cdot \frac{1}{2} \sum _{\begin{array}{c} \ell _1,\ell _2,r_1,r_2 \\ \ell _1+\ell _2 = \ell ,\,r_1+r_2 = r \\ \ell _i + r_i \text { even and }> 0 \end{array}} \left( {\begin{array}{c}\ell \\ \ell _1\end{array}}\right) \left( {\begin{array}{c}r\\ r_1\end{array}}\right) z M_{\ell _1,r_1}(z) M_{\ell _2,r_2}(z) \Phi ''\bigl (y(z)\bigr ) \nonumber \\&\qquad + O\bigl (|1-z|^{(1-s)/2+\eta }\bigr ). \end{aligned}$$

(5.30)

Let us write $\mathop {\mathrm {\sum \nolimits ^{\circ }}}\limits $ for the sum in (5.30). Plugging in (2.22), (2.28), and the induction hypothesis, we obtain

$$\begin{aligned} M_{\ell ,r}(z) = 2^{-3/2} \sigma \mathop {\mathrm {\sum \nolimits ^{\circ }}}\limits \left( {\begin{array}{c}\ell \\ \ell _1\end{array}}\right) \left( {\begin{array}{c}r\\ r_1\end{array}}\right) \widehat{\varkappa }_{\ell _1,r_1}\widehat{\varkappa }_{\ell _2,r_2} (1-z)^{(1-s)/2} + O\bigl (|1-z|^{(1-s)/2+\eta }\bigr ). \end{aligned}$$

(5.31)

Thus, we have completed the induction for (5.20) with

$$\begin{aligned} \widehat{\varkappa }_{\ell ,r} = 2^{-3/2} \sigma \mathop {\mathrm {\sum \nolimits ^{\circ }}}\limits \left( {\begin{array}{c}\ell \\ \ell _1\end{array}}\right) \left( {\begin{array}{c}r\\ r_1\end{array}}\right) \widehat{\varkappa }_{\ell _1,r_1}\widehat{\varkappa }_{\ell _2,r_2}. \end{aligned}$$

(5.32)

In order to verify the formula (5.21) for $\widehat{\varkappa }_{\ell ,r}$ given in the statement of the lemma, in light of (5.14), (5.16), and (5.17) we need only show that $\widehat{\varkappa }_{\ell ,r}$ as defined in (5.21) satisfies the recursion (5.32). This is easy to achieve by means of generating functions, as follows. Set

$$\begin{aligned} K(x,y)&:= \sum _{\begin{array}{c} s \geqslant 2 \\ s \text { even} \end{array}} \sum _{\ell +r = s} \widehat{\varkappa }_{\ell ,r} \frac{x^{\ell }}{\ell !} \frac{y^r}{r!} \nonumber \\&= \sum _{\begin{array}{c} s \geqslant 2 \\ s \text { even} \end{array}} \sum _{\ell +r = s} \frac{(s-3)!!}{\sigma 2^{(s-1)/2}} \sum _{\begin{array}{c} j=0 \\ j \equiv \ell \bmod 2 \end{array}}^{\ell \wedge r} \left( {\begin{array}{c}\ell \\ j\end{array}}\right) \left( {\begin{array}{c}r\\ j\end{array}}\right) j!\, (\ell -j-1)!!\,(r-j-1)!! \nonumber \\&\qquad \qquad \qquad \cdot \varkappa _{1,1}^{j} \varkappa _{2,0}^{(\ell -j)/2}\varkappa _{0,2}^{(r-j)/2} \frac{x^{\ell }}{\ell !} \frac{y^r}{r!}. \end{aligned}$$

(5.33)

Setting $\ell -j = 2a$ and $r-j = 2b$, this can be rewritten as

$$\begin{aligned} K(x,y)&= \sum _{\begin{array}{c} s \geqslant 2 \\ s \text { even} \end{array}} \frac{(s-3)!!}{\sigma 2^{(s-1)/2}} \sum _{\begin{array}{c} a,b,j \geqslant 0: \\ a+b+j = s/2 \end{array}} \frac{\varkappa _{1,1}^{j} \varkappa _{2,0}^{a}\varkappa _{0,2}^{b} x^{j+2a} y^{j+2b}}{j!\,a!\,b!\,2^{a+b}} \nonumber \\&= \sum _{\begin{array}{c} s \geqslant 2 \\ s \text { even} \end{array}} \frac{(s-3)!!}{\sigma 2^{(s-1)/2}(s/2)!} \left( \frac{\varkappa _{2,0}\,x^2}{2} + \varkappa _{1,1}\,xy + \frac{\varkappa _{0,2}\,y^2}{2} \right) ^{s/2} \nonumber \\&= \frac{\sqrt{2}}{\sigma } \sum _{t \geqslant 1} \frac{(2t-3)!!}{t!\, 2^{2t}} \left( \varkappa _{2,0}\,x^2 + 2\varkappa _{1,1}\,xy + \varkappa _{0,2}\,y^2 \right) ^t \nonumber \\&= \frac{\sqrt{2}}{\sigma } - \frac{1}{\sigma } \sqrt{2 - \left( \varkappa _{2,0}\,x^2 + 2\varkappa _{1,1}\,xy + \varkappa _{0,2}\,y^2 \right) }. \end{aligned}$$

(5.34)

The recursion (5.32) now follows by comparing coefficients of $x^{\ell }y^r$ in the identity

$$\begin{aligned} K(x,y) = 2^{-3/2} \sigma K(x,y)^2 + \frac{\varkappa _{2,0}\,x^2 + 2\varkappa _{1,1}\,xy + \varkappa _{0,2}\,y^2}{2^{3/2} \sigma }. \end{aligned}$$

(5.35)

This completes the proof of the lemma. $\square $

So the functions $M_{\ell ,r}(z)$ are amenable to singularity analysis, and we obtain the following theorem as an immediate application.

Theorem 5.2

Suppose that ${\text {Re}}\alpha _1 < 0$ and ${\text {Re}}\alpha _2 < 0$. Then there exist constants $\varkappa _{2,0}$, $\varkappa _{1,1}$, and $\varkappa _{0,2}$ such that, for all non-negative integers $\ell $ and r,

$$\begin{aligned}&\frac{{\mathbb E{}}[F_{\alpha _1}(\mathcal {T}_n)^{\ell }F_{\alpha _2}(\mathcal {T}_n)^{r}]}{n^{(\ell +r)/2}} \nonumber \\&\qquad \rightarrow \sum _{\begin{array}{c} j=0 \\ j \equiv \ell \bmod 2 \end{array}}^{\ell \wedge r} \left( {\begin{array}{c}\ell \\ j\end{array}}\right) \left( {\begin{array}{c}r\\ j\end{array}}\right) j!\, (\ell -j-1)!!\,(r-j-1)!!\, \varkappa _{1,1}^{j} \varkappa _{2,0}^{(\ell -j)/2}\varkappa _{0,2}^{(r-j)/2} \end{aligned}$$

(5.36)

as $n \rightarrow \infty $ if $\ell +r$ is even, and $\frac{{\mathbb E{}}[F_{\alpha _1}(\mathcal {T}_n)^{\ell }F_{\alpha _2}(\mathcal {T}_n)^{r}]}{n^{(\ell +r)/2}} \rightarrow 0$ otherwise.

Proof

In view of Lemma 5.1, singularity analysis gives us

$$\begin{aligned}{}[z^n] M_{\ell ,r}(z) = \frac{\widehat{\varkappa }_{\ell ,r}}{\Gamma ((s-1)/2)} n^{(s-3)/2} + O\bigl (n^{(s-3)/2-\eta }\bigr ) \end{aligned}$$

(5.37)

for $s = \ell + r \geqslant 2$, so, using (2.24),

$$\begin{aligned} {\mathbb E{}}[F_{\alpha _1}(\mathcal {T}_n)^{\ell }F_{\alpha _2}(\mathcal {T}_n)^{r}] = \frac{[z^n] M_{\ell ,r}(z)}{q_n} = \frac{\sqrt{2\pi }\sigma \widehat{\varkappa }_{\ell ,r}}{\Gamma ((s-1)/2)} n^{s/2} + O\bigl (n^{s/2-\eta }\bigr ). \end{aligned}$$

(5.38)

Since $\Gamma ((s-1)/2) =2^{1-(s/2)}\sqrt{\pi }(s-3)!!$ for even s (recall (2.2)), the statement follows immediately from the formula for $\widehat{\varkappa }_{\ell ,r}$ in Lemma 5.1 for all $s \geqslant 2$ and from (5.9) for $s = 1$. $\square $

The following lemma will be used in the proof of Theorem 1.5 to establish that the limiting variance is positive. Recall the notation (1.1) and $q_k = {\mathbb P{}}(|\mathcal {T}| = k)$.

Lemma 5.3

Consider any complex $\alpha $ with ${\text {Re}}\alpha \ne 0$. Then there exists k such that $q_k > 0$ and $F_{\alpha }(\mathcal {T}_k)$ is not deterministic.

Proof

We know that $p_0 > 0$ and that $p_j > 0$ for some $j \geqslant 2$. Fix such a value j. Let $k = 3 j + 1 \geqslant 7$. Consider two realizations of the random tree $\mathcal {T}_k$, each of which has positive probability. Tree 1 has j children of the root, and precisely two of those j children have j children each; the other $j - 2$ have no children. Tree 2 also has j children of the root; precisely one of those j children (call it child 1) has j children, while the other $j - 1$ have no children; precisely one of the children of child 1 has j children, while the others have no children.

Then the values of $F_{\alpha }$ for Tree 1 and Tree 2 are, respectively,

$$\begin{aligned} 3 j - 2 + 2 (j + 1)^{\alpha } + (3 j + 1)^{\alpha } \end{aligned}$$

(5.39)

and

$$\begin{aligned} 3 j - 2 + (j + 1)^{\alpha } + (2 j + 1)^{\alpha } + (3 j + 1)^{\alpha }. \end{aligned}$$

(5.40)

These values can’t be equal, because otherwise we would have $(j + 1)^{\alpha } = (2 j + 1)^{\alpha }$; but the two numbers here have unequal absolute values. $\square $

Proof of Theorem 1.5

The limit in (5.36) equals the mixed moment ${\mathbb E{}}\bigl [\zeta _1^\ell \zeta _2^r\bigr ]$, where $\zeta _1$ and $\zeta _2$ have a joint complex normal distribution and ${\mathbb E{}}\zeta _1^2=\varkappa _{2,0}$, ${\mathbb E{}}\zeta _1\zeta _2=\varkappa _{1,1}$, and ${\mathbb E{}}\zeta _2^2=\varkappa _{0,2}$; this follows by Wick’s theorem [11, Theorem 1.28 or Theorem 1.36] by noting that the factor $\left( {\begin{array}{c}\ell \\ j\end{array}}\right) \left( {\begin{array}{c}r\\ j\end{array}}\right) j!\, (\ell -j-1)!!\,(r-j-1)!!$ in (5.36) is the number of perfect matchings of $\ell $ (labelled) copies of $\zeta _1$ and r copies of $\zeta _2$ such that there are j pairs $(\zeta _1,\zeta _2)$.

Hence, Theorem 1.5, except for the assertion of positive variance addressed next, follows by the method of moments, taking $\alpha _1:=\alpha $ and $\alpha _2:=\overline{\alpha }$, cf. Remark 1.6.

We already know from Theorem 5.2 that ${\text {Var}}F_{\alpha }(\mathcal {T}_n) = \gamma ^2\,n + o(n)$ for some $\gamma \geqslant 0$; we need only show that $\gamma > 0$. Fix k as in Lemma 5.3. Write $v_k > 0$ for the variance of $F_{\alpha }(\mathcal {T}_k)$. Let $N_{n, k}$ denote the number of fringe subtrees of size k in $\mathcal {T}_n$. It follows from [14, Theorem 1.5(i)] that

$$\begin{aligned} {\mathbb E{}}N_{n, k} \sim q_k n \end{aligned}$$

(5.41)

as $n \rightarrow \infty $. If for $\mathcal {T}_n$ we condition on $N_{n, k} = m$ and all of $\mathcal {T}_n$ except for fringe subtrees of size k, then the conditional variance of $F_{\alpha }(\mathcal {T}_n)$ is the variance of the sum of m independent copies of $F_{\alpha }(\mathcal {T}_k)$, namely, $m v_k$. Thus,

$$\begin{aligned} {\text {Var}}F_{\alpha }(\mathcal {T}_n) \geqslant v_k {\mathbb E{}}N_{n, k} \geqslant (1 + o(1)) v_k q_k n, \end{aligned}$$

(5.42)

so the constant $\gamma ^2$ mentioned at the start of this paragraph satisfies $\gamma ^2 \geqslant v_k q_k > 0$. $\square $

Remark 5.4

The same idea used at the end of the proof of Theorem 1.5 can be used to give an answer to a question raised in [14, Remark 1.7], in the special case that the toll function f depends only on tree size. Indeed, the same proof shows that (with no other conditions on f) if F is not deterministic for all fixed tree sizes—more precisely, if $q_k > 0$ and $v_k:= {\text {Var}}F(\mathcal {T}_k) > 0$ for some k, then

$$\begin{aligned} \liminf _n \frac{{\text {Var}}F(\mathcal {T}_n)}{n} \in [v_k q_k, \infty ] \subseteq (0, \infty ]. \end{aligned}$$

(5.43)

$\square $

Remark 5.5

(a) It is by no means immediately clear that the constant $\varkappa _{1,1}$ appearing in (5.14)–(5.15) agrees with the value produced in [5, Remark 5.1]. Appendix D provides a reconciliation.

(b) Using the results of Appendix D, in Appendix E we discuss for real $\alpha < 0$ calculation of the variance in Theorem 1.5. $\square $

Remark 5.6

We recall that asymptotic normality of $X_n(\alpha )$, or equivalently of $F_{\alpha }(\mathcal {T}_n)$, is already proven in [5, Theorem 1.1]. Furthermore, [5, Section 5] shows joint asymptotic normality for several $\alpha $ with ${\text {Re}}\alpha <0$, which for the case of two values $\alpha _1$ and $\alpha _2$ is consistent with (5.36) (by the argument in the proof of Theorem 1.5 above). It would certainly be possible to generalize the moment convergence results in this section to convergence of mixed moments for combinations of several $\alpha _i$, similarly to Sect. 4.3, including also the possibility ${\text {Re}}\alpha _i\geqslant 0$ for some values of i. However, this would require a lengthy case distinction (depending on the signs of the values ${\text {Re}}\alpha _i$), so we did not perform these calculations explicitly. Instead we just note that if we consider only the case ${\text {Re}}\alpha _i<0$, then convergence of all mixed moments follows from the joint convergence in (1.4) for several $\alpha _i$ shown in [5, Section 5] together with the uniform integrability of $|n^{-1/2}[X_n(\alpha ) - \mu (\alpha )n]|^r$ for arbitrary $r>0$ that follows from Theorem 1.5 (see Remark 1.6). $\square $

6 Fractional Moments (Mainly of Negative Order) of Tree Size: Comparisons Across Offspring Distributions

Recall from [5, Theorem 1.7] that the $\alpha $th moment $\mu (\alpha ) = {\mathbb E{}}|\mathcal {T}|^{\alpha }$ of tree size defined at (1.8) is the slope in the lead-order linear approximation $\mu (\alpha ) n$ of ${\mathbb E{}}X_n(\alpha )$ whenever ${\text {Re}}\alpha < \tfrac{1}{2}$; and from Theorem 1.5 that this linear approximation suffices as a centering for $X_n(\alpha )$ in order to obtain a normal limit distribution when ${\text {Re}}(\alpha ) < 0$. (See also Remark 1.7.) It is, therefore, of interest to compute $\mu (\alpha )$ and, similarly, the constant $\mu ' = {\mathbb E{}}\log |\mathcal {T}|$ defined at (1.9), which serves as the centering slope in Theorem 1.3.

In [5, Appendix A] it is noted that although $\mu (\alpha )$ can be evaluated numerically, no exact values for important examples of Galton–Watson trees are known in any simple form except in the case that $\alpha $ is a negative integer. This section is motivated by our having noticed that for all such values (for small k) reported for four examples in that appendix, $\mu (-k)$ is smallest for binary trees [5, Example A.3], second smallest for labelled trees [5, Example A.1], second largest for full binary trees [5, Example A.4], and largest for ordered trees [5, Example A.2]. We wanted to understand why this ordering occurs and whether any such ordering could be predicted for the values $\mu '$ defined at (1.9).

In Sect. 6.1 we give a sufficient condition ((6.25) in Theorem 6.8) for such (strict) orderings that is fairly easy to check. In Sect. 6.2 we give a class of examples extending the four in [5, Appendix A] where this condition is met. In Sect. 6.3 we discuss numerical computation of $\mu '$, which we carry out for the four examples in [5, Appendix A] and some additional examples.

The results of this Sect. 6 do not require (1.7).

6.1 Comparison Theory

The main results of this section are in Theorem 6.8. Working toward those results, we begin by recalling from [5, (A.6)] (where y is called “g” and (1.7) is not required) that for ${\text {Re}}\alpha < \tfrac{1}{2}$ we have

$$\begin{aligned} \mu (\alpha ) = \frac{1}{\Gamma (1 - \alpha )} \int _0^1\! (\log \tfrac{1}{t})^{-\alpha } y'(t) \,\textrm{d}t. \end{aligned}$$

(6.1)

To utilize (6.1) directly, even merely to obtain inequalities across models for real $\alpha $, one needs to compute the derivative of the tree size probability generating function y, or at least to compare the functions $y'$ for the compared models. This is nontrivial, since explicit computation of $y'$ (or y) is difficult or even infeasible in examples such as m-ary trees and full m-ary trees when $m > 2$. Fortunately, according to (6.3) (and similarly (6.4) in regard to $\mu '$) to follow, one need only treat the simpler offspring probability generating function(s) $\Phi $.

Before proceeding to our main results, we present a simple lemma, a recasting of (6.1), and a definition.

Lemma 6.1

The function $t \mapsto t / \Phi (t)$ is the inverse function of $y:[0,1]\rightarrow [0,1]$, and it increases strictly from 0 to 1 for $t \in [0, 1]$.

Proof

It is obvious from (2.18) that y(z) is continuous and strictly increasing for $z\in [0,1]$ with $y(0)=0$ and $y(1)=1$. Hence its inverse is also strictly increasing from 0 to 1 on $[0,1]$. Finally, (2.19) shows that the inverse is $t \mapsto t / \Phi (t)$. $\square $

We will henceforth write

$$\begin{aligned} R(\eta ) := \frac{1}{y^{-1}(\eta )} = \frac{\Phi (\eta )}{\eta } \in [1, \infty ), \quad \eta \in (0, 1]; \end{aligned}$$

(6.2)

this strictly decreasing function R will appear on several occasions in the sequel, especially in Appendix A.2.

It follows from (6.1), Lemma 6.1, a change of variables from t to $\eta = y(t)$, and (2.19) that

$$\begin{aligned} \mu (\alpha ) = \frac{1}{\Gamma (1 - \alpha )} \int _0^1 [\log R(\eta )]^{-\alpha } \,\textrm{d}\eta . \end{aligned}$$

(6.3)

Further, differentiation with respect to $\alpha $ at $\alpha = 0$ gives

$$\begin{aligned} \mu ' = - \gamma - \int _0^1[\log \log R(\eta )] \,\textrm{d}\eta . \end{aligned}$$

(6.4)

For the remainder of Sect. 6 we focus on real $\alpha $ and utilize the following notation.

Definition 6.2

For two real-valued functions $g_1$ and $g_2$ defined on (0, 1), write $g_1 \leqslant g_2$ to mean that $g_1(t) \leqslant g_2(t)$ for all $t \in (0, 1)$; write $g_1 < g_2$ to mean that $g_1 \leqslant g_2$ but $g_2 \not \leqslant g_1$ (equivalently, that $g_1(t) \leqslant g_2(t)$ for all $t \in (0, 1)$, with strict inequality for at least one value of t); and write $g_1 \prec g_2$ to mean that $g_1(t) < g_2(t)$ for all $t \in (0, 1)$.

Consider two Galton–Watson trees, $\mathcal {T}^{(1)}$ and $\mathcal {T}^{(2)}$, with respective offspring distributions $\xi _1$ and $\xi _2$. Denote the trees’ respective $\Phi $-functions by $\Phi _1$ and $\Phi _2$, and use similarly subscripted notation for other functions associated with the trees.

We note in passing that, as a simple consequence of Lemma 6.1 whose proof is left to the reader,

$$\begin{aligned} \Phi _1 \leqslant \Phi _2 \quad \text{ if } \text{ and } \text{ only } \text{ if } \quad y_1 \leqslant y_2, \end{aligned}$$

(6.5)

and hence also

$$\begin{aligned} \Phi _1< \Phi _2 \quad \text{ if } \text{ and } \text{ only } \text{ if } \quad y_1 < y_2. \end{aligned}$$

(6.6)

The result (6.5) is perhaps of some independent interest but is used in the sequel mainly in the proof of Theorem 6.5.

Theorem 6.3

Consider two Galton–Watson trees, $\mathcal {T}^{(1)}$ and $\mathcal {T}^{(2)}$. Suppose

$$\begin{aligned} \Phi _1 \leqslant \Phi _2. \end{aligned}$$

(6.7)

(i)
If $\alpha < 0$, then
$$\begin{aligned} \mu _1(\alpha ) \leqslant \mu _2(\alpha ). \end{aligned}$$
(6.8)
(ii)
If $0< \alpha < \frac{1}{2}$, then
$$\begin{aligned} \mu _1(\alpha ) \geqslant \mu _2(\alpha ). \end{aligned}$$
(6.9)
(iii)
The centering constants for the corresponding shape functionals satisfy
$$\begin{aligned} \mu _1' \geqslant \mu _2'. \end{aligned}$$
(6.10)

Proof

This is immediate from (6.3) and (6.4). $\square $

Note that, by considering difference quotients, each of (i) and (ii) in Theorem 6.3 implies (iii) there; one does not need the stronger hypothesis (6.7) for this conclusion.

Remark 6.4

The conclusions in Theorem 6.3 do not always extend from $\mu (\alpha )$ to ${\mathbb E{}}X_n(\alpha )$ for finite n. A counterexample with $n = 3$ is provided by taking $\xi _1 \sim 2 {\text {Bi}}(1, \tfrac{1}{2})$ (corresponding to uniform full binary trees, with $X_3(\alpha )$ concentrated at $2 + 3^{\alpha }$) and $\xi _2 \sim {\text {Ge}}(\tfrac{1}{2})$ (corresponding to ordered trees, with ${\mathbb E{}}X_3(\alpha ) = \frac{3}{2} + \tfrac{1}{2}2^{\alpha } + 3^{\alpha }$). As shown in Lemma 6.11, we have $\Phi _1 \leqslant \Phi _2$, but Theorem 6.3(i)–(ii) with ${\mathbb E{}}X_3(\alpha )$ in place of of $\mu (\alpha )$ fails for every value of $\alpha $, as does (6.10). $\square $

The converse to Theorem 6.3 fails. That is, Theorem 6.3(i)–(ii) do not imply that (6.7) does, too. A counterexample is provided in Appendix A.2. However, as the next theorem shows, (6.7) has for $\alpha < 0$ a stronger consequence than Theorem 6.3(i), and this stronger consequence yields a converse result:

Theorem 6.5

We have

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}_1| - 1 + t)^{\alpha } \leqslant {\mathbb E{}}(|\mathcal {T}_2| - 1 + t)^{\alpha } \text{ for } \text{ all } \text{ integers } \alpha < 0 \text{ and } \text{ all } t \in (0, \infty ) \end{aligned}$$

(6.11)

if and only if (6.7) holds, in which case the inequality in (6.11) also holds for all real $\alpha < 0$ and all $t \in (0, \infty )$.

Proof

Setting $\alpha = -k$ in (6.3) and summing over positive integers k, for complex z in the open unit disk let us define the function H as at [5, (A.7)]:

$$\begin{aligned} H(z) := {\mathbb E{}}\Bigl (1 - \frac{z}{|\mathcal {T}|}\Bigr )^{-1} = \sum _{k = 0}^{\infty } \mu (-k) z^k = \int _0^1\!\exp \left[ z \log R(\eta ) \right] \,\textrm{d}\eta . \end{aligned}$$

(6.12)

Changing variables (back) from $\eta $ to $t = y^{-1}(\eta ) = 1 / R(\eta )$, we then find

$$\begin{aligned} H(z) = \int _0^1\!t^{-z} y'(t) \,\textrm{d}t = 1 + z \int _0^1\!t^{- z - 1} y(t) \,\textrm{d}t, \end{aligned}$$

(6.13)

with the last equality, resulting from integration by parts, as noted at [5, (A.9)]; thus,

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}| - z)^{-1} = z^{-1} (H(z) - 1) = \int _0^1\!t^{- z - 1} y(t) \,\textrm{d}t. \end{aligned}$$

(6.14)

Since both the first and third expressions in (6.14) are analytic for all z with ${\text {Re}}z < 1$, they are equal in this halfplane. Changing variables, we then find, for ${\text {Re}}z > -1$, that

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}| + z)^{-1} = \int _0^{\infty }\!e^{- z x} y(e^{-x}) \,\textrm{d}x. \end{aligned}$$

(6.15)

In particular, if (6.7) holds, then (recalling (6.5))

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}_1| - 1 + t)^{-1}&= \int _0^{\infty }\!e^{- t x} e^x y_1(e^{-x}) \,\textrm{d}x \nonumber \\&\leqslant \int _0^{\infty }\!e^{- t x} e^x y_2(e^{-x}) \,\textrm{d}x = {\mathbb E{}}(|\mathcal {T}_2| - 1 + t)^{-1} \end{aligned}$$

(6.16)

for real $t > 0$.

But more is true. Let $\Delta (z):= e^z [y_2(e^{-z}) - y_1(e^{-z})]$. Then for $t > 0$ we have that

$$\begin{aligned} h(t) := {\mathbb E{}}(|\mathcal {T}_2| - 1 + t)^{-1} - {\mathbb E{}}(|\mathcal {T}_1| - 1 + t)^{-1} = \int _0^{\infty }\!e^{- t x} \Delta (x) \,\textrm{d}x \end{aligned}$$

(6.17)

is the Laplace transform of the bounded continuous function $\Delta $ on $(0, \infty )$. It follows from the Bernstein–Widder theorem (e.g., [2, Theorem XIII.4.1a]) that h satisfies the (weak) complete monotonicity inequalities (6.11), i.e.,

$$\begin{aligned} (-1)^r h^{(r)}(t) \geqslant 0 \text{ for } \text{ all } \text{ integers } r \geqslant 0 \text{ and } \text{ all } t \in (0, \infty ), \end{aligned}$$

(6.18)

if and only if $\Delta (x) \geqslant 0$ for a.e. $x > 0$, which in turn is true if and only if $y_1 \leqslant y_2$, or (by (6.5)) equivalently (6.7), holds.

Next, if (6.7) holds, then for real $\alpha < 0$ and $t \in (0, 1]$ we have

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}_1| - 1 + t)^{\alpha }&= \sum _{k = 0}^{\infty } \left( {\begin{array}{c}|\alpha | + k - 1\\ k\end{array}}\right) \,\mu _1(\alpha - k)\,(1 - t)^k \nonumber \\&\leqslant \sum _{k = 0}^{\infty } \left( {\begin{array}{c}|\alpha | + k - 1\\ k\end{array}}\right) \,\mu _2(\alpha - k)\,(1 - t)^k \nonumber \\&= {\mathbb E{}}(|\mathcal {T}_2| - 1 + t)^{\alpha }, \end{aligned}$$

(6.19)

where the inequality holds by Theorem 6.3(i).

Finally, if (6.7) holds, then for real $\alpha < 0$ and $t > 1$, Theorem B.1 in Appendix B implies that for $j \in \{1, 2\}$ we have

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}_j| - 1 + t)^{\alpha } = (t - 1)^{\alpha } \int _0^1\!\left[ 1 - \frac{c_j(\eta )}{\Gamma (-\alpha )} \right] \,\textrm{d}\eta , \end{aligned}$$

(6.20)

where $c_j$ is the incomplete gamma function value

$$\begin{aligned} c_j(\eta ) = \int _{(t - 1) \log R_j(\eta )}^{\infty } w^{ - \alpha - 1} e^{-w} \,\textrm{d}w, \end{aligned}$$

(6.21)

and from (6.20) it is evident that ${\mathbb E{}}(|\mathcal {T}_1| - 1 + t)^{\alpha } \leqslant {\mathbb E{}}(|\mathcal {T}_2| - 1 + t)^{\alpha }$. $\square $

Remark 6.6

This remark concerns sufficient conditions for (6.7) (equivalently, by (6.5), for $y_1 \leqslant y_2$).

(a) The condition

$$\begin{aligned} |\mathcal {T}^{(1)}| \geqslant |\mathcal {T}^{(2)}| \text{ stochastically } \end{aligned}$$

(6.22)

is stronger than $y_1 \leqslant y_2$ and is of course equivalent to the condition that

$$\begin{aligned} {\mathbb E{}}g(|\mathcal {T}^{(1)}|) \geqslant {\mathbb E{}}g(|\mathcal {T}^{(2)}|) \end{aligned}$$

(6.23)

for every non-negative nondecreasing function g defined on the positive integers. In particular, (6.22) implies the conclusions of Theorem 6.3 and (6.11) in Theorem 6.5.

Note, however, that (6.22) is strictly stronger than $y_1 \leqslant y_2$. While the stronger condition (6.22) holds for some of the comparisons in Sect. 6.2 (for example, binary trees vs. labelled trees, for which there is monotone likelihood ratio (MLR); and full binary trees vs. ordered trees, for which there is no MLR but still stochastic ordering), an example satisfying (6.7) (see Lemma 6.11 for a proof) but not (6.22) is $\xi _1 \sim {\text {Po}}(1)$ (labelled trees) and $\xi _2 \sim 2 {\text {Bi}}(1, \tfrac{1}{2})$ (full binary trees), because ${\mathbb P{}}(|\mathcal {T}^{(1)}| \leqslant 2) = e^{-1} + e^{-2} > \frac{1}{2} = {\mathbb P{}}(|\mathcal {T}^{(2)}| \leqslant 2)$.

(b) Similarly, the condition

$$\begin{aligned} \xi _1 \geqslant \xi _2 \text{ stochastically } \end{aligned}$$

(6.24)

is stronger than (6.7); indeed, it’s even stronger than (6.22). But this stochastic ordering of offspring distributions can only hold if $\xi _1$ and $\xi _2$ have the same distribution, because ${\mathbb E{}}\xi _1 = {\mathbb E{}}\xi _2 = 1$. $\square $

Remark 6.7

This remark concerns necessary conditions for (6.7).

(a) If (6.7) holds, then by a Taylor expansion near $t = 1$ [12, (A.6)] (or, alternatively, recalling (6.5), by $y_1 \leqslant y_2$ and [12, (A.5)]), $\sigma _1^2 \leqslant \sigma _2^2$. (This does not require the assumption (1.7); when (1.7) holds, we can also use [5, Lemma 12.14] or (2.20).)

(b) More generally, and by similar reasoning, if (6.7) holds and for some integer $r \geqslant 2$ we have ${\mathbb E{}}\xi _1^j = {\mathbb E{}}\xi _2^j \leqslant \infty $ for $j = 1, \ldots , r - 1$, then $(-1)^r {\mathbb E{}}\xi _1^r \leqslant (-1)^r {\mathbb E{}}\xi _2^r \leqslant \infty $. See Appendix C for details.

(c) We can also consider a Taylor expansion near $t = 0$. Thus, if (6.7) holds, then ${\mathbb P{}}(\xi _1 = 0) \leqslant {\mathbb P{}}(\xi _2 = 0)$. More generally, if for some integer $r \geqslant 0$ we have ${\mathbb P{}}(\xi _1 = j) = {\mathbb P{}}(\xi _2 = j)$ for $j = 0, \ldots , r - 1$, then ${\mathbb P{}}(\xi _1 = r) \leqslant {\mathbb P{}}(\xi _2 = r)$. $\square $

We next address the question of a stronger condition than (6.7) under which the inequalities in (6.8)–(6.10) and (6.11) are all strict. Recall the meaning of $g_1 < g_2$ described in Definition 6.2.

Theorem 6.8

Consider two Galton–Watson trees, $\mathcal {T}^{(1)}$ and $\mathcal {T}^{(2)}$. Suppose

$$\begin{aligned} \Phi _1 < \Phi _2. \end{aligned}$$

(6.25)

(i)
If $\alpha < 0$, then
$$\begin{aligned} \mu _1(\alpha ) < \mu _2(\alpha ). \end{aligned}$$
(6.26)
(ii)
If $0< \alpha < \frac{1}{2}$, then
$$\begin{aligned} \mu _1(\alpha ) > \mu _2(\alpha ). \end{aligned}$$
(6.27)
(iii)
We have
$$\begin{aligned} \mu _1' > \mu _2'. \end{aligned}$$
(6.28)

Proof

If (6.25) holds, then (by continuity of $\Phi _1$ and $\Phi _2$) strict inequality $\Phi _1(t) < \Phi _2(t)$ holds over some interval of positive length. The inequalities (6.26)–(6.28) are then immediate from (6.3)–(6.4). $\square $

Theorem 6.9

We have

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}_1| - 1 + t)^{-m} < {\mathbb E{}}(|\mathcal {T}_2| - 1 + t)^{-m}\quad \text{ for } \text{ all } \text{ integers }\quad m \geqslant 0 \text{ and } \text{ all } t \in (0, \infty ) \end{aligned}$$

(6.29)

if and only if (6.25) holds.

Proof

The forward direction (6.29)$\implies $(6.25) follows from Theorem 6.5.

For the opposite direction, use the representation (6.17) and take derivatives with respect to t. $\square $

Remark 6.10

For all the comparison examples in Sect. 6.2 where the condition (6.25) holds, we in fact have the stronger condition that $\Phi _1 \prec \Phi _2$. When (6.25) holds, we can’t have $\Phi _1 = \Phi _2$ over a nondegenerate interval because $\Phi _2(z) - \Phi _1(z)$ is analytic for z in the open unit disk. But it is possible to have $\Phi _1(t) = \Phi _2(t)$ for some values of $t \in (0, 1)$. For an example with one such value, namely, $t = 1/6$, use the notation of Appendix A.1 and take $\Phi _1 = \Phi $ and $\Phi _2 = {\widetilde{\Phi }}_0$. $\square $

6.2 Comparison Examples

In this subsection we consider the following important examples of critical Galton–Watson trees, and we fix the subscripting notation in (6.30)–(6.34) for the remainder of Sect. 6:

$$\begin{aligned} m{\text {-}}\text{ ary } \text{ trees: }&\xi _{1, m} \sim {\text {Bi}}(m, \tfrac{1}{m})\quad (m \geqslant 2); \end{aligned}$$

(6.30)

$$\begin{aligned} \text{ labelled } \text{ trees: }&\xi _2 \sim {\text {Po}}(1); \end{aligned}$$

(6.31)

$$\begin{aligned} \text{ full } \text{ binary } \text{ trees: }&\xi _3 \sim 2 {\text {Bi}}(1, \tfrac{1}{2}); \end{aligned}$$

(6.32)

$$\begin{aligned} \text{ ordered } \text{ trees: }&\xi _4 \sim {\text {Ge}}(\tfrac{1}{2}); \end{aligned}$$

(6.33)

$$\begin{aligned} \text{ full } m{\text {-}}\text{ ary } \text{ trees: }&\xi _{5, m} \sim m {\text {Bi}}(1, \tfrac{1}{m})\quad (m \geqslant 3). \end{aligned}$$

(6.34)

Observe that

$$\begin{aligned} \sigma _{1, m}^2 = 1 - \tfrac{1}{m} \uparrow \text{ strictly } \text{ as } m \uparrow , \end{aligned}$$

(6.35)

that

$$\begin{aligned} \sigma _{5, m}^2 = m - 1\uparrow \text{ strictly } \text{ as } m \uparrow , \end{aligned}$$

(6.36)

and that, for any $m \geqslant 2$, we have

$$\begin{aligned} \sigma _{1, m}^2< \sigma _2^2 = \sigma _3^2 < \sigma _4^2 = \sigma _{5, 3}^2. \end{aligned}$$

(6.37)

Further,

$$\begin{aligned} {\mathbb E{}}\xi _2^3 = 5 > 4 = {\mathbb E{}}\xi _3^3 \end{aligned}$$

(6.38)

and

$$\begin{aligned} {\mathbb E{}}\xi _4^3 = 13 > 9 = {\mathbb E{}}\xi _{5, 3}^3. \end{aligned}$$

(6.39)

According to Remark 6.7(a)–(b) and (6.35)–(6.39), the only possible $\Phi $-orderings in the order < among the trees listed in (6.30)–(6.34) are

$$\begin{aligned} \Phi _{1, m} \uparrow \text{ strictly } \text{ as } m \uparrow , \end{aligned}$$

(6.40)

$$\begin{aligned} \Phi _{5, m} \uparrow \text{ strictly } \text{ as } m \uparrow , \end{aligned}$$

(6.41)

and, for any $m \geqslant 2$,

$$\begin{aligned} \Phi _{1, m}< \Phi _2< \Phi _3< \Phi _4 < \Phi _{5, 3}. \end{aligned}$$

(6.42)

Alternatively, we can note that

$$\begin{aligned} {\mathbb P{}}(\xi _{1, m} = 0) = (1 - \tfrac{1}{m})^m \uparrow \text{ strictly } \text{ as } m \uparrow \end{aligned}$$

(6.43)

(see (6.51) below with $t = 0$); that

$$\begin{aligned} {\mathbb P{}}(\xi _{5, m} = 0) = 1 - \tfrac{1}{m} \uparrow \text{ strictly } \text{ as } m \uparrow ; \end{aligned}$$

(6.44)

that, for any $m \geqslant 2$, we have

$$\begin{aligned} {\mathbb P{}}(\xi _{1, m} = 0)< e^{-1} = {\mathbb P{}}(\xi _2 = 0)< {\mathbb P{}}(\xi _3 = 0) = {\mathbb P{}}(\xi _4 = 0) < {\mathbb P{}}(\xi _{5, 3} = 0); \end{aligned}$$

(6.45)

and, further, that

$$\begin{aligned} {\mathbb P{}}(\xi _3 \leqslant 1) = \tfrac{1}{2}< \tfrac{3}{4} = {\mathbb P{}}(\xi _4 \leqslant 1) \end{aligned}$$

(6.46)

to conclude again, now using Remark 6.7(c), that the only possible $\Phi $-orderings in the order < for (6.30)–(6.34) are (6.40)–(6.42).

Remarkably, all the inequalities in (6.40)–(6.42) are true, and in fact there is strict inequality at every argument.

Lemma 6.11

For every $t \in (0, 1)$ we have

$$\begin{aligned}{} & {} \Phi _{1, m}(t) \uparrow \text{ strictly } \text{ as } m \uparrow , \end{aligned}$$

(6.47)

$$\begin{aligned}{} & {} \Phi _{5, m}(t) \uparrow \text{ strictly } \text{ as } m \uparrow ; \end{aligned}$$

(6.48)

and, for any $m \geqslant 2$,

$$\begin{aligned} \Phi _{1, m} \prec \Phi _2 \prec \Phi _3 \prec \Phi _4 \prec \Phi _{5, 3}. \end{aligned}$$

(6.49)

Proof

The proof is a collection of simple exercises in calculus.

Proof of (6.47). Fix $m \geqslant 2$ and $t \in (0, 1)$. Observe that

$$\begin{aligned} \Phi _{1, m}(t) = (\tfrac{m - 1}{m} + \tfrac{1}{m} t)^m = [1 - \tfrac{1}{m} (1 - t)]^m. \end{aligned}$$

(6.50)

Thus,

$$\begin{aligned}&\log \Phi _{1, m + 1}(t) - \log \Phi _{1, m}(t) \nonumber \\&\quad = (m + 1) \log \Bigl (1 - \frac{1 - t}{m + 1}\Bigr ) - m \log \Bigl (1 - \frac{1 - t}{m}\Bigr ) \nonumber \\&\quad = - \left[ (1 - t) + \frac{(1 - t)^2}{2 (m + 1)} + \frac{(1 - t)^3}{3 (m + 1)^2} + \cdots \right] + \left[ (1 - t) + \frac{(1 - t)^2}{2 m} + \frac{(1 - t)^3}{3 m^2} + \cdots \right] \nonumber \\&\quad > 0. \end{aligned}$$

(6.51)

Proof of (6.48). Fix $m \geqslant 3$. Consider $t \in (0, 1]$ and observe that

$$\begin{aligned} \Phi _{5, m}(t) = \tfrac{1}{m} (m - 1 + t^m). \end{aligned}$$

(6.52)

Let $f(t):= \Phi _{5, m + 1}(t) - \Phi _{5, m}(t)$. We have $f(1) = 1 - 1 = 0$ and

$$\begin{aligned} f'(t) = t^m - t^{m - 1} = - t^{m - 1} (1 - t) < 0 \end{aligned}$$

(6.53)

for $t \in (0, 1)$. Thus, $f(t) > 0$ for $t \in (0, 1)$.

Proof of $\Phi _{1, m} \prec \Phi _2$ for $2 \leqslant m < \infty $. From (6.50) we see that

$$\begin{aligned} \Phi _{1, \infty }(t) := \lim _{m \rightarrow \infty } \Phi _{1, m}(t) = e^{t - 1} = \Phi _2(t). \end{aligned}$$

(6.54)

The result follows.

Proof of $\Phi _2 \prec \Phi _3$. Consider $t \in (0, 1]$ and let

$$\begin{aligned} f(t) := \ln \Phi _3(t) - \ln \Phi _2(t) = \ln (1 + t^2) - \ln 2 - (t - 1). \end{aligned}$$

(6.55)

We have $f(1) = 0$ and

$$\begin{aligned} f'(t) = 2 t (1 + t^2)^{-1} - 1 = - (1 - t)^2 (1 + t^2)^{-1} < 0 \end{aligned}$$

(6.56)

for $t \in (0, 1)$. Thus, $f(t) > 0$ for $t \in (0, 1)$.

Proof of $\Phi _3 \prec \Phi _4$. Consider $t \in [0, 1]$ and let

$$\begin{aligned} f(t) := \Phi _4(t) - \Phi _3(t) = \tfrac{1}{2}(1 - \tfrac{1}{2}t)^{-1} - \tfrac{1}{2}(1 + t^2) = \tfrac{1}{4} t (1 - t)^2 (1- \tfrac{1}{2}t)^{-1}. \end{aligned}$$

(6.57)

Clearly, $f(t) > 0$ for $t \in (0, 1)$.

Proof of $\Phi _4 \prec \Phi _{5, 3}$. Consider $t \in (0, 1)$ and let

$$\begin{aligned} f(t) := \frac{\Phi _{5, 3}(t)}{\Phi _4(t)} = \frac{\tfrac{2}{3} + \tfrac{1}{3} t^3}{\tfrac{1}{2}(1 - \tfrac{1}{2}t)^{-1}}. \end{aligned}$$

(6.58)

Then

$$\begin{aligned} f(t) = 1 + \tfrac{1}{3} (1 - 2 t + 2 t^3 - t^4) = 1 + \tfrac{1}{3} (1 - t)^3 (1 + t) > 1, \end{aligned}$$

(6.59)

as desired. $\square $

Theorem 6.12

(i)
If $\alpha < 0$, then
$$\begin{aligned}{} & {} \mu _{1, m}(\alpha ) \uparrow \text{ strictly } \text{ as } m \uparrow , \end{aligned}$$
(6.60)
$$\begin{aligned}{} & {} \mu _{5, m}(\alpha ) \uparrow \text{ strictly } \text{ as } m \uparrow ; \end{aligned}$$
(6.61)
and, for any $m \geqslant 2$,
$$\begin{aligned} \mu _{1, m}(\alpha )< \mu _2(\alpha )< \mu _3(\alpha )< \mu _4(\alpha ) < \mu _{5, 3}(\alpha ). \end{aligned}$$
(6.62)
(ii)
The orders in (i) are all reversed for $0< \alpha < \tfrac{1}{2}$ and for $\mu '$.

Proof

The theorem is immediate from Lemma 6.11 and Theorem 6.8. $\square $

Remark 6.13

The only two examples among (6.30)–(6.34) for which $\xi \leqslant 2$ a.s. are binary trees with $\Phi _{1, 2}(t) = \frac{1}{4}(1 + t)^2$ and full binary trees for which $\Phi _3(t) = \tfrac{1}{2}(1 + t^2)$. These are two examples ($c = \tfrac{1}{2}$ and $c = 1$, respectively), along with so-called Motzkin trees ($c = \tfrac{2}{3}$), of the most general critical Galton–Watson offspring distribution $\xi _{(c)}$ to satisfy $\xi _{(c)}\leqslant 2$ a.s., with $0 < c \leqslant 1$ and

$$\begin{aligned} {\mathbb P{}}(\xi _{(c)}= 0) = {\mathbb P{}}(\xi _{(c)}= 2) = \tfrac{1}{2}c, \qquad {\mathbb P{}}(\xi _{(c)}= 1) = 1 - c. \end{aligned}$$

(6.63)

Generalizing $\Phi _{1, 2} \prec \Phi _3$ from (6.49) in Lemma 6.11, we claim that $\Phi _{(c)}$ is strictly increasing in the order $\prec $. Indeed, for $t \in (0, 1)$ we have

$$\begin{aligned} \Phi _{(c)}(t) = t + \tfrac{1}{2}c (1 - t)^2, \end{aligned}$$

(6.64)

which is clearly strictly increasing in $c \in (0, 1]$. $\square $

Remark 6.14

Despite a suggestion to the contrary provided by Lemma 6.11 and Remark 6.13, the partial order $\leqslant $ on tree size probability generating functions is not a linear order. An example of incomparable $\Phi $ and ${\widetilde{\Phi }}$ is provided in Appendix A.1 (taking $\varepsilon \in (0, 1]$ in the notation there). For a simpler counterexample, which shows that $\leqslant $ does not even linearly order cubic probability generating functions, let

$$\begin{aligned} \Phi (t) := \Phi _{1, 2}(t) = \tfrac{1}{4} (1 + t)^2 \end{aligned}$$

(6.65)

correspond to binary trees, as at (6.30); and let

$$\begin{aligned} {\widetilde{\Phi }}(t) := (\tfrac{1}{4} - 2 \varepsilon ) + (\tfrac{1}{2}+ 7 \varepsilon ) t + (\tfrac{1}{4} - 8 \varepsilon ) t^2 + 3 \varepsilon \,t^3 \end{aligned}$$

(6.66)

with $0 < \varepsilon \leqslant 1 / 32$. (For example, the choice $\varepsilon = 1 / 32$ gives

$$\begin{aligned} {\widetilde{\Phi }}(t) := \tfrac{3}{16} + \tfrac{23}{32} t + \tfrac{3}{32} t^3.) \end{aligned}$$

(6.67)

Then ${\widetilde{\Phi }}$ has non-negative coefficients and

$$\begin{aligned} {\widetilde{\Phi }}(1) = {\widetilde{\Phi }}'(1) = 1, \end{aligned}$$

(6.68)

as required, and one can simply note that

$$\begin{aligned} {\widetilde{\Phi }}(t) - \Phi (t) = \varepsilon (- 2 + 7 t - 8 t^2 + 3 t^3) = 3 \varepsilon (t - \tfrac{2}{3}) (1 - t)^2 \end{aligned}$$

(6.69)

is negative for $t < 2/3$ and positive for $t > 2/3$. Alternatively, one can apply Remark 6.7 and note that

$$\begin{aligned} {\mathbb P{}}(\xi = 0) = \tfrac{1}{4} > \tfrac{1}{4} - 2 \varepsilon = {\mathbb P{}}\bigl (\tilde{\xi }= 0\bigr ) \end{aligned}$$

(6.70)

but

$$\begin{aligned} \tilde{\sigma }^2 = {\mathbb E{}}\left[ \tilde{\xi }\bigl (\tilde{\xi }- 1\bigr ) \right] = 2 (\tfrac{1}{4} - 8 \varepsilon ) + 6 (3 \varepsilon ) = \tfrac{1}{2}+ 2 \varepsilon > \tfrac{1}{2}= \sigma ^2. \end{aligned}$$

(6.71)

For another example of incomparable $\Phi $ and ${\widetilde{\Phi }}$ with respect to $\leqslant $, consider quaternary trees ($m = 4$ in (6.30)) and Motzkin trees ($c = 2 / 3$ in Remark 6.13). $\square $

6.3 Numerical Computation of $\mu '$

In this subsection we will compute the constant $\mu '$ for several examples of critical Galton–Watson trees. First, to set the stage for what to expect, we consider in the next remark the possible values of $\mu '$ as $\xi $ ranges over all critical offspring distributions.

Recall (6.4). For the next remark, we find it convenient to break the integral into two pieces, using the notation $x^+:=\max \{x,0\}$ and $x^-:=\max \{-x,0\}$:

$$\begin{aligned} \mu '&= - \gamma - \int _{t \in (0, 1)} [\log \log R(t)]^+ \,\textrm{d}t + \int _{t \in (0, 1)} [\log \log R(t)]^- \,\textrm{d}t \nonumber \\&= - \gamma - J_+ + J_-, \end{aligned}$$

(6.72)

say.

Remark 6.15

In this remark we argue that there is no finite upper bound, nor positive lower bound, on $\mu '$ over all Galton–Watson trees.

(a) Referring to Remark 6.13, observe that

$$\begin{aligned} \Phi _{(c)}(t) \searrow t \end{aligned}$$

(6.73)

for each $t \in (0, 1)$ as $c \searrow 0$. By the dominated convergence theorem (DCT), $J_+ \searrow 0$ as $c \searrow 0$. By the monotone convergence theorem (MCT), $J_- \nearrow \infty $ as $c \searrow 0$. Thus, as $c \searrow 0$ we have

$$\begin{aligned} \mu '_{(c)}\nearrow \infty . \end{aligned}$$

(6.74)

Indeed, it can be shown that $\mu '_{(c)} = \log \frac{2}{c} + 1 - \gamma + o(1)$ as $c \searrow 0$.

(b) For the offspring distributions $\xi _{5,m}$, we have as ${m\rightarrow \infty }$ that

$$\begin{aligned} {\mathbb P{}}(\xi _{5,m}=0)=1-\tfrac{1}{m}\rightarrow 1, \end{aligned}$$

(6.75)

and thus, $\xi _{5, m} \overset{\textrm{p}}{\longrightarrow }0$, which implies convergence of the probability generating functions for every $t\in (0,1)$; hence,

$$\begin{aligned} \Phi _{5, \infty }(t) := \lim _{m \rightarrow \infty } \Phi _{5, m}(t) = 1, \end{aligned}$$

(6.76)

which is otherwise obvious by direct calculation (showing also that the limit is an increasing one). By the MCT applied to $J_+$ and the DCT applied to $J_-$, we find

$$\begin{aligned} \mu '_{5, m} \searrow - \gamma - \int _0^1\!(\log \log \tfrac{1}{t}) \,\textrm{d}t = 0 \end{aligned}$$

(6.77)

as $m \nearrow \infty $. Indeed, it can be shown that $\mu '_{5, m} \sim m^{-1} \ln m$ as $m \rightarrow \infty $.

(c) We claim that the image of $\mu '$ over Galton–Watson tree models is in fact $(0, \infty )$. To see this, we first note that $\mu '_{(c)}$ is continuous in c, with $\mu '_{(1)}=\mu '_3$, which by (a) implies that the image contains $[\mu '_3, \infty )$. Further, by considering the offspring probability generating functions

$$\begin{aligned} \Phi _{\lambda , m} := \lambda \Phi _{5, m} + (1 - \lambda ) \Phi _3 \end{aligned}$$

(6.78)

with $\lambda \in [0, 1]$, one can show (by consideration of large m) that the image also contains $(0, \mu '_3)$; we omit the details.

(d) Similarly as for (c), for each fixed value of $\alpha < 0$ the image of $\mu (\alpha )$ over all Galton–Watson tree models is (0, 1), and for each fixed value of $\alpha \in (0, \tfrac{1}{2})$ the image is $(1, \infty )$. $\square $

Example 6.16

The constant $\mu '_{1, 2}$ is computed to 50 digits in [4, Section 5.2] using the alternative form

$$\begin{aligned} \mu ' = - \gamma - \int _0^1\!(\log \log \tfrac{1}{t}) y'(t) \,\textrm{d}t \end{aligned}$$

(6.79)

of (6.4), explicit calculation of

$$\begin{aligned} y_{1, 2}(t) = \frac{2 - t - 2 \sqrt{1 - t}}{t} = t (1 + \sqrt{1 - t})^{-2} \end{aligned}$$

(6.80)

and, thence, its derivative

$$\begin{aligned} y_{1, 2}'(t) = (1 - t)^{-1/2} (1 + \sqrt{1 - t})^{-2}, \end{aligned}$$

(6.81)

and numerical integration. But it is easier to use (6.4) for (high-precision) computation of $\mu '$, especially for the values $\mu '_{1, m}$ and $\mu '_{5, m}$.

As examples, we find, rounded to five digits,

$$\begin{aligned} \mu '_{1, 2} = 2.0254, \qquad \mu '_2 = 1.5561, \qquad \mu '_3 = 1.4414, \qquad \mu '_4 = 1.1581. \end{aligned}$$

(6.82)

Note that

$$\begin{aligned} \infty> \mu '_{1, 2}> \mu '_2> \mu '_3> \mu '_4 > 0, \end{aligned}$$

(6.83)

as guaranteed by Lemma 6.11 and Theorem 6.8(iii); see also Remark 6.15 concerning the a priori lack of an upper bound on $\mu '_{1, 2}$ and a positive lower bound on $\mu '_4$.

As other examples, we find, rounded to five digits,

$$\begin{aligned}{} & {} \mu '_{1, 2} = 2.0254, \qquad \mu '_{1, 3} = 1.8224, \qquad \mu '_{1,10^3} = 1.5567; \end{aligned}$$

(6.84)

$$\begin{aligned}{} & {} \mu '_{5, 3} = 1.0164, \qquad \mu '_{5, 4} = 0.80800, \qquad \mu '_{5, 10^6} = 1.5372 \times 10^{-5}; \end{aligned}$$

(6.85)

and, in the notation of Remark 6.13,

$$\begin{aligned} \mu '_{(10^{-6})} = 14.931, \qquad \mu '_{(1/2)} = \mu '_{1, 2} = 2.0254, \qquad \mu '_{(1 - 10^{-2})} = 1.4496. \end{aligned}$$

(6.86)

$\square $

References

Caracciolo, S., Erba, V., Sportiello, A.: The $p$-Airy distribution. Preprint, 2020. arXiv:2010.14468v1
Feller, W.: An Introduction to Probability Theory and Its Applications, vol. II, 2nd edn. Wiley, New York (1971)
Fill, J.A.: On the distribution of binary search trees under the random permutation model. Random Struct. Algorithms 8(1), 1–25 (1996)
Article MathSciNet Google Scholar
Fill, J.A., Flajolet, P., Kapur, N.: Singularity analysis, Hadamard products, and tree recurrences. J. Comput. Appl. Math. 174(2), 271–313 (2005)
Article MathSciNet Google Scholar
Fill, J.A., Janson, S.: The sum of powers of subtree sizes for conditioned Galton–Watson trees. Electron. J. Probab. 27 (2022), article no. 114, 77 pp. Corrigendum: Electron. J. Probab. 28 (2023), article no. 23, 2 pp
Fill, J.A., Kapur, N.: An invariance principle for simply generated families of trees. Unpublished manuscript (2003)
Fill, J.A., Kapur, N.: Limiting distributions for additive functionals on Catalan trees. Theor. Comput. Sci. 326(1–3), 69–102 (2004)
Article MathSciNet Google Scholar
Flajolet, P.: Singularity analysis and asymptotics of Bernoulli sums. Theor. Comput. Sci. 215(1–2), 371–381 (1999)
Article MathSciNet Google Scholar
Flajolet, P., Sedgewick, R.: Analytic Combinatorics. Cambridge University Press, Cambridge, UK (2009)
Book Google Scholar
Hardy, G.H.: Divergent Series. Clarendon Press, Oxford (1949)
Google Scholar
Janson, S.: Gaussian Hilbert Spaces. Cambridge University Press, Cambridge, UK (1997)
Book Google Scholar
Janson, S.: Random cutting and records in deterministic and random trees. Random Struct. Algorithms 29(2), 139–179 (2006)
Article MathSciNet Google Scholar
Janson, S.: Simply generated trees, conditioned Galton-Watson trees, random allocations and condensation. Probab. Surv. 9, 103–252 (2012)
Article MathSciNet Google Scholar
Janson, S.: Asymptotic normality of fringe subtrees and additive functionals in conditioned Galton-Watson trees. Random Struct. Algorithms 48(1), 57–101 (2016)
Article MathSciNet Google Scholar
Kolchin, V.F.: Random Mappings. Nauka, Moscow (1984) (Russian). Optimization Software. New York (1986) (English)
Meir, A., Moon, J.W.: On the log-product of the subtree-sizes of random trees. Random Struct. Algorithms 12(2), 197–212 (1998)
Article MathSciNet Google Scholar
NIST Handbook of Mathematical Functions. Edited by Frank W. J. Olver, Daniel W. Lozier, Ronald F. Boisvert & Charles W. Clark. Cambridge Univ. Press, 2010. Also available as NIST Digital Library of Mathematical Functions, http://dlmf.nist.gov/
Otter, R.: The multiplicative process. Ann. Math. Stat. 20, 206–224 (1949)
Article MathSciNet Google Scholar
Pittel, B.: Normal convergence problem? Two moments and a recurrence may be the clues. Ann. Appl. Probab. 9(4), 1260–1302 (1999)
Article MathSciNet Google Scholar

Download references

Acknowledgements

We thank the anonymous referees for helpful comments.

Funding

Open access funding provided by Uppsala University.

Author information

Authors and Affiliations

Department of Applied Mathematics and Statistics, The Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD, 21218-2682, USA
James Allen Fill
Department of Mathematics, Uppsala University, PO Box 480, 751 06, Uppsala, Sweden
Svante Janson & Stephan Wagner

Authors

James Allen Fill
View author publications
You can also search for this author in PubMed Google Scholar
Svante Janson
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Wagner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Svante Janson.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Research of the first author supported by the Acheson J. Duncan Fund for the Advancement of Research in Statistics.

Research of the second and third authors supported by the Knut and Alice Wallenberg Foundation.

The original online version of this article was revised: Following publication of the original article, it was noted that due to typesetting errors in the HTML version of the article, there were several instances where the content of the display extended beyond the right-hand margin of the article and overlapped with the display identifier. These errors have been corrected in the original article. The publisher apologizes to the authors and readers for the inconvenience caused by this error”.

Appendices

Comparison Counterexamples

1.1 A Framework for Comparison Counterexamples

In this Appendix A.1, we establish a framework for various counterexamples involving comparisons of offspring distributions in Sect. 6. The idea is to set up two offspring distributions, say $\xi $ and $\tilde{\xi }$, with respective probability generating functions $\Phi $ and ${\widetilde{\Phi }}$, such that, for (real) $t \in [0, 1)$, the difference $\Delta (t):= {\widetilde{\Phi }}(t) - \Phi (t)$ satisfies $\Delta (t) > 0$ for most values of t, and $\Delta (t) \leqslant 0$ (but not by much) for t very near $\frac{1}{6}$ (with this value somewhat arbitrarily chosen).

Let $\xi $ have the following probability mass function satisfying ${\mathbb E{}}\xi = 1$, as required for a critical offspring distribution:

$$\begin{aligned} p_0&:= {\mathbb P{}}(\xi = 0) = \tfrac{1}{4} + 3 e^{-3} + 5 e^{-11} > 0, \end{aligned}$$

(A.1)

$$\begin{aligned} p_1&:= {\mathbb P{}}(\xi = 1) = \tfrac{1}{2}> 0, \end{aligned}$$

(A.2)

$$\begin{aligned} p_2&:= {\mathbb P{}}(\xi = 2) = \tfrac{1}{4} - 4 e^{-3} + 36 e^{-11} > 0, \end{aligned}$$

(A.3)

$$\begin{aligned} p_k&:= {\mathbb P{}}(\xi = k) = e^{-11} \frac{8^k}{k!} > 0 \text{ for } k \geqslant 3. \end{aligned}$$

(A.4)

We denote its probability generating function by $\Phi $.

Let $\varepsilon \geqslant 0$ and for $t \in [0, 1]$ define

$$\begin{aligned} g_{\varepsilon }(t) := \tfrac{1}{2}\left[ 1 - \cos \bigl ((4 \pi )(\tfrac{3}{5} t + \tfrac{2}{5})\bigr ) \right] - \varepsilon (1 - t)^3. \end{aligned}$$

(A.5)

Note that

$$\begin{aligned} g_{\varepsilon }(1) = g_{\varepsilon }'(1) = 0; \end{aligned}$$

(A.6)

moreover, for every $t \in [0, 1]$, we have

$$\begin{aligned} - \varepsilon \leqslant g_{\varepsilon }(t) \leqslant 1 \end{aligned}$$

(A.7)

(in particular, $g_0(t) \geqslant 0$), and one can verify for small $\varepsilon > 0$ that the set $\{t \in [0, 1): g_\varepsilon (t) < 0\}$ is an open interval of length $O(\varepsilon ^{1/2})$ containing $\frac{1}{6}$.

Because $12 \pi / 5 < 8$, it’s easy to check that there exists $c_1 > 0$ such that for all $\varepsilon \in [0, 1]$ the function

$$\begin{aligned} {\widetilde{\Phi }}_{\varepsilon }(t) := \Phi (t) + c_1 g_{\varepsilon }(t) \end{aligned}$$

(A.8)

has a power series expansion about the origin with non-negative coefficients. From (A.6), it now follows that ${\widetilde{\Phi }}_{\varepsilon }$ is the probability generating function of a random variable $\tilde{\xi }$ with ${\mathbb E{}}\tilde{\xi }= 1$.

As we have now discussed, the difference function $\Delta _{\varepsilon }(t):= {\widetilde{\Phi }}_{\varepsilon }(t) - \Phi (t)$ is non-negative when $\varepsilon = 0$; and when $\varepsilon > 0$ is small, the set

$$\begin{aligned} I_{\varepsilon } := \{t \in [0, 1): \Delta _\varepsilon (t)< 0\}= \{t \in [0, 1): g_\varepsilon (t) < 0\} \end{aligned}$$

(A.9)

is an open interval of length $O(\varepsilon ^{1/2})$ containing $\frac{1}{6}$.

Although not needed anywhere in Sect. 6 nor in this Appendix A, we note in passing that both $\xi $ and $\tilde{\xi }_{\varepsilon }$ have moment generating functions that are finite everywhere and probability generating functions that are entire; in particular, both satisfy (1.7).

1.2 The Converse to Theorem 6.3 Fails

In this subsection, we show that (i) and (ii) of Theorem 6.3 together do not imply (6.7). In fact, not even the strict inequalities in (i)–(iii) of Theorem 6.8 do.

Example A.1

In the notation of Appendix A.1, take $\Phi _1$ to be $\Phi $ and $\Phi _2$ to be the probability generating function ${\widetilde{\Phi }}_{\varepsilon }$ of (A.8). We do not have $\Phi _1 \leqslant \Phi _2$. But we claim that for all sufficiently small $\varepsilon > 0$ (not depending on $\alpha $, to be clear), Theorem 6.8(i)–(iii) hold.

To establish the desired inequalities about $\mu (\alpha )$, we will utilize (6.3). For this, we apply the mean value theorem to the function $x \mapsto (\log x)^{-\alpha }$, $x \in (1, \infty )$, as follows. Let $1< x_1 \leqslant x_2 < \infty $. If $\alpha \leqslant -1$, then for some point $x \in [x_1, x_2]$, we have

$$\begin{aligned}&(\log x_2)^{-\alpha } - (\log x_1)^{-\alpha } \nonumber \\&\quad = (- \alpha ) x^{-1} (\log x)^{ - \alpha - 1} (x_2 - x_1) \nonumber \\&\quad \in [(- \alpha ) x_2^{-1} (\log x_1)^{ - \alpha - 1} (x_2 - x_1),\ (- \alpha ) x_1^{-1} (\log x_2)^{ - \alpha - 1} (x_2 - x_1)]. \end{aligned}$$

(A.10)

Similarly, if $\alpha \in (-1, 0)$, then

$$\begin{aligned}&(\log x_2)^{-\alpha } - (\log x_1)^{-\alpha } \nonumber \\&\quad \in [(- \alpha ) x_2^{-1} (\log x_2)^{ - \alpha - 1} (x_2 - x_1),\ (- \alpha ) x_1^{-1} (\log x_1)^{ - \alpha - 1} (x_2 - x_1)]; \end{aligned}$$

(A.11)

and if $\alpha > 0$, then

$$\begin{aligned}&(\log x_1)^{-\alpha } - (\log x_2)^{-\alpha } \nonumber \\&\quad \in [\alpha x_2^{-1} (\log x_2)^{ - \alpha - 1} (x_2 - x_1),\ \alpha x_1^{-1} (\log x_1)^{ - \alpha - 1} (x_2 - x_1)]. \end{aligned}$$

(A.12)

For $t \in (0, 1) \setminus I_{\varepsilon }$, we have ${\widetilde{\Phi }}_\varepsilon (t) \geqslant \Phi (t)$, and thus, $\tilde{R}_\varepsilon (t) \geqslant R(t)$; hence, it follows from (A.10)–(A.12) that

$$\begin{aligned}{}[\log \tilde{R}_{\varepsilon }(t)]^{-\alpha } - [\log R(t)]^{-\alpha }&\geqslant |\alpha | \frac{1}{\tilde{R}_{\varepsilon }(t)} [\log R(t)]^{ - \alpha - 1} \frac{c_1 g_{\varepsilon }(t)}{t}{} & {} \text{ if } \alpha \leqslant -1; \end{aligned}$$

(A.13)

$$\begin{aligned} ^{-\alpha } - [\log R(t)]^{-\alpha }&\geqslant |\alpha | \frac{1}{\tilde{R}_{\varepsilon }(t)} [ \log \tilde{R}_{\varepsilon }(t)]^{ - \alpha - 1} \frac{c_1 g_{\varepsilon }(t)}{t}{} & {} \text{ if } \alpha \in (-1, 0); \end{aligned}$$

(A.14)

$$\begin{aligned} ^{-\alpha } - [\log \tilde{R}_{\varepsilon }(t)]^{-\alpha }&\geqslant \alpha \frac{1}{\tilde{R}_{\varepsilon }(t)} [\log \tilde{R}_{\varepsilon }(t)]^{ - \alpha - 1} \frac{c_1 g_{\varepsilon }(t)}{t}{} & {} \text{ if } \alpha > 0. \end{aligned}$$

(A.15)

Denote the interval $I_{\varepsilon }$ defined at (A.9) by $(a_{\varepsilon }, b_{\varepsilon })$. Consider $t \in I_{\varepsilon }$ for the next three displays; thus, $\Phi (t) > {\widetilde{\Phi }}_\varepsilon (t)$ and $R(t) > \tilde{R}_\varepsilon (t)$. If $\alpha \leqslant -1$ we have, recalling Lemma 6.1,

$$\begin{aligned}{}[\log \tilde{R}_{\varepsilon }(t)]^{-\alpha } - [\log R(t)]^{-\alpha }&\geqslant - |\alpha | \frac{1}{\tilde{R}_\varepsilon (t)} [\log R(t)]^{ - \alpha - 1} \frac{c_1 |g_{\varepsilon }(t)|}{t} \nonumber \\&= - |\alpha | \frac{1}{{\widetilde{\Phi }}_\varepsilon (t)} [\log R(t)]^{ - \alpha - 1} c_1 |g_{\varepsilon }(t)| \nonumber \\&\geqslant - |\alpha | \frac{1}{{\widetilde{\Phi }}_\varepsilon (a_{\varepsilon })} [\log R(a_{\varepsilon })]^{ - \alpha - 1} c_1 \varepsilon , \end{aligned}$$

(A.16)

where we have used (A.7) at the last inequality; similarly, if $\alpha \in (-1, 0)$, we have

$$\begin{aligned} {[}\log \tilde{R}_{\varepsilon }(t)]^{-\alpha } - [\log R(t)]^{-\alpha }&\geqslant - |\alpha | \frac{1}{{\widetilde{\Phi }}_\varepsilon (t)} [\log \tilde{R}_{\varepsilon }(t)]^{ - \alpha - 1} c_1 |g_{\varepsilon }(t)| \nonumber \\&\geqslant - |\alpha | \frac{1}{{\widetilde{\Phi }}_\varepsilon (a_{\varepsilon })} [\log \tilde{R}_{\varepsilon }(b_{\varepsilon })]^{ - \alpha - 1} c_1 \varepsilon ; \end{aligned}$$

(A.17)

and if $\alpha > 0$, we have

$$\begin{aligned} {[}\log R(t)]^{-\alpha } - [\log \tilde{R}_{\varepsilon }(t)]^{-\alpha }&\geqslant - \alpha \frac{1}{{\widetilde{\Phi }}_\varepsilon (t)} [\log \tilde{R}_{\varepsilon }(t)]^{ - \alpha - 1} c_1 |g_{\varepsilon }(t)| \nonumber \\&\geqslant - \alpha \frac{1}{{\widetilde{\Phi }}_\varepsilon (a_{\varepsilon })} [\log \tilde{R}_{\varepsilon }(b_{\varepsilon })]^{ - \alpha - 1} c_1 \varepsilon . \end{aligned}$$

(A.18)

We continue by assessing the contribution to the difference ${\text {sgn}}(\alpha ) \cdot [\mu (\alpha ) - \tilde{\mu }_{\varepsilon }(\alpha )]$ of integrals from $t \in I_{\varepsilon }$, with asserted inequalities valid for all small $\varepsilon > 0$. If $\alpha \leqslant -1$, the contribution is, using (A.16), bounded below by

$$\begin{aligned} - |\alpha | c_1 \varepsilon \frac{1}{{\widetilde{\Phi }}_\varepsilon (a_{\varepsilon })} [\log R(a_{\varepsilon })]^{ - \alpha - 1} (b_{\varepsilon } - a_{\varepsilon })&\geqslant - |\alpha | {}C_{1}\varepsilon ^{3/2} \bigl [\log \bigl (R(1/6) + {}C_{2}\varepsilon ^{1/2}\bigr )\bigr ]^{ - \alpha - 1} \nonumber \\&\geqslant - |\alpha | C_1 \varepsilon ^{3/2} [\log R(1/7)]^{ - \alpha - 1}, \end{aligned}$$

(A.19)

where we have used Lemma 6.1 and where the constants $C_1$ and $C_2$ do not depend on $\alpha $. Similarly, for $\alpha \in (-1, \tfrac{1}{2})$, the contribution is bounded below by

$$\begin{aligned} - |\alpha | {}C_{3}\varepsilon ^{3/2} [\log R(1/5)]^{ - \alpha - 1}. \end{aligned}$$

(A.20)

Next we similarly assess the contribution to ${\text {sgn}}(\alpha ) \cdot [\mu (\alpha ) - \tilde{\mu }_{\varepsilon }(\alpha )]$ from values $t \in (0, 1) {\setminus } I_{\varepsilon }$. For all small $\varepsilon > 0$, for $\alpha \leqslant -1$, the contribution is, using (A.13), at least

$$\begin{aligned} \int _{1/9}^{1/8}\!|\alpha | \frac{1}{{\widetilde{\Phi }}_{\varepsilon }(t)} [\log R(t)]^{ - \alpha - 1} c_1 g_{\varepsilon }(t) \,\textrm{d}t \geqslant |\alpha |\,c_2\,[\log R(1/8)]^{ - \alpha - 1}, \end{aligned}$$

(A.21)

where the constant $c_2$ does not depend on $\alpha $. Similarly, for $\alpha \in (-1, 1/2)$ the contribution is, using (A.14)–(A.15), at least

$$\begin{aligned} |\alpha | c_3 \int _{7/24}^{1/3}\![\log \tilde{R}_{\varepsilon }(t)]^{ - \alpha - 1} \,\textrm{d}t \geqslant |\alpha | c_4 \, [\log R(1/4)]^{ - \alpha - 1}. \end{aligned}$$

(A.22)

Summarizing, for $\alpha \leqslant -1$ we have, using (A.21) and (A.19),

$$\begin{aligned} \tilde{\mu }_{\varepsilon }(\alpha ) - \mu (\alpha ) \geqslant |\alpha | c_2 [\log R(1/8)]^{ - \alpha - 1} - |\alpha | C_1 \varepsilon ^{3/2} [\log R(1/7)]^{ - \alpha - 1}; \end{aligned}$$

(A.23)

and for $\alpha \in (-1, \tfrac{1}{2})$, we have, using (A.22) and (A.20),

$$\begin{aligned} {\text {sgn}}(\alpha ) \cdot [\mu (\alpha ) - \tilde{\mu }_{\varepsilon }(\alpha )]&\geqslant |\alpha | c_4 [\log R(1/4)]^{ - \alpha - 1} - |\alpha | C_3 \varepsilon ^{3/2} [\log R(1/5)]^{ - \alpha - 1}. \end{aligned}$$

(A.24)

Since by Lemma 6.1

$$\begin{aligned} R(1/8) > R(1/7) \quad \text{ and }\quad R(1/4) < R(1/5), \end{aligned}$$

(A.25)

for sufficiently small $\varepsilon \leqslant (\min \{c_2 / C_1, c_4 / C_3\})^{2/3}$ the desired strict inequalities (6.26) and (6.27) follow. Furthermore, to obtain (6.28), we divide through by $|\alpha |$ in (A.24) and let $\alpha $ tend to 0, which yields

$$\begin{aligned} \mu ' - \tilde{\mu }'_{\varepsilon } = \mu '(0) - \tilde{\mu }'_{\varepsilon }(0) \geqslant c_4 [\log R(1/4)]^{- 1} - C_3 \varepsilon ^{3/2} [\log R(1/5)]^{ - 1}, \end{aligned}$$

(A.26)

which is (strictly) positive for sufficiently small $\varepsilon \leqslant (c_4 /C_3)^{2/3}$. $\square $

Negative Moments of Affine Functions of Tree Size

The representation (6.3) of $\mu (\alpha )$ as an integral in terms of the offspring probability generating function $\Phi $ and the consequent ordering of $\mu $-values exhibited in Theorem 6.3(i) can be extended to treat means of more general functions of the Galton–Watson tree size. We illustrate this with the following theorem, used in the proof of Theorem 6.5.

Theorem B.1

For real $\alpha < 0$ and $t > 1$, we have

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}| - 1 + t)^{\alpha } = (t - 1)^{\alpha } \int _0^1\!\left[ 1 - \frac{c(\eta ; - \alpha , t)}{\Gamma (-\alpha )} \right] \,\textrm{d}\eta , \end{aligned}$$

(B.1)

where $c(\eta ; - \alpha , t)$ is the incomplete gamma function value

$$\begin{aligned} c(\eta ; -\alpha , t) = \int _{(t - 1) \log R(\eta )}^{\infty }\!v^{ - \alpha - 1} e^{-v} \,\textrm{d}v. \end{aligned}$$

(B.2)

Proof

Let $f(s):= (s - 1 + t)^{\alpha }$. Observe that $s \mapsto f(s) / s$ for $s > 0$ is the Laplace transform of the (strictly) increasing function g mapping $x > 0$ to

$$\begin{aligned} g(x) := (t - 1)^{\alpha } \left[ 1 - \frac{\gamma ((t - 1) x; -\alpha )}{\Gamma (-\alpha )} \right] \in (0, (t - 1)^{\alpha }), \end{aligned}$$

(B.3)

where here $\gamma (\cdot ; -\alpha )$ is the incomplete gamma function

$$\begin{aligned} \int _{\cdot }^\infty \!v^{ - \alpha - 1} e^{-v} \,\textrm{d}v. \end{aligned}$$

(B.4)

Therefore,

$$\begin{aligned} {\mathbb E{}}(|\mathcal {T}| - 1 + t)^{\alpha }&= {\mathbb E{}}f(|\mathcal {T}|) \nonumber \\&= \sum _{n = 1}^{\infty } {\mathbb P{}}(|\mathcal {T}| = n) f(n) = \sum _{n = 1}^{\infty } n {\mathbb P{}}(|\mathcal {T}| = n) \int _0^{\infty } e^{ - n x} g(x) \,\textrm{d}x \nonumber \\&= \int _0^{\infty }\!g(x) y'(e^{-x}) e^{-x} \,\textrm{d}x = \int _0^1\!g( - \log u) y'(u) \,\textrm{d}u \nonumber \\&= \int _0^1\!g\bigl (\log (R(\eta ))\bigr ) \,\textrm{d}\eta , \end{aligned}$$

(B.5)

again changing variables by $u = y^{-1}(\eta ) = 1 / R(\eta )$, and (B.1) follows by (B.3). $\square $

Comparisons Allowing Infinite Offspring Moments

Remark 6.7(b) follows quickly from the following theorem concerning Laplace transforms.

Theorem C.1

Let $\xi $ be a (not necessarily integer-valued) non-negative random variable with Laplace transform f and moments

$$\begin{aligned} m_j := {\mathbb E{}}\xi ^j\leqslant \infty , \quad j = 0, 1, 2, \dots \end{aligned}$$

(C.1)

(with $m_0:= 0$). For a given positive integer r, suppose that $m_{r - 1} < \infty $. Then

$$\begin{aligned} g(t) := (-1)^r t^{-r} r! \left[ f(t) - \sum _{j = 0}^{r - 1} (-1)^j m_j \frac{t^j}{j!} \right] \end{aligned}$$

(C.2)

is non-negative for $t > 0$ and increases (weakly) to $m_r \leqslant \infty $ as $t \searrow 0$.

We will prove Theorem C.1 using the following calculus lemma.

Lemma C.2

Let r be a fixed positive integer, and define

$$\begin{aligned} h(x) := (-1)^r x^{-r} \left[ e^{-x} - \sum _{j = 0}^{r - 1} (-1)^j \frac{x^j}{j!} \right] , \quad x > 0. \end{aligned}$$

(C.3)

Then h is (strictly) positive and (strictly) decreasing, with limit 1/r! as $x \searrow 0$.

Proof

The lemma is immediate from the claim that

$$\begin{aligned} h(x) = \frac{1}{(r - 1)!} \int _0^1\!v^{r - 1} e^{- x (1 - v)} \,\textrm{d}v. \end{aligned}$$

(C.4)

We offer two proofs of this claim.

Proof #1 of (C.4)

By Taylor’s theorem with remainder in integral form,

$$\begin{aligned} h(x) = \frac{x^{-r}}{(r - 1)!} \int _0^x\!(x - u)^{r - 1} e^{-u} \,\textrm{d}u. \end{aligned}$$

(C.5)

Now simply change the variable of integration from u to $v = 1 - \frac{u}{x}$.

Proof #2 of (C.4)

Let B denote Euler’s beta function. Then the right side of (C.4) equals

$$\begin{aligned} \frac{1}{(r - 1)!} \sum _{k = 0}^{\infty } (-1)^k \frac{x^k}{k!} \int _0^1\! v^{r - 1} (1 - v)^k \,\textrm{d}v&= \frac{1}{(r - 1)!} \sum _{k = 0}^{\infty } (-1)^k \frac{x^k}{k!} B(r, k + 1) \nonumber \\&= (-1)^r x^{-r} \sum _{j = r}^{\infty } (-1)^j \frac{x^j}{j!} = h(x). \end{aligned}$$

(C.6)

$\square $

Proof of Theorem C.1 For $t > 0$ we have

$$\begin{aligned} g(t)&= {\mathbb E{}}\left[ (-1)^r t^{-r} r! \left( e^{- t \xi } - \sum _{j = 0}^{r - 1} (-1)^j \frac{(t \xi )^j}{j!} \right) \right] \nonumber \\&= {\mathbb E{}}\left[ (-1)^r t^{-r} r! \left( e^{- t \xi } - \sum _{j = 0}^{r - 1} (-1)^j \frac{(t \xi )^j}{j!} \right) ;\,\xi> 0 \right] \nonumber \\&= {\mathbb E{}}\left[ r!\,h(t \xi )\,\xi ^r;\,\xi > 0 \right] . \end{aligned}$$

(C.7)

By Lemma C.2, the non-negative random variables $r!\,h(t \xi )\,\xi ^r \textbf{1}(\xi > 0)$ increase (weakly) to $\xi ^r \textbf{1}(\xi > 0) = \xi ^r$ as $t \searrow 0$. Thus, by the MCT, $g(t) \nearrow m_r \leqslant \infty $ as $t \searrow 0$. $\square $

On the Variances and Covariances of Additive Functionals in Conditioned Galton–Watson Trees

It is by no means immediately clear that the approach in Sect. 5 of this paper (the case ${\text {Re}}\alpha <0$) yields an asymptotic variance agreeing with that produced in [5, Remark 5.1], which is simply a specialization of [14, (1.17)]. The notes in this appendix provide a reconciliation, showing directly that variance formulas given by the two approaches are equivalent.

Section D.1 deals with variances for general real additive functionals of the sort discussed in [14], except that we specialize to the case that the toll functional depends only on tree size. Section D.2 extends Sect. D.1 to covariances for complex additive functionals. Finally, Sect. D.3 provides the desired agreement.

1.1 Variances for Real Additive Functionals

We return to the setting of [14, Theorem 1.5(ii)] and repeat the assumptions here for convenience. Let $\mathfrak T$ denote the countable set of all ordered rooted trees, and let $\mathfrak T_n:= \{T \in \mathfrak T: |T| = n\}$. Let $\mathcal {T}_n$ be a conditioned Galton–Watson tree of order n with offspring distribution $\xi $, where ${\mathbb E{}}\xi = 1$ and $0< \sigma ^2:= {\text {Var}}\xi < \infty $, and let $\mathcal {T}$ be the corresponding unconditioned Galton–Watson tree. Suppose that $f:\mathfrak T\rightarrow \mathbb R$ is a functional of rooted trees such that ${\mathbb E{}}|f(\mathcal {T})| < \infty $ (which, as noted in [14, Remark 1.6], is implied by either (D.1) or (D.2) below), and let $\mu := {\mathbb E{}}f(\mathcal {T})$. Assume that

$$\begin{aligned} {\mathbb E{}}|f(\mathcal {T}_n)|^2 \rightarrow 0 \text{ as } n \rightarrow \infty \end{aligned}$$

(D.1)

and

$$\begin{aligned} \sum _{n = 1}^{\infty } n^{-1} \sqrt{{\mathbb E{}}|f(\mathcal {T}_n)|^2} < \infty . \end{aligned}$$

(D.2)

Under these assumptions, [14, Theorem 1.5] asserts in part that the additive functional F corresponding to toll function f satisfies

$$\begin{aligned} {\mathbb E{}}F(\mathcal {T}_n) = n \mu + o\bigl (\sqrt{n}\bigr ) \end{aligned}$$

(D.3)

and

$$\begin{aligned} {\text {Var}}F(\mathcal {T}_n) = n \gamma ^2 + o(n) \end{aligned}$$

(D.4)

where

$$\begin{aligned} \gamma ^2&:= 2 {\mathbb E{}}\left[ f(\mathcal {T}) (F(\mathcal {T}) - |\mathcal {T}| \mu ) \right] - {\text {Var}}f(\mathcal {T}) - \sigma ^{-2} \mu ^2 \nonumber \\&= 2 {\mathbb E{}}\left[ f(\mathcal {T}) (F(\mathcal {T}) - |\mathcal {T}| \mu ) \right] - {\mathbb E{}}f^2(\mathcal {T}) + (1 - \sigma ^{-2}) \mu ^2 \end{aligned}$$

(D.5)

is non-negative and finite. Combining (D.3)–(D.4), note that we also have the conclusion

$$\begin{aligned} {\mathbb E{}}[F(\mathcal {T}_n) - n \mu ]^2 = n \gamma ^2 + o(n). \end{aligned}$$

(D.6)

We now (crucially) suppose further for the remainder of this subsection that f depends only on tree size and write $f_n:= f(T)$ for any $T \in \mathfrak T_n$. Then the assumptions (D.1) and (D.2) reduce to

$$\begin{aligned} f_n \rightarrow 0\,\,\textrm{as}\,\,n \rightarrow \infty \end{aligned}$$

(D.7)

and

$$\begin{aligned} \sum _{n = 1}^{\infty } n^{-1} |f_n| < \infty , \end{aligned}$$

(D.8)

respectively; the constant $\mu $ can be written as follows:

$$\begin{aligned} \mu = \sum _{n = 1}^{\infty } q_n f_n; \end{aligned}$$

(D.9)

and (D.5) can be written as follows:

$$\begin{aligned} \gamma ^2 = 2 \sum _{n = 1}^{\infty } q_n f_n {\mathbb E{}}[F(\mathcal {T}_n) - n \mu ] - \nu + (1 - \sigma ^{-2}) \mu ^2 \end{aligned}$$

(D.10)

with

$$\begin{aligned} \nu := {\mathbb E{}}f(\mathcal {T})^2 = \sum _{n = 1}^{\infty } q_n f_n^2. \end{aligned}$$

(D.11)

Example D.1

Consider the toll function $f(T) = |T|^{\alpha }$ with real $\alpha < 0$. Then $f_n = n^{\alpha }$ satisfies (D.7)–(D.8), and hence, also (D.9)–(D.11). Computation of $\gamma ^2$ in this case is discussed in Appendix E. $\square $

Now suppose that in place of a toll sequence $(f_n)$ satisfying (D.7)–(D.8), we use the toll sequence $(b_n:\equiv f_n - \mu )$, with $\mu $ given by (D.9), and denote the corresponding additive functional by

$$\begin{aligned} F^{\circ }(T) = F(T) - n \mu , \quad T \in \mathfrak T_n. \end{aligned}$$

(D.12)

Then, by (D.3) and (D.6),

$$\begin{aligned} {\mathbb E{}}F^{\circ }(\mathcal {T}_n)&= o(\sqrt{n}), \end{aligned}$$

(D.13)

$$\begin{aligned} {\mathbb E{}}F^{\circ }(\mathcal {T}_n)^2&= n \gamma ^2 + o(n), \end{aligned}$$

(D.14)

where from (D.10), we have

$$\begin{aligned} \gamma ^2 = 2 \sum _{n = 1}^{\infty } q_n f_n {\mathbb E{}}F^{\circ }(\mathcal {T}_n) - \nu + (1 - \sigma ^{-2}) \mu ^2. \end{aligned}$$

(D.15)

We next wish to generalize the treatment in Sect. 5.1 to treat ${\mathbb E{}}F^{\circ }(\mathcal {T}_n)$. Let

$$\begin{aligned} M(z) := {\mathbb E{}}[F^{\circ }(\mathcal {T}) z^{|\mathcal {T}|}] = \sum _{n = 1}^{\infty } q_n {\mathbb E{}}[F^{\circ }(\mathcal {T}_n)] z^n. \end{aligned}$$

(D.16)

Then note that (5.6) generalizes (again, as so many times before in this paper, by Lemma 3.1 and as in (3.12); note also the sentence preceding Lemma 3.1) to

$$\begin{aligned} M(z) = \frac{z y'(z)}{y(z)} \cdot [B(z) \odot y(z)], \end{aligned}$$

(D.17)

where

$$\begin{aligned} B(z) := \sum _{n = 1}^{\infty } b_n z^n = - \mu z (1 - z)^{-1} + \sum _{n = 1}^{\infty } f_n z^n. \end{aligned}$$

(D.18)

The power series M(z) and B(z) converge in the open unit disk, since (D.13) holds and $b_n=O(1)$ and $q_n=O(n^{-3/2})$ by (D.7) and (2.24). Furthermore, these estimates show also that the power series $B(z)\odot y(z)$ converges (absolutely and uniformly) in the closed unit disk to a continuous function. We have

$$\begin{aligned} (B\odot y)(1) =\sum _{n=1}^\infty q_n(f_n-\mu ) = {\mathbb E{}}f(\mathcal {T})-\mu =0, \end{aligned}$$

(D.19)

by the definition of $\mu $.

We assume in the sequel also that

$$\begin{aligned} \text {the generating function} \sum _{n=1}^\infty f_nz^n \text { is }\Delta \text {-analytic} .\end{aligned}$$

(D.20)

(Recall that this means that the generating function can be analytically continued to some $\Delta $-domain, see Sect. 2.2.) Then (D.18) shows that also B(z) is $\Delta $-analytic, and so is M(z) by (D.17) and Lemmas 2.4 and 2.3.

Remark D.2

For convenience, we used here Lemma 2.4 which is stated under the assumption ${\mathbb E{}}\xi ^{2+\delta }<\infty $. Similarly, we use in the proof below (2.20) and (2.22) from the same lemma. However, the version used below, with corresponding error terms $o\bigl (|1-z|^{1/2}\bigr )$ and $o\bigl (|1-z|^{-1/2}\bigr )$, requires only ${\mathbb E{}}\xi ^2<\infty $, see [12, Lemma A.2] and [5, (12.5) and (12.31)]. Hence, this lemma holds assuming only ${\mathbb E{}}\xi ^2<\infty $, and so do all other results in this and the following subsection. $\square $

Lemma D.3

Assume that the sequence $(f_n)$ satisfies the decay conditions (D.7)–(D.8). Suppose also that (D.20) holds, and that the generating function satisfies

$$\begin{aligned} \sum _{n = 1}^{\infty } f_n z^n = o(|1 - z|^{-1}) \end{aligned}$$

(D.21)

as $z\rightarrow 1$. Then, as $z\rightarrow 1$ we have

$$\begin{aligned} M(z) = \sigma ^{-2} \mu + o(1) . \end{aligned}$$

(D.22)

Recall our standing convention that estimates such as (D.21) and (D.22) hold as $z\rightarrow 1$ in some suitable $\Delta $-domain (not necessarily the same each time).

Proof From (D.18) and (D.21) we have

$$\begin{aligned} B(z) = - \mu (1 - z)^{-1} + o(|1 - z|^{-1}), \end{aligned}$$

(D.23)

and therefore, using (2.20), together with Lemma 2.3 and the corresponding o-result in [5, Lemma 12.2(ii)],

$$\begin{aligned} B(z) \odot y(z)&= [- \mu (1 - z)^{-1} + o(|1 - z|^{-1})] \odot [1-2^{1/2}\sigma ^{-1}(1-z)^{1/2}+o\bigl (|1-z|^{1/2}\bigr )] \nonumber \\&= c_1 + 2^{1/2} \sigma ^{-1} \mu (1 - z)^{1/2}+o(|1 - z|^{1/2}), \end{aligned}$$

(D.24)

for some constant $c_1$. Furthermore, letting $z\nearrow 1$ along the real axis, (D.24) and (D.19) yield

$$\begin{aligned} c_1=\lim _{z\nearrow 1}(B\odot y)(z) = (B\odot y)(1) =0. \end{aligned}$$

(D.25)

Now we use (D.17), (D.24)–(D.25), and (2.22) to conclude

$$\begin{aligned} M(z)&= \bigl [2^{-1/2}\sigma ^{-1}(1-z)^{-1/2}+o\bigl (|1-z|^{-1/2}\bigr )\bigr ] \nonumber \\ {}&\qquad \cdot \bigl [2^{1/2} \sigma ^{-1} \mu (1 - z)^{1/2} + o(|1 - z|^{1/2})\bigr ] \nonumber \\ {}&=\sigma ^{-2}\mu +o(1), \end{aligned}$$

(D.26)

which is (D.22). $\square $

We denote the limit $\sigma ^{-2}\mu $ in (D.22) by M(1).

Remark D.4

We do not know whether the power series M(z) converges absolutely for $z=1$, but it converges at least conditionally. In fact, the limit $\lim _{z\nearrow 1}M(z)$ exists by (D.22), which means that the sum (D.16) for $z=1$ is Abel summable to M(1); furthermore, the terms $q_n {\mathbb E{}}[F^{\circ }(\mathcal {T}_n)]=o(n^{-1})$ by (2.24) and (D.13); hence, Tauber’s theorem [10, Theorem 85] shows that the sum converges and

$$\begin{aligned} \sum _{n=1}^\infty q_n {\mathbb E{}}F^{\circ }(\mathcal {T}_n) = M(1)=\sigma ^{-2}\mu . \end{aligned}$$

(D.27)

$\square $

1.2 Covariances for Complex Additive Functionals

The treatment in Appendix D.1 extends routinely (using polarization and decomposition into real and imaginary parts) as follows. Let f and g now be complex-valued functions of rooted trees each satisfying (D.1)–(D.2). Then, extending (D.6) and (D.5), for the corresponding additive functionals F and G, we have

$$\begin{aligned} {\mathbb E{}}[ (F(\mathcal {T}_n) - n \mu _f) (G(\mathcal {T}_n) - n \mu _g) ] = c_{f, g} n + o(n), \end{aligned}$$

(D.28)

where $\mu _f:= {\mathbb E{}}f(\mathcal {T})$, $\mu _g:= {\mathbb E{}}g(\mathcal {T})$, and

$$\begin{aligned} c_{f, g}&:= {\mathbb E{}}\left[ f(\mathcal {T}) (G(\mathcal {T}) - |\mathcal {T}| \mu _g) \right] + {\mathbb E{}}\left[ g(\mathcal {T}) (F(\mathcal {T}) - |\mathcal {T}| \mu _f) \right] \nonumber \\&{} \qquad \qquad - {\mathbb E{}}[f(\mathcal {T}) g(\mathcal {T})] + (1 - \sigma ^{-2}) \mu _f \mu _g. \end{aligned}$$

(D.29)

The argument in [14, (8.3)] shows that the expectations above are finite.

If we suppose further that f and g depend only on tree size [and write $f_n:= f(T)$ and $g_n:= g(T)$ for any $T \in \mathfrak T_n$], then the assumptions again reduce to (D.7)–(D.8) (for both f and g); furthermore, generalizing (D.10)–(D.11) [see also (D.12)–(D.15)], we have, with absolutely convergent sums (by (2.24), (D.13), and (D.8)),

$$\begin{aligned} c_{f, g} = \sum _{n = 1}^{\infty } q_n f_n {\mathbb E{}}G^{\circ }(\mathcal {T}_n) + \sum _{n = 1}^{\infty } q_n g_n {\mathbb E{}}F^{\circ }(\mathcal {T}_n) - \mu _{f g} + (1 - \sigma ^{-2}) \mu _f \mu _g \end{aligned}$$

(D.30)

with

$$\begin{aligned} \mu _{f g} = {\mathbb E{}}[f(\mathcal {T}) g(\mathcal {T})] = \sum _{n = 1}^{\infty } q_n f_n g_n. \end{aligned}$$

(D.31)

Under certain regularity assumptions, we will in the following proposition express $c_{f, g}$ in an alternative form to (D.30). Since we now consider several toll functions, we modify the notation above and indicate the toll function by a subscript (as we already have done with $\mu _f,\mu _g,\mu _{fg}$); we, thus, write $B_f(z)$, $B_g(z)$, $M_f(z)$, and so on; for typographical reasons, we also write $b_n(f):= f_n - \mu _f$ and $b_n(g):= g_n - \mu _g$.

Proposition D.5

Assume that the sequences $(f_n)$ and $(g_n)$ both satisfy the decay conditions (D.7)–(D.8). Suppose also that these sequences have generating functions that are $\Delta $-analytic and satisfy, as $z\rightarrow 1$,

$$\begin{aligned} \sum _{n = 1}^{\infty } f_n z^n = o(|1 - z|^{-1}) \qquad \text{ and }\qquad \sum _{n = 1}^{\infty } g_n z^n = o(|1 - z|^{-1}). \end{aligned}$$

(D.32)

Then the sums

$$\begin{aligned} (B_f \odot M_g)(1) =\sum _{n=1}^\infty b_n(f)q_n{\mathbb E{}}G^{\circ }(\mathcal {T}_n) \quad \text { and}\quad (B_g \odot M_f)(1) =\sum _{n=1}^\infty b_n(g)q_n {\mathbb E{}}F^{\circ }(\mathcal {T}_n)\qquad \end{aligned}$$

(D.33)

converge, and the constant $c_{f, g}$ of (D.28) satisfies

$$\begin{aligned} c_{f, g} = (B_f \odot M_g)(1) + (B_g \odot M_f)(1) - (B_f \odot B_g \odot y)(1) + \sigma ^{-2} \mu _f \mu _g. \end{aligned}$$

(D.34)

Proof Note first that

$$\begin{aligned} (B_f \odot M_g)(1) =\sum _{n=1}^\infty b_n(f)q_n{\mathbb E{}}G^{\circ }(\mathcal {T}_n) =\sum _{n=1}^\infty f_nq_n{\mathbb E{}}G^{\circ }(\mathcal {T}_n) -\mu _f\sum _{n=1}^\infty q_n {\mathbb E{}}G^{\circ }(\mathcal {T}_n), \end{aligned}$$

(D.35)

where the first sum on the right-hand side converges as noted before (D.30), and the second by Remark D.4. Hence, the sums in (D.33) converge.

Next, by (D.19),

$$\begin{aligned} (B_f \odot y)(1) = (B_g \odot y)(1) = (B_{f g} \odot y)(1) = 0 .\end{aligned}$$

(D.36)

Hence,

$$\begin{aligned} 0&= (B_{f g} \odot y)(1) \nonumber \\&= \sum _{n = 1}^{\infty } q_n (f_n g_n - \mu _{f g}) \nonumber \\&= \sum _{n = 1}^{\infty } q_n [b_n(f) + \mu _f] [b_n(g) + \mu _g] - \mu _{f g} \nonumber \\&= (B_f \odot B_g \odot y)(1) + \mu _f (B_g \odot y)(1) + \mu _g (B_f \odot y)(1) + \mu _f \mu _g - \mu _{f g} \nonumber \\&= (B_f \odot B_g \odot y)(1) + \mu _f \mu _g - \mu _{f g} .\end{aligned}$$

(D.37)

It follows from (D.30), (D.33), (D.27), and (D.37) that

$$\begin{aligned} c_{f, g}&= \sum _{n = 1}^{\infty } q_n [b_n(f) + \mu _f] {\mathbb E{}}G^{\circ }(\mathcal {T}_n) + \sum _{n = 1}^{\infty } q_n [b_n(g) + \mu _g] {\mathbb E{}}F^{\circ }(\mathcal {T}_n) \nonumber \\&{} \qquad - \mu _{f g} + (1 - \sigma ^{-2}) \mu _f \mu _g \nonumber \\&= (B_f \odot M_g)(1) + \mu _f M_g(1) + (B_g \odot M_f)(1) + \mu _g M_f(1) \nonumber \\&{} \qquad - \mu _{f g} + (1 - \sigma ^{-2}) \mu _f \mu _g \nonumber \\&= (B_f \odot M_g)(1) + \sigma ^{-2}\mu _f\mu _g + (B_g \odot M_f)(1) + \sigma ^{-2}\mu _f\mu _g \nonumber \\ {}&{} \qquad - (B_f \odot B_g \odot y)(1) - \sigma ^{-2} \mu _f \mu _g ,\end{aligned}$$

(D.38)

which simplifies to (D.34). $\square $

Remark D.6

Since the sum $(B_f \odot M_g)(1)$ in (D.33) converges, it is Abel summable by Abel’s theorem [10, §1.4], i.e.

$$\begin{aligned} \lim _{z\nearrow 1} (B_f \odot M_g)(z) = (B_f \odot M_g)(1), \end{aligned}$$

(D.39)

for real $z\nearrow 1$. It follows by Lemma 2.3 that $(B_f \odot M_g)(z)$ is $\Delta $-analytic, and we conjecture that (D.39) holds also for complex $z\rightarrow 1$ in a suitable $\Delta $-domain, but we have no proof of that. (Lemma 2.3 is not enough here, since the resulting exponent is an integer (viz., 0), and this case is excepted for (2.16).) $\square $

Remark D.7

If $f_n \equiv g_n \equiv 1$, then $B_f(z) \equiv B_g(z) \equiv 0$ and the right side of (D.34) reduces to $\sigma ^{-2}$. Thus, (D.34), like the formula in [14, Theorem 1.5], see [14, Remark 1.10], gives the wrong answer in this illegitimate case, but at least this wrong answer is positive! $\square $

1.3 Agreement with Sect. 5.2

In this final subsection of Appendix D we reconcile Sect. 5.2 with this appendix by showing how (5.13) yields for $\varkappa _{1,1}$ of (5.14)–(5.15) the value provided by (D.34) with $f_n \equiv n^{\alpha _1}$ and $g_n \equiv n^{\alpha _2}$ (which satisfy all the conditions leading to Proposition D.5 since ${\text {Re}}\alpha _1 < 0$ and ${\text {Re}}\alpha _2<0$). For the reader’s convenience, we repeat (5.13) here:

$$\begin{aligned} M_{1,1}(z)&= \frac{z y'(z)}{y(z)} \left[ B_{\alpha _1}(z)\odot B_{\alpha _2}(z)\odot y(z) + B_{\alpha _1}(z)\odot (zM_{0,1}(z)\Phi '(y(z))) \right. \nonumber \\ {}&\qquad \left. + B_{\alpha _2}(z)\odot (zM_{1,0}(z)\Phi '(y(z))) + zM_{1,0}(z)M_{0,1}(z)\Phi ''(y(z)) \right] . \end{aligned}$$

(D.40)

First note that the second Hadamard factor in the second term in square brackets is, using (5.6) and the result of differentiating (2.19),

$$\begin{aligned} z M_{0, 1}(z) \Phi '(y(z))&= z \cdot \left[ \frac{z y'(z)}{y(z)} \right] \cdot [B_{\alpha _2}(z) \odot y(z)] \cdot \left[ z^{-1} - z^{-2} \frac{y(z)}{y'(z)} \right] \nonumber \\&= \left[ \frac{z y'(z)}{y(z)} - 1 \right] (B_{\alpha _2} \odot y)(z) \nonumber \\&= M_{0, 1}(z) - (B_{\alpha _2} \odot y)(z) .\end{aligned}$$

(D.41)

Thus, the second term in square brackets in (D.40) equals

$$\begin{aligned} (B_{\alpha _1} \odot M_{0, 1})(z) - (B_{\alpha _1} \odot B_{\alpha _2} \odot y)(z). \end{aligned}$$

(D.42)

Similarly, the third term equals

$$\begin{aligned} (B_{\alpha _2} \odot M_{1, 0})(z) - (B_{\alpha _1} \odot B_{\alpha _2} \odot y)(z). \end{aligned}$$

(D.43)

Using (5.8) and (2.28), the fourth term equals (recall here that $\eta $ is defined as $ \eta = \min (-{\text {Re}}\alpha _1,-{\text {Re}}\alpha _2,\,\delta /2)$)

$$\begin{aligned}&[1 + O(|1 - z|)] \cdot [\sigma ^{-2} \mu (\alpha _1) + O(|1 - z|^{\eta })] \cdot [\sigma ^{-2} \mu (\alpha _2) + O(|1 - z|^{\eta })] \nonumber \\&\quad \cdot [\sigma ^2 + O(|1 - z|^{\delta / 2})] = \sigma ^{-2} \mu (\alpha _1) \mu (\alpha _2) + O(|1 - z|^{\eta }). \end{aligned}$$

(D.44)

Employing (2.22) for the first of the two factors in (D.40) and assembling the pieces of our argument, we conclude

$$\begin{aligned} M_{1,1}(z)&= \left[ 2^{-1/2}\sigma ^{-1}(1-z)^{-1/2}+O\bigl (|1-z|^{-\frac{1}{2}+\frac{\delta }{2}}\bigr ) \right] \nonumber \\&{} \qquad \cdot \left[ (B_{\alpha _1} \odot M_{0, 1})(z) + (B_{\alpha _2} \odot M_{1, 0})(z) - (B_{\alpha _1} \odot B_{\alpha _2} \odot y)(z) \right. \nonumber \\&{} \qquad \qquad \left. + \sigma ^{-2} \mu (\alpha _1) \mu (\alpha _2) + O(|1 - z|^{\eta }) \right] . \end{aligned}$$

(D.45)

In particular, using (D.39), at least for real $z\nearrow 1$ we have

$$\begin{aligned} M_{1,1}(z)&= 2^{-1/2}\sigma ^{-1}(1-z)^{-1/2}\nonumber \\&{} \qquad \cdot \left[ (B_{\alpha _1} \odot M_{0, 1})(1) + (B_{\alpha _2} \odot M_{1, 0})(1) - (B_{\alpha _1} \odot B_{\alpha _2} \odot y)(1) \right. \nonumber \\&{} \qquad \qquad \left. + \sigma ^{-2} \mu (\alpha _1) \mu (\alpha _2) \right] + o(|1 - z|^{-\frac{1}{2}}), \end{aligned}$$

(D.46)

demonstrating that $\varkappa _{1,1}$ of (5.14)–(5.15) equals

$$\begin{aligned} (B_{\alpha _1} \odot M_{0, 1})(1) + (B_{\alpha _2} \odot M_{1, 0})(1) - (B_{\alpha _1} \odot B_{\alpha _2} \odot y)(1) + \sigma ^{-2} \mu (\alpha _1) \mu (\alpha _2), \end{aligned}$$

(D.47)

in agreement with (D.34).

Variances for Sums of Negative Real Powers of Subtree Sizes

In this appendix, we produce rather simple integral expressions for the constants $\gamma ^2 \equiv \gamma ^2(\alpha )$ of (D.10) for negative-power tolls $f_n = n^{\alpha }$ with $\alpha < 0$, as in Example D.1. Note that we have (see (D.6) and (D.28)) $\gamma ^2=c_{f,f}$ given by (D.30) and (D.34). For $\alpha = -1$, exact values of $\gamma ^2$ are obtained for the four important examples in [5, Appendix A] in Sect. E.1. Numerical integration can be used to compute $\gamma ^2$ for other negative-integer values of $\alpha $ for the same examples; this is discussed in Sect. E.2.

For general $\alpha < 0$, in (D.10) we have $\mu = \mu (\alpha )$ and $\nu = \mu (2 \alpha )$. These constants are discussed in Sect. 6 (see especially (6.1) and (6.3); for the four examples, see [5, Appendix A]), so in this appendix we need only focus on the sum

$$\begin{aligned} S \equiv S(\alpha ) := \sum _{n = 1}^{\infty } q_n n^{\alpha } {\mathbb E{}}F^{\circ }(\mathcal {T}_n) \end{aligned}$$

(E.1)

appearing in (D.10). Thus, (D.10) yields, recalling (D.11),

$$\begin{aligned} \gamma ^2 = 2 S - \nu + (1 - \sigma ^{-2}) \mu ^2. \end{aligned}$$

(E.2)

1.1 Computation of $\gamma ^2(-1)$ for General Offspring Distribution, with Examples

As just said, we need only focus on computation of S. For that, when $\alpha = -1$ we have, using (D.16)–(D.17),

$$\begin{aligned} S = \int _0^1\!t^{-1} M(t) \,\textrm{d}t = \int _0^1\!\frac{y'(t)}{y(t)} \cdot [B(t) \odot y(t)] \,\textrm{d}t. \end{aligned}$$

(E.3)

But

$$\begin{aligned} B(z) \odot y(z) = \sum _{n = 1}^{\infty } n^{-1} q_n z^n - \mu \,y(z) = \int _0^z\!t^{-1} y(t) \,\textrm{d}t - \mu \,y(z), \end{aligned}$$

(E.4)

so, using $\int _0^1y'(t)\,\textrm{d}t = y(1)-y(0)=1 - 0 = 1$, we find

$$\begin{aligned} S = \int _0^1\!\frac{y'(t)}{y(t)} \int _0^t\!s^{-1} y(s) \,\textrm{d}s \,\textrm{d}t - \mu . \end{aligned}$$

(E.5)

After integration by parts, we find from (E.5) that

$$\begin{aligned} S = \int _0^1\![- \log y(t)]\,t^{-1} y(t) \,\textrm{d}t - \mu . \end{aligned}$$

(E.6)

We could use (E.6) for computation of S and, thence, $\gamma ^2(-1)$ for the four examples of [5, Appendix A], because y is known explicitly for those examples. But to better prepare for examples such as m-ary trees and full m-ary trees with $m \geqslant 3$, we first recast (E.6) in terms of the probability generating function $\Phi $, just as we recast (6.1) as (6.3) in Sect. 6. (This recasting also leads to simpler calculations for the four examples.)

In (E.6), make the change of variables from t to $\eta = y(t)$. This gives

$$\begin{aligned} S + \mu&= \int _0^1\!(- \log \eta ) \Phi (\eta ) \left[ \frac{\textrm{d}}{\textrm{d}\eta } \frac{\eta }{\Phi (\eta )} \right] \,\textrm{d}\eta \nonumber \\&= \int _0^1\!(- \log \eta ) \,\textrm{d}\eta - \int _0^1\!\eta ( - \log \eta ) \frac{\Phi '(\eta )}{\Phi (\eta )} \,\textrm{d}\eta \nonumber \\&= 1 - \int _0^1\!\eta ( - \log \eta ) \frac{\Phi '(\eta )}{\Phi (\eta )} \,\textrm{d}\eta . \end{aligned}$$

(E.7)

We now use integration by parts again:

$$\begin{aligned} \int _0^1\!\eta ( - \log \eta ) \frac{\Phi '(\eta )}{\Phi (\eta )} \,\textrm{d}\eta = \int _0^1\![ - \log \Phi (\eta )] [( - \log \eta ) - 1] \,\textrm{d}\eta . \end{aligned}$$

(E.8)

Thus,

$$\begin{aligned} S = 1 - \int _0^1\![ - \log \Phi (\eta )] [( - \log \eta ) - 1] \,\textrm{d}\eta - \mu (-1). \end{aligned}$$

(E.9)

We now recall from (6.3) that

$$\begin{aligned} \mu (-1) = \int _0^1\![\log \Phi (\eta ) - \log \eta ] \,\textrm{d}\eta = 1 - \int _0^1\![ - \log \Phi (\eta )] \,\textrm{d}\eta . \end{aligned}$$

(E.10)

Combining (E.9) and (E.10), we find

$$\begin{aligned} S = \int _0^1\!(2 + \log \eta ) [ - \log \Phi (\eta )] \,\textrm{d}\eta . \end{aligned}$$

(E.11)

Example E.1

For labelled trees ([5, Example A.1]), we have $\Phi (z) \equiv e^{z - 1}$, $\sigma ^2 = 1$, $\mu = 1/2$, $\nu = 5/12$, and [from (E.11)] $S = 1/4$. We conclude

$$\begin{aligned} \gamma (-1)^2 = 2 \cdot \tfrac{1}{4} - \tfrac{5}{12} + (1 - 1) (\tfrac{1}{2})^2 = \tfrac{1}{12} \doteq 0.08333. \end{aligned}$$

(E.12)

$\square $

Example E.2

For ordered trees ([5, Example A.2]), we have $\Phi (z) \equiv \tfrac{1}{2}(1 - \tfrac{1}{2}z)^{-1}$, $\sigma ^2 = 2$, $\mu = 2 - 2 \log 2$, $\nu = 2 \log ^2 2 - 4 \log 2 - \tfrac{1}{6} \pi ^2 + 4$, and $S = - \tfrac{1}{6} \pi ^2 + 2 \log 2 + \log ^2 2$. We conclude

$$\begin{aligned} \gamma (-1)^2&= 2 \left( - \tfrac{1}{6} \pi ^2 + 2 \log 2 + \log ^2 2 \right) - \left( 2 \log ^2 2 - 4 \log 2 - \tfrac{1}{6} \pi ^2 + 4 \right) \nonumber \\&{} \qquad + (1 - \tfrac{1}{2}) (2 - 2 \log 2)^2 \nonumber \\&= 4 \log 2 - 2 - \tfrac{1}{6} \pi ^2 + 2 \log ^2 2 \doteq 0.08856. \end{aligned}$$

(E.13)

$\square $

Example E.3

For binary trees ([5, Example A.3]), we have $\Phi (z) \equiv (\tfrac{1}{2}+ \tfrac{1}{2}z)^2$, $\sigma ^2 = 1/2$, $\mu = 2 \log 2 - 1$, $\nu = \tfrac{1}{6} \pi ^2 - 2 \log ^2 2 - 2 \log 2 + 1$, and $S = \tfrac{1}{6} \pi ^2 - 2 \log 2$. We conclude

$$\begin{aligned} \gamma (-1)^2&= 2 \left( \tfrac{1}{6} \pi ^2 - 2 \log 2 \right) - \left( \tfrac{1}{6} \pi ^2 - 2 \log ^2 2 - 2 \log 2 + 1 \right) + (1 -2) (2 \log 2 - 1)^2 \nonumber \\&= - 2 + \tfrac{1}{6} \pi ^2 + 2 \log 2 - 2 \log ^2 2 \doteq 0.07032. \end{aligned}$$

(E.14)

$\square $

Example E.4

For full binary trees ([5, Example A.4]), we have $\Phi (z) \equiv \tfrac{1}{2}(1 + z^2)$, $\sigma ^2 = 1$, $\mu = \tfrac{1}{2}\pi - 1$, $\nu = 1 - \tfrac{1}{2}(1 - \log 2) \pi $, and $S = 2\,G - \tfrac{1}{2}\pi $, where

$$\begin{aligned} G := \sum _{n = 0}^{\infty } \frac{(-1)^n}{(2 n + 1)^2} \doteq 0.91596559417721901505460351493238411077414937428167 \end{aligned}$$

(E.15)

is Catalan’s constant. We conclude

$$\begin{aligned} \gamma (-1)^2&= 2 \left( 2G - \tfrac{1}{2}\pi \right) - \left[ 1 - \tfrac{1}{2}(1 - \log 2) \pi \right] + (1 - 1) \left( \tfrac{1}{2}\pi - 1 \right) ^2 \nonumber \\&= 4 G - \tfrac{1}{2}(1 + \log 2) \pi - 1 \doteq 0.004273. \end{aligned}$$

(E.16)

$\square $

Examples E.3 and E.4 can be generalized at the expense of rather complicated expressions.

Example E.5

For m-ary trees, we have $\Phi (z) \equiv (\frac{m - 1}{m} + \frac{1}{m} z)^m$, $\sigma ^2 = \frac{m - 1}{m}$, $\mu = (m - 1) [ - m \log \frac{m - 1}{m} - 1]$,

$$\begin{aligned} \nu&= \mu (-2) = \frac{1}{2} \int _0^1\! \left[ m \log \left( \frac{m - 1}{m} + \frac{1}{m} \eta \right) - \log \eta \right] ^2 \,\textrm{d}\eta \nonumber \\&= (m - 1) \left[ m - 1 + \left( \frac{m^2}{2}\log \frac{m}{m-1} + m^2 - m \right) \log \frac{m - 1}{m} \right. \nonumber \\&{} \left. \qquad \qquad \qquad - m {\text {Li}}_2\left( - \frac{1}{m - 1} \right) \right] \end{aligned}$$

(E.17)

[from (6.3) and with the help of Mathematica], and

$$\begin{aligned} S = m (m - 1) \left[ - {\text {Li}}_2\left( - \frac{1}{m - 1} \right) + \log \frac{m - 1}{m} \right] . \end{aligned}$$

(E.18)

From (E.2) we conclude

$$\begin{aligned} \gamma (-1)^2 = m (m - 1) \left[ -1 - {\text {Li}}_2\left( - \frac{1}{m - 1} \right) + ( m - 1 ) \ell _m -\frac{ m}{2} \ell _m^2 \right] , \end{aligned}$$

(E.19)

where we have abbreviated $\log \frac{m}{m - 1}$ as $\ell _m$. It is easily checked that (as must be the case) (E.19) reduces to (E.14) when $m = 2$ and (E.19) converges to the value 1/12 of (E.12) in Example E.1 as $m \rightarrow \infty $. $\square $

Example E.6

For full m-ary trees, we have $\Phi (z) \equiv \frac{m - 1}{m} + \frac{1}{m} z^m$ and $\sigma ^2 = m - 1$. With the help of Mathematica (and rather easily verified by hand) we can compute $\mu $ in terms of the hypergeometric function ${}_2F_1$:

$$\begin{aligned} \mu = 1 - \frac{m}{m^2 - 1}\,{}_2F_1(1, 1 + m^{-1}; 2 + m^{-1}; -(m - 1)^{-1}), \end{aligned}$$

(E.20)

or equivalently in terms of the Lerch transcendent function, usually denoted by $\Phi $ (but we use $\Psi $ to avoid notational conflict):

$$\begin{aligned} \mu = \Psi (-(m - 1)^{-1}, 1, m^{-1}) - m + 1, \end{aligned}$$

(E.21)

where

$$\begin{aligned} \Psi (z, r, a) := \sum _{n = 0}^{\infty } \frac{z^n}{(n + a)^r}. \end{aligned}$$

(E.22)

Mathematica is unable to compute $\nu = \mu (-2)$ when presented with the form (6.3):

$$\begin{aligned} \nu = \frac{1}{2} \int _0^1\! \left[ \log \left( \frac{m - 1}{m} + \frac{1}{m} \eta ^m \right) - \log \eta \right] ^2 \,\textrm{d}\eta , \end{aligned}$$

(E.23)

but when a change of variables from $\eta $ to $x = \eta ^m$ is made, yielding

$$\begin{aligned} \nu = \frac{1}{2 m} \int _0^1\! \left[ \log \left( \frac{m - 1}{m} + \frac{1}{m} x \right) - \frac{1}{m} \log x \right] ^2 x^{- (m - 1) / m} \,\textrm{d}x, \end{aligned}$$

(E.24)

Mathematica gives a complicated expression; we omit that expression here.

The expression Mathematica gives for S is not too complicated in terms of the functions ${}_2F_1$ or $\Psi $. We have

$$\begin{aligned} S = \frac{1}{m} \Psi \bigl (- (m - 1)^{-1}, 2, m^{-1}\bigr ) - \Psi \bigl (- (m - 1)^{-1}, 1, m^{-1}\bigr ). \end{aligned}$$

(E.25)

Despite the complicated expressions for general m, high-precision numerical results for any particular value of m are easy to obtain. For example, if $m = 3$, then $\sigma ^2 = 2$,

$$\begin{aligned} \mu&= \Psi (-\tfrac{1}{2}, 1, \tfrac{1}{3}) - 2 \nonumber \\&\doteq 0.70493277558252901544207714518773601710489130776015, \end{aligned}$$

(E.26)

$$\begin{aligned} \nu&\doteq 0.67423685107705512174520220134817841815252800297577, \end{aligned}$$

(E.27)

$$\begin{aligned} S&= \tfrac{1}{3} \Psi \bigl (- \tfrac{1}{2}, 2, \tfrac{1}{3}\bigr ) - \Psi \bigl (- \tfrac{1}{2}, 1, \tfrac{1}{3}\bigr ) \nonumber \\&\doteq 0.21371145926485383336608124534180063460550286916818, \end{aligned}$$

(E.28)

$$\begin{aligned} \gamma ^2&\doteq 0.00165117649789665303023160997264937410161618428615. \end{aligned}$$

(E.29)

$\square $

1.2 Computation of $\gamma ^2(-k)$ for General Offspring Distribution, with Examples for $k = 2$

One might hope that $\gamma ^2$ could be computed in closed form for the four examples in Examples E.1–E.4 (and [5, Appendix A]) for other small positive integer values of $- \alpha $, but even for $\alpha = -2$ we find it necessary to resort to numerical integration in three of the four cases.

We begin with a general treatment for $\alpha = -2$ and will discuss larger integer values of $- \alpha $ later in this subsection. The analogue of (E.3) is now

$$\begin{aligned} S&= \sum _{n = 1}^{\infty } n^{-2} q_n {\mathbb E{}}F^{\circ }(\mathcal {T}_n) \nonumber \\&= \int _0^1\!u^{-1} \int _0^u\!t^{-1} M(t) \,\textrm{d}t \,\textrm{d}u \nonumber \\&= \int _0^1\!t^{-1} (- \log t) M(t) \,\textrm{d}t \nonumber \\&= \int _0^1\!(- \log t) \frac{y'(t)}{y(t)} \cdot [B(t) \odot y(t)] \,\textrm{d}t, \end{aligned}$$

(E.30)

using (D.17) at the last equality, and the analogue of (E.4) is

$$\begin{aligned} B(z) \odot y(z) + \mu y(z)&= \sum _{n = 1}^{\infty } n^{-2} q_n z^n \nonumber \\&= \int _0^z\!u^{-1} \int _0^u\!t^{-1} y(t) \,\textrm{d}t \,\textrm{d}u \nonumber \\&= \int _0^z\!t^{-1} y(t) (\log z - \log t) \,\textrm{d}t \nonumber \\&= (\log z) \int _0^z\!t^{-1} y(t) \,\textrm{d}t - \int _0^z t^{-1} (\log t) y(t) \,\textrm{d}t. \end{aligned}$$

(E.31)

Combining (E.30)–(E.31), we find this analogue of (E.6):

$$\begin{aligned} S = \int _0^1\!(- \log t) \frac{y'(t)}{y(t)} \cdot \left[ (\log t) \int _0^t\!s^{-1} y(s) \,\textrm{d}s - \int _0^t\!s^{-1} (\log s) y(s) \,\textrm{d}s - \mu y(t) \right] \,\textrm{d}t. \end{aligned}$$

(E.32)

It is once again helpful to change variables and integrate by parts, as we did in moving from (E.6) to (E.9). The result, with $L(\eta ):= \log R(\eta )$, is

$$\begin{aligned} S&= - \int _0^1\!\eta L'(\eta ) \left( L(\eta ) \left\{ \int _{\eta }^1\!\ L(\xi ) \xi ^{-1} \,\textrm{d}\xi \right\} - \left\{ \int _{\eta }^1\!\ \left[ L(\xi ) \right] ^2 \xi ^{-1} \,\textrm{d}\xi \right\} \right) \,\textrm{d}\eta \nonumber \\&{} \qquad {} - \mu \int _0^1\!L(\eta ) \,\textrm{d}\eta . \end{aligned}$$

(E.33)

We revisit the examples in Examples E.1–E.4, but now for $\alpha =-2$. Note that now $\mu = \mu (-2)$ and $\nu = \mu (-4)$; several of these values are computed in [5].

Example E.7

For labelled trees ([5, Example A.1]), with the help of Mathematica we find, using (E.33), that $S = 101/432$. We have, from [5, Example A.1], $\mu =\mu (-2)=5/12$ and $\nu =\mu (-4)=1631/4320$. We conclude

$$\begin{aligned} \gamma (-2)^2 = 2 \cdot \tfrac{101}{432} - \tfrac{1631}{4320} + (1 - 1) (\tfrac{5}{12})^2 = \tfrac{389}{4320} \doteq 0.090046 .\end{aligned}$$

(E.34)

$\square $

Example E.8

For ordered trees ([5, Example A.2]), only the first outer integral on $\eta $ seems to require numerical integration. We find

$$\begin{aligned} S \doteq 0.234522. \end{aligned}$$

(E.35)

We have, from [5, Example A.2],

$$\begin{aligned} \mu = 2 \log ^2 2 - 4 \log 2 - \tfrac{1}{6} \pi ^2 + 4 \doteq 0.543383 \end{aligned}$$

(E.36)

and

$$\begin{aligned} \nu&= - \tfrac{1}{40} \pi ^4 + \bigl (-\tfrac{1}{3} \log ^2 2 + \tfrac{2}{3} \log 2 - \tfrac{2}{3}\bigr ) \pi ^2 + \tfrac{2}{3} \log ^4 2 \nonumber \\&{} \qquad - \tfrac{8}{3} \log ^3 2 + 8 \log ^2 2 -16 \log 2 + (4 \log 2 - 4) \zeta (3) +16 \nonumber \\&\doteq 0.508810 .\end{aligned}$$

(E.37)

We conclude

$$\begin{aligned} \gamma (-2)^2 = 2 S - \nu + \tfrac{1}{2}\mu ^2 \doteq 0.107866 .\end{aligned}$$

(E.38)

$\square $

Example E.9

For binary trees ([5, Example A.3]), we find

$$\begin{aligned} S \doteq 0.205868 .\end{aligned}$$

(E.39)

We have, using [5, Example A.3],

$$\begin{aligned} \mu = \tfrac{1}{6} \pi ^2 - 2 \log ^2 2 -2 \log 2 + 1 \doteq 0.297734 \end{aligned}$$

(E.40)

and

$$\begin{aligned} \nu&= 1 + \tfrac{1}{40} \pi ^4 + \tfrac{1}{6} \pi ^2 (2\log ^2 2 + 2\log 2+1) - (4\log 2+2)\zeta (3) \nonumber \\&{} \qquad - \tfrac{2}{3} \log ^42-\tfrac{4}{3} \log ^32-2\log ^22-2\log 2 \doteq 0.259105. \end{aligned}$$

(E.41)

We conclude

$$\begin{aligned} \gamma (-2)^2 = 2 S - \nu - \mu ^2 \doteq 0.063985 .\end{aligned}$$

(E.42)

$\square $

Example E.10

For full binary trees ([5, Example A.4]), we find

$$\begin{aligned} S \doteq 0.251039 .\end{aligned}$$

(E.43)

We have, using [5, Example A.4],

$$\begin{aligned} \mu = 1 - \tfrac{1}{2}(1 - \log 2) \pi \doteq 0.517996 \end{aligned}$$

(E.44)

and

$$\begin{aligned} \nu&= 1 + \tfrac{1}{48}\pi ^3 (\log 2-1) + \tfrac{1}{12}\pi (\log ^32-3\log ^2 2 +6\log 2-6) + \tfrac{1}{8}\pi \zeta (3) \nonumber \\&\doteq 0.501666 .\end{aligned}$$

(E.45)

We conclude

$$\begin{aligned} \gamma (-2)^2 = 2 S - \nu - (1 - 1) \mu ^2 \doteq 0.000412 .\end{aligned}$$

(E.46)

$\square $

Remark E.11

The same sort of general development as for $\alpha = -2$ shows that $\gamma ^2$ can be computed using just one- and two-dimensional integration whenever $\alpha = - k$ with k a positive integer. We know that only one-dimensional integration is needed for the second and third terms on the right in (D.15), so it suffices to consider the term S.

For example, when $\alpha = -3$, the analogue of (E.3) is

$$\begin{aligned} S&= \sum _{n = 1}^{\infty } n^{-3} q_n {\mathbb E{}}F^{\circ }(\mathcal {T}_n) \nonumber \\&= \int _0^1 v^{-1} \int _0^v\!u^{-1} \int _0^u\!t^{-1} M(t) \,\textrm{d}t \,\textrm{d}u \,\textrm{d}v \nonumber \\&= \frac{1}{2} \int _0^1\!t^{-1} (- \log t)^2 M(t) \,\textrm{d}t \nonumber \\&= \frac{1}{2} \int _0^1\!(- \log t)^2\,\frac{y'(t)}{y(t)} \cdot [B(t) \odot y(t)] \,\textrm{d}t, \end{aligned}$$

(E.47)

using (D.17) at the last equality, and the analogue of (E.4) is

$$\begin{aligned} B(z) \odot y(z) + \mu y(z)&= \sum _{n = 1}^{\infty } n^{-3} q_n z^n \nonumber \\&= \int _0^z\! v^{-1} \int _0^v\!u^{-1} \int _0^u\!t^{-1} y(t) \,\textrm{d}t \,\textrm{d}u \nonumber \\&= \frac{1}{2} \int _0^z\!t^{-1} y(t) \left( \log \frac{z}{t} \right) ^2 \,\textrm{d}t. \end{aligned}$$

(E.48)

Combining (E.47)–(E.48), we find

$$\begin{aligned} S \equiv S(-3) = \frac{1}{4} \int _0^1\!(- \log t)^2\,\frac{y'(t)}{y(t)} \cdot \left[ \int _0^t\!s^{-1} y(s) \left( \log \frac{t}{s} \right) ^2 \,\textrm{d}s - 2 \mu (-3) y(t) \right] \,\textrm{d}t. \end{aligned}$$

(E.49)

For general positive integer k we find

$$\begin{aligned} S(-k)&= \frac{1}{[(k - 1)!]^2} \int _0^1\!(- \log t)^{k - 1}\,\frac{y'(t)}{y(t)} \nonumber \\&{} \qquad \qquad \qquad \cdot \left[ \int _0^t\!s^{-1} y(s) \left( \log \frac{t}{s} \right) ^{k - 1} \,\textrm{d}s - (k - 1)! \mu (-k) y(t) \right] \,\textrm{d}t. \end{aligned}$$

(E.50)

We can re-express $S(-k)$ in terms of R as follows:

$$\begin{aligned} S(-k)&= \frac{1}{[(k - 1)!]^2} \int _0^1\!\eta R(\eta ) \left[ \frac{\textrm{d}}{\textrm{d}\eta } \frac{1}{R(\eta )} \right] \int _{\eta }^1\!\left\{ [\log R(\xi )] \left[ \log \frac{R(\eta )}{R(\xi )} \right] \right\} ^{k - 1} \xi ^{-1} \,\textrm{d}\xi \,\textrm{d}\eta \nonumber \\&{} \qquad - \frac{\mu (-k)}{(k - 1)! } \int _0^1 [\log R(\eta )]^{k - 1} \,\textrm{d}\eta . \end{aligned}$$

(E.51)

$\square $

Problem E.12

Can $\gamma ^2$ similarly be represented in terms of integrals when $\alpha < 0$ is not an integer? If not, then one could resort to the following laborious process for numerical computation of $\gamma ^2$. From the function y, extract coefficients $q_n$ for $n \leqslant N$, with N suitably large, in order to compute a suitably large number of coefficients of the function $(B_{\alpha } \odot y)(z)$ appearing in (D.17). Similarly approximate the other factor there, $z y'(z) / y(z)$, by a high-order polynomial, and multiply to do the same for M. Then a numerical approximation to $\gamma ^2$ can be obtained using (D.22):

$$\begin{aligned} \gamma (\alpha )^2 = 2 (B_{\alpha } \odot M)(1) - (B_{2 \alpha } \odot y)(1) + \sigma ^{-2} \mu (\alpha )^2. \end{aligned}$$

(E.52)

Problem E.13

Is there any sort of theory for comparisons of $\gamma ^2(\alpha )$ as real $\alpha < 0$ varies [simultaneously for at least a large class of offspring distributions, in the spirit of the observation that $\mu (\alpha )$ is increasing in $\alpha $ for any offspring distribution], or, as in Sect. 6, across offspring distributions (simultaneously for all $\alpha $)? Both prospects seem to us unlikely.

For varying $\alpha $, the results of our examples are discouraging: While $\gamma ^2(-1) > \gamma ^2(-2)$ for binary trees and full binary trees, the reverse inequality holds for the other two examples in [5, Appendix A]. For varying offspring distributions, the examples provide a scintilla of hope: For both $\alpha = -1$ and $\alpha = -2$, the values of $\gamma ^2$ increase from full binary to binary to labelled to ordered; but note that is not the same order as for $\mu (\alpha )$ in Sect. 6.

Even the behavior across offspring distributions of $S(-1)$ given in the simple expression (E.11) is not evident, because the integrand factor $2 + \log \eta $ is not constant in sign.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fill, J.A., Janson, S. & Wagner, S. Conditioned Galton–Watson Trees: The Shape Functional, and More on the Sum of Powers of Subtree Sizes and Its Mean. La Matematica (2024). https://doi.org/10.1007/s44007-024-00087-0

Download citation

Received: 30 December 2022
Revised: 28 November 2023
Accepted: 08 January 2024
Published: 29 March 2024
DOI: https://doi.org/10.1007/s44007-024-00087-0

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Conditioned Galton–Watson Trees: The Shape Functional, and More on the Sum of Powers of Subtree Sizes and Its Mean

Abstract

Similar content being viewed by others

On polynomials in primes, ergodic averages and monothetic groups

Strong asymptotic freeness for independent uniform variables on compact groups associated to nontrivial representations

On the rate of convergence in Wasserstein distance of the empirical measure

1 Introduction and Main Results

Theorem 1.1

Theorem 1.2

Theorem 1.3

Theorem 1.4

Theorem 1.5

Remark 1.6

Remark 1.7

Remark 1.8

Remark 1.9

Problem 1.10

Problem 1.11

Problem 1.12

2 Notation and Preliminaries

2.1 General Notation

2.2 \(\Delta \)-domains and Singularity Analysis

2.3 Polylogarithms

Lemma 2.1

Remark 2.2

2.4 Hadamard Products

Lemma 2.3

2.5 Generating Functions for Galton–Watson trees

Lemma 2.4

Proof

Remark 2.5

Lemma 2.6

Proof

Remark 2.7

3 The Shape Functional

Lemma 3.1

3.1 The Mean

Lemma 3.2

Proof

Remark 3.3

3.2 The Second Moment

Lemma 3.4

Proof

Lemma 3.5

Proof

3.3 Higher Moments

Lemma 3.6

Proof

Proof of Theorem 1.3

4 Imaginary Powers

4.1 The Mean

4.2 Higher Moments

Lemma 4.1

Proof

Proof of Theorem 1.4

4.3 Joint Distributions

Theorem 4.2

Proof

5 Negative Real Part

5.1 The Mean

5.2 Higher Moments

Lemma 5.1

Proof

Theorem 5.2

Proof

Lemma 5.3

Proof

Proof of Theorem 1.5

Remark 5.4

Remark 5.5

Remark 5.6

6 Fractional Moments (Mainly of Negative Order) of Tree Size: Comparisons Across Offspring Distributions

6.1 Comparison Theory

Lemma 6.1

Proof

Definition 6.2

Theorem 6.3

Proof

Remark 6.4

Theorem 6.5