1 Introduction

A well-known conjecture in the theory of random walk on a group asserts that for the random walk on the symmetric group generated by all permutations of a common cycle structure, the mixing time of the walk depends only on the number of fixed points. Given measure \(\mu \) on \(S_n\) and integer \(t \ge 1\), denote \(\mu ^{*t}\) its t-fold convolution, which is the law of the random walk starting from the identity in \(S_n\) after t steps from \(\mu \). We measure convergence to uniformity in the total variation metric, which for probabilities \(\mu \) and \(\nu \) on \(S_n\) is given by

$$\begin{aligned} \Vert \mu - \nu \Vert _{T.V.} = \max _{A \subset S_n} |\mu (A) - \nu (A)|. \end{aligned}$$

A precise statement of the conjecture is as follows.

Conjecture 1

For each \(n > 1\) let \(C_n\) be a conjugacy class of the symmetric group \(S_n\) having \(n-k_n< n\) fixed points, and let \(\mu _{C_n}\) denote the uniform probability measure on \(C_n\). Let \(t_2, t_3, t_4, \ldots \) be a sequence of positive integer steps and for each \(n \ge 2\) let \(U_n\) be uniform measure on the coset of the alternating group \(A_n\) supporting the measure \(\mu _{C_n}^{*t_n}\). For any \(\varepsilon > 0\), if eventually \(t_n \ge (1 +\varepsilon ) \frac{n}{k_n} \log n\), then

$$\begin{aligned} \lim _{n \rightarrow \infty } \left\| \mu _{C_n}^{*t_n} - U_n\right\| _{T.V.} = 0. \end{aligned}$$

If eventually \(t_n \le (1-\varepsilon ) \frac{n}{k_n} \log n\), then

$$\begin{aligned} \lim _{n \rightarrow \infty } \left\| \mu _{C_n}^{*t_n} - U_n\right\| _{T.V.} = 1. \end{aligned}$$

This conjecture aims to generalize the mixing time analysis of Diaconis and Shahshahani [4] for the random transposition walk. The formal conjecture seems to have first appeared in [14].

When k grows with n like a constant times n the conjecture is known to be false because the walk mixes too rapidly. This is the work of a number of authors, but first [12] and for later results see [10] and references therein. Whenever \(k = o(n)\) the conjecture is expected to hold, however, in part because the proposed lower bound on mixing time follows from standard techniques in the field. We give the second moment method proof in Appendix A.

Since the initial work of Diaconis and Shahshahani, Conjecture 1 has received quite a bit of attention, see [5, 11, 1417, 20] for cases of conjugacy classes with finite numbers of non-fixed points. A discussion of ongoing work of Schlage–Puchta towards the general conjecture is contained in [18]. See [2] and [19] for broader perspective.

Recently Berestycki et al. [1] have established the conjecture for any set of k cycles with k fixed as \(n \rightarrow \infty \). Moreover, they assert that their analysis will go through to treat the case of any conjugacy class having bounded total length of non-trivial cycles, and may cover the case when the total cycle length grows like \(o\left( \sqrt{n}\right) \), although they state that they have not checked carefully the uniformity in k with respect to n.

The purpose of this article is to prove Conjecture 1 for the random k cycle walk in the full range \(k = o(n)\).

Theorem 2

Conjecture 1 holds when C is the conjugacy class of all k cycles with k permitted to be any function of n that satisfies \(k = o(n)\) as \(n \rightarrow \infty \). If \(k = o\left( \frac{n}{\log n}\right) \) then the conclusion of Conjecture 1 remains valid when \(\varepsilon = \varepsilon (n)\) is any function satisfying \(\varepsilon (n) \log n \rightarrow \infty \).

Remark

When \(k = o\left( \frac{n}{\log n}\right) \) the window \(\epsilon (n)\log n \rightarrow \infty \) is essentially best possible.

As remarked above, the lower bound was already known. Prior to [1], which is purely combinatorial, all approaches to Conjecture 1 have followed the work of Diaconis and Shahshahani in passing through bounds for characters on the symmetric group. We return to this previous approach; in particular, we give an asymptotic evaluation of character ratios at a k cycle for many representations, in the range \(k = O\left( n^{\frac{1}{2}-\epsilon }\right) \). The asymptotic formula appears to be new when \(k > 6\), see [9] for the smaller cases. A precise statement of our result appears in Sect. 3, after we introduce the necessary notation.

The basis for our argument is an old formula of Frobenius [6], which gives the value of a character of the symmetric group evaluated at a k cycle as a contour integral of a certain rational function, characterized by the cycle and the representation. In the special case of a transposition this formula was already used by Diaconis and Shahshahani. Previous authors in attempting to extend the result of [4] had also used Frobenius’ formula, but they had attempted to estimate the sum of residues of the function directly, which entails significant difficulties since nearby residues of the function are unstable once the cycle length k becomes somewhat large. We avoid these difficulties by estimating the integral itself rather than the residues in most situations. In doing so, a certain regularity of the character values becomes evident. For instance, while nearby residues appear irregular, by grouping clumps of poles inside a common integral we are able to show that the ‘amortized’ contribution of any pole is slowly varying.

It may be initially surprising that our analysis becomes greatly simplified as k grows. For instance, once k is larger than a sufficiently large constant times \(\log n\), essentially trivial bounds for the contour integral suffice, and the greatest part of the analysis goes into showing that our method can handle the handful of small cases which had been treated previously using the character approach. A similar feature occurs in the related paper [8] where a contour integral is also used to bound character ratios at rotations on the orthogonal group. It seems that the contour method works best when there is significant oscillation in the sum of residues, but is difficult to use in the cases where the residues are generally positive.

In Appendix C we show that Frobenius’ formula has a natural generalization to other conjugacy classes on the symmetric group, but with increasing complexity, since one contour integration enters for each non-trivial cycle. The analysis here applies more broadly than to just the class of k cycles, but we have not pushed the method to its limit because it seems that some new ideas are needed to obtain the full Conjecture 1 when the number of small cycles in the cycle decomposition becomes large. From our point of view, the classes containing \(\ge n^{1-\epsilon }\) 2-cycles would appear to pose the greatest difficulty.

We remark that, as typical in the character ratio approach, our upper bound is a consequence of a corresponding upper bound for the walk in \(L^2\), so that our result gives a broader set of Markov chains on \(S_n\) to which one can apply the comparison techniques of [3].

Regarding notation, it will occasionally be convenient to use the Vinogradov notation \(A \ll B\), with the same meaning as \(A = O(B)\). The implicit constants in notation of both types should be assumed to vary from line to line.

2 Character theory and mixing times

We recall some basic facts regarding the character theory of a finite group. A good reference for these is [2].

A conjugation-invariant measure \(\mu \) on finite group G is a class function, which means that it has a ‘Fourier expansion’ expressing \(\mu \) as a linear combination of the irreducible characters X(G) of G. It will be convenient to normalize the Fourier coefficients by setting

$$\begin{aligned} \forall \chi \in X(G), \qquad \hat{\mu }(\chi )= \frac{1}{\chi (1)}\sum _{g \in G}\mu (g)\overline{\chi (g)}. \end{aligned}$$

By orthogonality of characters, the Fourier expansion takes the form

$$\begin{aligned} \mu = \frac{1}{|G|}\sum _{\chi \in X(G)}\chi (1) \hat{\mu }(\chi ) \chi , \end{aligned}$$
(1)

since, writing \(C_x\) for the conjugacy class of \(x \in G\),

$$\begin{aligned} \frac{1}{|G|} \sum _{\chi \in X(G)} \chi (1)\hat{\mu }(\chi ) \chi (x)&= \frac{1}{|G|} \sum _{\chi \in X(G)} \sum _{g \in G} \mu (g) \overline{\chi }(g) \chi (x)\\&= \sum _{g \in C_x} \frac{\mu (g)}{|C_x|}= \mu (x). \end{aligned}$$

In this setting, the Plancherel identity is

$$\begin{aligned} \Vert \mu \Vert _2^2 = \frac{1}{|G|}\sum _{g \in G} |\mu (g)|^2 = \frac{1}{|G|^2}\sum _{\chi \in X(G)} \chi (1)^2 \left| \hat{\mu }(\chi )\right| ^2. \end{aligned}$$
(2)

Note the somewhat non-standard factor of the inverse of the dimension \(\chi (1)^{-1}\) in the Fourier coefficients. The advantage of this choice is that the Fourier map satisfies the familiar property of carrying convolution to pointwise multiplication: for conjugation-invariant measures \(\mu _1\) and \(\mu _2\)

$$\begin{aligned} \forall \chi \in X(G), \qquad \widehat{\mu _1 *\mu _2}(\chi ) = \hat{\mu }_1(\chi ) \cdot \hat{\mu }_2(\chi ). \end{aligned}$$
(3)

This is because the regular representation in \(L^2(G)\) splits as the direct sum over irreducible representations,

$$\begin{aligned} L^2(G) = \bigoplus _{\rho } d_\rho M_\rho , \end{aligned}$$

and convolution by \(\mu _i\) acts as a scalar multiple of the identity on each representation space \(M_{\rho }\), the scalar of proportionality being \(\hat{\mu }_i(\chi _\rho )\).

When \(G = S_n\) is the symmetric group on n letters, the irreducible representations are indexed by partitions of n, with the partition (n) corresponding to the trivial representation, and partition \((1^n)\) corresponding to the sign. Given a conjugacy class C on \(S_n\) and integer \(t \ge 1\), the measure \(\mu _C^{*t}\) is conjugation invariant, and supported on permutations of a fixed sign, odd if both C and t are odd, and otherwise even. We let \(U_t\) be the uniform measure on permutations of this sign:

$$\begin{aligned} \forall \sigma \in S_n, \qquad U_t(\sigma ) = \frac{1 + \mathrm {sgn}(\sigma )\cdot \mathrm {sgn}(C)^t}{n!} \end{aligned}$$

with Fourier coefficients

$$\begin{aligned} \hat{U_t}(\chi ^n) = 1, \qquad \hat{U_t}\left( \chi ^{1^n}\right) = \mathrm {sgn}(C)^t, \qquad \hat{U_t}(\chi ^\lambda ) = 0, \; \lambda \ne n, 1^n. \end{aligned}$$

The total variation distance between \(\mu _C^{*t}\) and \(U_t\) is equal to

$$\begin{aligned} \left\| \mu _C^{*t} - U_t\right\| _{T.V.}&:= \sup _{A \subset S_n} \left| \mu _C^{*t}(A) - U_t(A)\right| \\&=\frac{1}{2}\sum _{\sigma \in S_n} \left| \mu _C^{*t}(\sigma ) - \frac{1 + \mathrm {sgn}(\sigma )\cdot \mathrm {sgn}(C)^t}{n!}\right| . \end{aligned}$$

Thus Cauchy–Schwarz, Plancherel, and the convolution identity give the upper bound

$$\begin{aligned} \left\| \mu _C^{*t}-U_t\right\| _{T.V.} \le \frac{1}{2} \left( \sum _{\lambda \vdash n, \lambda \ne n, 1^n} \left( \chi ^\lambda (1)\right) ^2 \left( \frac{\chi ^\lambda (C)}{\chi ^\lambda (1)}\right) ^{2t}\right) ^{\frac{1}{2}}. \end{aligned}$$
(4)

The main ingredient in the proof of the upper bound of Theorem 2 will thus be the following estimate for character ratios.

Proposition 3

Let C be the class of k cycles on \(S_n\). There exists a constant \(\delta > 0\) and a constant \(c_1> 0\) such that uniformly in n, partition \(\lambda \vdash n\) and all \(2 \le k < \delta n\)

$$\begin{aligned} \left| \frac{\chi ^\lambda (C)}{\chi ^\lambda (1)}\right| ^{\frac{n}{k}(\log n + c_1)} \le \frac{1}{\chi ^\lambda (1)}. \end{aligned}$$

For the deduction of Theorem 2 we will use the following technical result on the dimensions of the irreducible representations of \(S_n\).

Proposition 4

For a sufficiently large fixed constant \(c_2>0\),

$$\begin{aligned} \sum _{\lambda \vdash n} \left( \chi ^\lambda (1)\right) ^{-\frac{c_2}{\log n}} = O(1), \end{aligned}$$

with the estimate uniform in n.

Deduction of mixing time upper bound of Theorem 2 We use the fact that, apart from the trivial and sign representations, the lowest dimensional irreducible representation of \(S_n\) has dimension \(n-1\) (so \(S_n\) is essentially quasi-random). Let \(c_1, c_2\) as above, and let \(C>0\). Then for \(t > \frac{n}{k_n}(\log n + c_1)\left( 1 + \frac{c_2 + C}{2\log n}\right) \) the application of Cauchy–Schwarz above gives

$$\begin{aligned} \left\| \mu _C^{*t}-U_t\right\| _{T.V.}^2 \le \frac{1}{4} \sum _{\lambda \vdash n, \lambda \ne n, 1^n} \left( \chi ^\lambda (1)\right) ^{- \frac{c_2 + C}{\log n}} = O(e^{-C}). \end{aligned}$$

\(\square \)

The proof of Proposition 3 will require some more detailed information regarding the characters of \(S_n\) evaluated at a cycle. We discuss this in the next section. Proposition 4 is deduced from a very useful approximate dimension formula of Larsen–Shalev [10]. The rather technical proof of this proposition is given in Appendix B.

3 Character theory of \(S_n\)

The irreducible representations of \(S_n\) are indexed by partitions of n. Given a partition

$$\begin{aligned} \lambda \vdash n,\qquad \lambda = (\lambda _1 \ge \lambda _2 \ge \cdots \ge \lambda _n \ge 0),\qquad \sum \lambda _i = n, \end{aligned}$$

the dual partition \(\lambda '\) is found by reflecting the diagram of \(\lambda \) along its diagonal. We write \(\chi ^\lambda \) for the character of representation \(\rho ^\lambda \), and \(f^\lambda = \chi ^\lambda (1)\) for the dimension. Let \(\mu = \lambda + (n-1, n-2, \ldots , 0)\). Then the dimension is given by (see e.g. [13])

$$\begin{aligned} f^\lambda = \frac{n!}{\mu _1!\mu _2!\ldots \mu _n!}\prod _{i < j} (\mu _i - \mu _j). \end{aligned}$$
(5)

Setting apart those terms that pertain to \(\lambda _1\), we find

$$\begin{aligned} f^\lambda = \left( {\begin{array}{c}n\\ \lambda _1\end{array}}\right) \prod _{j=2}^n \frac{\lambda _1-\lambda _j + j-1}{\lambda _1 + j-1} f^{\lambda -\lambda _1}. \end{aligned}$$

The product is bounded by 1, and \(\sum _{\lambda \vdash n} (f^\lambda )^2 = n!\), so that we have the bound employed by Diaconis–Shahshahani

$$\begin{aligned} f^\lambda \le \left( {\begin{array}{c}n\\ \lambda _1\end{array}}\right) \sqrt{(n-\lambda _1)!}. \end{aligned}$$
(6)

The trivial representation is \(\rho ^n\) while the sign representation is \(\rho ^{1^n}\), both one-dimensional. \(\rho ^{n-1,1}\) corresponds to the standard representation, which is the irreducible \((n-1)\)-dimensional sub-representation of the representation \(\rho \) in \({\mathbb {R}}^n\) given by

$$\begin{aligned} \rho (\sigma ) \cdot e_i = e_{\sigma (i)}, \qquad \sigma \in S_n. \end{aligned}$$

This representation and its dual are the lowest dimensional non-trivial irreducible representations of \(S_n\).

Given the character \(\chi ^\lambda \), the character of the dual representation is given by

$$\begin{aligned} \chi ^{\lambda '}(\sigma ) = \mathrm {sgn}(\sigma )\cdot \chi ^\lambda (\sigma ). \end{aligned}$$

The characters corresponding to partitions with long first piece have relatively simple interpretations. For instance, if we write \(i_1\) for the number of fixed points and \(i_2\) for the number of 2-cycles in permutation \(\sigma \) then [7, ex. 4.15]

$$\begin{aligned} \chi ^{n-1,1}(\sigma )&= i_1 -1, \nonumber \\ \chi ^{n-2,1,1}(\sigma )&= \frac{1}{2}(i_1-1)(i_1-2) - i_2,\\ \chi ^{n-2,2}(\sigma )&= \frac{1}{2}(i_1-1)(i_1-2) + i_2-1.\nonumber \end{aligned}$$
(7)

It is now immediate that

$$\begin{aligned} \left( \chi ^{n-1,1}\right) ^2 = \chi ^{n-2,2} + \chi ^{n-2, 1, 1} + \chi ^{n-1,1} + \chi ^{n}. \end{aligned}$$
(8)

Also,

$$\begin{aligned} f^n = 1, \;\; f^{n-1,1} = n-1, \;\; f^{n-2,1,1} = \left( {\begin{array}{c}n-1\\ 2\end{array}}\right) , \;\; f^{n-2,2} = \left( {\begin{array}{c}n-1\\ 2\end{array}}\right) - 1. \end{aligned}$$

We use these formulas in the proof of the lower bound of Conjecture 1.

Many properties of partitions are most readily evident in Frobenius notation, and we will use this notation to state our main technical theorem. In Frobenius notation we identify the partition \(\lambda \vdash n\) by drawing the diagonal, say of length m, and measuring the legs that extend horizontally to the right and vertically below the diagonal:

$$\begin{aligned} \lambda= & {} (a_1, a_2, \ldots , a_m| b_1, b_2, \ldots , b_m);\\ a_ i= & {} \lambda _i - i + \frac{1}{2},\qquad b_i = \lambda _i' -i + \frac{1}{2}. \end{aligned}$$

Notice that

$$\begin{aligned} \sum _{i =1}^m a_i + \sum _{i=1}^m b_i = n. \end{aligned}$$

In Appendix B we consider the quantity \(\Delta \), which is the number of boxes contained neither in the square formed by the diagonal nor in the first row. The notation used here is summarized in Fig. 1.

Fig. 1
figure 1

Partition \(\lambda \) in Frobenius notation

We now state our asymptotic evaluation of the character ratio.

Theorem 5

Let n be large, let \(2 \le k \le n\), and let C be the class of k cycles on \(S_n\). Let \(\lambda \vdash n\) be a partition of n with Frobenius notation \(\lambda = (a_1, \ldots , a_m|b_1, \ldots , b_m)\).

  1. (a)

    (Long first row) Let \(0 < \epsilon < \frac{1}{2}\), let \(r = n-\lambda _1\) and suppose that \(r \,+\, k \,+\, 1 < \left( \frac{1}{2} - \epsilon \right) n\). Then

    $$\begin{aligned} \frac{\chi ^\lambda (C)}{f^\lambda }= & {} \frac{(a_1 - \frac{1}{2})^{\underline{k}}}{n^{\underline{k}}} \prod _{j=2}^m \frac{a_1 -a_j - k}{a_1 - a_j} \prod _{j=1}^m \frac{a_1 + b_j}{a_1 + b_j-k} \nonumber \\&+ O_\epsilon \left( \exp \left( k \left[ \log \frac{(1 + \epsilon )(k+1+r)}{n-k} + O_\epsilon \left( r^{-\frac{1}{2}}\right) \right] \right) \right) . \end{aligned}$$
    (9)

    If \(r < k\) then the error term is actually 0.

  2. (b)

    (Large k, short first row and column) Let \(\theta > \frac{2}{3}\). There exists \(\epsilon (\theta )>0\) such that, for all n sufficiently large, for all k with \(6\log n \le k \le \epsilon n\) and for all \(\lambda \vdash n\) such that \(b_1 \le a_1 \le e^{-\theta }n\),

    $$\begin{aligned} \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le e^{\frac{-k}{2}}. \end{aligned}$$
  3. (c)

    (Asymptotic expansion) Let \(0<\epsilon < \frac{1}{2}\) and suppose now that \(k < n^{\frac{1}{2}-\epsilon }\). We have the approximate formula

    $$\begin{aligned} \frac{\chi ^\lambda (C)}{f^\lambda }&= \sum _{a_i > kn^{\frac{1}{2}}} \frac{a_i^k}{n^k}\left( 1 + O_\epsilon \left( \frac{kn^{\frac{3}{4}+\epsilon }}{a_i}\right) \right) \nonumber \\&\quad + (-1)^{k-1}\sum _{ b_i > kn^{\frac{1}{2} }} \frac{b_i^k}{n^k}\left( 1 + O_\epsilon \left( \frac{kn^{\frac{3}{4}+\epsilon }}{b_i}\right) \right) \nonumber \\&\quad + O_\epsilon \left( n^{\frac{1}{2}}(\log n)^2 \left( \frac{k \log ^2 n}{\sqrt{n}}\right) ^k\right) . \end{aligned}$$
    (10)

This Theorem is the main technical result of the paper. Actually it gives more than we will need, and we apply the detailed statement of part (c) only in the case when the cycle length k is relatively short, \(k \le 6 \log n\). The cruder bounds of parts (a) and (b) suffice when the cycle length k is larger.

4 Proof of Theorem 5

When C is the class of k cycles Frobenius [6] (see [7, p. 52 ex. 4.17 b]) proved a famous formula that expresses the character ratio of a given representation at C as the ‘residue at \(\infty \)’ of a meromorphic function depending upon the representation. In Frobenius notation, and using the falling power

$$\begin{aligned} z^{\underline{r}} =z(z-1)\ldots (z-r+1), \end{aligned}$$

the formula is

$$\begin{aligned} \frac{\chi ^\lambda (C)}{f^\lambda }= & {} \frac{-1}{kn^{\underline{k}}}\frac{1}{2\pi i}\oint F_k^{a,b}(z) dz, \quad F_k^{a,b}(z) = \left( z+ \frac{k-1}{2}\right) ^{\underline{k}}F^a_k(z)F^b_k(z)\nonumber \\ F^a_k(z)= & {} \prod _{j=1}^m \frac{z-a_j-\frac{k}{2}}{z-a_j+\frac{k}{2}} , \quad F^b_k(z) = \prod _{j=1}^m \frac{z + b_j + \frac{k}{2}}{z+b_j-\frac{k}{2}} \end{aligned}$$
(11)

where the integration has winding number 1 around each (finite) pole of the integrand. [Note that our \((a_i, b_i)\) correspond to \(\left( b_{m-i+1} + \frac{1}{2}, a_{m-i + 1} + \frac{1}{2}\right) \) of [7], and replace y there with our \(z = y - \frac{k-1}{2}\).]

Our proof of Theorem 5 is an asymptotic estimation of this integral. We first record several properties of \(F_k^{a,b}\) in the following lemma.

Lemma 6

Let \(\lambda = (a_1, \ldots , a_m|b_1, \ldots , b_m)\) be a partition of n in Frobenius notation.

  1. (1)

    Each of \(F_k^{a}(z)\) and \(F_k^b(z)\) has at most \(\sqrt{n}\) poles.

  2. (2)

    If \(k > n\) then \(F_k^{a,b}(z)\) is holomorphic.

Denote by \(\lambda ' = \lambda {\setminus }\lambda _1 = (a_1', \ldots , a_{m'}'|b_1', \ldots , b_{m'}')\) the partition of \(n-\lambda _1\) found by deleting the first row of \(\lambda \).

  1. (3)

    We have

    $$\begin{aligned} F_k^{a,b}(z) \frac{z- a_1 +\frac{k}{2}}{z-a_1 - \frac{k}{2}} = F_k^{a',b'}(z+1). \end{aligned}$$
  2. (4)

    If \(a_1 > n-k - \frac{1}{2}\) and if \(n \ge 2k\) then \(F_k^{a,b}(z)\) has a single simple pole at \(z= a_1 - \frac{k}{2}\).

Proof

The first item follows from the bound for the diagonal \(m \le \sqrt{n}\), since both \(F_k^a\) and \(F_k^b\) are products of m terms.

For (2), first observe that \(a_i\) and \(b_i\) are strictly decreasing so that the poles of \(F_k^a(z)\) are all simple, as are the poles of \(F_k^b(z)\). Since \(k > n\), \(a_i + b_j - k < 0\), so \(a_i - \frac{k}{2} \ne \frac{k}{2} - b_j\). Thus the poles of \(F_k^a(z)F_k^b(z)\) are all simple, and are all cancelled by the factor of \(\left( z + \frac{k-1}{2}\right) ^{\underline{k}}\).

For (3), observe that deleting the first row of the diagram for \(\lambda \) shifts the diagonal down one square. Thus, if \(b_m = \frac{1}{2}\) then \(m' = m-1\) and

$$\begin{aligned} \forall \; 1 \le i \le m',\quad a_{i}' = a_{i+1} + 1, \quad b_i' = b_i-1. \end{aligned}$$

In this case, accounting for the lost factor from \(b_m\),

$$\begin{aligned} F^a_k(z)F^b_k(z)\cdot \frac{z - a_1 + \frac{k}{2}}{z-a_1 - \frac{k}{2}} = F^{a'}_k(z+1)F^{b'}_k(z+1) \frac{z +\frac{k+1}{2}}{z -\frac{k-1}{2}}. \end{aligned}$$
(12)

If instead \(b_m > \frac{1}{2}\) then \(a_m = \frac{1}{2}\), which implies \(m' = m\), and

$$\begin{aligned} \forall \; 1 \le i \le m-1, \quad a_{i}' = a_{i+1} + 1,\; a_m' = \frac{1}{2}, \quad \forall \; 1 \le j \le m, \quad b_j' = b_j-1. \end{aligned}$$

Thus (12) still holds, the new factor accounting for the introduction of \(a_{m}' = \frac{1}{2}\). Thus, in either case,

$$\begin{aligned} F_k^{a,b}(z)&\cdot \frac{z - a_1 + \frac{k}{2}}{z-a_1 - \frac{k}{2}} = \left( z + \frac{k-1}{2}\right) ^{\underline{k}} F^{a'}_k(z+1)F^{b'}_k(z+1) \frac{z +\frac{k+1}{2}}{z -\frac{k-1}{2}}\\&= \left( z+\frac{k+1}{2}\right) ^{\underline{k}} F^{a'}_k(z+1)F^{b'}_k(z+1) \\&= F_k^{a',b'}(z+1). \end{aligned}$$

For (4), the condition \(a_1 > n-k - \frac{1}{2}\) and \(n \ge 2k\) implies that \(a_1 > k -\frac{1}{2}\). Thus, for all i, \(a_1 + b_i > k\), so that \(a_1 - \frac{k}{2} \ne \frac{k}{2} - b_i\), and \(a_1 -\frac{k}{2}\) is a simple pole of \(F_k^a(z)F_k^b(z)\). Furthermore, it is not cancelled by \(\left( z+\frac{k-1}{2}\right) ^{\underline{k}}\), so that \(a_1 - \frac{k}{2}\) is a simple pole of \(F_k^{a,b}(z)\). It is the only pole, since \(F_k^{a',b'}(z)\) has \(n-\lambda _1 < k\), hence is holomorphic. \(\square \)

Parts (a) and (b) of Theorem 5 are more easily proven. We give these proofs immediately, and then prove several lemmas before proving part (c).

Proof of Theorem 5 part (a)

In this part, the first row \(\lambda _1\) of partition \(\lambda \) is significantly larger than the remainder of the partition, of size r. We extract a residue contribution from the pole corresponding to \(\lambda _1\), and bound the remainder of the character ratio by approximating it with a character ratio on \(S_r\).

Observe that \(\sum _{i>1} a_i + \sum _i b_i =n-a_1 = r + \frac{1}{2}\). Recall that we assume \(r + k + 1 \le (\frac{1}{2}-\epsilon )n\), and that this part of the theorem is the asymptotic

$$\begin{aligned} \frac{\chi ^\lambda (C)}{f^\lambda }= & {} \frac{\left( a_1 - \frac{1}{2}\right) ^{\underline{k}}}{n^{\underline{k}}} \prod _{j=2}^m \frac{a_1 -a_j - k}{a_1 - a_j} \prod _{j=1}^m \frac{a_1 + b_j}{a_1 + b_j-k} \nonumber \\&+ O_\epsilon \left( \exp \left( k \left[ \log \frac{(1 + \epsilon )(k+1+r)}{n-k} + O_\epsilon \left( r^{-\frac{1}{2}}\right) \right] \right) \right) , \end{aligned}$$

with the error equal to 0 if \(r <k\).

The poles of \(F_k^{a,b}\) are among the points \(a_i - \frac{k}{2}\), \(\frac{k}{2} - b_i\), \(i = 1, \ldots , k\), some of which may be cancelled by the numerator. The condition \(k+ 1 + r \le \left( \frac{1}{2}-\epsilon \right) n\) guarantees that

$$\begin{aligned} a_1 - \frac{k}{2} = n - r - \frac{k+1}{2} > \left( \frac{1}{2} + \epsilon \right) n. \end{aligned}$$

Since \(a_i, b_j \le r+\frac{1}{2}\) for \(i \ge 2\), \(j \ge 1\), it follows that \(a_1 - \frac{k}{2}\) is the furthest pole from zero of the function \(F^a_k(z)F^b_k(z)\), and that this is a simple pole of \(F_k^{a,b}(z)\).

Set \(R = (1 + \epsilon )\left( r + \frac{k+1}{2}\right) \) and notice

$$\begin{aligned} R < \frac{n}{2} < a_1 - \frac{k}{2}, \qquad R > \max _{i \ge 2, j \ge 1}\left( \left| a_i - \frac{k}{2}\right| , \left| b_j - \frac{k}{2}\right| \right) , \end{aligned}$$

so that \(a_1 - \frac{k}{2}\) is outside the loop \(|z| = R\), while all other poles are inside. Thus

$$\begin{aligned} \frac{\chi ^\lambda (C)}{f^\lambda } = \frac{-1}{kn^{\underline{k}}} \mathrm {Res}_{z = a_1 - \frac{k}{2}} F_k^{a,b}(z) + \frac{-1}{k n^{\underline{k}}} \frac{1}{2\pi i} \int _{|z| = R} F_k^{a,b}(z)dz. \end{aligned}$$
(13)

The residue term is equal to the main term of the theorem, so it remains to bound the integral.

Recall the notation \(\left( a_1', \ldots , a_{m'}'|b_1', \ldots , b_{m'}'\right) = \lambda {\setminus } \lambda _1\), and that

$$\begin{aligned} F_k^{a,b}(z) = \frac{z-a_1 - \frac{k}{2}}{z-a_1+\frac{k}{2}} F_k^{a',b'}(z+1). \end{aligned}$$

Thus we express the integral of (13) as

$$\begin{aligned} \frac{-1}{k n^{\underline{k}}} \frac{1}{2\pi i} \int _{|z| = R} F_k^{a,b}(z) dz= \frac{-1}{kn^{\underline{k}}} \frac{1}{2\pi i} \int _{|z| = R} F_k^{a',b'}(z+1) \frac{z- a_1 - \frac{k}{2}}{z-a_1 + \frac{k}{2}} dz. \end{aligned}$$
(14)

When \(k > r\), \(F_k^{a',b'}(z+1)\) is holomorphic, and so (14) is zero, which proves the latter claim of the theorem. Thus we may now assume that \(r \ge k\).

On the contour \(|z| = R\),

$$\begin{aligned} \left| z- a_1 + \frac{k}{2}\right| \ge a_1 - R - \frac{k}{2} \ge n - (2 + \epsilon )\left( r + \frac{k+1}{2}\right) \gg _\epsilon n, \end{aligned}$$

so that \(\left| \frac{z- a_1 - \frac{k}{2}}{z-a_1 + \frac{k}{2}} -1\right| = O_\epsilon (\frac{k}{n}).\) Thus (14) is equal to (\(C'\) is the k cycle class on \(S_r\))

$$\begin{aligned} \frac{r^{\underline{k}}}{n^{\underline{k}}} \frac{\chi ^{\lambda {\setminus } \lambda _1}(C')}{f^{\lambda {\setminus } \lambda _1}} + O_\epsilon \left( \frac{k}{n} \frac{1}{ k n^{\underline{k}}} \int _{|z| = R} \left| F_{k}^{a',b'}(z+1)\right| d|z|\right) . \end{aligned}$$
(15)

We bound the first term by \(\frac{r^{\underline{k}}}{n^{\underline{k}}}\), since all character ratios are bounded by 1.

To bound the integral, note that \(F^{a'}_k(z)\) and \(F^{b'}_k(z)\) each have at most \(\sqrt{r}\) poles. On the contour \(|z| = R\),

$$\begin{aligned} \left| \frac{z-a_i -\frac{k}{2}}{z-a_i +\frac{k}{2}}\right| \le 1 + O_\epsilon \left( \frac{k}{r}\right) , \end{aligned}$$

and the terms in \(F^{b'}_k(z)\) are bounded similarly. It follows that on \(|z| = R\),

$$\begin{aligned} \left| F^{a'}_k(z+1)F^{b'}_k(z+1)\right| \le \exp \left( O_\epsilon \left( \frac{k}{\sqrt{r}}\right) \right) . \end{aligned}$$

Meanwhile, also on \(|z| = R\),

$$\begin{aligned} \left| \left( z + \frac{k+1}{2}\right) ^{\underline{k}}\right| \le \left( |z| + \frac{k+1}{2}\right) ^k \le \left( (1 + \epsilon )(k+1+r)\right) ^k. \end{aligned}$$

Putting these bounds together, we deduce a bound for the second term of (15) of

$$\begin{aligned}&\frac{1}{n\cdot n^{\underline{k}}}\frac{1}{2\pi }\int _{|z| = R} \left| F^{a',b'}_k(z+1)\right| d|z| \le \frac{1}{n^{\underline{k}} } \sup _{|z| = R} \left| F_k^{a',b'}(z+1)\right| \\&\quad = O_\epsilon \left( \frac{1}{n^{\underline{k}}} \exp \left( k \left[ \log \left( (1 +\epsilon )(r + k + 1)\right) +O_\epsilon \left( \frac{1}{\sqrt{r}}\right) \right] \right) \right) . \end{aligned}$$

To complete the proof of the theorem, use \(n^{\underline{k}} \ge (n-k)^k\) and note that the term \(\frac{r^{\underline{k}}}{n^{\underline{k}}}\) from the character ratio in (15) is trivially absorbed into this error term. \(\square \)

Proof of Theorem 5 part (b)

Choose \(\theta _1\) with \(\frac{2}{3} < \theta _1 < \theta \). Since the poles of \(F^{a,b}_k\) are among \(a_i - \frac{k}{2}\) and \( \frac{k}{2}- b_i\), \(i = 1, 2, 3, \ldots \), choosing \(\epsilon = \epsilon (\theta )>0 \) sufficiently small, \(a_i, b_i <e^{-\theta }n\) and \(k \le \epsilon n\) guarantees that the contour \(|z| =R = e^{-\theta _1} n\) contains all poles of \(F^{a,b}_k(z)\). Thus

$$\begin{aligned} \frac{\chi ^{\lambda }(C)}{f^\lambda } = \frac{-1}{kn^{\underline{k}}}\frac{1}{2\pi i} \int _{|z| = R} F^{a,b}_k(z) dz. \end{aligned}$$

Write \( \frac{1}{k n^{\underline{k}}} F^{a,b}_k(z) =\frac{\left( z + \frac{k-1}{2}\right) ^{\underline{k}}}{k n^{\underline{k}}} F^a_k(z)F^b_k(z)\). Since \(k \le \epsilon n\), for \(|z| = R\),

$$\begin{aligned} \left| \frac{\left( z + \frac{k-1}{2}\right) ^{\underline{k}}}{n^{\underline{k}}}\right| \le \frac{\left( R+\frac{k}{2}\right) ^k}{(n-k)^k} \le \exp \left( k (-\theta _1 + o(1))\right) , \end{aligned}$$

with o(1) indicating a quantity that may be made arbitrarily small with a sufficiently small choice of \(\epsilon \). Also, \(F^a_k(z)\) and \(F^b_k(z)\) are each composed of at most \( \sqrt{n}\) factors, each of size at most \(1 + O_{\theta ,\theta _1,\epsilon }\left( \frac{k}{n}\right) \). We deduce that for \(|z| = R\),

$$\begin{aligned} \left| \frac{1}{k n^{\underline{k}}} F^{a,b}_k(z)\right| \le \exp \left( k\left( -\theta _1 + o_{\theta ,\theta _1}(1) \right) \right) , \end{aligned}$$

the error term requiring that first \(\epsilon \) be small, and then that n be large. The length of the contour is \(O(n)=O \left( \exp (\frac{k}{6})\right) \). Since \(\theta _1 > \frac{1}{2} + \frac{1}{6}\), it follows that the bound \(\exp \left( -\frac{k}{2}\right) \) holds for the character ratio for all n sufficiently large. \(\square \)

We now turn to part (c) of Theorem 5. We will again estimate the integral

$$\begin{aligned} \frac{\chi ^\lambda (C)}{f^\lambda } =\frac{-1}{kn^{\underline{k}}}\frac{1}{2\pi i}\oint F_k^{a,b}(z) dz, \end{aligned}$$
(16)

but the idea now will be to evaluate clumps of poles together. With an appropriate choice of contour, at any given point the poles that are sufficiently far away make an essentially constant contribution to the integrand, and so may be safely removed, simplifying the integral.

Denote by

$$\begin{aligned} {\mathcal {P}} = \left\{ a_j - \frac{k}{2}: 1\le j \le m\right\} \cup \left\{ \frac{k}{2} - b_j: 1 \le j \le m\right\} \end{aligned}$$

the multi-set (that is, set counted with multiplicities) of poles of \(F_k^{a}(z)\) and \(F_k^b(z)\). One trivial fact concerning the distribution of poles in \({\mathcal {P}}\) is the following bound.

Lemma 7

For any \(x > 0\),

$$\begin{aligned} \#\{p \in {\mathcal {P}}: |p| > x\} \le \frac{n + km}{x}. \end{aligned}$$

Indeed, this follows from the fact that

$$\begin{aligned} \sum _{i=1}^m \left| a_i - \frac{k}{2}\right| + \sum _{i=1}^m\left| \frac{k}{2} - b_i\right| \le km + \sum a_i + \sum b_i = km+n. \end{aligned}$$

A useful consequence is that for x real, \(|x| > k\sqrt{n}\), we can always find a real point y nearby x with large distance from all poles in \({\mathcal {P}}\).

Definition 1

Let \(x \in {\mathbb {R}}{\setminus } 0\) and let \(L>0\) be a parameter. We say that a real number y is L -well-spaced for x with respect to \({\mathcal {P}}\) if the bound is satisfied

$$\begin{aligned} \sum _{p \in {\mathcal {P}}: |p-x|< \frac{|x|}{2}}\frac{1}{|p-y|} \le \frac{L}{|x|}. \end{aligned}$$
(17)

The following lemma says that if |x| is sufficiently large, then there are always many points y nearby x that are well-spaced for x with respect to \({\mathcal {P}}\).

Lemma 8

Let \(n > e^5\) and let \(1<k < \sqrt{n}\). Let x be real with \(|x| > k \sqrt{n}\log n\). Then there exists a real y with \(|y-x| < \sqrt{n}\) which is \(4 \sqrt{n} \log n\)-well-spaced for x with respect to \({\mathcal {P}}\).

Proof

Let I be the interval \(I = \{y: |y-x| < \sqrt{n}\}\). By Lemma 7, the interval I contains at most

$$\begin{aligned} \frac{n+km}{|x|-\sqrt{n}} \le \frac{2n}{|x|-\sqrt{n}} \le \frac{2.5\sqrt{n}}{k \log n} \end{aligned}$$

poles of \({\mathcal {P}}\). Deleting the segment of radius k around each pole in \({\mathcal {P}}\cap I\) removes a set of total length at most

$$\begin{aligned} \frac{5 \sqrt{n}}{ \log n} \le \sqrt{n} \end{aligned}$$

leaving \(I'\) with \(|I'|\ge \frac{|I|}{2} = \sqrt{n}\). Now

$$\begin{aligned} \frac{1}{|I'|} \int _{y \in I'} \sum _{p \in {\mathcal {P}}, |p-x|< \frac{|x|}{2}} \frac{dy}{|y-p|}&\le \frac{1}{|I'|} \sum _{p \in {\mathcal {P}}, |p-x|< \frac{|x|}{2}} \int _{u \in I, |u-p|\ge k} \frac{du}{|u-p|}\nonumber \\&\le \frac{1}{|I'|} \sum _{p \in {\mathcal {P}}, |p-x|< \frac{|x|}{2}}2 \times \int _{k}^{\sqrt{n}} \frac{du}{u} \nonumber \\&\le \frac{\log n}{|I'|} \left| \left\{ p \in {\mathcal {P}}: |p-x| < \frac{|x|}{2}\right\} \right| . \end{aligned}$$
(18)

Applying the cardinality bound of Lemma 7 a second time (recall \(|I'|\ge \sqrt{n}\)) we obtain a bound for (18) of

$$\begin{aligned} \frac{\log n}{\sqrt{n}} \frac{ (n + km)}{\frac{|x|}{2}} \le \frac{4 \sqrt{n} \log n}{|x|}. \end{aligned}$$

It follows that a typical point \(y \in I'\) is \(4 \sqrt{n}\log n\)-well-spaced for x. \(\square \)

Recall that \(k \le n^{\frac{1}{2}-\epsilon }\). We now assume, as we may, that n is sufficiently large to guarantee \(k \le \frac{\sqrt{n}}{(\log n)^2}\) and we choose a sequence of well-spaced points at which we partition our integral.

To find our sequence of points, first set

$$\begin{aligned} x_0 = \frac{k}{2}\sqrt{n}(\log n)^2 \end{aligned}$$

and

$$\begin{aligned} x_j = x_0 + 2j \sqrt{n}, \qquad j = 1,2,\ldots \end{aligned}$$

for those j such that \(x_j \le n + 4 \sqrt{n}\). In each interval

$$\begin{aligned} \Big [x_j-\sqrt{n}, x_j + \sqrt{n}\Big ), \qquad j = 0, 1, 2, \ldots \end{aligned}$$

apply Lemma 8 to find \(y_j^+\), a \(4\sqrt{n}\log n\)-well-spaced point for \(x_j\). Also find a point \(y_j^-\) in each interval \(\left( -x_j-\sqrt{n}, - x_j + \sqrt{n}\right] \) which is \(4\sqrt{n}\log n\)-well-spaced for \(-x_j\). Thus we have the disjoint intervals

$$\begin{aligned} I_0 = \left( y_0^-, y_0^+\right) , \qquad I_j^+ = \left[ y_{j-1}^+,y_j^+\right) , \quad I_j^- = \left( y_j^-, y_{j-1}^-\right] , \quad j = 1, 2, 3, \ldots \end{aligned}$$

which together cover

$$\begin{aligned} \left( -n-\sqrt{n}, n+\sqrt{n}\right) . \end{aligned}$$

Let \(q_j^+\) (resp. \(q_j^-, q_0\)) be the number of poles of \({\mathcal {P}}\) contained in \(I_j^+\), (resp. \(I_j^-, I_0\)).

Around each interval \(I_j^+,\) \(j \ge 1\) we draw a rectangular box

$$\begin{aligned} {\mathcal {B}}_j^+ = \left\{ z \in {\mathbb {C}}: y_{j-1}^+ \le \mathfrak {R}(z) \le y_j^+, \; |\mathfrak {I}(z)| \le k q_j^+\right\} \end{aligned}$$

and similarly around \(I_j^-\). If \(q_j^\pm = 0\) then the box may be discarded. We also draw a large box \({\mathcal {B}}_0\) containing the origin, with endpoints at \(y_0^- \pm i k\sqrt{n}, y_0^+ \pm i k\sqrt{n}\). A permissible contour with which to apply integral formula (16) is given by

$$\begin{aligned} \partial {\mathcal {B}}_0 \cup \bigcup _{j\ge 1} \partial {\mathcal {B}}_j^+ \cup \bigcup _{j\ge 1} \partial {\mathcal {B}}_j^-, \end{aligned}$$

each box being positively oriented, see Fig. 2. We use a somewhat shortened version of this contour.

Fig. 2
figure 2

Schematic of the boxes \({\mathcal {B}}_j\). Between successive equally spaced points \(x_j, x_{j+1}\) denoted by (+), point \(y_j\) is chosen to be well-spaced from the set of poles (red). Box \({\mathcal {B}}_j\) has vertical edges at \(\mathfrak {R}(z) = y_{j-1}, y_j\), and height proportional to the number of poles contained (Color figure online)

In what follows we treat only the integrals around the boxes \({\mathcal {B}}_j^+\) and \({\mathcal {B}}_0\), and we drop superscripts (so we write e.g. \(q_j\) for \(q_j^+\), \(I_j\) for \(I_j^+\) etc). The argument may be carried out symmetrically for \({\mathcal {B}}_j^-\). The main proposition is as follows.

Proposition 9

Let \(j \ge 1\). There exists a piecewise linear contour \({\mathcal {C}}_j\), having total length

$$\begin{aligned} \left| {\mathcal {C}}_j\right| \le 12 k q_j n^{\frac{1}{4}}, \end{aligned}$$

such that \({\mathcal {C}}_j\) has winding number 1 around each pole \(p \in {\mathcal {P}} \cap {\mathcal {B}}_j\) and winding number 0 around all poles \(p' \in {\mathcal {P}}{\setminus } {\mathcal {B}}_j\). For \(z\in {\mathcal {C}}_j\), \(F_k^{a,b}(z)\) satisfies

$$\begin{aligned} F^{a,b}_k(z) = \left( x_j + \frac{k-1}{2}\right) ^{\underline{k}} \prod _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j} \frac{z-p- k}{z-p} + O\left( k x_j^{k-1} \sqrt{n} \log n\right) . \end{aligned}$$
(19)

Proof

Recall that \({\mathcal {B}}_j\) is a box containing \(q_j\) poles and surrounding interval \(I_j = [y_{j-1}, y_j)\), with corners at

$$\begin{aligned} y_{j-1} \pm ikq_j, \qquad y_j \pm ikq_j. \end{aligned}$$

Recall also that \(y_j \in \left[ x_j - \sqrt{n}, x_j+\sqrt{n}\right) \) and \(x_j - x_{j-1} = 2\sqrt{n}\), so that \(|y_{j} - y_{j-1}| \le 4 \sqrt{n}\) (Fig. 2).

To form \({\mathcal {C}}_j\) we shorten the contour \(\partial {\mathcal {B}}_j\). At each pole \(p \in {\mathcal {B}}_j\) consider the segment

$$\begin{aligned} S_p = [p-kq_j, p+kq_j]. \end{aligned}$$

If there exists a subinterval J of interval \(I_j\) that is not covered by \(\cup _{p \in {\mathcal {P}}\cap {\mathcal {B}}_j}S_p\) then such J may be discarded from the box \({\mathcal {B}}_j\) by drawing vertical segments at the endpoints of J and deleting the parts of \({\mathcal {B}}_j\) vertically above and below J. Deleting subintervals in this way if it reduces the total perimeter, or allowing them to remain when the perimeter is increased, we arrive at a contour \({\mathcal {C}}_j\) which is the union of some rectangles, and has total length at most

$$\begin{aligned} \left| {\mathcal {C}}_j\right| \le \min \left( 8 \sqrt{n} + 4 k q_j, 8kq_j^2\right) , \end{aligned}$$

the first term bounding the result of doing nothing, and the second bounding the length of a contour with an individual box around every pole, see Fig. 3 for a schematic of the construction of \({\mathcal {C}}_j\). Using \(\min (a,b) \le \sqrt{ab}\), we have the bound

$$\begin{aligned} \left| {\mathcal {C}}_j\right| \le 4k q_j + \min \left( 8 \sqrt{n}, 8kq_j^2\right) \le 4kq_j + 8 k q_j n^{\frac{1}{4}} \le 12 k q_j n^{\frac{1}{4}}. \end{aligned}$$

We now prove the formula (19). Observe that by Lemma 7, the number of poles \(q_j\) in \({\mathcal {P}} \cap {\mathcal {B}}_j\) satisfies \(q_j \ll \frac{n}{x_j} \ll \frac{\sqrt{n}}{k}\), so that each point of \({\mathcal {C}}_j\) has distance \(O\left( \sqrt{n}\right) \) from \(x_j\). Thus

$$\begin{aligned} \forall z \in {\mathcal {C}}_j, \qquad \left( z + \frac{k-1}{2}\right) ^{\underline{k}} = \left( x_j + \frac{k-1}{2}\right) ^{\underline{k}}\left( 1 + O\left( \frac{k\sqrt{n}}{x_j}\right) \right) . \end{aligned}$$

In particular, it follows that

$$\begin{aligned} \forall z \in {\mathcal {C}}_j, \qquad F_k^{a,b}(z) = \left( 1 + O \left( \frac{k \sqrt{n}}{x_j}\right) \right) \left( x_j + \frac{k-1}{2}\right) ^{\underline{k}}F_k^a(z)F_k^b(z). \end{aligned}$$
(20)

We prove

$$\begin{aligned} \forall z \in {\mathcal {C}}_j, \qquad F_k^a(z)F_k^b(z) = \left( 1 + O\left( \frac{k\sqrt{n} \log n}{x_j}\right) \right) \cdot \prod _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j}\frac{z-p-k}{z-p}, \end{aligned}$$
(21)

and

$$\begin{aligned} \forall z \in {\mathcal {C}}_j, \qquad \prod _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j}\frac{z-p-k}{z-p} = O(1). \end{aligned}$$
(22)

Inserted in (20) these combine to prove the formula (19), (use \(\left( x_j + \frac{k-1}{2}\right) ^{\underline{k}} \le x_j^k\)).

We first consider (21). This estimate is equivalent to

$$\begin{aligned} \forall z \in {\mathcal {C}}_j, \qquad \prod _{p \in {\mathcal {P}} {\setminus } {\mathcal {B}}_j} \frac{z - p \mp k}{z-p} = 1 + O\left( \frac{k\sqrt{n} \log n}{x_j}\right) , \end{aligned}$$

the sign determined according as p is a pole from \(F_k^a\) or \(F_k^b\). Say a pole \(p \in {\mathcal {P}} {\setminus } {\mathcal {B}}_j\) is ‘good’ if \(|p-x_j| > \frac{x_j}{2}\) and ‘bad’ otherwise. For \(z \in {\mathcal {C}}_j\),

$$\begin{aligned} \left| z-x_j\right| = O\left( \sqrt{n}\right) < \frac{x_j}{3} \end{aligned}$$

if n is sufficiently large, which implies that for good p, \(|z-p| \ge \frac{x_j}{6}\). Since the total number of poles is \(O\left( \sqrt{n}\right) \), the product over good poles evidently satisfies the bound

$$\begin{aligned} \prod _{p \text { good}} \left( 1 \mp \frac{k}{z-p}\right) = 1 + O\left( \frac{k\sqrt{n}}{x_j}\right) . \end{aligned}$$

Since it is not contained in the box \({\mathcal {B}}_j\), a bad pole p necessarily lives in the complement \({\mathbb {R}}{\setminus } [y_{j-1},y_j]\), and either satisfies \(p > y_j\) and \(|p- x_j|\le \frac{x_j}{2}\) or else \(p < y_{j-1}\), which entails \(\frac{x_j}{2} \le p \le x_j\). Since \(\frac{x_{j-1}}{2} < \frac{x_j}{2}\), and \(\frac{3 x_{j-1}}{2} > x_j\) (at least if n is sufficiently large), it follows that in the second case, \(|p - x_{j-1}| \le \frac{x_{j-1}}{2}\). Thus, for \(z \in {\mathcal {C}}_j\), the fact that \(y_{j-1} \le \mathfrak {R}(z) \le y_j\) implies

$$\begin{aligned} \sum _{p \text { bad},\; p > y_j} \frac{1}{|z-p|} \le \sum _{\begin{array}{c} p > y_j\\ |p-x_j| < \frac{x_j}{2} \end{array}} \frac{1}{p - y_j} \le \frac{4 \sqrt{n} \log n}{x_j} \end{aligned}$$

and

$$\begin{aligned} \sum _{p \text { bad},\; p< y_{j-1}} \frac{1}{|z-p|} \le \sum _{\begin{array}{c} p<y_{j-1}\\ |p -x_{j-1}| < \frac{x_{j-1}}{2} \end{array}} \frac{1}{y_{j-1} - p} \le \frac{4 \sqrt{n}\log n}{x_{j-1}} \le \frac{8 \sqrt{n} \log n}{x_j}, \end{aligned}$$

by invoking the \(4 \sqrt{n}\log n\) well-spaced property of \(y_{j-1}\) and \(y_j\). It follows

$$\begin{aligned} \prod _{p \text { bad}} \left( \frac{z-p \mp k}{z-p}\right) = 1 + O\left( \frac{k\sqrt{n}\log n}{x_j}\right) , \end{aligned}$$

proving (21).

To prove (22) we consider two cases. When either \(\mathfrak {R}(z) = y_{j-1}\) or \(\mathfrak {R}(z) = y_j\), observe that for each \(p \in {\mathcal {P}} \cap {\mathcal {B}}_j\),

$$\begin{aligned} |p - x_j| \le 2 \sqrt{n} \le \frac{x_j}{2}. \end{aligned}$$

Also, since \(x_j-x_{j-1} = 2\sqrt{n}\),

$$\begin{aligned} |p - x_{j-1}| \le 4 \sqrt{n} \le \frac{x_{j-1}}{2} \qquad (n \text { large}). \end{aligned}$$

Thus, invoking the well-spaced property of \(y_{j-1}\) and \(y_j\),

$$\begin{aligned} \sum _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j} \frac{1}{|p-y_{j-1}|} \le \frac{4 \sqrt{n}\log n}{x_{j-1}}, \qquad \sum _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j} \frac{1}{|p - y_j|} \le \frac{4 \sqrt{n} \log n}{x_j} \end{aligned}$$

so that if \(\mathfrak {R}(z) = y_{j-1}\),

$$\begin{aligned} \prod _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j} \left( 1 - \frac{k}{z-p}\right) \le \exp \left( O \left( \frac{k \sqrt{n}\log n}{x_{j-1}}\right) \right) = O(1), \end{aligned}$$

and similarly when \(\mathfrak {R}(z) = y_j\).

When \(\mathfrak {R}(z) \not \in \{y_{j-1}, y_j\}\) then \(|z-p| \ge k q_j\) for each \(p \in {\mathcal {P}}\cap {\mathcal {B}}_j\), so that in this case the bound

$$\begin{aligned} \prod _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j} \left( 1 - \frac{k}{z-p}\right) = O(1) \end{aligned}$$

is immediate from \(|{\mathcal {P}} \cap {\mathcal {B}}_j| = q_j\). Thus (22) holds in either case. \(\square \)

We now bound integration around the box \({\mathcal {B}}_0\).

Fig. 3
figure 3

Schematic of contour \({\mathcal {C}}_j\). Poles in red, box \({\mathcal {B}}_j\) in blue, \({\mathcal {C}}_j\) in green (Color figure online)

Lemma 10

Assume \(n > e^5\). We have

$$\begin{aligned} \frac{-1}{k n^{\underline{k}}} \frac{1}{2\pi i} \oint _{\partial {\mathcal {B}}_0} F^{a,b}_k(z) dz = O\left( \sqrt{n} (\log n)^2 \left( \frac{k\log ^2 n}{n^{\frac{1}{2}}}\right) ^k \right) . \end{aligned}$$

Proof

Since \({\mathcal {B}}_0\) is a box with corners at \(y_0^{\pm } \pm ik\sqrt{n}\), the integral has length \(O\left( k\sqrt{n}(\log n)^2\right) \), so it will suffice to prove

$$\begin{aligned} \forall z \in \partial {\mathcal {B}}_0, \qquad \left| \frac{F_k^{a,b}(z)}{k n^{\underline{k}}} \right| = O\left( \frac{1}{k}\left( \frac{k\log ^2 n}{n^{\frac{1}{2}}}\right) ^k\right) . \end{aligned}$$

Recall that

$$\begin{aligned} F_{k}^{a,b}(z) = \left( z + \frac{k-1}{2}\right) ^{\underline{k}} F_k^a(z)F_k^b(z). \end{aligned}$$

Recall also that \(\left| y_0^{\pm } \mp x_0\right| \le \sqrt{n}\) and \(x_0 = \frac{k}{2}\sqrt{n}(\log n)^2\). Thus, on \(\partial {\mathcal {B}}_0\) we have \(|\mathfrak {R}z| \le \frac{k}{2}\sqrt{n} (\log n)^2 + \sqrt{n}\). Hence, using \(k < \sqrt{n}\),

$$\begin{aligned} \left( z + \frac{k-1}{2}\right) ^{\underline{k}}&\le \left( |\mathfrak {R}(z)| + |\mathfrak {I}(z)| + \frac{k-1}{2} \right) ^k \\&\le \left( \frac{k}{2}\sqrt{n} (\log n)^2 + 2\sqrt{n} + k \sqrt{n}\right) ^k \\&\le \left( k \sqrt{n}(\log n)^2\right) ^k, \end{aligned}$$

the last bound requiring \(k+2 \le \frac{k}{2}(\log n)^2\), which plainly holds for \(n > e^5\). Since \(n^{\underline{k}} \gg n^k\) when \(k < \sqrt{n}\), we obtain the required bound for \(\sup _{z \in \partial {\mathcal {B}}_0} \left| \frac{F_{k}^{a,b}(z)}{kn^{\underline{k}}}\right| \) by checking that

$$\begin{aligned} \sup _{z \in \partial {\mathcal {B}}_0}\left| F_k^a(z)F_k^b(z)\right| = O(1). \end{aligned}$$

To do so, for each \(p \in {\mathcal {P}}\) write the corresponding factor of \(F^a_k(z)\) or \(F^b_k(z)\) as

$$\begin{aligned} \frac{ z-p\pm k}{z-p} = 1 \pm \frac{k}{z-p}. \end{aligned}$$

For a given \(z \in \partial {\mathcal {B}}_0\), say the pole \(p \in {\mathcal {P}}\) is ‘good’ for z if \(|z-p| \ge k \sqrt{n}\). Since the total number of poles in \({\mathcal {P}}\) is \(O\left( \sqrt{n}\right) \), the part of the product in \(F^a_k(z)F^b_k(z)\) contributed by good poles is bounded by O(1), so we may consider only the bad poles.

Bad poles exist only if z is on one of the two vertical sides of the box, that is \(\mathfrak {R}(z) = y_0^{\pm }\). Notice that, e.g.

$$\begin{aligned} \left| p-y_0^+\right| < k\sqrt{n} \Rightarrow \left| p - x_0\right| < (k+1)\sqrt{n} < \frac{k}{4}\sqrt{n}(\log n)^2 = \frac{x_0}{2} \end{aligned}$$

(similarly if p is near \(y_0^-\)), and so bad poles satisfy \(\min (|p + x_0|, |p - x_0|) < \frac{x_0}{2}\).

By the well-spaced property of \(y_0^\pm \), when \(\mathfrak {R}(z) = y_0^\pm \) we have

$$\begin{aligned} \sum _{p \text { bad}} \frac{1}{|z - p|}&\le \sum _{p \text { bad}, p>0} \frac{1}{\left| y_0^+ - p\right| } + \sum _{p \text { bad}, p < 0} \frac{1}{\left| y_0^- - p\right| }\\&\le \frac{8 \sqrt{n} \log n}{x_0} \le \frac{16}{k \log n}. \end{aligned}$$

It follows that

$$\begin{aligned} \prod _{p \text { bad}}\left( 1 + \frac{1}{|z-p|}\right) = O(1). \end{aligned}$$

\(\square \)

We now have all of the estimates that we need to complete the proof of Theorem 5. We use the following integral formula.

Lemma 11

Let \(p_1, p_2, \ldots , p_s \in {\mathbb {C}}\) be a sequence of poles and let d be an arbitrary complex number. Then

$$\begin{aligned} I(p_1, \ldots , p_s, d) = \frac{1}{2\pi i}\oint \prod _{j=1}^s \frac{z-p_j +d}{z-p_j} = sd \end{aligned}$$

where the contour is such that it has winding number one about each pole.

Proof

The integral is independent of the appropriate contour, so take the contour to be a large loop containing the poles. We may write the integrand as \(\prod _{j=1}^s \left( 1 + \frac{d}{z-p_j}\right) .\) Differentiating the integrand with respect to \(p_j\) results in a rational function having total degree \(-2\). It follows that \(\frac{\partial }{\partial p_j} I(p_1, \ldots , p_s, d) = 0\) for each j, since the integral can be made arbitrarily small by taking the contour to be sufficiently large. Taking \(p_j = 0\) for each j, we find that the residue at 0 is sd. \(\square \)

Proof of Theorem 5 part (c)

Write

$$\begin{aligned} \frac{\chi ^\lambda (C)}{f^\lambda } = \frac{-1}{kn^{\underline{k}}} \frac{1}{2\pi i}\left\{ \oint _{\partial {\mathcal {B}}_0} + \sum _{j \ge 1} \oint _{{\mathcal {C}}_j^+} + \sum _{j \ge 1} \oint _{{\mathcal {C}}_j^-}\right\} F_k^{a,b}(z) dz. \end{aligned}$$

The contribution from \(\partial {\mathcal {B}}_0\) is bounded in Lemma 10 by

$$\begin{aligned} \frac{-1}{kn^{\underline{k}}} \frac{1}{2\pi i} \oint _{\partial {\mathcal {B}}_0} F^{a,b}_k(z) dz = O\left( \sqrt{n} (\log n)^2 \left( \frac{k\log ^2 n}{n^{\frac{1}{2}}}\right) ^k \right) . \end{aligned}$$

which may be absorbed into the error term of the theorem.

For each \(j \ge 1\), write the contribution from the \(q_j^+\) poles inside \({\mathcal {C}}_j^+\) as

$$\begin{aligned} \frac{1}{2\pi i} \int _{{\mathcal {C}}_j^+} \left[ \frac{-\left( x_j + \frac{k-1}{2}\right) ^{\underline{k}}}{k n^{\underline{k}}} \prod _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j} \frac{z-p- k}{z-p} + O\left( \left( \frac{x_j}{n}\right) ^{k-1} \frac{\log n}{\sqrt{n}}\right) \right] dz. \end{aligned}$$

For the main term of this integral, Lemma 11 gives the evaluation

$$\begin{aligned}&\frac{1}{2\pi i} \int _{{\mathcal {C}}_j^+} \frac{-\left( x_j + \frac{k-1}{2}\right) ^{\underline{k}}}{k n^{\underline{k}}} \prod _{p \in {\mathcal {P}} \cap {\mathcal {B}}_j} \frac{z-p- k}{z-p} dz = q_j \frac{\left( x_j + \frac{k-1}{2}\right) ^{\underline{k}}}{n^{\underline{k}}}\\&\quad = \sum _{a_i: a_i - \frac{k}{2} \in {\mathcal {B}}_j^+} \left( \frac{a_i}{n}\right) ^k \left( 1 + O\left( \frac{k\sqrt{n}}{a_i}\right) \right) , \end{aligned}$$

the error resulting since each of the k terms of \(\left( x+\frac{k-1}{2}\right) ^{\underline{k}}\)  is equal to \(a_i + O(\sqrt{n})\)—we use that \(a_i = \Omega \left( k\sqrt{n}\right) \) here. Had we considered \({\mathcal {C}}_j^-\) rather than \({\mathcal {C}}_j^+\) this main term would involve a sum over the \(b_i\), with an appropriate sign factor.

The integral of the error term over \({\mathcal {C}}_j\) is bounded trivially by using \(\left| {\mathcal {C}}_j\right| = O\left( k q_j n^{\frac{1}{4}}\right) \), which gives

$$\begin{aligned} O\left( k q_j n^{\frac{1}{4}} \cdot \left( \frac{x_j}{n}\right) ^{k-1} \frac{\log n}{\sqrt{n}} \right) . \end{aligned}$$

The ratio of the error from \({\mathcal {C}}_j\) to the corresponding main term is

$$\begin{aligned} O\left( \frac{k n^{\frac{3}{4}} \log n}{x_j}\right) , \end{aligned}$$

and since each \(a_i\) in the sum above is within constants of \(x_j\), this proves the Theorem.

Note that in the claimed formula of the Theorem, the sums over \(a_i\) and \(b_j\) contain terms for which \(k\sqrt{n} < a_i \le y_0^+ + \frac{k}{2}\) and \(k \sqrt{n} < b_j \le -y_0^- + \frac{k}{2}\), which do not appear in our evaluation. However, only a bound, rather than an asymptotic, is claimed in this range, so that the theorem is valid with the extra terms discarded. \(\square \)

5 Proof of upper bound for Theorem 2

We now prove the upper bound on mixing time from Theorem 2 by proving Proposition 3. Recall that this proposition is a bound for character ratios at class C of k cycles,

$$\begin{aligned} \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| ^{\frac{n}{k}(\log n + c)} \le \frac{1}{f^\lambda }, \end{aligned}$$
(23)

uniformly for k less than a fixed constant times n and for all \(\lambda \vdash n\).

Before proving the estimate, we collect together several observations. First note that the character ratio bound is trivial for the one-dimensional representations \(\lambda = (n), (1^n)\). Also, exchanging \(\lambda \) with its dual \(\lambda '\) leaves the dimension \(f^\lambda \) unchanged, and at most changes the sign of \(\chi ^\lambda (C)\), so we will assume without loss of generality that \(\lambda \) has \(a_1 \ge b_1\). We set \(r = n-\lambda _1 = n-a_1 - \frac{1}{2}\).

Lemma 12

(Criterion lemma) To prove the character ratio bound (23), it suffices to prove that for all n larger than the maximum of k and a fixed constant, and for a sufficiently large \(c > 0\), that for all non-trivial \(\lambda \) with \(a_1 \ge b_1\),

$$\begin{aligned} \log \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le \max \left( \frac{-kr}{n} + \frac{kr \log r}{2n \log n} + \frac{ckr}{n\log n}, \frac{-k}{2} + \frac{ck}{\log n}\right) . \end{aligned}$$

Proof

By the bound (6) for the dimension, we have (use \(\log r! \ge r \log \frac{r}{e}\))

$$\begin{aligned} f^\lambda \le \left( {\begin{array}{c}n\\ r\end{array}}\right) \sqrt{r!} \le \frac{n^r}{\sqrt{r!}}\le \exp \left( r \log n - \frac{1}{2}\left( r \log \frac{r}{e}\right) \right) \end{aligned}$$

so that a bound of

$$\begin{aligned} \log \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le \frac{-kr}{n} + \frac{kr \log r}{2 n \log n} + \frac{c kr}{n \log n} \end{aligned}$$

is sufficient.

On the other hand, for all \(\lambda \), \(f^\lambda \le \sqrt{n!}\), so that a bound of

$$\begin{aligned} \log \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le \frac{-k}{2} + \frac{ck}{\log n} \end{aligned}$$

also suffices. \(\square \)

Our proof splits into two cases depending upon whether \(k \ge 6\log n\). The essential tool in both cases will be the evaluation of the character ratio from Theorem 5. For large k we will use only parts (a) and (b) of that theorem. Recall that part (a) had a main term equal to

$$\begin{aligned} {\text {MT}}:= \frac{\left( a_1 - \frac{1}{2}\right) ^{\underline{k}}}{n^{\underline{k}}} \prod _{j=2}^m \frac{a_1 -a_j - k}{a_1 - a_j} \prod _{j=1}^m \frac{a_1 + b_j}{a_1 + b_j-k}. \end{aligned}$$

Lemma 13

Assume \(k + r+1 < \frac{n}{2}\). We have the bound \( {\text {MT}}\le \exp \left( \frac{-kr}{n}\right) . \)

Proof

Recall \(a_1 = n-r-\frac{1}{2}\). We estimate

$$\begin{aligned} {\text {MT}}&\le \prod _{i=1}^k \left( \frac{n-r-i}{n+1-i}\right) \prod _{j=2}^m \left( 1 - \frac{k}{n-r}\right) \prod _{j=1}^m\left( 1 + \frac{k}{n-k-r}\right) \\&\le \frac{n-r}{n-k-r} \prod _{i=1}^k \frac{n-r-i}{n-i+1} \\&= \prod _{i=1}^k \frac{n-r-i+1}{n-i+1} \le \left( 1-\frac{r}{n}\right) ^k \le \exp \left( \frac{-kr}{n}\right) . \end{aligned}$$

\(\square \)

Proof of Proposition 3 when \(6 \log n \le k \le \delta n\) When \(r > 0.49 n\) we have \(a_1 < 0.51 n\). Let \(\theta = 0.67\), and note that \(e^{-\theta } > 0.511\). Thus \(a_1 < e^{-\theta }n\) so that part (b) of Theorem 5 guarantees that there exists \(\delta =\epsilon (0.67)>0\), such that, for n larger than a fixed constant, for all \(6 \log n \le k \le \delta n\), \(\left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le e^{\frac{-k}{2}}.\) Thus the second condition of Lemma 12 is satisfied.

So we may suppose that \(r \le 0.49 n\) and appeal to part (a) of Theorem 5. Suppose that \(\delta \) is sufficiently small so that \(r + k + 1 < \frac{n}{2}\). If \(r < k\) then the error term of part (a) of Theorem 5 is zero so that the previous lemma implies

$$\begin{aligned} \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| = {\text {MT}}\le \exp \left( \frac{-kr}{n}\right) . \end{aligned}$$

Thus the first criterion of Lemma 12 is satisfied with \(c = 0\).

Assume now that \(r \ge k\). Let now \(\delta \) sufficiently small so that we may choose \(\epsilon = \frac{1}{200}\) in part (a), that is, \(r + k + 1 \le \left( \frac{1}{2} - \frac{1}{200}\right) n\), and also assume that \(\frac{\left( 1 + \epsilon \right) \left( k + r+1\right) }{n-k} < 0.5 - \eta \) for some fixed \(\eta > 0\). Then part (a) gives \(\left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le {\text {MT}}+ {\text {ET}}\) with

$$\begin{aligned} {\text {ET}}\ll \exp \left( k \left[ \log \frac{(1 + \epsilon ) (k+r+1)}{n-k} + O_\epsilon \left( r^{\frac{-1}{2}}\right) \right] \right) \le 2^{-k} \; (n \text { large}). \end{aligned}$$

Since \(r < \frac{n}{2}\) we deduce that

$$\begin{aligned} {\text {MT}}+ {\text {ET}}&\le \exp \left( \frac{-kr}{n}\right) + \exp \left( -k \log 2\right) \\&\le \exp \left( \frac{-kr}{n}\right) \left( 1 + \exp \left( -k\left( \log 2 - \frac{1}{2}\right) \right) \right) . \end{aligned}$$

Since \(k \ge 6 \log n\) and \(6(\log 2 - .5) >1.15\) we deduce

$$\begin{aligned} \log \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le \log ({\text {MT}}+ {\text {ET}}) \le \frac{-kr}{n} + O\left( n^{-1.15}\right) , \end{aligned}$$

so that the first criteria of Lemma 12 is satisfied. \(\square \)

When \(k \le 6\log n\) we make essential use of the asymptotic evaluation of the character ratio proved in part (c) of Theorem 5. The next lemma shows that we may restrict attention to only the main term of that evaluation.

Lemma 14

(Small k criterion lemma) Let \(2 \le k \le 6 \log n\). We have the bound

$$\begin{aligned} \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le \left( 1 + O \left( \frac{\log n}{n^{\frac{1}{4}}}\right) \right) \left[ \sum _{a_i > k n^{\frac{1}{2}}} \frac{a_i^k}{n^k} + \sum _{b_i > k n^{\frac{1}{2}}} \frac{b_i^k}{n^k}\right] + O\left( \frac{e^{-k}(\log n)^4}{n^{\frac{1}{4}}}\right) . \end{aligned}$$

In particular, if \(r > n^{\frac{5}{6}}\) and n is sufficiently large, and if

$$\begin{aligned}&\log \left( \sum _{a_i > k n^{\frac{1}{2}}} \frac{a_i^k}{n^k} + \sum _{b_i > k n^{\frac{1}{2}}} \frac{b_i^k}{n^k}\right) \\&\qquad \le \max \left( \frac{-kr}{n} + \frac{kr \log r}{2n \log n} + \frac{ckr}{n \log n}, \frac{-k}{2} + \frac{ck}{\log n}\right) \end{aligned}$$

then after changing constants, the same estimate holds for \(\log \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \), so that the condition of Lemma 12 is satisfied.

Proof

To deduce the second statement from the first, note that

$$\begin{aligned} \log \left( 1 + O\left( \frac{\log n}{n^{\frac{1}{4}}}\right) \right) = O\left( \frac{\log n}{n^{\frac{1}{4}}}\right) , \end{aligned}$$

which, for \(r > n^{\frac{5}{6}}\) may be absorbed into the RHS of the second statement by increasing the value of c. Regarding the error of \(O\left( \frac{e^{-k}(\log n)^4}{n^{\frac{1}{4}}}\right) \), it is no loss of generality to assume that

$$\begin{aligned} \sum \frac{a_i^k}{n^k} + \sum \frac{b_i^k}{n^k} \ge e^{\frac{-k}{2}}, \end{aligned}$$

so that this error term has relative size \(1 + O \left( \frac{(\log n)^4e^{-\frac{k}{2}}}{n^{\frac{1}{4}}}\right) \). Again, the logarithm of this error may be absorbed by increasing c.

To prove the first statement, part (c) of Theorem 5 gives

$$\begin{aligned} \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right|&\le \sum _{a_i > kn^{\frac{1}{2}}} \frac{a_i^k}{n^k}\left( 1 + O\left( \frac{kn^{\frac{3}{4}}}{a_i}\right) \right) + \sum _{b_i > k n^{\frac{1}{2}}} \frac{b_i^k}{n^k}\left( 1 + O \left( \frac{kn^{\frac{3}{4}}}{b_i}\right) \right) \\&\quad + O\left( n^{\frac{1}{2}}(\log n)^2\left( \frac{k \log n}{\sqrt{n}}\right) ^k\right) . \end{aligned}$$

For \(2 \le k \le 6 \log n\), the last error term is plainly \(O\left( \frac{e^{-k}(\log n)^4}{\sqrt{n}}\right) \). Split the sum over \(a_i\) according to \(a_i \ge \frac{n}{e^2}\). Thus the sum over \(a_i\) is bounded by

$$\begin{aligned}&\left( 1 + O\left( \frac{\log n}{n^{\frac{1}{4}}}\right) \right) \sum _{a_i \ge \frac{n}{e^2}} \frac{a_i^k}{n^k} + \sum _{kn^{\frac{1}{2}} < a_i < \frac{n}{e^2}} \frac{ a_i^k}{n^k} \\&\quad + O\left( \frac{ e^{-k}}{n^{\frac{1}{4}}} \sum _{kn^{\frac{1}{2}} < a_i < \frac{n}{e^2}} \frac{k(e a_i)^{k-1}}{n^{k-1}}\right) . \end{aligned}$$

In the last sum, \(k \left( \frac{ e a_i}{n}\right) ^{k-1}\) is minimized at \(k=2\). Thus

$$\begin{aligned} \sum _{kn^{\frac{1}{2}} < a_i < \frac{n}{e^2}} \frac{k(e a_i)^{k-1}}{n^{k-1}} \le 2e \sum _{a_i} \frac{a_i}{n} = O(1). \end{aligned}$$

Handling the sum over \(b_i\) in the same way proves the lemma. \(\square \)

Recall that we require \(a_1 \ge b_1\) and that \(a_1 = n-r-\frac{1}{2}\). Thinking of r as fixed, set \(\delta := \frac{r}{n}\). Since \(\sum a_i + \sum b_i = n\), an upper bound for \(\sum \frac{a_i^k}{n^k} + \sum \frac{b_i^k}{n^k}\) is given by the solution to the following optimization problem.

Let \(x_1, x_2, x_3, \ldots \) be real variables and let \(k \ge 2\).

$$\begin{aligned} \text {maximize}{:}&\qquad \sum x_i^k \\ \text {subject to}{:}&\qquad 0 \le x_i \le 1-\delta , \; \sum x_i = 1 . \end{aligned}$$

Let \(\ell = \left\lfloor (1-\delta )^{-1} \right\rfloor .\) It is easily checked by varying parameters that an optimal solution of this problem has \(x_1 = \cdots = x_\ell = 1-\delta \), and \(x_{\ell +1} = 1 - \ell (1-\delta )\), \(x_i = 0\) for \(i > \ell +1\), which yields the maximum

$$\begin{aligned} \ell (1-\delta )^k + (1- \ell (1-\delta ))^k \le (1-\delta )^{k-1} \end{aligned}$$

\((1-\delta )^{k-1}\) being the solution of the continuous analogue of the optimization problem

$$\begin{aligned} \text {maximize}{:}&\qquad \Vert f\Vert _{L^k({\mathbb {R}})}^k\\ \text {subject to}{:}&\qquad \Vert f\Vert _{L^1({\mathbb {R}})} =1, \; \Vert f\Vert _{L^\infty ({\mathbb {R}})} \le 1-\delta , \end{aligned}$$

which is less constrained.

Lemma 15

Let \(k \ge 2\) and let \(r = n-a_1 -\frac{1}{2}\). Assume \(a_1 \ge b_1\). There exists a \(c_0 > 0\) such that if \(r \le c_0 n\) then

$$\begin{aligned} \sum \frac{a_i^k}{n^k} + \sum \frac{b_i^k}{n^k} \le e^{-\frac{kr}{n} + \frac{kr^2}{n^2}}. \end{aligned}$$

For all \(r \le n\) we have

$$\begin{aligned} \sum \frac{a_i^k}{n^k} + \sum \frac{b_i^k}{n^k} \le e^{-\frac{kr}{2n}}. \end{aligned}$$

Proof

Set \(\delta =\frac{r}{n} \le c_0\). We may assume that \(c_0 \le \frac{1}{2}\). Then the first bound for the maximum reduces to \(\delta ^k + (1-\delta )^k\) and we require the statement

$$\begin{aligned} \forall \; 0 \le \delta \le c_0, \quad \left( \delta ^k + (1-\delta )^k\right) ^{\frac{1}{k}} \le e^{-\delta + \delta ^2}. \end{aligned}$$

Write the left hand side as \((1-\delta ) \left( 1 + (\frac{\delta }{1-\delta })^k\right) ^{\frac{1}{k}}\). Then it is readily checked with calculus that this is decreasing in k, hence maximized at \(k = 2\). Since

$$\begin{aligned} (1-\delta )^2 + \delta ^2 = 1 - 2\delta + 2 \delta ^2, \qquad e^{-2\delta +2 \delta ^2} = 1-2\delta +4\delta ^2 + O(\delta ^3) \end{aligned}$$

it follows that the left is bounded by the right for \(\delta \) sufficiently small.

For the second statement, we use the second bound of the maximum, so that we need to show \((1-\delta )^{k-1} \le e^{-\frac{k\delta }{2}}\). The worst case is \(k = 2\), which reduces to the true inequality \((1-\delta ) \le e^{-\delta }\) for \(0 \le \delta \le 1\). \(\square \)

We can now prove the case \(2 \le k \le 6\log n\) of Proposition 3.

Proof of Proposition 3 for \(k \le 6\log n\) As usual, assume \(a_1 \ge b_1\). As in the proof of the Proposition in the case \(k > 6 \log n\), we write \(\lambda _1 = a_1 + \frac{1}{2} = n-r\).

For \(r \le n^{\frac{5}{6}}\) we appeal to part (a) of Theorem 5, with the main term \({\text {MT}}\) from Lemma 13. This gives

$$\begin{aligned} \left| \frac{\chi ^\lambda (C)}{f^\lambda }\right| \le e^{\frac{-kr}{n}} + O \left( \exp \left( -k\left( \log \frac{n}{r} +O(1)\right) \right) \right) . \end{aligned}$$

In this range, \(\exp \left( -\frac{kr}{n}\right) = 1 - \frac{kr}{n} + O\left( \left( \frac{kr}{n}\right) ^2\right) = 1-o(1)\), so that, since \(k \ge 2\), the error term is negligible, and the condition of the first criterion lemma, Lemma 12, is satisfied.

Now suppose \(r > n^{\frac{5}{6}}\). Let \(c_0\) be the constant from Lemma 15. Let \(c_1\) be a constant, such that, for \(r \le c_1 n\), \(\frac{r}{\log r} \le \frac{n}{2\log n}\). Notice that this implies \(\frac{r^2}{n^2} \le \frac{r \log r}{2n \log n}\). For \(r \le \min (c_0, c_1) n\), Lemma 15 implies that

$$\begin{aligned} \log \left( \sum \frac{a_i^k}{n^k} + \sum \frac{b_i^k}{n^k}\right) \le \frac{-kr}{n} + \frac{k r^2}{n^2} \le \frac{-kr}{n} + \frac{kr\log r}{2n\log n}, \end{aligned}$$

so that this quantity is bounded by the first term in the maximum of the Small k criterion lemma, Lemma 14. Thus we may assume that \(r > \min (c_0, c_1)n\). In this case, using that \(\log r = \log n + O(1)\), the second bound of Lemma 15 implies that for a sufficiently large constant c

$$\begin{aligned} \log \left( \sum \frac{a_i^k}{n^k} + \sum \frac{b_i^k}{n^k}\right) \le \frac{-kr}{2n} \le \frac{-kr}{n} + \frac{kr \log r}{2 n \log n} + \frac{ c kr}{n \log n}. \end{aligned}$$

Thus again this is bounded by the first term in the maximum of Lemma 14. \(\square \)