Abstract
Consider Young diagrams of n boxes distributed according to the Plancherel measure. So those diagrams could be the output of the RSK algorithm, when applied to random permutations of the set \(\{1,\ldots ,n\}\). Here we are interested in asymptotics, as \(n\rightarrow \infty \), of expectations of certain functions of random Young diagrams, such as the number of bumping steps of the RSK algorithm that leads to that diagram, the side length of its Durfee square, or the logarithm of its probability. We can express these functions in terms of hook lengths or contents of the boxes of the diagram, which opens the door for application of known polynomiality results for Plancherel averages. We thus obtain representations of expectations as binomial convolutions, that can be further analyzed with the help of Rice’s integral or Poisson generating functions. Among our results is a very explicit expression for the constant appearing in the almost equipartition property of the Plancherel measure.
Similar content being viewed by others
1 Introduction
We identify Young diagrams (sets consisting of left aligned decreasingly ordered rows of square boxes) with partitions \(\lambda =(\lambda _1,\ldots ,\lambda _k)\) with \(\lambda _1\ge \lambda _2\ge \cdots \ge \lambda _k\), and denote \(|\lambda |=\lambda _1+\ldots +\lambda _k\). The notation \(\lambda \vdash n\) then signifies that \(\lambda \) is a partition of n, i.e., \(|\lambda |=n\). We let \(Y(\pi )=Y_\lambda :=\sum _{\ell =1}^k\lambda _\ell (\ell -1)\) denote the number of bumping steps of the Robinson–Schensted algorithm (see Figs. 1 and 2) when applied to a permutation \(\pi \) that is mapped to a pair of standard Young tableaux of shape \(\lambda \). A standard Young tableau is a Young diagram \(\lambda \) filled with numbers \(1,\ldots ,|\lambda |\) in a way such that numbers in each row and each column are increasing. See e.g. [23, Sect. 1.6] or [24, Sect. 3.1] for nice expositions of the algorithm and references to the original articles by Gilbert de Beauregard Robinson, by Craige Eugene Schensted, and by Donald Ervin Knuth, who significantly widened the scope of the algorithm, the abbreviation with reference to all three authors, RSK algorithm, now frequently being used also to refer to the original Robinson–Schensted algorithm.
We denote by \(Y_n\) the restriction of \(Y(\pi )\) to permutations of the set \(\{1,2,\ldots ,n\}\) chosen uniformly at random. The Young diagrams \(\lambda \) obtained by the RSK algorithm are then distributed according to the nth Plancherel measure, i.e., \(I \!P\, \!l^{(n)}(\lambda )=\frac{f_\lambda ^2}{n!}=\frac{n!}{p_\lambda ^2}\), where \(f_\lambda \) is the number of standard Young tableaux of shape \(\lambda \), satisfying \(f_\lambda =\frac{n!}{p_\lambda }\), and where \(p_\lambda :=\prod _{u\in \lambda }h_u\) denotes the product of the hook lengths of the diagram \(\lambda \), see [7]. Here the hook length \(h_u\) of a particular box u of \(\lambda \) is one more than the number of boxes to the right of u plus the number of boxes below u. Note that \(Y_\lambda \) has also the meaning of \(|\lambda |\) times the y-coordinate of the barycenter of the set
which is just the set of lower left corners of the boxes of \(\lambda \) in French notation, which addresses boxes by Cartesian coordinates of the first quadrant. Apart from here we always stick to English notation with its matrix style indexing of boxes. Note that \(X_\lambda \), the x-coordinate of the barycenter of \(S_\lambda \), is given by \(Y_{\lambda '}\), where \(\lambda '\) is the partition conjugate to \(\lambda \), its parts being defined by \(\lambda '_j:=|\{i:\lambda _i\ge j\}|\). Stated differently, \(\lambda \) and \(\lambda '\) are mirror images of one another with respect to the main diagonal (upper left to lower right). The sets of hook lengths are therefore the same for \(\lambda \) and \(\lambda '\), which yields invariance of Plancherel measure under conjugation. Thus \(X_n\) and \(Y_n\) are identical in distribution. This allows for a representation of \(I \!E\, Y_n\) and \(Var\, Y_n\) in terms of \(X_n+Y_n\) and \(X_n-Y_n\),
Note that we can express \(X_\lambda -Y_\lambda \), resp. \(X_\lambda +Y_\lambda \), in terms of contents \(\{c_u:u\in \lambda \}\), resp. hook lengths \(\{h_u:u\in \lambda \}\), of the diagram \(\lambda \):
Here the content \(c_u\) of a box \(u=(i,j)\) of \(\lambda \) is \(j-i\), i.e., the column number of u minus the row number of u, see Fig. 3 for an illustration of hook lengths, contents, and bumping step counts. For a proof of (1.1) note
and similarly
leading to \(X_\lambda -Y_\lambda =Y_{\lambda '}-Y_\lambda =\sum \limits _{(i,j)\in \lambda }\Big [(j-1)-(i-1)\Big ] =\sum \limits _{(i,j)\in \lambda }(j-i)=\sum \limits _{u\in \lambda }c_u\) and \(X_\lambda +Y_\lambda +|\lambda |=\sum \limits _{(i,j)\in \lambda }\Big [(\lambda _i-j)+(\lambda '_j-i) +1\Big ]=\sum \limits _{u\in \lambda }h_u\), where \((\lambda _i-j)+(\lambda '_j-i)+1\) is clearly the hook length of box (i, j).
Further functions of \(\lambda \) that can be written in terms of hook lengths or contents are \(\log p_\lambda =\sum _{u\in \lambda }\log (h_u)\) making its appearance in Sect. 3, and \(D(\lambda )=\sum _{u\in \lambda }\delta _{0,c_u}\), the number of boxes of \(\lambda \) on the main diagonal, that we will meet in Sect. 4.
Being able to express some function of \(\lambda \) in terms of the contents or hook lengths of the boxes of \(\lambda \) can allow us to employ the polynomiality results for Plancherel averages derived by Stanley [26].
Theorem 1.1
([26, Theorems 2.1, 4.3]) Let F(x) be a formal power series over \(\mathbb {Q}\) of bounded degree that is symmetric in the variables \(x = (x_1,x_2,\ldots )\). Then both averages
are polynomial functions of n.
Note that an even more general result is given in [26, Theorem 4.4]. See also [20] for alternative proofs and further generalizations.
The proof of [26, Theorem 2.1] restricts w. l. o. g. to elementary symmetric functions indexed by partitions \(\mu =(\mu _1,\ldots ,\mu _k)\), i.e., to functions \(F(\cdot )=e_\mu (\cdot )=\prod _{i=1}^{k}e_{\mu _i}(\cdot )\), where \(e_m(x_1,x_2,\ldots )=\sum _{i_1<\cdots <i_m}x_{i_1}\ldots x_{i_m}\) for \(m\ge 1\). As remarked in [26] right below that proof, the resulting polynomial \(N_\mu \) is of degree \(|\mu |\) if and only if \(|\mu |\) is even and \(\mu _1\le \frac{|\mu |}{2}\), otherwise, \(N_\mu =0\). Here is an immediate application of these degree considerations.
Lemma 1.2
\(Var\, (X_n-Y_n)=\left( {\begin{array}{c}n\\ 2\end{array}}\right) \).
Proof
Since \(I \!E\, (X_n-Y_n)=0\), we have
Note that here we have \(F(c_u:u\in \lambda )=\big (e_1(c_u:u\in \lambda )\big )^2\), i.e., \(\mu =(1,1)\). The polynomial \(N_\mu \) is therefore of degree 2, and it is completely determined by its values at \(n\in \{0,1,2\}\), which are \(N_\mu (0)=N_\mu (1)=0, N_\mu (2)=1\), proving the claim. The result \(N_\mu (n)=\frac{n(n-1)}{2}\) is also stated as a special case in [26, p. 94]. \(\square \)
A workaround is needed for \(Var\, (X_n+Y_n)\), or even \(I \!E\, (X_n+Y_n)\), because \(X_{\lambda }+Y_{\lambda }\) is not a symmetric function of \(\{h^2_u:u\in \lambda \}\), but only of \(\{h_u:u\in \lambda \}\). Finding a series representation \(x=\sum _{k\ge 0}a_kp_k(x^2)\) with polynomials \(p_k\), that holds for integers \(x\ge 1\), (but need not hold or even converge elsewhere) would allow, interchanging summations, to apply the polynomiality results termwise. If we are lucky—and we are—the polynomials \(p_k\) have well known Plancherel averages. Such kind of workaround is employed in this paper to deal with Plancherel averages of several interesting functions of partitions, leading firstly to a representation of the expectation as a binomial convolution, that is free of references to partitions, and can be analyzed using Rice’s integral, or Poisson generating functions. In some cases holonomicity of the sequence of expectations can be inferred from the binomial convolution representation. This then allows for fast computation of many terms, that can be used to numerically confirm error terms, or conduct experiments.
The paper is organized as follows: In Sect. 2 we consider the expected number of bumping steps in the RSK algorithm. In particular, we derive asymptotics for \(I \!E\, (X_n+Y_n)\), thus refining the result obtained by Romik [22]. In Sect. 3 we consider \(I \!E\, \log I \!P\, \! l^{(n)}(\lambda )\), with \(\lambda \) distributed according to Plancherel measure. From the first asymptotic terms we obtain a very explicit representation of a constant appearing in an almost equipartition property (abbreviated AEP) for Plancherel measure, conjectured in [28], and proven in [4]. In Sect. 4 we derive asymptotics of the expectation of the side length \(D(\lambda )\) of the Durfee square of \(\lambda \), i.e., the largest square fitting in the upper left corner of the Young diagram of \(\lambda \), when partitions \(\lambda \) are distributed according to Plancherel measure, see Table 1. Considering in Sect. 5 more generally lengths of south-east directed cuts through the Young diagram of \(\lambda \), we enter the realm of a sequence of random curves \(\psi _\lambda \) known to converge uniformly in probability to the Logan-Shepp-Vershik-Kerov limit shape curve \(\Omega \) for \(|\lambda |\rightarrow \infty \). For any fixed integer a the sequence with terms \(\sqrt{n}I \!E\, \psi _\lambda \left( \frac{a}{\sqrt{2n}}\right) \), with expectation computed with respect to \(I \!P\, \! l^{(n)}\), turns out to be holonomic. Experiments then strongly hint at convergence of \(I \!E\, \psi _\lambda \left( \frac{\lfloor u\sqrt{2n}\rfloor }{\sqrt{2n}}\right) \rightarrow \Omega (u)\), uniformly in u, and reveal that second order terms show interesting fluctuations. However we are only able to prove asymptotic results in the case of fixed a, i.e., in the vicinity of \(u=0\). In Sect. 6 we return to the number \(Y_n\) of bumping steps, giving a heuristic argument for \(Var\, Y_n=\mathcal {O}(n^2)\), based on the limit shape curve.
2 Refined Asymptotics of the Expected Number of Bumping Steps in the RSK Algorithm
Recall that \(I \!E\, (X_n+Y_n)=2I \!E\, Y_n\) denotes twice the expected number of bumping steps of the RSK algorithm when applied to a random permutation of \(\{1,\ldots ,n\}\). Romik [22, Eq. (1)] derived the following asymptotic result, \(I \!E\, Y_n\sim \frac{128}{27\pi ^2}n^{\frac{3}{2}}\), and showed \(Y_n/I \!E\, Y_n\rightarrow 1\) in probability. The sequence of interest starts
The next theorem leads to a refinement of Romik’s asymptotic equivalent for \(I \!E\, Y_n\).
Theorem 2.1
(Expected number of bumping steps in the RSK algorithm) Let \(\delta _n:=\log n+2\gamma +12\log 2\), with \(\gamma \) denoting Euler’s constant. Then
Proof
As \(X_n+Y_n\) is not a symmetric polynomial of the multiset \(\{h_u^2:u\in \lambda \}\), but only of the multiset \(\{h_u:u\in \lambda \}\), we can not expect \(I \!E\, (X_n+Y_n)\) to be a polynomial in \(|\lambda |\). Indeed, by Romik’s result, \(I \!E\, (X_n+Y_n)=\Theta (n^\frac{3}{2})\) is definitely not a polynomial. However, we can invoke polynomiality results via the following identity. Using
the equation
holds for \(x\in \mathbb {N}:=\{1,2,3,\ldots \}\). This will be proved in the Appendix.
Now, by [21, Theorem 1], we have
with \(K_r=\frac{(2r)!(2r+1)!}{(r+1)!^2r!}\), leading to
where C is a contour that encircles integers \(2,3,\ldots ,n\), but neither any other integers, nor poles of f, which is given by
For the method of evaluating a large finite difference via the so-called Rice’s integral used above see the article [9]. By computing (leading asymptotic terms of) residues of \(g_n(z):=f(z)\frac{n!\Gamma (-z)}{\Gamma (n+1-z)}\) at \(\pm \frac{3}{2},1,\pm \frac{1}{2}\), (note that \(g_n(z)\) is analytic for \(z\in \{-1,0\}\)) we obtain
where we recall \(\delta _n=\log n+2\gamma +12\log 2,\) and where \(C'\) encircles the interval \([-\frac{3}{2},n]\), but no poles of the integrand outside that interval. This integral is taken care for in the appendix, yielding the contribution involving \(\cos (8\sqrt{n}+\frac{\pi }{4})\). Such a term is not completely uncommon, see e.g. the example at the end of [9, Sect. 5]. Slightly rearranging the terms completes the proof. \(\square \)
2.1 Holonomicity of the Sequence \((I \!E\, (X_n+Y_n)+n)_{n\in \mathbb {N}}\)
Defining
we have \(I \!E\, (X_n+Y_n)=u_n-n\). The sequence \((u_n)\) satisfies the linear recurrence relation
with initial conditions
Clearly, the terms \(\frac{u_n}{n!}\) comprise a sequence, that is the convolution of two sequences that are obviously holonomic. For said convolution the gfun package [25] then easily produces a recursion.
For the Poisson generating function \(U(z):=e^{-z}\sum _{n\ge 0}\frac{u_n}{n!}z^n\) we obtain
a hypergeometric function that may also be used to recover asymptotics of \(u_n\), see [15, Sect. 5.11.2] for asymptotic expansions of generalized hypergeometric functions. Indeed, the leading terms of an asymptotic expansion of U(z), provided by Maple, together with Depoissonization via the saddle point method, yield an alternative proof of Theorem 2.1.
Note that the recurrence relation allows for easily computing millions of terms of the sequence \((I \!E\, (X_n+Y_n))_{n\ge 1}\), which can be used to numerically confirm the error term in Theorem 2.1.
3 The Constant Appearing in the AEP for Plancherel Measure
We consider the random variables \(Z_n=Z_n(\lambda ):=\sum _{u\in \lambda }\log h_u\), where \(\lambda \vdash n\) is distributed according to the Plancherel measure, and denote \(z_n:=I \!E\, Z_n\). The sequence starts
The first few asymptotic terms of \(z_n\) will lead to a representation of the constant H, conjectured by Vershik and Kerov [28] to exist as the limit in probability of random variables \(-\frac{1}{\sqrt{n}}\log I \!P\, \! l^{(n)}(\lambda )\), where \(I \!P\, \! l^{(n)}(\lambda ):=n!\left( \prod _{u\in \lambda }\frac{1}{h_u}\right) ^2\), with \(\lambda \vdash n\) again distributed according to the Plancherel measure. A strengthening of the conjecture (convergence in \(L_p\) for \(p<\infty \)) has been proved by Bufetov [4], from which we borrowed above notation, and an expression for H in terms of a threefold integral has been given in [4, Eq. (15)]. We aim here at a less involved representation of H, and at more terms of an asymptotic expansion of \(I \!E\, [n^{-\frac{1}{2}}(2Z_n-\log n!)]\).
Theorem 3.1
Let \(H_n=1+\frac{1}{2}+\cdots +\frac{1}{n}\) denote the nth harmonic number. Then, as \(n\rightarrow \infty \), we have
where
and
Proof
As we will prove in the appendix, the Kronecker delta defined on \(\mathbb {N}\times \mathbb {N}\) can be expressed in terms of the polynomials p(x, r) given in (2.1) as follows,
From this we deduce
for \(n\in \mathbb {N}\), where g is given by
We want to extend g to a meromorphic function in the right halfplane \(\Re r>-1\). Therefore we employ
and the identities (all with easy proofs, only the last one is proven in the appendix)
By Euler’s reflection formula, for complex \(r\not \in \mathbb {Z}\) and for real \(\ell \rightarrow \infty \) we have
Therefore the following series
converges for \(\Re r>-1\), and satisfies \(h(1)=h(0)=0\), with \(h'(0)\) as given in the theorem. Hence
is the sought extension, meromorphic for \(\Re z>-1\). Now
where
Here C is a contour that encircles integers \(2,\ldots ,n\), but neither any other integers, nor poles of \(\phi \).
By computing (leading asymptotic terms of) residues of \(\Phi _n(z):=\phi (z)\frac{n!\Gamma (-z)}{\Gamma (n+1-z)}\) at \(1,\,\frac{1}{2}\), and 0, we obtain
where \(C'\) encircles the interval [0, n], but no poles of the integrand outside that interval. As shown in the appendix, the latter integral is o(1), thus we arrive at
Finally, for the evaluation of \(h(\frac{1}{2})\), we use \(\Gamma (\frac{1}{2}+\ell +1)\Gamma (\frac{1}{2}-\ell +1)=(-1)^{\ell +1}\frac{\pi }{4}(4\ell ^2-1)\), as well as \(\sum _{\ell \ge 2}(4\ell ^2-1)^{-1}=\frac{1}{6}\), leading to
which completes the proof. \(\square \)
Remark 3.2
Note that the term \(2\sum _{r\ge 2}(-1)^{r}g(r)K_{r-1}\left( {\begin{array}{c}n\\ r\end{array}}\right) \) can be used to compute \(z_n\) for values of n so large that naively generating all partitions \(\lambda \vdash n\) is not an option. Of course, care has to be taken, since cancellations will occur in numerical computations because of alternating signs of summands. Table 2 shows that \(\frac{1}{\sqrt{n}}(2z_n-\log n!)\) is slowly approaching H from below, with the values obtained in [29, Table 1] from simulations fitting neatly into this pattern. The convergence rate is in good accordance with the error term given in Theorem 3.1. Observe that \(\frac{13\gamma }{12}+\log \sqrt{2\pi }+\frac{1}{4}-h'(0)\approx 1.792693.\) For \(n=2048\) we then get \(H-(\frac{13}{24}\log {2048}+1.792693)/\sqrt{2048}\approx 1.746154\), which matches the table entry fairly well.
4 The Expected Side Length of the Durfee Square
Here we consider the side length of the Durfee square of partition \(\lambda \),
and we denote the restriction of \(D(\lambda )\) to \(\lambda \vdash n\) distributed according to Plancherel measure by \(D_n\). With respect to uniform measure, where all partitions \(\lambda \vdash n\) are equally likely, the expectation and the most likely value of \(D(\lambda )\) have been studied in [5, 6, 18]. Regarding Plancherel measure, it is known since the days of the limit shape theorem (see Theorem 5.1 in the next section) that \(\frac{1}{\sqrt{n}}D_n\rightarrow \frac{2}{\pi }\) in probability. Furthermore, we may deduce from [2, Theorem 3.6] convergence in distribution of \(\frac{\pi }{\sqrt{\log n}}\left( D_n-\frac{2}{\pi }\sqrt{n}\right) \) to a standard normal random variable. Should that convergence in distribution be accompanied by convergence of second moments, \(I \!E\, D_n=\frac{2}{\pi }\sqrt{n}+\mathcal {O}\left( \sqrt{\log n}\right) \) would follow. We are not aware of a proof of such result, let alone of any results in the literature regarding fine asymptotics of \(I \!E\, D_n\).
Theorem 4.1
Let \(d_n:=I \!E\, D_n\). Then, as \(n\rightarrow \infty \), we have
Proof
In terms of contents \(c_u\) of a Young diagram \(\lambda \), we have
Define polynomials in terms of the polynomials p(x, r) given in (2.1) via
These also allow for a representation of the Kronecker delta, similar to (3.1),
now valid for non-negative integers \(\ell ,n\). By [26, Eq. (7)], see also [8, Theorem A.1], we have
This leads to the representation
For the Poisson generating function \(D(z):=e^{-z}\sum _{n\ge 0}\frac{d_n}{n!}z^n\) we obtain
where \(J_0\) and \(J_1\) are Bessel functions of the first kind. We may use D(z) to recover asymptotics of \(d_n\), see [15, Sect. 5.11.4] for asymptotic expansions of Bessel functions. We find
for \(|z|\rightarrow \infty \), \(|\arg z|\le \pi -\delta \) with \(\delta >0\). A uniform bound is furnished by \(|D(z)|\le \cosh (4\sqrt{|z|})\). Evaluating now \(d_n=\frac{n!}{2\pi \textrm{i}}\oint _C z^{-n-1}e^zD(z){\text {d}}z\), with contour \(C:=\{z\in \mathbb {C}:|z|=n\}\), observing that there is an approximate saddle point at \(z=n\), finishes the proof. \(\square \)
Remark 4.2
The sequence starts \(\big (d_n\big )_{n=1}^{10}\!=\!(1, 1, 1, \frac{7}{6}, \frac{17}{12}, \frac{33}{20}, \frac{109}{60}, \frac{3217}{1680}, \frac{39703}{20160}, \frac{364859}{181440})\), and it again satisfies a linear recurrence relation,
with initial conditions
readily obtained from (4.3) using gfun.
5 Expected Fluctuations Around the Limit Shape Curve
Let us introduce the limit shape curve
The lower right boundary of the Young diagram of a partition \(\lambda \vdash n\), scaled to have unit area, rotated together with parts of positive x-axis and negative y-axis by \(135^\circ \), gives rise to a piecewise linear function \(\psi _{\lambda }\), also defined on \(\mathbb {R}\). When \(\lambda \) is distributed according to Plancherel measure, the random functions \(\psi _{\lambda }\) approach the limit shape curve, as \(n\rightarrow \infty \), in a sense that is made precise in the following result by Vershik and Kerov [27] and Logan and Shepp [14], which we present following closely [23, Theorem 1.22].
Theorem 5.1
(Limit shape theorem for Plancherel-random partitions) For all \(\varepsilon >0\), we have \(\mathbb {P}(\Vert \psi _\lambda -\Omega \Vert _\infty >\varepsilon )\rightarrow 0\) as \(n\rightarrow \infty \), i.e., the random functions \(\psi _\lambda \) converge to \(\Omega \) in probability in the norm \(\Vert \cdot \Vert _\infty \).
A discretized version of \(\psi _{\lambda }\), defined on the set \(\big \{\frac{a}{\sqrt{2n}}:a\in \mathbb {Z}\big \}\), can be expressed in terms of contents \(c_u\) of \(\lambda \) via \(\Psi _\lambda (a):=\sum _{u\in \lambda } \delta _{{-}a,c_u}\).
Indeed, the set \(\Big \{\Big (\frac{a}{\sqrt{2n}},\frac{2}{\sqrt{2n}}\big (\Psi _\lambda (a)+\frac{|a|}{2}\big )\Big ):a\in \mathbb {Z}\Big \}\) is a subset of the graph of \(\psi _\lambda \) containing, among others, all the points where the slope of \(\psi _\lambda \) changes from 1 to \(-1\) or back. For example, if \(\lambda =(5,3,1,1)\), then \((\Psi _\lambda (a))_{a=-4}^4=(1,1,1,2,2,1,1,1,0)\). Define now a related function, \(\Phi _\lambda (a):=\frac{1}{2}\big (\Psi _\lambda (a)+\Psi _\lambda (-a)\big )\), i.e.,
This symmetrised function is used because it can be expressed in terms of Kronecker deltas restricted to pairs of nonnegative integers, thus allowing to use the representation (4.1). Next, let \(\omega _{a,n}:=I \!E\, \Phi _\lambda (a)\), with \(\lambda \vdash n\) distributed according to the Plancherel measure, and define a sequence of functions
that one would expect to converge to \(\Omega (u)\), although such convergence is not implied by Theorem 5.1. By [2, Theorem 3.6] we have convergence in distribution of \(\frac{\pi \sqrt{n}}{\sqrt{2\log n}}\left[ \sqrt{\frac{2}{n}}\Big (\Phi _\lambda (\lfloor \sqrt{2n}u\rfloor ) +\frac{1}{2}|\lfloor \sqrt{2n}u\rfloor |\Big )-\Omega (u)\right] \) to a standard normal random variable in case that \(|u|<\sqrt{2}\). Should there also be convergence of second moments, \(\tilde{\Omega }_n(u)=\Omega (u)+\mathcal {O}\left( \sqrt{\frac{\log n}{n}}\right) \) would follow for \(|u|<\sqrt{2}\). See Fig. 4 for the limit shape curve, and, scaled to unit area, a superimposed partition of 10, and values \(\tilde{\Omega }_{10}(u)\) for \(u\in \{\frac{a}{\sqrt{20}}:-6\le a\le 6\}\). There is a seeming coincidence on the y-axes, yet \(\Omega (0)=\frac{2\sqrt{2}}{\pi }\approx .900316\), \(\frac{\omega _{0,10}}{\sqrt{5}}=\frac{364859}{181440\sqrt{5}}\approx .899305\), and the ordinate of the upper corner of the rotated Young diagram, \(\frac{2}{\sqrt{5}}\approx .894427\), are all different.
An alternating sum representation of \(\omega _{a,n}\), building upon (4.1), is the following
which again gives rise to a linear recurrence relation (obtained using gfun)
holding for \(n\ge a-2\), with initial conditions
Note that there is a common factor \((n+1)\) in the recurrence relation, when \(a=2\). Note also that setting \(a=0\) yields a recurrence relation with both order and degree one larger than the one given in (4.4).
As was done for \(d_n\), asymptotics via Poisson generating functions (which can again be expressed in terms of Bessel functions) can be obtained also for \(\omega _{a,n}\) for fixed integer \(a>0\):
In order to obtain asymptotics of \(\omega _{a,n}\) for n and a simultaneously approaching \(\infty \), which would be needed for asymptotics of \(\tilde{\Omega }_n(u)\), one could use the parametrization \(n=2\kappa ^2\in \mathbb {N}, a=\lfloor 2\kappa u\rfloor \), and consider
implied by (5.2), where \(C''\) is a contour that encircles integers \(1,\ldots ,2\kappa ^2\), but neither any other integers, nor poles of the integrand. Outside \(C''\), the integrand has poles at \(\frac{1}{2}\), at 0, and at all negative half-integers. For fixed u it turns out that each residue contributes to the leading (constant) term of the asymptotics in the limit \(\kappa \rightarrow \infty \), with the sum of those contributions converging, but for fixed \(\kappa \) the sum of residues does not converge. Balancing those two limiting processes (taking more and more residues into account, letting \(\kappa \rightarrow \infty \)) and at the same time bounding the integral over a sequence of correspondingly deformed contours appears to be intricate, so unfortunately we have not been able to prove \(\tilde{\Omega }_n(u)\rightarrow \Omega (u)\) for \(u\ne 0\).
Using holonomicity of \((\omega _{a,n})_{n\ge 0}\) to generate many terms of that sequence for many values of a, we obtain the plots in Figs. 5 and 6. The values for n in Fig. 5 and in the second plot in Fig. 6 have been chosen to satisfy \(\sin (4\sqrt{n})\approx 1\) in order to give maximal weight to the term \((-1)^a\) present in (5.3) and thus ensure better comparability of the plots in the vicinity of 0. The value of n in the first plot of Fig. 6 satisfies \(\sin (4\sqrt{n})\approx 0\). We conclude this section with some (non-rigorous) observations based on these plots.
-
(a)
Regarding convergence of the sequence \(\big (\tilde{\Omega }_n(\cdot )\big )_{n\ge 1}\) to \(\Omega (\cdot )\), the plots in Fig. 5 suggest that for any \(\epsilon >0\) we have \(\max \limits _{0\le u\le \sqrt{2}-\epsilon }|\tilde{\Omega }_n(u)-\Omega (u)|=\mathcal {O}\big (\frac{1}{n}\big )\). Near \(\sqrt{2}\) we only seem to have \(\max \limits _{|u-\sqrt{2}|\le \epsilon }|\tilde{\Omega }_n(u)-\Omega (u)| =\mathcal {O}\big (\frac{1}{\sqrt{n}}\big )\), with this slower convergence in agreement with the convergence rate stated in [2, Eq. (3.4)] for convergence of \(\tilde{\Omega }_n(\sqrt{2})\rightarrow \Omega (\sqrt{2})\) in probability, building upon results from [1, 3, 12, 19] addressing the problem of the longest increasing subsequence, see also [23].
-
(b)
A common feature of all plots in Figs. 5 and 6 is the presence of subintervals of “smooth” behavior surrounded by regions of more “irregular” behavior. In order to enforce “smooth” dependence of \(\omega _{a,n}\) on a, one would, in the light of (5.3), restrict to odd (or to even) a. However, this would only work for \(0\le a\le \alpha _n\) with \(\alpha _n=o(\sqrt{n})\). The location of the first “peak” to the right of 0 seems to suggest, that \(\alpha _n=\Theta \big ( n^{\frac{1}{4}}\big )\) may hold. For larger a it is no longer useful to distinguish between even and odd, instead one should consider \(\omega _{a,n}\) evaluated at a belonging to other arithmetic progressions: Near \(\frac{a}{\sqrt{2n}}=\sqrt{2}\cos \frac{\pi }{3}\approx 0.707\) the way to go would be to consider \(\big (\omega _{a+3k,n}\big )_{k}\), whereas near \(\frac{a}{\sqrt{2n}}=\sqrt{2}\cos \frac{\pi }{4}=1\) it would be \(\big (\omega _{a+4k,n}\big )_{k}\). Every fifth term should be taken near \(\sqrt{2}\cos \frac{\pi }{5}\approx 1.144\) and \(\sqrt{2}\cos \frac{2\pi }{5}\approx 0.437\). We expect this pattern to continue, with regions of smoothness near \(\sqrt{2}\cos \frac{\ell \pi }{m}\) for \(1\le \ell <\frac{m}{2}\), and \(\ell ,m\) coprime. For larger m these regions will become noticeable only if n gets large enough, and those regions will shrink with n further increasing, making room for yet other regions to pop up.
6 A Heuristic Upper Bound for the Variance of the Number of Bumping Steps in the RSK Algorithm
Let \(L_n:=X_n+Y_n\), \(\ell _n:=I \!E\, (X_n+Y_n)\), and \(v_n:=Var\, (X_n+Y_n)\). We now give a heuristic derivation of \(\ell _n\), and an upper bound for \(v_n\) based on [13]. Let \(\Omega (x)\) be the function defined in (5.1), describing the limit shape of normalized Young diagrams with respect to Plancherel measure. Denote \(s(x):=\frac{1}{\pi }\sqrt{2-x^2}\), the density of the semicircle distribution with support \([-\sqrt{2},\sqrt{2}]\). As shown in [13], this is also the limiting density of the random abscissa of a newly inserted box into a scaled and rotated Young diagram that closely resembles the limit shape curve, when new insertions are made according to the Plancherel growth process, that ensures that at each stage of the process the Young diagram is distributed according to Plancherel measure, see also [23, Sect. 1.19]. This leads to
and thus \(\ell _n\sim \frac{256}{27\pi ^2}n^{\frac{3}{2}}\). Moreover, assuming independence of \(L_{n-1}\) and \(L_n-L_{n-1}\),
and thus
Numerically we have e.g. \(\frac{v_{50}}{50^2}\approx 0.01216526413\).
Indeed, \(L_{n-1}\) and \(L_n-L_{n-1}\) seem to be negatively correlated. The sequence of covariances \(\big (Cov\, (L_n-L_{n-1},L_{n-1})\big )_{n\ge 2}\) starts \((0,0,-\frac{1}{9},-\frac{17}{180},-\frac{1}{15},-\frac{61}{450},-\frac{863}{5600},\ldots )\), staying negative up to \(n=40\) with roughly linear growth, see Fig. 7.
So it seems that in the light of Lemma 1.2 one can safely guess that for the number \(Y_n\) of bumping steps \(Var\, Y_n=\Theta (n^2)\) holds. It would be desirable to have a proof for that, and also know at least the leading asymptotic term of \(Var\, Y_n\).
7 Conclusion
In this paper we have obtained asymptotics of expectations of certain statistics of Plancherel distributed Young diagrams. That these statistics could be expressed in terms of hook lengths and contents of the boxes of such diagram was essential, as it allowed us to invoke polynomiality results for Plancherel averages, leading to representations of expectations as binomial convolutions, that make for easier asymptotic treatment. We hope that this approach will help to analyse further statistics of Plancherel distributed Young diagrams. Now polynomiality results have also been found for measures different from Plancherel (such as the Jack deformation of Plancherel measure, see [20]), or for subclasses of Plancherel distributed Young diagrams, such as strict partitions (see [11, 16, 17]). In case that appropriate substitutes for (2.3) or (4.2) are at hand, it is reasonable to believe that certain statistics in these settings could also be analysed along the lines of this paper.
Data Availability
Not applicable.
References
Baik, J., Deift, P., Johansson, K.: On the distribution of the length of the longest increasing subsequence of random permutations. J. Am. Math. Soc. 12, 1119–1178 (1999)
Bogachev, L.V., Su, Z.: Gaussian fluctuations of Young diagrams under the Plancherel measure. Proc. R. Soc. Lond. Ser. A 463, 1069–1080 (2007)
Borodin, A., Okounkov, A., Olshanski, G.: Asymptotics of Plancherel measures for symmetric groups. J. Am. Math. Soc. 13, 481–515 (2000)
Bufetov, A.I.: On the Vershik-Kerov conjecture concerning the Shannon-McMillan-Breiman theorem for the Plancherel family of measures on the space of Young diagrams. Geom. Funct. Anal. 22, 938–975 (2012)
Canfield, E.R., Corteel, S., Savage, C.D.: Durfee polynomials. Electron. J. Combin. 5, 32 (1998)
Canfield, E.R.: From recursions to asymptotics: Durfee and dilogarithmic deductions. Adv. Appl. Math. 34, 768–797 (2005)
Frame, J.S., Robinson, G. de B., Thrall, R.M.: The hook graphs of the symmetric groups. Can. J. Math. 6, 316–324 (1954)
Fujii, S., Kanno, H., Moriyama, S.: Instanton calculus and chiral one-point functions in supersymmetric gauge theories. Adv. Theor. Math. Phys. 12, 1401–1428 (2008)
Flajolet, P., Sedgewick, R.: Mellin transforms and asymptotics: finite differences and Rice’s integrals. Theor. Comput. Sci. 144, 101–124 (1995)
Graham, R.L., Knuth, D.E., Patashnik, O.: Conrete mathematics: a foundation for computer science. Addison-Wesley, Boston (1994)
Han, G.-N., Xiong, H.: Polynomiality of Plancherel averages of hook-content summations for strict, doubled distinct and self-conjugate partitions. J. Combin. Theory Ser. A 168, 50–83 (2019)
Johansson, K.: Discrete orthogonal polynomial ensembles and the Plancherel measure. Ann. Math. 153, 259–296 (2001)
Kerov, S.: A differential model for the growth of Young diagrams. In: Proceedings of the St. Petersburg Mathematical Society, , Vol. IV, Amer. Math. Soc. Transl. Ser. 2, vol. 188, pp. 111–130. American Mathematical Society, Providence (1999)
Logan, B.F., Shepp, L.A.: A variational problem for random Young tableaux. Adv. Math. 26, 206–222 (1977)
Luke, Y.L.: The special functions and their approximations. Academic Press, Cambridge (1969)
Matsumoto, S.: Polynomiality of shifted Plancherel averages and content evaluations. Ann. Math. Blaise Pascal 24, 55–82 (2017)
Matsumoto, S., Śniady, P.: Random strict partitions and random shifted tableaux. Sel. Math. New Ser. 26, 10 (2020)
Mutafchiev, L.R.: On the size of the Durfee square of a random integer partition. J. Comput. Appl. Math. 142, 173–184 (2002)
Okounkov, A.: Random matrices and random permutations. Int. Math. Res. Not. 20, 1043–1095 (2000)
Olshanski, G.: Plancherel averages: remarks on a paper by Stanley. Electron. J. Combin. 17, 43 (2010)
Panova, G.: Polynomiality of some hook-length statistics. Ramanujan J. 27, 349–356 (2012)
Romik, D.: The number of steps in the Robinson-Schensted algorithm. Funct. Anal. Appl. 39, 152–155 (2005)
Romik, D.: The surprising mathematics of longest increasing subsequences. Cambridge University Press, New York (2015)
Sagan, B.E.: The symmetric group. Springer, New York (2001)
Salvy, B., Zimmermann, P.: Gfun: a Maple package for the manipulation of generating and holonomic functions in one variable. ACM Trans. Math. Softw. 20(2), 163–177 (1994)
Stanley, R.P.: Some combinatorial properties of hook lengths, contents, and parts of partitions. Ramanujan J. 23, 91–105 (2010)
Vershik, A.M., Kerov, S.V.: Asymptotics of the Plancherel measure of the symmetric group and the limiting shape of Young tableaux. Soviet Math. Dokl. 18, 527–531 (1977)
Vershik, A.M., Kerov, S.V.: Asymptotic behavior of the maximum and generic dimensions of irreducible representations of the symmetric group. Funktsional. Anal. i Prilozhen. 19, 25–36 (1985)
Vershik, A., Pavlov, D.: Numerical experiments in problems of asymptotic representation theory. J. Math. Sci. 168, 351–361 (2010)
Acknowledgements
We would like to thank two anonymous referees, whose suggestions led to substantial improvements of the paper.
Funding
Open access funding provided by University of Vienna.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
8. Appendix
8. Appendix
1.1 8.1 Proof of Equation (2.2)
We use \(p(n,r)=\frac{(2r+1)!}{n}\left( {\begin{array}{c}n+r\\ 2r+1\end{array}}\right) \) and rewrite (2.2) as
Denoting by \(\Delta \) the forward difference operator, we will show that \(\Delta ^3S\) is the zero sequence, which together with \(S_1=1,\Delta S_1=3,\Delta ^2 S_1=2\) yields \(S_n=n^2\) for \(n\in \mathbb {N}\). Now
where the last equality follows from [10, Eq. (5.24)]. \(\square \)
1.2 8.2 Proof of Equation (3.1)
The equation is easily checked for \(\ell > n\), since then also \(r\ge n\) and thus \(p(n,r)=0\). For \(1\le \ell \le n\) we have
treating the case \(\ell =n\) directly, and using [10, Eq. (5.24)] again for \(n>\ell \). \(\square \)
1.3 8.3 Proof of Equation (4.1)
We use \(q(n,r)=\frac{n(2r)!}{n+r}\left( {\begin{array}{c}n+r\\ 2r\end{array}}\right) \) for \(n>0\), and \(q(0,0)=1\). The equation is easily checked for \(n=0\), and for \(\ell > n\), since then also \(r> n\) and thus \(q(n,r)=0\). For \(n>0,\,0\le \ell \le n\) we have
treating the case \(\ell =n\) directly, and using [10, Eq. (5.24)] again for \(n>\ell \). \(\square \)
1.4 8.4 Proof of Equation (3.2d)
Using \(H_\ell -1=\sum _{k=2}^\ell \frac{1}{k}\), and interchanging summation, we obtain, using [10, Eq. (5.16)] at several places,
1.5 8.5 Saddle Point Evaluation of the Integral \(I_n:=\frac{1}{2\pi \textrm{i}}\oint _{C'}f(z)\frac{n!\Gamma (-z)}{\Gamma (n+1-z)}{\text {d}}z\)
This integral appears in the proof of Theorem 2.1, see Sect. 2 for relevant notation. Putting \(n=m^2\), the integrand may be rewritten as
Denoting by \(\psi \) the digamma function, we have
with error terms \(\mathcal {O}(m^{-2})+\mathcal {O}(z^{-2})\), holding for \(m\rightarrow +\infty ,|z|\rightarrow \infty \), subject to \(z=o(m^2)\) and \(\delta \le |\arg z|\le \pi -\delta \) for some \(\delta >0\).
Two approximate saddle points of G(z) are \(\zeta :=6+4m\textrm{i}\) and \(\bar{\zeta }=6-4m\textrm{i}\). Indeed, \(\frac{\textrm{d}}{{\text {d}}z}\log G(\zeta )=\frac{G'(\zeta )}{G(\zeta )}=\mathcal {O}(\frac{1}{m^2})\), and , which suggests a contour directed north-west in the point \(\zeta \): Let \(z=\zeta +e^{\textrm{i}\frac{\pi }{4}}u\) and observe
Also note that a cumbersome evaluation results in
Define the counter-clockwise oriented contour \(C'\) as the polygon connecting the points
with \(\varepsilon >0\) small, and with segment \(c_i\) connecting \(z_i\) and \(z_{i+1}\) for \(0\le i\le 5\), and \(c_6\) connecting \(z_6\) and \(z_0\). It turns out that the integrals along \(c_0\) and \(c_6\) are of order \(\mathcal {O}(m^{-5+2\varepsilon })\), and \(c_i\), for \(2\le i\le 4\), make even smaller contributions. Moreover, the combined contribution of \(c_1\) and \(c_5\) is \(-2\sqrt{\frac{m}{\pi }}\Im (e^{\textrm{i} \frac{\pi }{4}}G(\zeta )(1+\mathcal {O}(\frac{1}{m})))\), which, up to error terms of order \(\mathcal {O}(m^{-\frac{9}{2}})\), simplifies to
1.6 8.6 Bounding the Integral \(J_n:=\frac{1}{2\pi \textrm{i}}\oint _{C'}\Phi _n(z)\,{\text {d}}z\)
This integral appears in the proof of Theorem 3.1, see Sect. 3 for relevant notation. Let \(C'\) be the boundary of the rectangle with corners \(-\frac{1}{2}\pm \textrm{i} 4em\), \(n+\frac{1}{2}\pm \textrm{i} 4em\), and \(m=\sqrt{n}\).
Observe \((-1)^{\ell }\ell ^2\big (\log \ell -H_\ell +\gamma +\frac{1}{2\ell }-\frac{1}{12\ell ^2}\big )=\mathcal {O}(\ell ^{-2})\), and, abbreviating \(\rho =r+\frac{1}{2}\),
for \(\Re \rho \ge 0\), i.e., for \(\Re r\ge -\frac{1}{2}\), therefore \(\Gamma ^2(r+1)h(r)=\mathcal {O}(1)\) for \(\Re r\ge -\frac{1}{2}\), which leads to \(\Gamma ^2(z+1)g(z)=\mathcal {O}(1)\) for \(\Re z\ge -\frac{1}{2}\), \(|z-w|\ge \frac{1}{2}\) for \(w\in \{0,\frac{1}{2},1\}\). Hence
by the reflection and duplication formulas. For \(z=-\frac{1}{2}+\textrm{i} t\), with \(t=\mathcal {O}\big (\sqrt{n}\big )\), we have \(|\Gamma ^2(z+1)\sin \pi z|=\pi \) and \(\left| \frac{\Gamma (n+1)}{\Gamma (n+1-z)}\right| =\mathcal {O}\big (n^{-\frac{1}{2}}\big )\), yielding a contribution \(\mathcal {O}\big (n^{-\frac{1}{2}}\big )\) from the integral over the left segment of \(C'\). The contributions from the two horizontal segments is \(\mathcal {O}\big (n^{-\frac{3}{2}}\big )\), while the right segment makes an exponentially small contribution. All this can be seen from the estimates
holding for \(z=\sigma +\textrm{i} t\) with \(\sigma \ge -\frac{1}{2}\) and \(|z-w|\ge \frac{1}{2}\) for \(w\in \mathbb {Z}\), and
holding for \(z=\sigma +\textrm{i} t\) with \(-\frac{1}{2}\le \sigma \le n+\frac{1}{2}\) and \(t=\mathcal {O}\big (\sqrt{n}\big )\).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Schachinger, W. Asymptotics of Some Plancherel Averages Via Polynomiality Results. La Matematica 2, 668–691 (2023). https://doi.org/10.1007/s44007-023-00061-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s44007-023-00061-2