1 Introduction

We identify Young diagrams (sets consisting of left aligned decreasingly ordered rows of square boxes) with partitions \(\lambda =(\lambda _1,\ldots ,\lambda _k)\) with \(\lambda _1\ge \lambda _2\ge \cdots \ge \lambda _k\), and denote \(|\lambda |=\lambda _1+\ldots +\lambda _k\). The notation \(\lambda \vdash n\) then signifies that \(\lambda \) is a partition of n, i.e., \(|\lambda |=n\). We let \(Y(\pi )=Y_\lambda :=\sum _{\ell =1}^k\lambda _\ell (\ell -1)\) denote the number of bumping steps of the Robinson–Schensted algorithm (see Figs. 1 and 2) when applied to a permutation \(\pi \) that is mapped to a pair of standard Young tableaux of shape \(\lambda \). A standard Young tableau is a Young diagram \(\lambda \) filled with numbers \(1,\ldots ,|\lambda |\) in a way such that numbers in each row and each column are increasing. See e.g. [23, Sect. 1.6] or [24, Sect. 3.1] for nice expositions of the algorithm and references to the original articles by Gilbert de Beauregard Robinson, by Craige Eugene Schensted, and by Donald Ervin Knuth, who significantly widened the scope of the algorithm, the abbreviation with reference to all three authors, RSK algorithm, now frequently being used also to refer to the original Robinson–Schensted algorithm.

Fig. 1
figure 1

Tableaux P and Q, as they evolve when subjecting the RSK algorithm to the permutation \(\pi =(75186342)\). P is constructed by row insertions of elements of \(\pi \) one by one, while Q is recording the position of boxes as they are added

Fig. 2
figure 2

Detailed insertion of 2 into the second to last tableau P. Inserting 2, 3, and 5 into their (shaded) destination boxes causes bumps of elements down one row, being now in need of insertion themselves. In the last step, 7 is the largest element of row four, so this insertion happens without a pump

We denote by \(Y_n\) the restriction of \(Y(\pi )\) to permutations of the set \(\{1,2,\ldots ,n\}\) chosen uniformly at random. The Young diagrams \(\lambda \) obtained by the RSK algorithm are then distributed according to the nth Plancherel measure, i.e., \(I \!P\, \!l^{(n)}(\lambda )=\frac{f_\lambda ^2}{n!}=\frac{n!}{p_\lambda ^2}\), where \(f_\lambda \) is the number of standard Young tableaux of shape \(\lambda \), satisfying \(f_\lambda =\frac{n!}{p_\lambda }\), and where \(p_\lambda :=\prod _{u\in \lambda }h_u\) denotes the product of the hook lengths of the diagram \(\lambda \), see [7]. Here the hook length \(h_u\) of a particular box u of \(\lambda \) is one more than the number of boxes to the right of u plus the number of boxes below u. Note that \(Y_\lambda \) has also the meaning of \(|\lambda |\) times the y-coordinate of the barycenter of the set

$$\begin{aligned} S_\lambda :=\{(i,j)\in \mathbb {Z}^2:0\le j\le k-1,0\le i\le \lambda _{j+1}-1\}, \end{aligned}$$

which is just the set of lower left corners of the boxes of \(\lambda \) in French notation, which addresses boxes by Cartesian coordinates of the first quadrant. Apart from here we always stick to English notation with its matrix style indexing of boxes. Note that \(X_\lambda \), the x-coordinate of the barycenter of \(S_\lambda \), is given by \(Y_{\lambda '}\), where \(\lambda '\) is the partition conjugate to \(\lambda \), its parts being defined by \(\lambda '_j:=|\{i:\lambda _i\ge j\}|\). Stated differently, \(\lambda \) and \(\lambda '\) are mirror images of one another with respect to the main diagonal (upper left to lower right). The sets of hook lengths are therefore the same for \(\lambda \) and \(\lambda '\), which yields invariance of Plancherel measure under conjugation. Thus \(X_n\) and \(Y_n\) are identical in distribution. This allows for a representation of \(I \!E\, Y_n\) and \(Var\, Y_n\) in terms of \(X_n+Y_n\) and \(X_n-Y_n\),

$$\begin{aligned} I \!E\, Y_n&=\tfrac{1}{2}I \!E\, (X_n+Y_n),\\ Var\, Y_n&=\tfrac{1}{2}(Var\, X_n+Var\, Y_n)=\tfrac{1}{4}\big (Var\, (X_n+Y_n)+Var\, (X_n-Y_n)\big ). \end{aligned}$$

Note that we can express \(X_\lambda -Y_\lambda \), resp. \(X_\lambda +Y_\lambda \), in terms of contents \(\{c_u:u\in \lambda \}\), resp. hook lengths \(\{h_u:u\in \lambda \}\), of the diagram \(\lambda \):

$$\begin{aligned} X_\lambda -Y_\lambda =\sum _{u\in \lambda }c_u,\qquad X_\lambda +Y_\lambda =\sum _{u\in \lambda }h_u-|\lambda |. \end{aligned}$$
(1.1)

Here the content \(c_u\) of a box \(u=(i,j)\) of \(\lambda \) is \(j-i\), i.e., the column number of u minus the row number of u, see Fig. 3 for an illustration of hook lengths, contents, and bumping step counts. For a proof of (1.1) note

$$\begin{aligned} Y_\lambda= & {} \sum _{i=1}^k\lambda _i(i-1)=\sum _{(i,j)\in \lambda }(i-1) =\sum _{j=1}^{\lambda _1}\sum _{i=1}^{\lambda '_j}(i-1)=\sum _{j=1}^{\lambda _1}\sum _{i=1}^{\lambda '_j}(\lambda '_j-i)\\= & {} \sum _{(i,j)\in \lambda }(\lambda '_j-i), \end{aligned}$$

and similarly

$$\begin{aligned} X_\lambda =Y_{\lambda '}=\sum _{(i,j)\in \lambda }(j-1) =\sum _{(i,j)\in \lambda }(\lambda _i-j), \end{aligned}$$

leading to \(X_\lambda -Y_\lambda =Y_{\lambda '}-Y_\lambda =\sum \limits _{(i,j)\in \lambda }\Big [(j-1)-(i-1)\Big ] =\sum \limits _{(i,j)\in \lambda }(j-i)=\sum \limits _{u\in \lambda }c_u\) and \(X_\lambda +Y_\lambda +|\lambda |=\sum \limits _{(i,j)\in \lambda }\Big [(\lambda _i-j)+(\lambda '_j-i) +1\Big ]=\sum \limits _{u\in \lambda }h_u\), where \((\lambda _i-j)+(\lambda '_j-i)+1\) is clearly the hook length of box (ij).

Further functions of \(\lambda \) that can be written in terms of hook lengths or contents are \(\log p_\lambda =\sum _{u\in \lambda }\log (h_u)\) making its appearance in Sect. 3, and \(D(\lambda )=\sum _{u\in \lambda }\delta _{0,c_u}\), the number of boxes of \(\lambda \) on the main diagonal, that we will meet in Sect. 4.

Fig. 3
figure 3

The partition \((5,3,1,1)\vdash 10\), drawn as Young diagram, filled from left to right with bumping step counts, sums of box coordinates, hook lengths, and contents. The sum of the entries of the second diagram is 10 less than the sum of the hook lengths

Being able to express some function of \(\lambda \) in terms of the contents or hook lengths of the boxes of \(\lambda \) can allow us to employ the polynomiality results for Plancherel averages derived by Stanley [26].

Theorem 1.1

([26, Theorems 2.1,  4.3]) Let F(x) be a formal power series over \(\mathbb {Q}\) of bounded degree that is symmetric in the variables \(x = (x_1,x_2,\ldots )\). Then both averages

$$\begin{aligned} \frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2F(c_u:u\in \lambda ) \qquad and \qquad \frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2F(h^2_u:u\in \lambda ) \end{aligned}$$

are polynomial functions of n.

Note that an even more general result is given in [26, Theorem 4.4]. See also [20] for alternative proofs and further generalizations.

The proof of [26, Theorem 2.1] restricts w. l. o. g. to elementary symmetric functions indexed by partitions \(\mu =(\mu _1,\ldots ,\mu _k)\), i.e., to functions \(F(\cdot )=e_\mu (\cdot )=\prod _{i=1}^{k}e_{\mu _i}(\cdot )\), where \(e_m(x_1,x_2,\ldots )=\sum _{i_1<\cdots <i_m}x_{i_1}\ldots x_{i_m}\) for \(m\ge 1\). As remarked in [26] right below that proof, the resulting polynomial \(N_\mu \) is of degree \(|\mu |\) if and only if \(|\mu |\) is even and \(\mu _1\le \frac{|\mu |}{2}\), otherwise, \(N_\mu =0\). Here is an immediate application of these degree considerations.

Lemma 1.2

\(Var\, (X_n-Y_n)=\left( {\begin{array}{c}n\\ 2\end{array}}\right) \).

Proof

Since \(I \!E\, (X_n-Y_n)=0\), we have

$$\begin{aligned} Var\, (X_n-Y_n)=\frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2\bigg (\sum _{u\in \lambda }c_u\bigg )^2=\frac{n(n-1)}{2}. \end{aligned}$$

Note that here we have \(F(c_u:u\in \lambda )=\big (e_1(c_u:u\in \lambda )\big )^2\), i.e., \(\mu =(1,1)\). The polynomial \(N_\mu \) is therefore of degree 2, and it is completely determined by its values at \(n\in \{0,1,2\}\), which are \(N_\mu (0)=N_\mu (1)=0, N_\mu (2)=1\), proving the claim. The result \(N_\mu (n)=\frac{n(n-1)}{2}\) is also stated as a special case in [26, p. 94]. \(\square \)

A workaround is needed for \(Var\, (X_n+Y_n)\), or even \(I \!E\, (X_n+Y_n)\), because \(X_{\lambda }+Y_{\lambda }\) is not a symmetric function of \(\{h^2_u:u\in \lambda \}\), but only of \(\{h_u:u\in \lambda \}\). Finding a series representation \(x=\sum _{k\ge 0}a_kp_k(x^2)\) with polynomials \(p_k\), that holds for integers \(x\ge 1\), (but need not hold or even converge elsewhere) would allow, interchanging summations, to apply the polynomiality results termwise. If we are lucky—and we are—the polynomials \(p_k\) have well known Plancherel averages. Such kind of workaround is employed in this paper to deal with Plancherel averages of several interesting functions of partitions, leading firstly to a representation of the expectation as a binomial convolution, that is free of references to partitions, and can be analyzed using Rice’s integral, or Poisson generating functions. In some cases holonomicity of the sequence of expectations can be inferred from the binomial convolution representation. This then allows for fast computation of many terms, that can be used to numerically confirm error terms, or conduct experiments.

The paper is organized as follows: In Sect. 2 we consider the expected number of bumping steps in the RSK algorithm. In particular, we derive asymptotics for \(I \!E\, (X_n+Y_n)\), thus refining the result obtained by Romik [22]. In Sect. 3 we consider \(I \!E\, \log I \!P\, \! l^{(n)}(\lambda )\), with \(\lambda \) distributed according to Plancherel measure. From the first asymptotic terms we obtain a very explicit representation of a constant appearing in an almost equipartition property (abbreviated AEP) for Plancherel measure, conjectured in [28], and proven in [4]. In Sect. 4 we derive asymptotics of the expectation of the side length \(D(\lambda )\) of the Durfee square of \(\lambda \), i.e., the largest square fitting in the upper left corner of the Young diagram of \(\lambda \), when partitions \(\lambda \) are distributed according to Plancherel measure, see Table 1. Considering in Sect. 5 more generally lengths of south-east directed cuts through the Young diagram of \(\lambda \), we enter the realm of a sequence of random curves \(\psi _\lambda \) known to converge uniformly in probability to the Logan-Shepp-Vershik-Kerov limit shape curve \(\Omega \) for \(|\lambda |\rightarrow \infty \). For any fixed integer a the sequence with terms \(\sqrt{n}I \!E\, \psi _\lambda \left( \frac{a}{\sqrt{2n}}\right) \), with expectation computed with respect to \(I \!P\, \! l^{(n)}\), turns out to be holonomic. Experiments then strongly hint at convergence of \(I \!E\, \psi _\lambda \left( \frac{\lfloor u\sqrt{2n}\rfloor }{\sqrt{2n}}\right) \rightarrow \Omega (u)\), uniformly in u, and reveal that second order terms show interesting fluctuations. However we are only able to prove asymptotic results in the case of fixed a, i.e., in the vicinity of \(u=0\). In Sect. 6 we return to the number \(Y_n\) of bumping steps, giving a heuristic argument for \(Var\, Y_n=\mathcal {O}(n^2)\), based on the limit shape curve.

Table 1 Some functions of partitions \(\lambda \vdash 4\) and their expectations with respect to Plancherel measure

2 Refined Asymptotics of the Expected Number of Bumping Steps in the RSK Algorithm

Recall that \(I \!E\, (X_n+Y_n)=2I \!E\, Y_n\) denotes twice the expected number of bumping steps of the RSK algorithm when applied to a random permutation of \(\{1,\ldots ,n\}\). Romik [22, Eq. (1)] derived the following asymptotic result, \(I \!E\, Y_n\sim \frac{128}{27\pi ^2}n^{\frac{3}{2}}\), and showed \(Y_n/I \!E\, Y_n\rightarrow 1\) in probability. The sequence of interest starts

$$\begin{aligned} \big (I \!E\, (X_n+Y_n)\big )_{n=1}^{10}=\big (0,1, \tfrac{7}{3}, \tfrac{25}{6}, \tfrac{19}{3}, \tfrac{44}{5}, \tfrac{347}{30}, \tfrac{8181}{560}, \tfrac{541273}{30240}, \tfrac{1943453}{90720}\big ). \end{aligned}$$

The next theorem leads to a refinement of Romik’s asymptotic equivalent for \(I \!E\, Y_n\).

Theorem 2.1

(Expected number of bumping steps in the RSK algorithm) Let \(\delta _n:=\log n+2\gamma +12\log 2\), with \(\gamma \) denoting Euler’s constant. Then

$$\begin{aligned} I \!E\, (X_n+Y_n)&=\frac{256}{27\pi ^2}n^{\frac{3}{2}}-n +\frac{9\delta _n-77}{9\pi ^2}n^{\frac{1}{2}} +\frac{3510\delta _n-31589}{27648\pi ^2}n^{-\frac{1}{2}}\\&\quad +\frac{5565\delta _n-62224}{786432\pi ^2}n^{-\frac{3}{2}} +\frac{e^8}{2^{12}\pi ^{3/2}}\cos \left( 8\sqrt{n}+\frac{\pi }{4}\right) n^{-\frac{7}{4}}+\mathcal {O}\Big (n^{-\frac{9}{4}}\Big ). \end{aligned}$$

Proof

As \(X_n+Y_n\) is not a symmetric polynomial of the multiset \(\{h_u^2:u\in \lambda \}\), but only of the multiset \(\{h_u:u\in \lambda \}\), we can not expect \(I \!E\, (X_n+Y_n)\) to be a polynomial in \(|\lambda |\). Indeed, by Romik’s result, \(I \!E\, (X_n+Y_n)=\Theta (n^\frac{3}{2})\) is definitely not a polynomial. However, we can invoke polynomiality results via the following identity. Using

$$\begin{aligned} p(x,r):=\prod _{i=1}^r(x^2-i^2), \end{aligned}$$
(2.1)

the equation

$$\begin{aligned} x=1+\sum _{r=1}^\infty \left( {\begin{array}{c}2r\\ r\end{array}}\right) \frac{(-1)^{r}}{(1-2r)(2r+1)!}p(x,r) \end{aligned}$$
(2.2)

holds for \(x\in \mathbb {N}:=\{1,2,3,\ldots \}\). This will be proved in the Appendix.

Now, by [21, Theorem 1], we have

$$\begin{aligned} \frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2\sum _{u\in \lambda }p(h_u,r)=K_r\left( {\begin{array}{c}n\\ r+1\end{array}}\right) , \end{aligned}$$
(2.3)

with \(K_r=\frac{(2r)!(2r+1)!}{(r+1)!^2r!}\), leading to

$$\begin{aligned} I \!E\, \,(X_n+Y_n)&=\frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2\sum _{u\in \lambda }(h_u\!-\!1) \overset{(2.2)}{=}\frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2\sum _{u\in \lambda }\sum _{r=1}^\infty \left( {\begin{array}{c}2r\\ r\end{array}}\right) \frac{(-1)^{r}}{(1-2r)(2r+1)!}p(h_u,r)\\&\overset{(2.3)}{=}\sum _{r=1}^\infty \left( {\begin{array}{c}2r\\ r\end{array}}\right) \frac{(-1)^{r}}{(1-2r)(2r+1)!}K_r\left( {\begin{array}{c}n\\ r+1\end{array}}\right) \\&=\sum _{r=1}^\infty \left( {\begin{array}{c}2r\\ r\end{array}}\right) ^2\frac{(-1)^{r+1}}{(2r-1)(r+1)(r+1)!}\left( {\begin{array}{c}n\\ r+1\end{array}}\right) =\sum _{r=2}^n\left( {\begin{array}{c}2r-2\\ r-1\end{array}}\right) ^2\frac{(-1)^{r}}{(2r^2-3r)r!}\left( {\begin{array}{c}n\\ r\end{array}}\right) \\&=(-1)^n\frac{n!}{2\pi \textrm{i}}\oint _C\frac{f(z)}{z(z-1)(z-2)\cdots (z-n)}{\text {d}}z, \end{aligned}$$

where C is a contour that encircles integers \(2,3,\ldots ,n\), but neither any other integers, nor poles of f, which is given by

$$\begin{aligned} f(z):=\frac{\Gamma ^2(2z-1)}{z^2(2z-3)\Gamma ^5(z)}. \end{aligned}$$

For the method of evaluating a large finite difference via the so-called Rice’s integral used above see the article [9]. By computing (leading asymptotic terms of) residues of \(g_n(z):=f(z)\frac{n!\Gamma (-z)}{\Gamma (n+1-z)}\) at \(\pm \frac{3}{2},1,\pm \frac{1}{2}\), (note that \(g_n(z)\) is analytic for \(z\in \{-1,0\}\)) we obtain

$$\begin{aligned} I \!E\, \,(X_n+Y_n)=&\,\frac{256 n^{\frac{3}{2}}}{27\pi ^2}\left( 1-\frac{3}{8n} -\frac{7}{128n^2}-\frac{9}{1024n^3}\right) \\&-n\\&+\delta _n\frac{n^{\frac{1}{2}}}{\pi ^2}\Big (1+\frac{1}{8n}+\frac{1}{128n^2}\Big )- \frac{5 n^{\frac{1}{2}}}{\pi ^2}\left( 1+\frac{1}{8n} -\frac{1}{1920n^2}\right) \\&+\delta _n\frac{n^{-\frac{1}{2}}}{512\pi ^2}\Big (1-\frac{3}{8n}\Big ) +\frac{n^{-\frac{1}{2}}}{1024\pi ^2}\left( 1+\frac{13}{8n}\right) \\&-\delta _n\frac{n^{-\frac{3}{2}}}{2^{18}\pi ^2} +\frac{n^{-\frac{3}{2}}}{2^{14}\!\cdot \!3\pi ^2}\\&+\mathcal {O}\left( n^{-\frac{5}{2}}\log n\right) +\frac{1}{2\pi \textrm{i}}\oint _{C'}f(z)\frac{n!\Gamma (-z)}{\Gamma (n+1-z)}{\text {d}}z, \end{aligned}$$

where we recall \(\delta _n=\log n+2\gamma +12\log 2,\) and where \(C'\) encircles the interval \([-\frac{3}{2},n]\), but no poles of the integrand outside that interval. This integral is taken care for in the appendix, yielding the contribution involving \(\cos (8\sqrt{n}+\frac{\pi }{4})\). Such a term is not completely uncommon, see e.g. the example at the end of [9, Sect. 5]. Slightly rearranging the terms completes the proof. \(\square \)

2.1 Holonomicity of the Sequence \((I \!E\, (X_n+Y_n)+n)_{n\in \mathbb {N}}\)

Defining

$$\begin{aligned} u_n:=\sum _{r=1}^n\left( {\begin{array}{c}2r-2\\ r-1\end{array}}\right) ^2\frac{(-1)^{r}}{(2r^2-3r)r!}\left( {\begin{array}{c}n\\ r\end{array}}\right) , \end{aligned}$$

we have \(I \!E\, (X_n+Y_n)=u_n-n\). The sequence \((u_n)\) satisfies the linear recurrence relation

$$\begin{aligned} u_{n+4} =\tfrac{4n^3+23n^2+63n+78}{(n+3)(n+4)^2}u_{n+3} -\tfrac{(2n-1)(3n+2)}{(n+4)^2}u_{n+2} +\tfrac{(4n-7)(n+2)}{(n+4)^2}u_{n+1} -\tfrac{(n+1)(n+2)}{(n+4)^2}u_n, \end{aligned}$$

with initial conditions

$$\begin{aligned} u_0=0,\quad u_1=1,\quad u_2=3,\quad u_3=\frac{16}{3}. \end{aligned}$$

Clearly, the terms \(\frac{u_n}{n!}\) comprise a sequence, that is the convolution of two sequences that are obviously holonomic. For said convolution the gfun package [25] then easily produces a recursion.

For the Poisson generating function \(U(z):=e^{-z}\sum _{n\ge 0}\frac{u_n}{n!}z^n\) we obtain

$$\begin{aligned} U(z)=-z\sum _{k\ge 0}\left( {\begin{array}{c}2k\\ k\end{array}}\right) ^2\frac{(-z)^k}{(k+1)(2k-1)(k+1)!^2}=z\, {}_2 F_3\left[ \begin{matrix}-\frac{1}{2},\, \frac{1}{2}~\\ 2,\,2,\,2\end{matrix};-16z\right] , \end{aligned}$$

a hypergeometric function that may also be used to recover asymptotics of \(u_n\), see [15, Sect. 5.11.2] for asymptotic expansions of generalized hypergeometric functions. Indeed, the leading terms of an asymptotic expansion of U(z), provided by Maple, together with Depoissonization via the saddle point method, yield an alternative proof of Theorem 2.1.

Note that the recurrence relation allows for easily computing millions of terms of the sequence \((I \!E\, (X_n+Y_n))_{n\ge 1}\), which can be used to numerically confirm the error term in Theorem 2.1.

3 The Constant Appearing in the AEP for Plancherel Measure

We consider the random variables \(Z_n=Z_n(\lambda ):=\sum _{u\in \lambda }\log h_u\), where \(\lambda \vdash n\) is distributed according to the Plancherel measure, and denote \(z_n:=I \!E\, Z_n\). The sequence starts

$$\begin{aligned} (z_1,\ldots ,z_{5})&= \left( 0,~\log 2,~\tfrac{\log 2}{3}\!+\!\log 3,~\tfrac{17}{6}\log 2\! +\!\tfrac{\log 3}{4},~\tfrac{13}{6}\log 2\!+\!\tfrac{7}{10}\log 3\!+\!\tfrac{7}{12}\log 5 \right) \\&\approx (0,~ 0.6931471806,~ 1.329661349,~ 2.238570083,~ 3.209686276) \end{aligned}$$

The first few asymptotic terms of \(z_n\) will lead to a representation of the constant H, conjectured by Vershik and Kerov [28] to exist as the limit in probability of random variables \(-\frac{1}{\sqrt{n}}\log I \!P\, \! l^{(n)}(\lambda )\), where \(I \!P\, \! l^{(n)}(\lambda ):=n!\left( \prod _{u\in \lambda }\frac{1}{h_u}\right) ^2\), with \(\lambda \vdash n\) again distributed according to the Plancherel measure. A strengthening of the conjecture (convergence in \(L_p\) for \(p<\infty \)) has been proved by Bufetov [4], from which we borrowed above notation, and an expression for H in terms of a threefold integral has been given in [4, Eq. (15)]. We aim here at a less involved representation of H, and at more terms of an asymptotic expansion of \(I \!E\, [n^{-\frac{1}{2}}(2Z_n-\log n!)]\).

Theorem 3.1

Let \(H_n=1+\frac{1}{2}+\cdots +\frac{1}{n}\) denote the nth harmonic number. Then, as \(n\rightarrow \infty \), we have

$$\begin{aligned} -\frac{I \!E\, \log I \!P\, \! l^{(n)}(\lambda )}{\sqrt{n}}=H-\left( \frac{13}{24}\log n+\frac{13\gamma }{12}+\log \sqrt{2\pi }+\frac{1}{4}-h'(0)\right) \frac{1}{\sqrt{n}}+o\big (n^{-\frac{1}{2}}\big ), \end{aligned}$$

where

$$\begin{aligned} H&=\frac{16}{3\pi ^2}(4\gamma +1)+\frac{64}{\pi ^2} \sum _{\ell \ge 2}\frac{\ell ^2}{4\ell ^2-1}\Big (\log \ell -H_\ell +\gamma +\frac{1}{2\ell }\Big )\\&\approx 1.87702830628 \end{aligned}$$

and

$$\begin{aligned} h'(0)=\sum _{\ell \ge 2}\ell \left( H_\ell -\log \ell -\gamma - \frac{1}{2\ell }+\frac{1}{12\ell ^2}\right) \approx 0.001562493. \end{aligned}$$

Proof

As we will prove in the appendix, the Kronecker delta defined on \(\mathbb {N}\times \mathbb {N}\) can be expressed in terms of the polynomials p(xr) given in (2.1) as follows,

$$\begin{aligned} \delta _{\ell ,n}=\sum _{r=\ell -1}^\infty (-1)^{\ell +r+1}\frac{2\ell ^2}{(r+\ell +1)!(r-\ell +1)!}p(n,r). \end{aligned}$$
(3.1)

From this we deduce

$$\begin{aligned} \log n=\sum _{\ell \ge 2}\log \ell \sum _{r\ge \ell -1}\frac{2(-1)^{\ell +r+1}\ell ^2}{(r+\ell +1)!(r-\ell +1)!}p(n,r)=2\sum _{r\ge 2}(-1)^{r}g(r)p(n,r-1), \end{aligned}$$

for \(n\in \mathbb {N}\), where g is given by

$$\begin{aligned} g(r)=\sum _{\ell =2}^r\frac{(-1)^{\ell }\ell ^2\log \ell }{\Gamma (r+\ell +1)\Gamma (r-\ell +1)}. \end{aligned}$$

We want to extend g to a meromorphic function in the right halfplane \(\Re r>-1\). Therefore we employ

$$\begin{aligned} \log \ell =H_\ell -\gamma -\frac{1}{2\ell }+\frac{1}{12\ell ^2}+\mathcal {O}(\ell ^{-4}), \end{aligned}$$

and the identities (all with easy proofs, only the last one is proven in the appendix)

$$\begin{aligned} \sum _{\ell =2}^r\frac{(-1)^{\ell }\ell ^2}{\Gamma (r+\ell +1)\Gamma (r-\ell +1)}&=\frac{1}{\Gamma (r)\Gamma (r+2)} \end{aligned}$$
(3.2a)
$$\begin{aligned} \sum _{\ell =2}^r\frac{(-1)^{\ell }\ell }{\Gamma (r+\ell +1)\Gamma (r-\ell +1)}&=\frac{3(r-1)}{2(2r-1)\Gamma (r)\Gamma (r+2)} \end{aligned}$$
(3.2b)
$$\begin{aligned} \sum _{\ell =2}^r\frac{(-1)^{\ell }}{\Gamma (r+\ell +1)\Gamma (r-\ell +1)}&=\frac{r-1}{2r\Gamma (r)\Gamma (r+2)} \end{aligned}$$
(3.2c)
$$\begin{aligned} \sum _{\ell =2}^r\frac{(-1)^{\ell }\ell ^2(H_\ell -1)}{\Gamma (r+\ell +1)\Gamma (r-\ell +1)}&=\frac{1}{4(r-1)(2r-1)\Gamma (r)^2}. \end{aligned}$$
(3.2d)

By Euler’s reflection formula, for complex \(r\not \in \mathbb {Z}\) and for real \(\ell \rightarrow \infty \) we have

$$\begin{aligned} \Gamma (r+\ell +1)\Gamma (r-\ell +1)\frac{\sin \pi (\ell -r)}{\pi }= \frac{\Gamma (r+\ell +1)}{\Gamma (\ell -r)} \sim \ell ^{2r+1}. \end{aligned}$$

Therefore the following series

$$\begin{aligned} h(r):=\sum _{\ell \ge 2}(-1)^{\ell }\ell ^2\frac{\log \ell -H_\ell +\gamma +\frac{1}{2\ell }-\frac{1}{12\ell ^2}}{\Gamma (r+\ell +1)\Gamma (r-\ell +1)} \end{aligned}$$

converges for \(\Re r>-1\), and satisfies \(h(1)=h(0)=0\), with \(h'(0)\) as given in the theorem. Hence

$$\begin{aligned} g(z)=h(z)+\frac{1-\gamma }{\Gamma (z)\Gamma (z+2)}+\frac{1}{4(z-1)(2z-1)\Gamma (z)^2} -\frac{(z-1)(16z+1)}{24z(2z-1)\Gamma (z)\Gamma (z+2)} \end{aligned}$$

is the sought extension, meromorphic for \(\Re z>-1\). Now

$$\begin{aligned} z_n=\frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2\sum _{u\in \lambda }\log h_u&=2\sum _{r\ge 2}(-1)^{r}g(r)\frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2\sum _{u\in \lambda }p(h_u,r-1)\\&=2\sum _{r\ge 2}(-1)^{r}g(r)K_{r-1}\left( {\begin{array}{c}n\\ r\end{array}}\right) \\&=(-1)^n\frac{n!}{2\pi \textrm{i}}\oint _C\frac{\phi (z)}{z(z-1)(z-2)\cdots (z-n)}{\text {d}}z, \end{aligned}$$

where

$$\begin{aligned} \phi (z)=2g(z)\frac{\Gamma (2z)\Gamma (2z-1)}{\Gamma (z+1)^2\Gamma (z)}. \end{aligned}$$

Here C is a contour that encircles integers \(2,\ldots ,n\), but neither any other integers, nor poles of \(\phi \).

By computing (leading asymptotic terms of) residues of \(\Phi _n(z):=\phi (z)\frac{n!\Gamma (-z)}{\Gamma (n+1-z)}\) at \(1,\,\frac{1}{2}\), and 0, we obtain

$$\begin{aligned} z_n=&\,\frac{n\log n-n}{2}-\frac{1}{4} +\frac{4}{9\pi ^2}\Big (24\gamma +7-18\pi h(\tfrac{1}{2})\Big )n^{\frac{1}{2}}\\&-\frac{\log n}{48}-\frac{13\gamma }{24}+\frac{1}{8}+\frac{h'(0)}{2} +\mathcal {O}(n^{-\frac{1}{2}})+\frac{1}{2\pi \textrm{i}}\oint _{C'}\Phi _n(z){\text {d}}z, \end{aligned}$$

where \(C'\) encircles the interval [0, n], but no poles of the integrand outside that interval. As shown in the appendix, the latter integral is o(1), thus we arrive at

$$\begin{aligned} -I \!E\, \frac{\log I \!P\, \! l^{(n)}(\lambda )}{\sqrt{n}}&=I \!E\, \frac{2Z_n-\log n!}{\sqrt{n}}\\&=\frac{8}{9\pi ^2}\Big (24\gamma +7-18\pi h(\tfrac{1}{2})\Big ) -\frac{13}{24}\frac{\log n}{\sqrt{n}}\\&\ \ \ \ -\left( \frac{13\gamma }{12}+\log \sqrt{2\pi }+\frac{1}{4}-h'(0)\right) \frac{1}{\sqrt{n}} +o\big (n^{-\frac{1}{2}}\big ). \end{aligned}$$

Finally, for the evaluation of \(h(\frac{1}{2})\), we use \(\Gamma (\frac{1}{2}+\ell +1)\Gamma (\frac{1}{2}-\ell +1)=(-1)^{\ell +1}\frac{\pi }{4}(4\ell ^2-1)\), as well as \(\sum _{\ell \ge 2}(4\ell ^2-1)^{-1}=\frac{1}{6}\), leading to

$$\begin{aligned} h(\tfrac{1}{2})=-\frac{4}{\pi }\sum _{\ell \ge 2}\frac{\ell ^2}{4\ell ^2-1} \Big (\log \ell -H_\ell +\gamma +\frac{1}{2\ell }\Big )+\frac{1}{18\pi }, \end{aligned}$$

which completes the proof. \(\square \)

Table 2 Some terms of the sequence approaching H

Remark 3.2

Note that the term \(2\sum _{r\ge 2}(-1)^{r}g(r)K_{r-1}\left( {\begin{array}{c}n\\ r\end{array}}\right) \) can be used to compute \(z_n\) for values of n so large that naively generating all partitions \(\lambda \vdash n\) is not an option. Of course, care has to be taken, since cancellations will occur in numerical computations because of alternating signs of summands. Table 2 shows that \(\frac{1}{\sqrt{n}}(2z_n-\log n!)\) is slowly approaching H from below, with the values obtained in [29, Table 1] from simulations fitting neatly into this pattern. The convergence rate is in good accordance with the error term given in Theorem 3.1. Observe that \(\frac{13\gamma }{12}+\log \sqrt{2\pi }+\frac{1}{4}-h'(0)\approx 1.792693.\) For \(n=2048\) we then get \(H-(\frac{13}{24}\log {2048}+1.792693)/\sqrt{2048}\approx 1.746154\), which matches the table entry fairly well.

4 The Expected Side Length of the Durfee Square

Here we consider the side length of the Durfee square of partition \(\lambda \),

$$\begin{aligned} D(\lambda ):=\max \{i:\lambda _i\ge i\}, \end{aligned}$$

and we denote the restriction of \(D(\lambda )\) to \(\lambda \vdash n\) distributed according to Plancherel measure by \(D_n\). With respect to uniform measure, where all partitions \(\lambda \vdash n\) are equally likely, the expectation and the most likely value of \(D(\lambda )\) have been studied in [5, 6, 18]. Regarding Plancherel measure, it is known since the days of the limit shape theorem (see Theorem 5.1 in the next section) that \(\frac{1}{\sqrt{n}}D_n\rightarrow \frac{2}{\pi }\) in probability. Furthermore, we may deduce from [2, Theorem 3.6] convergence in distribution of \(\frac{\pi }{\sqrt{\log n}}\left( D_n-\frac{2}{\pi }\sqrt{n}\right) \) to a standard normal random variable. Should that convergence in distribution be accompanied by convergence of second moments, \(I \!E\, D_n=\frac{2}{\pi }\sqrt{n}+\mathcal {O}\left( \sqrt{\log n}\right) \) would follow. We are not aware of a proof of such result, let alone of any results in the literature regarding fine asymptotics of \(I \!E\, D_n\).

Theorem 4.1

Let \(d_n:=I \!E\, D_n\). Then, as \(n\rightarrow \infty \), we have

$$\begin{aligned} d_n=\frac{2}{\pi }\sqrt{n}+\left( \frac{3}{16\pi } -\frac{e^2}{8\pi }\sin \left( 4\sqrt{n}\right) \right) \frac{1}{\sqrt{n}}+\mathcal {O}\big (n^{-1}\big ). \end{aligned}$$

Proof

In terms of contents \(c_u\) of a Young diagram \(\lambda \), we have

$$\begin{aligned} D(\lambda )=\sum _{u\in \lambda }\delta _{0,c_u}. \end{aligned}$$

Define polynomials in terms of the polynomials p(xr) given in (2.1) via

$$\begin{aligned} q(x,r):=\prod _{i=0}^{r-1}(x^2-i^2)={\left\{ \begin{array}{ll}x^2p(x,r-1),&{}r\ge 1,\\ 1,&{}r=0.\end{array}\right. } \end{aligned}$$

These also allow for a representation of the Kronecker delta, similar to (3.1),

$$\begin{aligned} \delta _{\ell ,n}=\sum _{r=\ell }^\infty (-1)^{\ell +r}\frac{2-\delta _{0,n}}{(r+\ell )!(r-\ell )!}q(n,r), \end{aligned}$$
(4.1)

now valid for non-negative integers \(\ell ,n\). By [26, Eq. (7)], see also [8, Theorem A.1], we have

$$\begin{aligned} \frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2\sum _{u\in \lambda }q(c_u,r)=\frac{(2r)!}{(r+1)!}\left( {\begin{array}{c}n\\ r+1\end{array}}\right) . \end{aligned}$$
(4.2)

This leads to the representation

$$\begin{aligned} d_n=\frac{1}{n!}\sum _{\lambda \vdash n}f_\lambda ^2\sum _{u\in \lambda }\delta _{0,c_u}= \sum _{r\ge 1}\frac{(-1)^{r+1}(2r-2)!}{(r-1)!^2r!}\left( {\begin{array}{c}n\\ r\end{array}}\right) . \end{aligned}$$
(4.3)

For the Poisson generating function \(D(z):=e^{-z}\sum _{n\ge 0}\frac{d_n}{n!}z^n\) we obtain

$$\begin{aligned} D(z)=z\sum _{k\ge 0}\frac{(-z)^k(2k)!}{k!^2(k+1)!^2} =z\left( 2J_0^2(2\sqrt{z})-\frac{J_0(2\sqrt{z})J_1(2\sqrt{z})}{\sqrt{z}}+2J_1^2(2\sqrt{z})\right) , \end{aligned}$$

where \(J_0\) and \(J_1\) are Bessel functions of the first kind. We may use D(z) to recover asymptotics of \(d_n\), see [15, Sect. 5.11.4] for asymptotic expansions of Bessel functions. We find

$$\begin{aligned} D(z)=\frac{2}{\pi }\sqrt{z}-\frac{1}{16\pi \sqrt{z}} -\frac{\sin (4\sqrt{z})}{8\pi \sqrt{z}} +\frac{3\cos (4\sqrt{z})}{64\pi z}+\mathcal {O}\big (z^{-\frac{3}{2}}\big ), \end{aligned}$$

for \(|z|\rightarrow \infty \), \(|\arg z|\le \pi -\delta \) with \(\delta >0\). A uniform bound is furnished by \(|D(z)|\le \cosh (4\sqrt{|z|})\). Evaluating now \(d_n=\frac{n!}{2\pi \textrm{i}}\oint _C z^{-n-1}e^zD(z){\text {d}}z\), with contour \(C:=\{z\in \mathbb {C}:|z|=n\}\), observing that there is an approximate saddle point at \(z=n\), finishes the proof. \(\square \)

Remark 4.2

The sequence starts \(\big (d_n\big )_{n=1}^{10}\!=\!(1, 1, 1, \frac{7}{6}, \frac{17}{12}, \frac{33}{20}, \frac{109}{60}, \frac{3217}{1680}, \frac{39703}{20160}, \frac{364859}{181440})\), and it again satisfies a linear recurrence relation,

$$\begin{aligned} d_{n+3} =\frac{3n^2+9n+8}{(n+2)(n+3)}d_{n+2} -\frac{3n+1}{n+3}d_{n+1} +\frac{n+1}{n+3}d_{n}, \end{aligned}$$
(4.4)

with initial conditions

$$\begin{aligned} d_0=0,\quad d_1=1,\quad d_2=1, \end{aligned}$$

readily obtained from (4.3) using gfun.

Fig. 4
figure 4

The Logan–Shepp–Vershik–Kerov limit shape curve \(\Omega \) (upper boundary of the grey region). Superimposed is the partition \((5,3,1,1)\vdash 10\), properly scaled and rotated by \(135^{\circ }\). The dots have coordinates \(\Big (\frac{a}{\sqrt{20}},{\frac{1}{\sqrt{5}}}\big (\omega _{a,10}+\frac{|a|}{2}\big )\Big )\), for \(a\in \{-6,\ldots ,6\}\)

5 Expected Fluctuations Around the Limit Shape Curve

Let us introduce the limit shape curve

$$\begin{aligned} \Omega (u)={\left\{ \begin{array}{ll}\frac{2}{\pi }\left( u\arcsin \frac{u}{\sqrt{2}}+\sqrt{2-u^2}\right) ,&{} if |u|\le \sqrt{2},\\ |u|, &{} if |u|>\sqrt{2}. \end{array}\right. } \end{aligned}$$
(5.1)

The lower right boundary of the Young diagram of a partition \(\lambda \vdash n\), scaled to have unit area, rotated together with parts of positive x-axis and negative y-axis by \(135^\circ \), gives rise to a piecewise linear function \(\psi _{\lambda }\), also defined on \(\mathbb {R}\). When \(\lambda \) is distributed according to Plancherel measure, the random functions \(\psi _{\lambda }\) approach the limit shape curve, as \(n\rightarrow \infty \), in a sense that is made precise in the following result by Vershik and Kerov [27] and Logan and Shepp [14], which we present following closely [23, Theorem 1.22].

Theorem 5.1

(Limit shape theorem for Plancherel-random partitions) For all \(\varepsilon >0\), we have \(\mathbb {P}(\Vert \psi _\lambda -\Omega \Vert _\infty >\varepsilon )\rightarrow 0\) as \(n\rightarrow \infty \), i.e., the random functions \(\psi _\lambda \) converge to \(\Omega \) in probability in the norm \(\Vert \cdot \Vert _\infty \).

A discretized version of \(\psi _{\lambda }\), defined on the set \(\big \{\frac{a}{\sqrt{2n}}:a\in \mathbb {Z}\big \}\), can be expressed in terms of contents \(c_u\) of \(\lambda \) via \(\Psi _\lambda (a):=\sum _{u\in \lambda } \delta _{{-}a,c_u}\).

Indeed, the set \(\Big \{\Big (\frac{a}{\sqrt{2n}},\frac{2}{\sqrt{2n}}\big (\Psi _\lambda (a)+\frac{|a|}{2}\big )\Big ):a\in \mathbb {Z}\Big \}\) is a subset of the graph of \(\psi _\lambda \) containing, among others, all the points where the slope of \(\psi _\lambda \) changes from 1 to \(-1\) or back. For example, if \(\lambda =(5,3,1,1)\), then \((\Psi _\lambda (a))_{a=-4}^4=(1,1,1,2,2,1,1,1,0)\). Define now a related function, \(\Phi _\lambda (a):=\frac{1}{2}\big (\Psi _\lambda (a)+\Psi _\lambda (-a)\big )\), i.e.,

$$\begin{aligned} \Phi _\lambda (a)={\left\{ \begin{array}{ll}D(\lambda ),&{}a=0,\\ \frac{1}{2}\sum _{u\in \lambda } \delta _{|a|,|c_u|},&{}else. \end{array}\right. } \end{aligned}$$

This symmetrised function is used because it can be expressed in terms of Kronecker deltas restricted to pairs of nonnegative integers, thus allowing to use the representation (4.1). Next, let \(\omega _{a,n}:=I \!E\, \Phi _\lambda (a)\), with \(\lambda \vdash n\) distributed according to the Plancherel measure, and define a sequence of functions

$$\begin{aligned} \tilde{\Omega }_n(u):=\sqrt{\frac{2}{n}}\left( \omega _{\lfloor \sqrt{2n}u\rfloor ,n} +\frac{1}{2}|\lfloor \sqrt{2n}u\rfloor |\right) , \end{aligned}$$

that one would expect to converge to \(\Omega (u)\), although such convergence is not implied by Theorem 5.1. By [2, Theorem 3.6] we have convergence in distribution of \(\frac{\pi \sqrt{n}}{\sqrt{2\log n}}\left[ \sqrt{\frac{2}{n}}\Big (\Phi _\lambda (\lfloor \sqrt{2n}u\rfloor ) +\frac{1}{2}|\lfloor \sqrt{2n}u\rfloor |\Big )-\Omega (u)\right] \) to a standard normal random variable in case that \(|u|<\sqrt{2}\). Should there also be convergence of second moments, \(\tilde{\Omega }_n(u)=\Omega (u)+\mathcal {O}\left( \sqrt{\frac{\log n}{n}}\right) \) would follow for \(|u|<\sqrt{2}\). See Fig. 4 for the limit shape curve, and, scaled to unit area, a superimposed partition of 10, and values \(\tilde{\Omega }_{10}(u)\) for \(u\in \{\frac{a}{\sqrt{20}}:-6\le a\le 6\}\). There is a seeming coincidence on the y-axes, yet \(\Omega (0)=\frac{2\sqrt{2}}{\pi }\approx .900316\), \(\frac{\omega _{0,10}}{\sqrt{5}}=\frac{364859}{181440\sqrt{5}}\approx .899305\), and the ordinate of the upper corner of the rotated Young diagram, \(\frac{2}{\sqrt{5}}\approx .894427\), are all different.

Fig. 5
figure 5

Plots of \(\tilde{\Omega }_n(u)-\Omega (u)\) for roughly quadrupling values of \(n\in \{1573,6230,24798,98943\}\) from top left to bottom right

An alternating sum representation of \(\omega _{a,n}\), building upon (4.1), is the following

$$\begin{aligned} \omega _{a,n}=\sum _{r>a}\frac{(-1)^{r+a+1}(2r-2)!}{(r-1+a)!(r-1-a)!r!}\left( {\begin{array}{c}n\\ r\end{array}}\right) , \end{aligned}$$
(5.2)

which again gives rise to a linear recurrence relation (obtained using gfun)

$$\begin{aligned}&(n+4)(n+a+3)(n-a+3)\omega _{a,n+4}\\&\quad =[4n^3+32n^2+(86-2a^2)n+78-7a^2]\omega _{a,n+3}\\&\qquad -(n+3)(6n^2+22n+20-a^2)\omega _{a,n+2}\\&\qquad +(n+1)(n+2)(n+3)(4\omega _{a,n+1}-\omega _{a,n}), \end{aligned}$$

holding for \(n\ge a-2\), with initial conditions

$$\begin{aligned} \omega _{a,n}=0, for \min (0,a-2)\le n\le a,\quad \omega _{a,a+1}=\frac{1}{(a+1)!}. \end{aligned}$$

Note that there is a common factor \((n+1)\) in the recurrence relation, when \(a=2\). Note also that setting \(a=0\) yields a recurrence relation with both order and degree one larger than the one given in (4.4).

As was done for \(d_n\), asymptotics via Poisson generating functions (which can again be expressed in terms of Bessel functions) can be obtained also for \(\omega _{a,n}\) for fixed integer \(a>0\):

$$\begin{aligned} \omega _{a,n}+\frac{a}{2}=\frac{2}{\pi }\sqrt{n}+\left( \frac{4a^2+3}{16\pi }-(-1)^a\frac{e^2}{8\pi }\sin \left( 4\sqrt{n}\right) \right) \frac{1}{\sqrt{n}}+\mathcal {O}\big (n^{-1}\big ).\qquad \quad \end{aligned}$$
(5.3)

In order to obtain asymptotics of \(\omega _{a,n}\) for n and a simultaneously approaching \(\infty \), which would be needed for asymptotics of \(\tilde{\Omega }_n(u)\), one could use the parametrization \(n=2\kappa ^2\in \mathbb {N}, a=\lfloor 2\kappa u\rfloor \), and consider

$$\begin{aligned} \frac{\omega _{\lfloor 2\kappa u\rfloor ,2\kappa ^2}}{\kappa }=\frac{1}{2\pi \textrm{i}}\oint _{C''}\frac{1}{\kappa }\frac{(-1)^{\lfloor 2\kappa u\rfloor +1}\Gamma (-z)\Gamma (2z-1)}{\Gamma (z+\lfloor 2\kappa u\rfloor )\Gamma (z-\lfloor 2\kappa u\rfloor )\Gamma (z+1)} \frac{(2\kappa ^2)!\Gamma (-z)}{\Gamma (2\kappa ^2+1-z)}{\text {d}}z, \end{aligned}$$

implied by (5.2), where \(C''\) is a contour that encircles integers \(1,\ldots ,2\kappa ^2\), but neither any other integers, nor poles of the integrand. Outside \(C''\), the integrand has poles at \(\frac{1}{2}\), at 0, and at all negative half-integers. For fixed u it turns out that each residue contributes to the leading (constant) term of the asymptotics in the limit \(\kappa \rightarrow \infty \), with the sum of those contributions converging, but for fixed \(\kappa \) the sum of residues does not converge. Balancing those two limiting processes (taking more and more residues into account, letting \(\kappa \rightarrow \infty \)) and at the same time bounding the integral over a sequence of correspondingly deformed contours appears to be intricate, so unfortunately we have not been able to prove \(\tilde{\Omega }_n(u)\rightarrow \Omega (u)\) for \(u\ne 0\).

Using holonomicity of \((\omega _{a,n})_{n\ge 0}\) to generate many terms of that sequence for many values of a, we obtain the plots in Figs. 5 and 6. The values for n in Fig. 5 and in the second plot in Fig. 6 have been chosen to satisfy \(\sin (4\sqrt{n})\approx 1\) in order to give maximal weight to the term \((-1)^a\) present in (5.3) and thus ensure better comparability of the plots in the vicinity of 0. The value of n in the first plot of Fig. 6 satisfies \(\sin (4\sqrt{n})\approx 0\). We conclude this section with some (non-rigorous) observations based on these plots.

Fig. 6
figure 6

Plots of \(\tilde{\Omega }_n(u)-\Omega (u)\) for \(n=200415\) (upper plot) and for \(n=200767\) (lower plot), with u varying in the discrete set \(\{\frac{a}{\sqrt{2n}}:0\le a\le 800\}\)

  1. (a)

    Regarding convergence of the sequence \(\big (\tilde{\Omega }_n(\cdot )\big )_{n\ge 1}\) to \(\Omega (\cdot )\), the plots in Fig. 5 suggest that for any \(\epsilon >0\) we have \(\max \limits _{0\le u\le \sqrt{2}-\epsilon }|\tilde{\Omega }_n(u)-\Omega (u)|=\mathcal {O}\big (\frac{1}{n}\big )\). Near \(\sqrt{2}\) we only seem to have \(\max \limits _{|u-\sqrt{2}|\le \epsilon }|\tilde{\Omega }_n(u)-\Omega (u)| =\mathcal {O}\big (\frac{1}{\sqrt{n}}\big )\), with this slower convergence in agreement with the convergence rate stated in [2, Eq. (3.4)] for convergence of \(\tilde{\Omega }_n(\sqrt{2})\rightarrow \Omega (\sqrt{2})\) in probability, building upon results from [1, 3, 12, 19] addressing the problem of the longest increasing subsequence, see also [23].

  2. (b)

    A common feature of all plots in Figs. 5 and  6 is the presence of subintervals of “smooth” behavior surrounded by regions of more “irregular” behavior. In order to enforce “smooth” dependence of \(\omega _{a,n}\) on a, one would, in the light of (5.3), restrict to odd (or to even) a. However, this would only work for \(0\le a\le \alpha _n\) with \(\alpha _n=o(\sqrt{n})\). The location of the first “peak” to the right of 0 seems to suggest, that \(\alpha _n=\Theta \big ( n^{\frac{1}{4}}\big )\) may hold. For larger a it is no longer useful to distinguish between even and odd, instead one should consider \(\omega _{a,n}\) evaluated at a belonging to other arithmetic progressions: Near \(\frac{a}{\sqrt{2n}}=\sqrt{2}\cos \frac{\pi }{3}\approx 0.707\) the way to go would be to consider \(\big (\omega _{a+3k,n}\big )_{k}\), whereas near \(\frac{a}{\sqrt{2n}}=\sqrt{2}\cos \frac{\pi }{4}=1\) it would be \(\big (\omega _{a+4k,n}\big )_{k}\). Every fifth term should be taken near \(\sqrt{2}\cos \frac{\pi }{5}\approx 1.144\) and \(\sqrt{2}\cos \frac{2\pi }{5}\approx 0.437\). We expect this pattern to continue, with regions of smoothness near \(\sqrt{2}\cos \frac{\ell \pi }{m}\) for \(1\le \ell <\frac{m}{2}\), and \(\ell ,m\) coprime. For larger m these regions will become noticeable only if n gets large enough, and those regions will shrink with n further increasing, making room for yet other regions to pop up.

6 A Heuristic Upper Bound for the Variance of the Number of Bumping Steps in the RSK Algorithm

Let \(L_n:=X_n+Y_n\), \(\ell _n:=I \!E\, (X_n+Y_n)\), and \(v_n:=Var\, (X_n+Y_n)\). We now give a heuristic derivation of \(\ell _n\), and an upper bound for \(v_n\) based on [13]. Let \(\Omega (x)\) be the function defined in (5.1), describing the limit shape of normalized Young diagrams with respect to Plancherel measure. Denote \(s(x):=\frac{1}{\pi }\sqrt{2-x^2}\), the density of the semicircle distribution with support \([-\sqrt{2},\sqrt{2}]\). As shown in [13], this is also the limiting density of the random abscissa of a newly inserted box into a scaled and rotated Young diagram that closely resembles the limit shape curve, when new insertions are made according to the Plancherel growth process, that ensures that at each stage of the process the Young diagram is distributed according to Plancherel measure, see also [23, Sect. 1.19]. This leads to

$$\begin{aligned} \ell _n-\ell _{n-1}\sim \sqrt{2n}\int _{-\sqrt{2}}^{\sqrt{2}}\Omega (x)s(x){\text {d}}x=\frac{128}{9\pi ^2}\sqrt{n}, \end{aligned}$$

and thus \(\ell _n\sim \frac{256}{27\pi ^2}n^{\frac{3}{2}}\). Moreover, assuming independence of \(L_{n-1}\) and \(L_n-L_{n-1}\),

$$\begin{aligned} v_n-v_{n-1}\sim 2n\int _{-\sqrt{2}}^{\sqrt{2}}\left( \Omega (x)-\frac{64\sqrt{2}}{9\pi ^2}\right) ^2s(x){\text {d}}x =\frac{54\pi ^4+2835\pi ^2-32768}{162\pi ^4}n, \end{aligned}$$

and thus

$$\begin{aligned} v_n\sim \frac{54\pi ^4+2835\pi ^2-32768}{324\pi ^4}n^2\approx 0.01496867061\,n^2. \end{aligned}$$

Numerically we have e.g. \(\frac{v_{50}}{50^2}\approx 0.01216526413\).

Fig. 7
figure 7

A plot of \(Cov\, (L_n-L_{n-1},L_{n-1})\) for \(n\in \{2,\ldots ,40\}\)

Indeed, \(L_{n-1}\) and \(L_n-L_{n-1}\) seem to be negatively correlated. The sequence of covariances \(\big (Cov\, (L_n-L_{n-1},L_{n-1})\big )_{n\ge 2}\) starts \((0,0,-\frac{1}{9},-\frac{17}{180},-\frac{1}{15},-\frac{61}{450},-\frac{863}{5600},\ldots )\), staying negative up to \(n=40\) with roughly linear growth, see Fig. 7.

So it seems that in the light of Lemma 1.2 one can safely guess that for the number \(Y_n\) of bumping steps \(Var\, Y_n=\Theta (n^2)\) holds. It would be desirable to have a proof for that, and also know at least the leading asymptotic term of \(Var\, Y_n\).

7 Conclusion

In this paper we have obtained asymptotics of expectations of certain statistics of Plancherel distributed Young diagrams. That these statistics could be expressed in terms of hook lengths and contents of the boxes of such diagram was essential, as it allowed us to invoke polynomiality results for Plancherel averages, leading to representations of expectations as binomial convolutions, that make for easier asymptotic treatment. We hope that this approach will help to analyse further statistics of Plancherel distributed Young diagrams. Now polynomiality results have also been found for measures different from Plancherel (such as the Jack deformation of Plancherel measure, see [20]), or for subclasses of Plancherel distributed Young diagrams, such as strict partitions (see [11, 16, 17]). In case that appropriate substitutes for (2.3) or (4.2) are at hand, it is reasonable to believe that certain statistics in these settings could also be analysed along the lines of this paper.