1 Introduction

Let G be a group. Under a character of G, we shall understand a positive-definite class function χ:G→ℂ. A character which satisfies χ(e)=1 will be called normalized.

For gR g , a finite-dimensional matrix representation of G, the trace

is a character. For a finite or, more generally, compact group, the set of normalized characters is a simplex whose extreme points are normalized traces gχ(g)/χ(e) of irreducible representations of G. For infinite groups, the connection is more delicate since there are many infinite-dimensional representations and the matrix traces are of no use. A classical construction associates extreme normalized characters with factor representations of finite von Neumann type, see [8, 34]. Yet another approach exploits spherical representations of the Gelfand pairs, see [26, 27]. The representation theory is mainly focused on the classification of extreme characters and the decomposition of the generic character in a convex sum of the extremes, the latter being as a counterpart of the decomposition of a representation into the irreducible ones. However, the extreme characters may be complicated functions and the set of the extremes may be too large, so it is of interest to study smaller tractable families of reducible characters, for instance, those which have some kind of symmetry or depend on some simple statistic on the group.

In this paper, we study symmetric groups \({\mathfrak{S}}_{n}\) and their block characters which depend on a permutation \(g\in {\mathfrak{S}}_{n}\) only through the number of cycles n (g). Our interest to block characters is motivated by the analogous concept of derangement characters of the general linear group GL n (q) of invertible matrices over a Galois field, as studied in [13]. A derangement character depends on the matrix hGL n (q) only through the dimension of the space ker(hId) of fixed vectors of h. See [29, 35] for a connection of the derangement characters to representation theory of GL (q) and [10] for a connection of these characters to some random walks. The natural embedding \({\mathfrak{S}}_{n}\) in GL n (q), which maps a permutation g to a permutation matrix h, yields a link between two families of characters. Indeed, it is easily seen that n (g) is equal to the dimension of the space of fixed vectors of h, so the restriction of a derangement character from GL n (q) to \({\mathfrak{S}}_{n}\) is a block character.

The convex set of normalized block characters of \({\mathfrak{S}}_{n}\) is a simplex whose extreme points are normalized versions of the characters \(\tau_{1}^{n},\dots,\tau^{n}_{n}\) introduced by Foulkes [9]. It should be stressed that, a priori, there is no general reason for the set of normalized block characters to be a simplex. To compare, the set of normalized derangement characters of GL n (q) is a simplex for some n, and not a simplex for other [13]. Characters \(\tau_{k}^{n}\) are related to the descent statistics of permutations. In particular, \(\tau_{k}^{n}(e)\) coincides with the Eulerian number, which counts permutations with k−1 descents. Using a decomposition of the coinvariant algebra, we will find representations of \({\mathfrak{S}}_{n}\) whose traces are the \(\tau_{k}^{n}\)’s. Versions of decompositions of the coinvariant algebra is a classical topic (see [32, Proposition 4.11], [23], [28, Sect. 8.3]) which has been studied recently in [1] in connection with descent representations [30].

Foulkes [9] defined the \(\tau^{n}_{k}\)’s by summing ‘rim hook’ characters which are not block functions at all. In this paper, we take a more straightforward approach, starting with a collection of block characters associated with a natural action of \({\mathfrak{S}}_{n}\) on words. On this way, we derive ‘from scratch’ a number of known results on decomposition and branching of the Foulkes characters, as found in [6, 19].

Extending the finite-n case, we shall consider the infinite symmetric group \({\mathfrak{S}}_{\infty}\) of bijections g:ℕ→ℕ satisfying g(j)=j for all sufficiently large j. The counterparts of the block characters of \({\mathfrak{S}}_{\infty}\) are the characters depending on the permutation through its decrement defined by c(g):=n n (g), with any large enough n. We will show that the set of normalized block characters of \({\mathfrak{S}}_{\infty}\) is a Choquet simplex with extreme points \(\sigma_{z}^{\infty}(g):=z^{c(g)}\) where \(z\in \mathbb{V}=\{0,\pm 1,\pm 1/2, \pm 1/3,\dots\}\) (the instance, z=0 is understood as the delta function at e). Recall that the characteristic property of a Choquet simplex is the uniqueness of decomposition of the generic point in a convex mixture of extremes [15].

The extreme normalized characters of \({\mathfrak{S}}_{\infty}\) were parameterized in a seminal paper by Thoma [36] (see also [21, 25, 37]) via two infinite sequences

$$ \alpha_1\ge \alpha_2\ge\dots \ge 0,\quad\quad \beta_1\ge \beta_2\ge\dots \ge 0,\quad\quad \sum_i( \alpha_i+\beta_i) \le 1. $$
(1)

It turns out that the extreme normalized block characters of \({\mathfrak{S}}_{\infty}\) are extreme among all normalized characters, with \(\sigma_{1/k}^{\infty}\) corresponding to the parameters α 1=α 2=…=α k =1/k and \(\sigma_{-1/k}^{\infty}\) to the parameters β 1=β 2=…=β k =1/k (where k>0). The character \(\sigma_{0}^{\infty}\) has α j β j ≡0 and corresponds to the regular representation of \({\mathfrak{S}}_{\infty}\).

The block characters also possess an additional extremal property. Observe that the set (1) of parameters of extreme normalized characters of \({\mathfrak{S}}_{\infty}\) is a simplex by itself. The block characters \(\sigma_{z}^{\infty}(g)\) correspond precisely to all extreme points of this simplex.

Every normalized character χ n of the symmetric group \({\mathfrak{S}}_{n}\) defines a probability measure on the set \(\mathbb{Y}_{n}\) of Young diagrams with n boxes. Indeed, recall that irreducible representations of \({\mathfrak{S}}_{n}\) are parameterized by the elements of \(\mathbb{Y}_{n}\) and decompose χ n into the linear combination of their (conventional) characters χ λ:

The numbers p n(λ) are non-negative and sum up to 1, thus, they define a probability distribution on \(\mathbb{Y}_{n}\) or random Young diagram \(Y^{\chi^{n}}\). As n→∞, these random Young diagrams may possess intriguing properties depending on the sequence of characters χ n .

Kerov, Vershik [38] and, independently, Logan, Shepp [24] proved in the 1970s that if we choose χ n to be the character of the regular representation of \({\mathfrak{S}}_{n}\) then after a proper rescaling the boundary of the Young diagram \(Y^{\chi^{n}}\) converges to a deterministic smooth curve called the limit shape. We will prove a similar result for the extreme block characters \(\tau_{k}^{n}\). More precisely, if n→∞ and \(k\sim c\sqrt{n}\) then the (rescaled) boundary of the Young diagram \(Y^{\tau_{k}^{n}}\) converges to the deterministic limit shape depending on c; similar result holds if \(n-k\sim c\sqrt{n}\). Our limit shapes are the same as those obtained by Biane [4] in the context of tensor representations of \({\mathfrak{S}}_{n}\). This fact could have been predicted since the characters considered by Biane coincide with restrictions of \(\sigma_{z}^{\infty}\) on finite symmetric groups \({\mathfrak{S}}_{n}\) and extreme block characters \(\tau_{k}^{n}\) approximate \(\sigma_{z}^{\infty}\) as n tends to infinity.

From another probabilistic viewpoint, the characters \(\sigma_{1/k}^{\infty}\) (k>0) have been studied in [18] and [20, Sect. III.3]; it was shown that the probability measures on the set of Young diagrams which they define are related to the distributions of eigenvalues of random matrices.

Like ‘supercharacters’ of Diaconis and Isaacs [7], the block characters are constant on big blocks of conjugacy classes (‘superclasses’). The latter feature motivated our choice of the name for this family of functions. However, the block characters do not fit in the theory of ‘supercharacters’ since the extremes \(\tau_{k}^{n}\) (hence their mixtures) are not disjoint in their decomposition over the irreducible traces. The same distinction applies to the derangement characters of finite linear groups as well.

The rest of the paper is organized as follows. In Sect. 2, we introduce families of block characters of \({\mathfrak{S}}_{n}\) and derive their properties. In Sect. 3, we prove that the set of normalized block characters of \({\mathfrak{S}}_{n}\) is a simplex and identify its extreme points (the normalized Foulkes characters). In Sect. 4, we study block characters \(\tau_{k}^{n}\) and their relation with the coinvariant algebra of \({\mathfrak{S}}_{n}\). In Sect. 5, we prove the limit shape theorem for extreme block characters of \({\mathfrak{S}}_{n}\). In Sect. 6, we derive the branching rule for the characters \(\tau_{k}^{n}\)’s as n varies. In Sect. 7, we prove that the set of normalized block characters of \({\mathfrak{S}}_{\infty}\) is a simplex and identify its extreme points. Finally, in Sect. 8, we comment on the relation between the block characters and the derangement characters of GL n (q) for n≤∞.

2 The block characters

Let \({\mathfrak{S}}_{n}\) denote the group of permutations of {1,…,n} and let n (g) be the number of cycles of permutation \(g\in {\mathfrak{S}}_{n}\). A block function on \({\mathfrak{S}}_{n}\) is a function which depends on g only through n (g).

In general, a character of a group G is a complex-valued function χ on G which is

  1. 1.

    Central, i.e., a class function: χ(a −1 ba)=χ(b).

  2. 2.

    Positive definite, i.e., for any finite collection (g i ) of elements of G the matrix with entries \(\chi(g_{i} g_{j}^{-1})\) is a Hermitian non-negative definite matrix.

If gR g is a finite-dimensional matrix representation of a group G, then its trace \(\chi(g)=\operatorname{Trace}(R_{g})\) is a character.

A block character of \({\mathfrak{S}}_{n}\) is a positive-definite block function. A conjugacy class of \(g\in {\mathfrak{S}}_{n}\) is determined by the partition of n into parts equal to the cycle-sizes of g, hence every block function is central, and every block character is indeed a character.

Our starting point is an elementary construction of a family of block characters. Fix an integer k>0 and let \(A^{n}_{k}\) be the set of all words of length n in the alphabet {1,2,…,k}. The group \({\mathfrak{S}}_{n}\) naturally acts in \(A^{n}_{k}\) by permuting positions of letters, and this action defines a unitary representation of the group in \(\mathcal{L}_{2}(A^{n}_{k})\) by the formula:

Let \(R^{n}_{k}\) denote this representation and let \(\sigma^{n}_{k}\) be its character. Furthermore, let \(\widehat{S}^{n}\) be the one-dimensional sign representation of \({\mathfrak{S}}_{n}\), which has the block character \(g\mapsto (-1)^{n-\ell_{n}(g)}\), and let \(\widehat{R}^{n}_{k}\) be the tensor product of the representations \(\widehat{S}^{n}\) and \(R^{n}_{k}\):

Let \(\widehat{\sigma}_{k}^{n}\) be the matrix trace of \(\widehat{R}^{n}_{k}\).

Proposition 2.1

The functions \(\sigma^{n}_{k}\) and \(\widehat{\sigma}^{n}_{k}\) are block characters of \({\mathfrak{S}}_{n}\). Explicitly,

Proof

The trace of \(R_{k}^{n}(g)\) is equal to the number of words in \(A_{n}^{k}\) fixed by g. A word is fixed if within each set of positions comprising a cycle the letters occupying these positions are the same, whence the first formula. The second formula follows from the multiplication rule for traces of tensor products. □

The possible values of the function n on \({\mathfrak{S}}_{n}\) are integers 1,…,n. On the other hand, the determinant of the matrix (k ) k,∈{1,…,n} is a nonzero Vandermonde determinant; therefore, the n characters \(\sigma^{n}_{1},\dots, \sigma^{n}_{n}\) comprise a basis of the linear space of block functions.

We are mostly interested in the extreme rays of the cone of block characters. Now we define another family of block characters which (as we will see later) generate these rays.

Definition 2.2

For k=1,…,n define

$$ \tau_k^n:=\sum_{j=0}^{k-1} (-1)^j {{n+1} \choose {j}} \sigma^n_{k-j}. $$
(2)

Clearly, \(\tau_{k}^{n}\) is a block function, although it is not obvious whether it is a character. The formula (2.2) can be inverted as follows:

Proposition 2.3

We have

$$ \sigma_k^n=\sum_{j=0}^{k-1} {n+j\choose j} \tau^n_{k-j}. $$
(3)

Proof

The inversion formula is equivalent to the identity (for α>0)

(where and henceforth 1(⋯) is 1 when ⋯ is true and 0 otherwise). The identity is derived by substituting the generating function

in (1−x)α(1−x)α−1=0 and equating the coefficients to 0. □

Our next aim is to decompose \(\sigma_{k}^{n}\) and \(\tau_{k}^{n}\) into linear combinations of irreducible characters of \({\mathfrak{S}}_{n}\). Let us introduce some notations first.

A partition of n is a finite nondecreasing sequence λ=(λ 1,λ 2,…,λ ) of positive integers such that |λ|:=∑λ i =n. We identify partition λ with its Young diagram defined as the set \({\{(i,j)\in\mathbb{Z}^{2}_{+}: 1\le j\le \lambda_{i}\}}\). We call an element x=(i,j)∈λ a box, and draw it as a unit square at location (i,j) (with the English convention that (1,1) is at the top left and the first coordinate is vertical). Let \(\mathbb{Y}_{n}\) denote the set of all partitions of n and let \(\mathbb{Y}_{n}^{k}\) denote the set of all partitions of n with at most k non-zero parts (i.e., such that k).

It is well-known that irreducible representations of \({\mathfrak{S}}_{n}\) are enumerated by the elements of \(\mathbb{Y}_{n}\). For \(\lambda\in \mathbb{Y}_{n}\) we denote V λ and χ λ the irreducible representation corresponding to λ and the character (matrix trace) of this representation, respectively.

A Young tableau T of shape λ is a map assigning to boxes of the Young diagram λ positive integer entries which are non-decreasing along rows and columns. We denote T(x) the entry assigned to box x. A Young tableau is semistandard if the entries strictly increase along the columns. A Young tableau T of shape λ is standard if the set of entries of T is {1,2,…,|λ|}. For a standard Young tableau T of shape λ, a descent is an integer 0<i<|λ| such that the entry i+1 appears in T below entry i, that is to say, the vertical coordinate of T −1(i+1) is greater than that of T −1(i). The number of descents is denoted d(T). Figure 1 gives an example.

Fig. 1
figure 1

Standard Young tableau of shape (4,3,1) with the set of descents {3,5,6}

Now we proceed to the decomposition of the characters \(\sigma_{k}^{n}\) and \(\tau_{k}^{n}\).

Proposition 2.4

We have

where s k (λ) is the number of semistandard Young tableaux of shape λ with entries belonging to the set {1,…,k}.

Proof

Let W k be a k-dimensional vector space with basis w 1,…,w k . The symmetric group \({\mathfrak{S}}_{n}\) acts on the tensor power \(W^{\otimes n}_{k}\) by permuting the factors. Note that the basis of \(W^{\otimes n}_{k}\) is enumerated by the elements of \(A^{n}_{k}\), thus the representation in \(W^{\otimes n}_{k}\) is equivalent to \(R^{n}_{k}\). The decomposition of the representation in \(W^{\otimes n}_{k}\) into irreducibles is a well-known fact related to the Schur–Weyl duality (see, e.g., [39]):

$$ W^{\otimes n}_k=\bigoplus_{\lambda\in \mathbb{Y_n^k}} \operatorname{Dim}_k(\lambda)\cdot V^{\lambda}, $$
(4)

where \(\operatorname{Dim}_{k}(\lambda)\) is equal to the number of semistandard Young tableaux with entries from {1,…,k} (which, in turn, equals to the dimension of the irreducible representation, corresponding to λ, of the group of unitary matrices of size k). The proposition easily follows. □

Proposition 2.5

We have

where m k (λ) is the number of standard Young tableaux of shape λ with k−1 descents.

Proof

Proposition 2.4 and Definition 2.2 imply that

where

Thus, it remains to prove that h k (λ)=m k (λ), which amounts to showing that for every \(\lambda\in\mathbb{Y}_{n}\) and every k=1,2,…,n we have

$$ m_k(\lambda)=\sum _{j=0}^{k-1} (-1)^j {{n+1} \choose {j}} s_{k-j}(\lambda). $$
(5)

Similarly to the proof of Proposition 2.3, Eq. (5) is equivalent to the inversion formula

$$ s_k(\lambda)=\sum _{j=0}^{k-1} {n+j\choose j} m_{k-j}(\lambda), $$
(6)

which will be shown combinatorially (see also [31, Eq. (7.96)] for the proof of a more general fact).

Fix a standard Young tableau T of shape λ. We call a non-decreasing integer sequence X=(X 1,…,X n ) T-admissible if 1≤X 1, X n k, and X i <X i+1 for every descent i of T. Given a pair (T,X), where X is T-admissible, we define a semistandard tableau Y T,X of the same shape λ by setting Y T,X (T −1(j))=X j for j=1,…,n. An example of such procedure is shown in Fig. 2.

Fig. 2
figure 2

Standard Young tableau T of shape λ=(4,3,1) with 3 descents (top-left), numbers {1,…,8} with descents of T marked by hats (top-right), lexicographically minimal T-admissible sequence X (bottom-right) and corresponding semistandard Young tableau Y T,X (bottom-left)

Observe that for a given standard Young tableau T with kj−1 descents there are \({n+j\choose j}\) ways to choose a T-admissible sequence X. One easily proves that the map (T,X)→Y T,X is a bijection between the set of pairs (T,X) and the set of semistandard Young tableaux of shape λ with entries in {1,…,k}. The number of pairs (T,X), where T is a standard Young tableau of shape λ and X is a T-admissible sequence, is equal to the right-hand side of (6), while the number of semistandard Young tableaux of shape λ with entries in {1,…,k} is s k (λ), so we are done. □

Corollary 2.6

Block functions \(\tau_{k}^{n}\) are traces of some representations of \({\mathfrak{S}}_{n}\), in particular, they are characters.

Proof

By Proposition 2.5, \(\tau_{k}^{n}\) is the matrix trace of the representation

$$ \pi_k^n \cong \bigoplus_{\lambda\in \mathbb{Y}_n} m_k(\lambda) V^\lambda. $$
(7)

 □

There is a certain duality among the characters \(\tau_{k}^{n}\), as described in the following proposition.

Proposition 2.7

Let \(\pi_{k}^{n}\) be the representation with character \(\tau_{k}^{n}\). As above, let \(\widehat{S}^{n}\) be the one-dimensional sign representation of \({\mathfrak{S}}_{n}\). Then the representation \(\pi_{k}^{n} \otimes \widehat{S}^{n}\) is equivalent to \(\pi^{n}_{n+1-k}\), in particular, its character coincides with \(\tau_{n+1-k}^{n}\).

Proof

We multiply (7) by \(\widehat{S}^{n}\) and use the fact that \(V^{\lambda}\otimes\widehat{S}^{n}\) is equivalent to V λ, where λ′ is the transposed diagram obtained by reflecting λ about the main diagonal, so that the row lengths of λ′ become the column lengths of λ. We obtain:

We claim that m k (λ′)=m n+1−k (λ). Indeed, if T is a standard Young tableau of shape λ, then every i=1,…,n−1 is either a descent in T or a descent in T′, where T′ is the tableau of shape λ′ defined by T′(i,j)=T(j,i). In particular, d(T)+d(T′)=n−1. □

Corollary 2.8

We have

$$ \widehat{\sigma}_k^n=\sum_{j=0}^{k-1} {n+j\choose j} \tau^n_{n+1-k+j}. $$
(8)

We shall give now two further examples of block characters. Formula (3) generalizes as

$$ \theta^{\ell_n(g)}= \sum_{j=1}^n{\theta+n-j\choose n} \tau_j^n(g), $$
(9)

where the generalized binomial coefficient involves parameter θ>0. This is a polynomial identity which may be shown by extrapolating from the integer values θ∈{1,…,n}. For θ≥0 the function \(g\mapsto p\theta^{\ell_{n}(g)}\), where p −1=θ(θ+1)⋯(θ+n−1), is a probability on \({\mathfrak{S}}_{n}\), known as Ewens’ distribution; see, e.g., [3, 33]. Ewens’ distribution was first discovered in the context of population genetics and has become central in many contexts of pure and applied probability. It is also intensively used in the harmonic analysis on \({\mathfrak{S}}_{\infty}\), see [22].

As a simple corollary of the results in the next section, the function \(\theta^{\ell_{n}(g)}\) is a character for θn−1, but it is not positive definite for non-integer 0<θ<n−1.

Another natural series of characters is obtained by splitting the set \(A_{k}^{n}\) of words on k letters in subsets invariant under \({\mathfrak{S}}_{n}\). Taking the set of words with exactly k letters, the character of the corresponding representation is equal to the number of such words fixed by g, which by the inclusion–exclusion principle is equal to

$$ \psi_k^n:=\sum_{j=0}^{k-1} (-1)^j{k\choose j}\sigma_{k-j}^n=\sum _{j=1}^k (-1)^{k-j}{k\choose j} \sigma_j^n. $$
(10)

From Proposition 2.4 and an inversion formula, one finds

Both ψ- and τ-characters can be obtained by the iterated differencing of the sequence \(\sigma^{n}_{\cdot}\) (with \(\sigma_{k}^{n}=0\) for k≤0). Specifically, introducing the forward and backward difference operators acting as ∇(x ) k =x k x k−1 and Δ(x ) k =x k+1x k , respectively, we have \(\tau_{k}^{n}=\nabla^{n}(\sigma_{\cdot}^{n})_{k}\) and \(\psi_{k}^{n}=\Delta^{k}(\sigma^{n}_{\cdot})_{0}\).

3 The simplex of normalized block characters of \({\mathfrak{S}}_{n}\)

The set of all block characters is a convex cone of non-negative linear combinations of the extreme normalized block characters. In this section, we prove that the normalized block characters form a simplex whose extreme points are the normalized versions of characters \(\tau_{k}^{n}\). A closely related result can be found in Sect. 2 of [19].

Because the functions \(\{\sigma_{k}^{n}\}_{k=1,\dots,n}\) form a basis of the linear space of block functions, and because the systems of functions \(\{\tau_{k}^{n}\}\) and \(\{\sigma_{k}^{n}\}\) are related by a triangular linear transform, it follows that every block function φ on \({\mathfrak{S}}_{n}\) can be uniquely written in the basis of characters \(\tau_{1}^{n}, \dots,\tau_{n}^{n}\) as a linear combination

On the other hand, the traces of irreducible representations χ λ, \(\lambda\in \mathbb{Y}_{n}\), comprise an orthonormal basis in the space of central functions on \({\mathfrak{S}}_{n}\) endowed with the scalar product

Now, immediately from Proposition 2.5, we have

for \(\rho_{k}^{n}\) the hook diagram with k rows: \(\rho_{k}^{n}=(n-k+1,1^{k-1})\). Indeed, there are \({n-1\choose k-1}\) standard Young tableaux of shape \(\rho_{k}^{n}\), and every such tableau has exactly k−1 descents.

Corollary 3.1

A block function φ is a character if and only if a k ≥0 for k=1,…,n.

Proof

If a k ≥0 for k=1,…,n, then

$$ \varphi=\sum_{\lambda\in \mathbb{Y}_n} b( \lambda) \chi^{\lambda}, $$
(11)

with non-negative coefficients b(λ). Since the traces of irreducible representation χ λ are positive-definite, so is φ.

If φ is positive-definite, then in the decomposition (11) all coefficients b(λ) are non-negative, in particular, \(b(\rho^{n}_{k})\ge 0\) for k=1,…,n. It follows that a k ≥0 for k=1,…,n. □

Corollary 3.2

The set of normalized block characters is a simplex whose extreme points are the normalized characters \(\tau_{k}^{n}(\cdot)/\tau_{k}^{n}(e)\), k=1,…,n.

Corollary 3.3

Suppose θ is not an integer. Then \(\theta^{\ell_{n}(g)}\) is a character if and only if θ>n−1.

Proof

Indeed, all the coefficients in (9) are non-negative if and only if θ>n−1. □

4 Characters \(\tau_{k}^{n}\) and the coinvariant algebra

In this section, we construct representations \(\pi^{n}_{k}\) with traces \(\tau_{k}^{n}\). We start with computation of the dimension \(\tau^{n}_{k}(e)\). We give first a purely combinatorial proof, an alternative representation-theoretic proof will be given at the end of this section, and yet another proof based on the branching rule will be retained for the next section.

Let \(g=(g(1),\dots,g(n))\in {\mathfrak{S}}_{n}\) be a permutation written in the one-row notation. The descent number d(g) counts descents, that is, positions j such that g(j+1)<g(j). In particular, the descent number of the identity permutation e=(1,2,…,n) is 0, while the descent number of the reverse permutation (n,…,2,1) is n−1. The Eulerian number counts permutations from \({\mathfrak{S}}_{n}\) whose descent number is k−1.

Proposition 4.1

The dimension of the representation of \({\mathfrak{S}}_{n}\) with character \(\tau_{k}^{n}\) is the Eulerian number

Proof

We refer to [11] for the Robinson–Schensted–Knuth (RSK) correspondence between permutations and pairs of standard Young tableaux. The RSK has the following property: g(j+1)<g(j) in permutation g if and only if entry j is a descent in the recording tableau (see [11]).

It follows that the RSK sends permutations with k descents to pairs of Young tableaux such that the recording tableau has k descents. Thus

$$ \biggl < \begin{matrix} n\\ k\end{matrix} \biggr >=\sum_{\lambda\in \mathbb{Y}_n} m_k(\lambda) \operatorname{dim}(\lambda), $$
(12)

where \(\operatorname{dim}(\lambda)=\chi^{\lambda}(e)\) is the dimension of the corresponding irreducible representation, equal to the number of standard Young tableaux of shape λ. From the latter classical fact and comparing (12) with Proposition 2.5, the proof of the proposition is concluded. □

Now we proceed to the construction of representations \(\pi_{k}^{n}\).

Proposition 4.2

The regular representation \(R_{\mathrm{reg}}^{n}\) of \({\mathfrak{S}}_{n}\) can be decomposed as

$$ R^n_\mathrm{reg}=\pi_1^n \oplus \pi_2^n\oplus\dots \oplus \pi_n^n, $$
(13)

where representation \(\pi_{k}^{n}\) has trace \(\tau_{k}^{n}\).

Proof

Indeed, we have

where \(\operatorname{\dim}(\lambda)=\sum_{k} m_{k}(\lambda)\) is the dimension of π λ, that is, the number of standard Young tableaux of shape λ. Here χ reg(g) is the character of the regular representation of \({\mathfrak{S}}_{n}\). □

In the remaining part of this section, we focus on an explicit construction of the decomposition (13).

Let us denote \({\mathcal{R}}_{n}\) the algebra of polynomials in variables x 1,…,x n . The symmetric group \({\mathfrak{S}}_{n}\) acts naturally on \({\mathcal{R}}_{n}\) by permuting the variables. Let \({\mathcal{R}}_{n}^{{\mathfrak{S}}_{n}}\) denote the subalgebra of invariants of this action, which is the algebra of symmetric polynomials. The coinvariant algebra \({\mathcal{R}}^{*}_{n}\) is the quotient-algebra

where I n is the ideal in \({\mathcal{R}}_{n}\) spanned by the symmetric polynomials without constant term. The elementary symmetric polynomials

is the set of algebraic generators of \({\mathcal{R}}_{n}^{{\mathfrak{S}}_{n}}\), thus I n is an ideal spanned by the polynomials e 1,…,e n . We denote by P n the canonical projection:

Observe that \({\mathcal{R}}^{*}_{n}\) inherits from \({\mathcal{R}}_{n}\) the structure of a \({\mathfrak{S}}_{n}\)-module, and that the projection P n is an intertwining operator. It is known that the representation of \({\mathfrak{S}}_{n}\) in \({\mathcal{R}}^{*}_{n}\) is equivalent to the left regular representation; see, e.g., [5] or [17, Sect. 3.6].

Given a multidegree p=(p 1,…,p n ), let λ(p) be a partition obtained by rearranging the coordinates of p in a non-increasing order. In what follows, we write λ<μ for two partitions λ and μ if (λ 1,λ 2,…) precedes (μ 1,μ 2,…) in the lexicographic order. For a polynomial

its partition degree \(\operatorname{pdeg}(f)\) is a minimal partition μ such that b p =0 each time λ(p)>μ.

We need the following filtration \({\mathcal{F}}_{n}(k), k=0,1,\dots\) of the algebra \({\mathcal{R}}_{n}\):

where index 1 refers to the largest part of the partition. Note that \({\mathcal{F}}_{n}(k)\) is a \({\mathfrak{S}}_{n}\) submodule of \({\mathcal{R}}_{n}\). Also, denote

and observe that \({\mathcal{F}}_{n}^{*}(k)\) is a \({\mathfrak{S}}_{n}\) submodule of \({\mathcal{R}}_{n}^{*}\).

Next, we want to introduce the so-called Garsia–Stanton descent basis in \({\mathcal{R}}^{*}_{n}\), which agrees with filtration \({\mathcal{F}}^{*}_{n}(k)\).

For a permutation \(g\in {\mathfrak{S}}_{n}\), let D(g) be the set of its descents and let d i (g)=|D(g)∩{i,i+1,…,n}|. For a standard Young tableau T of shape λ with |λ|=n, let D(T) be the set of its descents and let d i (T)=|D(T)∩{i,i+1,…,n}|.

The descent monomial u g of permutation g is defined as

In our notation, \(\operatorname{pdeg}(u_{g})=(d_{1}(g),\dots,d_{n}(g))\).

Theorem 4.3

[1, 2, 12]

Classes u g +I n , \(g\in {\mathfrak{S}}_{n}\), form a linear basis of \({\mathcal{R}}^{*}_{n}\). If a polynomial f belongs to u g +I n , then \(\operatorname{pdeg}(f)\ge \operatorname{pdeg}(u_{g})\).

Theorem 4.3 implies the following description of \({\mathcal{F}}^{*}_{n}(k)\). For a class \(f^{*}\in {\mathcal{R}}^{*}_{n}\), we denote by \(\operatorname{pdeg}(f^{*})\) the minimum value of \(\operatorname{pdeg}\) on all polynomials of the class f . Then

and also

The following theorem explains the relevance of the filtration \({\mathcal{F}}^{*}_{n}(k)\) to the study of block characters.

Theorem 4.4

Let \(\pi_{k}^{n}\) be the representation of \({\mathfrak{S}}_{n}\) with character \(\tau_{k}^{n}\). We have the following isomorphism of \({\mathfrak{S}}_{n}\)-modules:

and

Remark

This theorem gives another way to prove that the dimension of \(\pi_{k}^{n}\) is .

Proof of Theorem 4.4

Let \(\bar{q} =(q_{1},q_{2},q_{3},\dots)\) be a sequence of formal variables. For a permutation \(s\in {\mathfrak{S}}_{n}\), let \(\operatorname{ Tr}^{\bar{q}}_{{\mathcal{R}}^{*}_{n}}(s)\) denote its graded trace in the representation in \({{\mathcal{R}}^{*}_{n}}\):

where 〈⋅,⋅〉 is the inner product on \({{\mathcal{R}}^{*}_{n}}\), such that the elements u g form an orthonormal basis. The following formula for the above graded trace was obtained in Sect. 4.2 of [1]:

where SYT(λ) is the set of all standard Young tableau of shape λ. Now setting q 1=q, q 2=⋯=q n =1, we obtain the identity

$$ \sum_{g\in {\mathfrak{S}}_n} \bigl\langle s(u_g),u_g \bigr\rangle q^{d_1(g)}= \sum _{\lambda\in \mathbb{Y}_n} \chi^{\lambda}(s) \sum _{T\in \,\mathrm {SYT}(\lambda)} q^{d_1(T)}. $$
(14)

Note that

$$ \sum_{\lambda\in \mathbb{Y}_n} \chi^{\lambda}(s) \sum_{T\in \,\mathrm {SYT}(\lambda)} q^{d_1(T)}= \sum_{k=1}^n q^{k-1} \sum _{\lambda} \chi^{\lambda}(s) m_k(\lambda) $$
(15)

and

$$ \sum_{g\in {\mathfrak{S}}_n} \bigl\langle s(u_g),u_g \bigr\rangle q^{d_1(g)}=\sum _{k=1}^n q^{k-1} \widetilde{\tau}^n_k(s), $$
(16)

where \(\widetilde{\tau}^{n}_{k}\) is the matrix trace of the representation of \({\mathfrak{S}}_{n}\) in \({\mathcal{F}}^{*}_{n}(k) / {\mathcal{F}}^{*}_{n}(k-1)\). Combining (14), (15), and (16), we get

Therefore,

$$ \widetilde{\tau}_k^n(s)= \sum _{\lambda} \chi^{\lambda}(s) m_k( \lambda). $$
(17)

Comparing (17) with Proposition 2.5, we conclude that \(\tau^{n}_{k}=\widetilde{\tau}^{n}_{k}\). □

5 Limit shapes

In this section, we prove that the decomposition of the characters \(\tau^{k}_{n}\) into the linear combination of the characters of irreducible representations of \({\mathfrak{S}}_{n}\) leads to certain limit shape theorems.

Following [38] and [4], given a Young diagram λ, we construct a piecewise-linear function f λ (x), x∈ℝ with slopes ±1 as shown in Fig. 3.

Fig. 3
figure 3

Young diagram λ=(3,3,1) and the graph y=f λ (x) of the corresponding piecewise-linear function

Now let χ be a character of \({\mathfrak{S}}_{n}\). Recall that irreducible representations of \({\mathfrak{S}}_{n}\) are parameterized by the Young diagrams with n boxes and write

Define

and note that P(λ) are non-negative numbers summing up to 1. Therefore, P(λ) defines a probability distribution on the set \(\mathbb{Y}_{n}\) or, equivalently, on piecewise-linear functions. Let us denote by f χ(⋅) the resulting random piecewise-linear function.

Biane [4] proved the following concentration theorem about the behavior of random piecewise-linear functions corresponding to characters \(\sigma^{n}_{k}\).

Theorem 5.1

Let n,k→∞ in such a way that \(k/\sqrt{n}\to w>0\), then for any ε>0

Here g w (x) is a deterministic function (depending on w).

The explicit formulas for the functions g w (x) are quite involved, they can be found in [4, Sect. 3]. As w→∞, the curves g w (x) approach the celebrated Vershik–Kerov–Logan–Shepp curve, which is a limit shape for the Plancherel random Young diagrams, see [24, 38].

It turns out that the limit behavior of the random functions corresponding to the extreme block characters \(\tau^{n}_{k}\) is described by the very same curves g w (x).

Theorem 5.2

If n,k→∞ in such a way that \(k/\sqrt{n}\to w>0\), then for any ε>0

If n,k→∞ in such a way that \((n-k)/\sqrt{n}\to w>0\), then for any ε>0

The proof of Theorem 5.2 is based on the following observation.

Proposition 5.3

The following estimate holds:

where c(k,n)→0 as n,k→∞ in such a way that \(k/\sqrt{n}\to w>0\).

Proof

Propositions 2.5 and 4.1 imply that

while Proposition 2.4 yields

Then Eq. (5) implies that

(18)

Let us analyze the coefficients c j,k,n . We have

Now if n,k→∞ in such a way that \(k/\sqrt{n}\to w>0\) and j<k, then for large enough n,

On the other hand, summing (18) over all \(\lambda\in\mathbb{Y}_{n}\), we conclude that

Therefore, for large enough n, we have

and for 0<j<k we have

Hence,

which vanishes as n→∞. □

Now Theorem 5.2 becomes a simple corollary of Theorem 5.1.

Proof of Theorem 5.2

First, suppose that n,k→∞ in such a way that \(k/\sqrt{n}\to w>0\), then for any ε>0

Next, suppose that n,k→∞ in such a way that \((n-k)/\sqrt{n}\to w>0\). Proposition 2.7 yields that the distributions of \(f^{\tau^{n}_{k}}(x)\) and \(f^{\tau^{n}_{n-k}}(-x)\) coincide. Hence we get the second claim of Theorem 5.2. □

6 Branching rules

Let us embed \({\mathfrak{S}}_{n-1}\) into \({\mathfrak{S}}_{n}\) as the subgroup of permutations fixing n. For a central function χ on \({\mathfrak{S}}_{n}\) the restriction \(\operatorname{Res}_{n-1}(\chi)\) on \({\mathfrak{S}}_{n-1}\) is a central function on \({\mathfrak{S}}_{n-1}\). If χ is a block function, then so is the restriction \(\operatorname{Res}_{n-1} \chi\). The following two propositions describe what happens with the characters \(\sigma_{k}^{n}, \tau_{k}^{n}\) by restricting them to the subgroup; we call these formulas the branching rules.

Proposition 6.1

For 1≤kn we have

In terms of the normalized characters, the relations can be rewritten as:

Proof

This immediately follows from n+1(g)= n (g)+1. □

Proposition 6.2

For 1<k<n we have

Proof

Observe the binomial coefficients identity

Indeed, this is equivalent to

which is easy to check.

Then using the definition of \(\tau_{k}^{n}\) and the branching rule for \(\sigma_{k}^{n}\), we obtain

as wanted. □

Remark

Proposition 6.2 suggests another way to prove Proposition 4.1. Indeed, evaluating at e, we obtain the relation

which coincides with the well-known recursion for Eulerian numbers.

7 The Choquet simplex of block characters of \({\mathfrak{S}}_{\infty}\)

We proceed with a classification of block characters of the infinite symmetric group \({\mathfrak{S}}_{\infty}\). Let us recall the definition of the group. Let ℤ+ denote the set of positive integers. Realize the group \({\mathfrak{S}}_{n}\) as the group of bijections g:ℤ+→ℤ+ which may only move the first n integers, that is, satisfy g(j)=j for j>n. This yields a natural embedding \({\mathfrak{S}}_{n}\subset {\mathfrak{S}}_{n+1}\) and allows one to introduce the infinite symmetric group as an inductive limit

whose elements are the bijections g:ℤ+→ℤ+ satisfying g(j)=j for all sufficiently large j.

The notion of a block character for the group \({\mathfrak{S}}_{\infty}\) needs to be adapted. We call the statistic c(g):=n n (g) the decrement of permutation \(g\in {\mathfrak{S}}_{n}\). Note that the concept is consistent with embeddings, that is, considering g as an element of \({\mathfrak{S}}_{n+1}\), the decrement remains the same: n n (g)=n+1− n+1(g). Thus, c(g) is a well defined function on \({\mathfrak{S}}_{\infty}\).

A block function on \({\mathfrak{S}}_{\infty}\) is defined as a function which depends on a permutation through its decrement. A positive definite normalized block function will be called normalized block character of \({\mathfrak{S}}_{\infty}\). Two trivial examples are the unit character and the delta function at e.

We introduce next the analogues of characters \(\sigma^{n}_{k}\) and \(\widehat{\sigma}^{n}_{k}\). Denote

and set

$$ \sigma^\infty_z(g):=z^{c(g)}, \quad\mbox{for } z\in \mathbb{V}, $$
(19)

where \(\sigma^{\infty}_{0}\) is understood as the delta-function at e

Keep in mind that in our parametrization \(\sigma^{\infty}_{1/k}\) is a counterpart of \(\sigma_{k}^{n}\) and \(\sigma^{\infty}_{-1/k}\) is an analogue of \(\widehat{\sigma}_{k}^{n}\), for k=1,2,….

Proposition 7.1

The functions \(\sigma^{\infty}_{z}, z\in \mathbb{V}\), are normalized block characters of \({\mathfrak{S}}_{\infty}\).

Proof

For \(\sigma^{\infty}_{0}\) the statement is trivial. As for \(\sigma^{\infty}_{\pm 1/k}\) they are, clearly, central normalized functions on \({\mathfrak{S}}_{\infty}\). Note that

$$ \sigma^{\infty}_{1/k} (g)= \frac{\sigma^n_k(g)}{\sigma^n_k(e)} $$
(20)

for \(g\in {\mathfrak{S}}_{n}\), and similarly for \(\sigma^{\infty}_{-1/k}\). Therefore, the positive-definiteness of \(\sigma^{n}_{k}\) and \(\widehat{\sigma}^{n}_{k}\) implies the positive-definiteness of \(\sigma^{\infty}_{\pm 1/k}\). □

The characters \(\sigma_{z}^{\infty}\) can be associated with certain infinite-dimensional representations of \({\mathfrak{S}}_{\infty}\). One way to establish the connection is to realize \(\sigma_{z}^{\infty}\) as the trace of a finite von Neumann factor representation, which may be seen as a substitute of irreducible representation, see [34, 36]. Another approach is to consider spherical representations of the Gelfand pair \((\bar{G}, \bar{K})\), where \(\bar{G}\) and \(\bar{K}\) are certain extensions of the groups \({\mathfrak{S}}_{\infty}\times {\mathfrak{S}}_{\infty}\) and \({\mathfrak{S}}_{\infty}\), respectively, see [26, 27] and also [22] for details and references. By both approaches, the character \(\sigma^{\infty}_{0}\) corresponds to the regular representation of \({\mathfrak{S}}_{\infty}\).

It follows readily from a multiplicative property of the extremes [36] that the characters \(\sigma^{\infty}_{z}, z\in \mathbb{V}\), are in fact extreme points in the set of all normalized characters of \({\mathfrak{S}}_{\infty}\). In Thoma’s parametrization (see [36] and [21, 25, 37]) of the extremes by two nonincreasing sequences (α,β), \(\sigma^{\infty}_{1/k}\) corresponds to α 1=⋯=α k =1/k,β=0, while \(\widehat{\sigma}^{\infty}_{1/k}\) corresponds to the pair α=0,β 1=⋯=β k =1/k.

Note that for a finite n the situation is different: as we saw, the representations with traces \(\sigma^{n}_{k}\) and \(\widehat{\sigma}^{n}_{k}\) are reducible, so their normalized traces are not extreme characters. A priori, there is no reason for the list \(\{\sigma^{\infty}_{z}, z\in\mathbb{V}\}\) to exhaust all extreme normalized block characters of \({\mathfrak{S}}_{\infty}\). The following theorem asserts that this is indeed the case.

Theorem 7.2

Let Ω be the infinite-dimensional simplex of non-negative sequences \((\gamma_{z}, z\in\mathbb{V})\) with1/z∈ℤ∪{∞} γ z =1. Endowed with the topology of point-wise convergence, the set of normalized block characters of \({\mathfrak{S}}_{\infty}\) is a Choquet simplex. The correspondence

is an affine homeomorphism between Ω and the Choquet simplex of normalized block characters of \({\mathfrak{S}}_{\infty}\). In particular, the set of extreme normalized block characters of \({\mathfrak{S}}_{\infty}\) is \(\{\sigma^{\infty}_{z}, z\in \mathbb{V}\}\).

Proof

Let χ be a block character of \({\mathfrak{S}}_{\infty}\) and let χ n denote the restriction of χ on the subgroup \({\mathfrak{S}}_{n}\). Then χ n is a normalized block character of \({\mathfrak{S}}_{n}\), hence, by Proposition 3.2, it can be uniquely decomposed as a convex combination as follows:

where the array of coefficients satisfies

$$ a^n_k\ge 0, \quad \sum_{k=1}^n a^n_k=1. $$
(21)

By Proposition 6.2, the coefficients \(a^{n}_{k}\) satisfy the following backward recursion: for 1≤knn=1,2,…:

(22)

Moreover, the correspondence \(\chi \leftrightarrow (a^{n}_{k})\) is an affine homeomorphism between normalized block characters and arrays \((a_{k}^{n})\) satisfying (21) and (22) (Condition (22) implies that in (21) it is enough to require \(a_{1}^{1}=1\)).

The fact that the set of arrays \((a_{k}^{n})\) satisfying (21) and (22) is a Choquet simplex is just a particular case of a very general result; see, e.g., [15, Proposition 11.6]. As for the extreme points of this simplex, it was shown in [14] that all extreme solutions to the recursion(22) subject to the constraints (21) are of one of the types

(23)
(24)
(25)

It is possible to write all three types as a single factorial formula (see [14], Eq. (2)).

Calculating with (23), we arrive, in view of Proposition 2.3, at

which means that the array (23) corresponds to \(\sigma_{z}^{\infty}\) with 1/z=K∈{1,2,…}. Similarly, with (24) and (8) the convex combination is \(\widehat{\sigma}_{K}^{n}\), so we arrive at \(\sigma_{z}^{\infty}\) with 1/z=−K∈{−1,−2,…}. Finally, with the array (25) the convex combination obtained is the normalized character of the regular representation of \({\mathfrak{S}}_{n}\), as it follows from (13), and this corresponds to \(\sigma_{0}^{\infty}\). Thus all extreme block characters of \({\mathfrak{S}}_{\infty}\) have been identified with (19). □

8 Connection to the characters of the linear groups over the Galois fields

The group GL (q) is the group of infinite matrices of the kind

where h is a finite square matrix with coefficients from the Galois field \(\mathbb{F}_{q}\) with q elements, 1 denotes the infinite unit matrix, and 0’s are zero matrices of suitable dimensions. From this definition it is clear that the group has the structure of inductive limit GL (q)=⋃ n≥1 GL n (q). Thoma [35] conjectured that all extreme normalized characters of GL (q) are of the form

$$ \chi(g)= \epsilon(\det g) q^{-mc(g)}, \quad m\in \mathbb{Z}_{\ge 0}\cup\{\infty\}, $$
(26)

where ϵ is a one-dimensional character of the cyclic group \(\mathbb{F}_{q}^{*}\), and c(g) is the rank of the matrix gId. The conjecture was proved by Skudlarek [29]. Thoma’s characters gq mc(g) belong to the class of derangement characters introduced in [13] as the characters of GL n (q) (respectively, GL (q)) which only depend on c(g).

As mentioned in the Introduction, a connection between block characters and derangement characters is established via the natural embedding \({\mathfrak{S}}_{\infty}\hookrightarrow \mathit{GL}_{\infty}(q)\) obtained by writing a permutation as a permutation matrix. The decrement of permutation is equal to the rank of the matrix gId. Furthermore, the determinant of the permutation matrix is ±1 depending on the parity of the permutation.

Each extreme character (26) restricts from GL (q) to \({\mathfrak{S}}_{\infty}\) as one of the characters \(\sigma^{\infty}_{z}\) with \(z=\pm q^{-m}, m\in \mathbb{Z_{\ge 0}}\cup \{+\infty\}\), unless q is a power of 2. It q is a power of 2, then −1=1 in \(\mathbb{F}_{q}\), hence the negative values are excluded. For other values of z, the function gz c(g) is not positive definite on GL (q).

For \({\mathfrak{S}}_{n}\) we constructed n extreme characters \(\tau_{k}^{n}\) from the basic characters in just one step, by differencing the series \(\sigma_{1}^{n},\dots,\sigma_{n}^{n}\). For GL n (q) the situation is more involved: one first needs to construct q-analogues of characters (10), then proceed with further differencing and search for the extremes. Moreover, the set of normalized derangement characters of GL n (q) is known to be a simplex for n∈{1,2,3,4,5,6,8,9,11,12} and it is not a simplex (has more than n extremes) for n∈{7,10,13,14,15,16,17,18,19,20,21,22} (see [13]). Another, more substantial, distinction occurs on the level of infinite groups. For GL (q) the extreme normalized derangement characters exhaust all extreme normalized characters of the group (up to tensoring with a linear character, and literally all for the special linear group SL (q)). For \({\mathfrak{S}}_{\infty}\) the extreme normalized block characters comprise only a countable subset of the infinite-dimensional Thoma simplex of all extreme normalized characters.