1 Introduction

The notion of k-closure of a permutation group G on a set \(\Omega \) was introduced in 1969 by Wielandt [30, Definition 5.3]: namely the k-closure \(G^{(k),\Omega }\) of G is the set of all \(g\in \textrm{Sym}(\Omega )\) (permutations of \(\Omega \)) such that g leaves invariant each G-orbit in the induced G-action on ordered k-tuples from \(\Omega \). The k-closure \(G^{(k),\Omega }\) is a subgroup of \(\textrm{Sym}(\Omega )\) containing G [30, Theorem 5.4], and a permutation group G is said to be k-closed if \(G^{(k),\Omega }=G\). Determining when G is k-closed can be important when studying the relational complexity of G. This is the smallest positive integer r such that, whenever A and B are ordered n-tuples from \(\Omega \) with \(n \geqslant r\), and each r-subtuple of A lies in the same G-orbit as the corresponding r-subtuple of B, it is also the case that \(A \in B^G\). The relational complexity of G is very difficult to calculate in general, but it is bounded below by the smallest integer k such that G is k-closed, as shown in [16, Lemmas 1.6.5]; see that monograph for an excellent introduction to relational complexity.

The k-closures for different faithful permutation representations of the same group G may be quite different, even for the symmetric group \(S_3\) (see [13, p. 240]). We are interested in the stronger concept of total k-closure, which is independent of the permutation representation of the group. A group G is said to be totally k-closed if for each set \(\Omega \) upon which G acts faithfully we have \(G=G^{(k),\Omega }\). The concept was suggested in 2016 by Holt [1], and first studied by Abdollahi and Arezoomand [2] when \(k=2\).

For a permutation group \(G\leqslant \textrm{Sym}(\Omega )\), and \(k\geqslant 2\), Wielandt [30, Theorem 5.8] proved that

$$\begin{aligned} G\leqslant G^{(k),\Omega }\leqslant G^{(k-1),\Omega }, \end{aligned}$$
(1)

and in addition, see [30, Theorem 5.12], that if G is finite then \(G=G^{(k),\Omega }\) for \(k=|G|\). Thus if a finite group G is totally \((k-1)\)-closed then it is automatically totally k-closed, and also G is totally k-closed for sufficiently large k. We define the closure number k(G) of a finite group G as the least positive integer k such that G is totally k-closed.

In this paper we study the closure numbers for finite simple groups. In previous work it has been shown that \(k(G)=2\) if G is a cyclic group of prime order [13, Theorem 1.2], and for exactly six nonabelian simple groups [3, Theorem 1.1], namely the Janko groups \(J_1, J_3\) and \(J_4\), together with LyTh and the Monster M. By (1), \(k(G)\geqslant 3\) for all other finite simple groups. The problem of determining the closure number k(G) for the remaining simple groups was posed in [3, Question 3], and in particular for determining all those with closure number 3 (if any exist), see also [13, Problem 3] and the Kourovka Notebook [19, New Problems 20.2 and 20.3].

A crucial tool for this work uses the notion of a base for a permutation group \(G\leqslant \textrm{Sym}(\Omega )\), that is to say, a set of points \(\alpha _1,\dots ,\alpha _{b}\in \Omega \) such that the only element of G fixing each \(\alpha _i\) is the identity. If \(G\leqslant \textrm{Sym}(\Omega )\) and G has a base of size b, then Wielandt [30, Theorem 5.12] proved that \(G=G^{(b+1),\Omega }\). To see how this is used, let us consider the finite alternating groups \(G=A_n\), \(n\geqslant 5\). For the natural action of G on \(\Omega =\{1,\dots ,n\}\), and for \(k\leqslant n-2\), the groups G and \(\textrm{Sym}(\Omega )\) have exactly the same orbits on \(\Omega ^k\), and hence \(G^{(k),\Omega }=\textrm{Sym}(\Omega )\ne G\), so by definition the closure number satisfies \(k(G)\geqslant n-1\). On the other hand any \((n-2)\)-subset of \(\Omega \) is a base for G and hence \(G^{(n-1),\Omega }= G\) by [30, Theorem 5.12]. We prove in Lemma 3.7 that, in fact, in every faithful permutation representation of G, the minimum size of a base is at most \(n-2\), and hence by [30, Theorem 5.12], the action is \((n-1)\)-closed. Thus the closure number is determined.

Theorem 1.1

For \(n\geqslant 5\), the closure number \(k(A_n)=n-1\).

On the other hand we exploit known upper bounds on the base sizes of primitive permutation representations of the remaining simple groups in order to prove the following upper bounds for their closure numbers.

Theorem 1.2

Suppose that G is a finite nonabelian simple group that is not isomorphic to an alternating group. Then the following hold.

  1. (a)

    If G is a sporadic group, then \(k(G)\leqslant 6\).

  2. (b)

    If G is an exceptional group of Lie type, then \(k(G)\leqslant 7\).

  3. (c)

    Suppose that G is a classical group, with natural module of dimension n. Then either G and an upper bound for k(G) are given in Table 1, or \(k(G) \leqslant \lfloor n/3+12 \rfloor \). In particular, if \(n \geqslant 14\) in any case, then \(k(G) \leqslant n+2\).

Table 1 Upper bounds for k(G), for finite simple classical groups G that satisfy the specified conditions, and are not isomorphic to alternating groups

We show in Lemma 5.4 that \(k(M_{24})=6\). Thus Theorem 1.2(a) and [3, Theorem 1.1] leave 19 sporadic groups G for which \(3\leqslant k(G)\leqslant 6\) and where we do not know the exact value.

Question 1

Determine k(G) for these remaining 19 sporadic simple groups.

The bound \(k(G)\leqslant 7\) for exceptional groups of Lie type is derived (see Corollary 3.3) by using the fact that the minimum base size of primitive actions of such groups is at most 6. By [7], equality only holds for certain actions of \(E_6(q)\) and \(E_7(q)\), and so these are the only possibilities to have \(k(G)=7\). However, we do not know if this bound is met.

Question 2

Determine a tight upper bound for k(G) for an exceptional group G of Lie type.

In many of the low-dimensional classical cases not specified here, our work in fact yields a smaller upper bound than the general bound of \(\lfloor n/3+12 \rfloor \); see the comment following the proof of Theorem 1.2(c), before Corollary 4.4. For simple n-dimensional classical groups we do not know whether the closure numbers grow linearly with n or are in fact asymptotically smaller.

Question 3

For a simple n-dimensional classical group G, is \(k(G) = o(n)\)?

In particular, deciding whether or not the closure numbers of finite simple classical groups are bounded or not, as the dimension of their natural module increases, would answer the open question [19, 20.3] in the Kourovka Notebook.

For a group G we define \(k_{\textrm{trans}}(G)\) to be the smallest integer k such that \(G^{(k),\Omega }=G\) for each set \(\Omega \) upon which G acts faithfully and transitively. Similarly, we define \(k_{\textrm{prim}}(G)\) to be the smallest integer k such that \(G^{(k),\Omega }=G\) for each set \(\Omega \) upon which G acts faithfully and primitively. Clearly,

$$\begin{aligned}k(G)\geqslant k_{\textrm{trans}}(G)\geqslant k_{\textrm{prim}}(G).\end{aligned}$$

We note that we can use the results of [23] and [27] to give absolute upper bounds on \(k_{\textrm{prim}}(G)\). In particular \(k_{\textrm{prim}}(G)\leqslant 6\) for all complete simple groups G.

2 Preliminary results about k-closures

We begin by noting the following result of Wielandt which we will often use without reference.

Lemma 2.1

[30, Theorem 5.6] Let G act on a set \(\Omega \) and \(k\geqslant 1\) be an integer. Then \(h\in G^{(k),\Omega }\) if and only if for all \(\alpha _1,\ldots ,\alpha _k\in \Omega \) there exists \(g\in G\) such that \((\alpha _1,\ldots ,\alpha _k)^g=(\alpha _1,\ldots ,\alpha _k)^h\).

Note that, because of our focus on the set of all faithful permutation representations of a given group G, we never write \(G^{(k)}\) for a k-closure without giving also the set acted upon. So, for example, if G acts on a set \(\Omega \) and \(\Delta \) is a G-invariant subset of \(\Omega \), then \((G^{(k)})^\Delta \) has no meaning in our work. We can write \(G^{(k),\Delta }\) for the k-closure of the action of G on \(\Delta \), and \((G^{(k),\Omega })^\Delta \) for the permutation group induced on \(\Delta \) by the k-closure of the action of G on \(\Omega \).

We also note the following result.

Lemma 2.2

Let G act on a set \(\Omega \) and let \(\Delta \) be a G-invariant subset of \(\Omega \). Then \((G^{(k),\Omega })^\Delta \leqslant G^{(k),\Delta }\).

Proof

[30, Exercise 5.25]. \(\square \)

Next we prove a result about the k-closure of an intransitive action of a nonabelian simple group.

Lemma 2.3

Let G be a nonabelian simple group acting on a set \(\Omega \) with orbits \(\Delta _1,\ldots ,\Delta _t\) with \(t\geqslant 2\). Suppose that for each \(i,j\in \{1,\ldots , t\}\) there is a point \(\beta \in \Delta _i\) such that \(G_\beta \) is not transitive on \(\Delta _j\), and also that \(G^{(k),\Delta _i}=G^{\Delta _i}\cong G\) for each \(i=1,\ldots , t\) and some \(k\geqslant 2\). Then \(G^{(k),\Omega }=G^\Omega \cong G\).

Proof

For each \(i=1,\ldots , t\), G acts faithfully on \(\Delta _i\) since \(G^{\Delta _i}\cong G\). Also, by Lemma 2.2, \(G^{(k),\Omega } \leqslant \prod _{i=1}^t (G^{(k),\Omega })^{\Delta _i} \leqslant \prod _{i=1}^t G^{(k),\Delta _i} \cong G^t\). It follows that \(G^{(k),\Omega }\) is a subdirect subgroup of \(G^{(k),\Delta _1}\times \cdots \times G^{(k),\Delta _t}\cong G^t\), and so, by a lemma of Scott ([29, p. 328] or see [28, Theorem 4.16]), the group \(G^{(k),\Omega }\) is equal to a direct product of \(\ell \) diagonal subgroups \(H_j\cong G\) for \(j=1,\ldots , \ell \) (where \(\ell \geqslant 1\)), with pairwise disjoint supports, that is to say, there is a partition \(\{\mathcal {O}_1,\ldots ,\mathcal {O}_\ell \}\) of \(\{\Delta _1,\ldots ,\Delta _t\}\) such that, for each j, \((H_j)^\Delta \cong G\) for all \(\Delta \in \mathcal {O}_j\) and \((H_j)^\Delta =1\) for all \(\Delta \notin \mathcal {O}_j\). In particular, \(G^{(k),\Omega }\cong G^\ell \). Suppose that \(\ell >1\). Choose \(\Delta _i\in \mathcal {O}_1\) and \(\Delta _j\in \mathcal {O}_2\). Then the kernel \((G^{(k),\Omega })_{(\Delta _i)}\) of the action on \(\Delta _i\) contains the diagonal subgroup \(H_2\), which induces G on \(\Delta _j\). By assumption, there exists \(\beta \in \Delta _i\) such that \(G_\beta \) is not transitive on \(\Delta _j\). Choose \(\alpha \in \Delta _j\) and consider the k-tuple \((\alpha ,\beta ,\beta ,\ldots ,\beta )\) of points. By definition of the k-closure, for each \(h\in H_2 < G^{(k),\Omega }\), there exists \(g\in G\) such that \((\alpha ,\beta ,\beta ,\ldots ,\beta )^g=(\alpha ,\beta ,\ldots ,\beta )^h=(\alpha ^h,\beta ,\beta ,\ldots ,\beta )\). In particular \(\alpha ^g=\alpha ^h\) and \(g\in G_\beta \). Since \(H_2\) is transitive on \(\Delta _j\) it follows that \(G_\beta \) is transitive on \(\Delta _j\), which is a contradiction. Thus \(\ell =1\), and so \(G^{(k),\Omega }=G^\Omega \cong G\). \(\square \)

Corollary 2.4

Let G be a nonabelian simple group acting on a set \(\Omega \) with orbits \(\Delta _1,\ldots ,\Delta _t\) with \(t\geqslant 2\) such that the actions of G on each orbit are pairwise equivalent. If, for some k, \(G^{(k),\Delta _i}=G^{\Delta _i}\cong G\), then \(k\geqslant 2\) and \(G^{(k),\Omega }=G^\Omega \cong G\).

Proof

Since the actions of G are pairwise equivalent, if \(\beta \in \Delta _i\) then \(G_\beta \) fixes a point in each orbit and so is not transitive on any \(\Delta _j\). Suppose that \(G^{(k),\Delta _i}=G^{\Delta _i}\cong G\), for some k. Then \(k\geqslant 2\), since \(G^{(1),\Delta _i}=\textrm{Sym}(\Delta _i) > G^{\Delta _i}\) as \(G^{\Delta _i}\cong G\) is a nonabelian simple group. The assertion now follows from Lemma 2.3. \(\square \)

Next we need some results about the k-closure of an imprimitive group.

Lemma 2.5

Let G be a group with a transitive action on a set \(\Omega \), and suppose that \(\mathcal {B}\) is a nontrivial G-invariant partition of \(\Omega \). Let k be an integer such that \(2\leqslant k\leqslant |\mathcal {B}|\) and let \(B\in \mathcal {B}\). Then the following hold:

  1. (a)

    \(\mathcal {B}\) is a \(G^{(k),\Omega }\)-invariant partition.

  2. (b)

    \((G^{(k),\Omega })^{\mathcal {B}}\leqslant G^{(k),\mathcal {B}}\).

  3. (c)

    \((G^{(k),\Omega })_B^B\leqslant (G_B)^{(k),B}\).

  4. (d)

    If there exist \(B_1,\ldots ,B_k\in \mathcal {B}\) such that \(G_{B_1,\ldots ,B_k}=1\), then \(G^{(k),\Omega }\) acts faithfully on \(\mathcal {B}\).

Proof

Let \(g\in G^{(k),\Omega }\). Let \(B\in \mathcal {B}\) and choose \(\alpha ,\beta \in B\) with \(\alpha \ne \beta \), and consider the k-tuple \((\alpha ,\beta ,\beta ,\ldots ,\beta )\). Since \(g\in G^{(k),\Omega }\), there exists \(h\in G\) such that \((\alpha ,\beta ,\beta ,\ldots ,\beta )^h=(\alpha ,\beta ,\beta ,\ldots ,\beta )^g\). Then since \(\mathcal {B}\) is a G-invariant partition of \(\Omega \), it follows that \(\alpha ^g\) and \(\beta ^g\) lie in the same part of \(\mathcal {B}\). Hence \(\mathcal {B}\) is also invariant under \(G^{(k),\Omega }\) and part (a) holds

Now consider \(g^{\mathcal {B}}\). Let \(B_1,\ldots , B_k\in \mathcal {B}\) and choose \(\beta _i\in B_i\), for \(i=1,\dots ,k\). Then there exists \(h\in G\) such that \((\beta _1,\ldots ,\beta _k)^g= (\beta _1,\ldots ,\beta _k)^h\). Thus \((B_1,\ldots ,B_k)^{g^\mathcal {B}}=(B_1,\ldots ,B_k)^{h^{\mathcal {B}}}\) and so \(g^{\mathcal {B}}\in G^{(k),\mathcal {B}}\), as required for part (b).

Next, let \(h\in (G^{(k),\Omega })_B\), so that \(h^B\in (G^{(k),\Omega })_B^B\). To show that \(h^B\in (G_B)^{(k),B}\) we prove the criteria of Lemma 2.1, namely for all k-tuples \((\alpha _1,\dots ,\alpha _k)\in B^k\), we find an element \(x\in (G_B)^{B}\) such that \((\alpha _1,\dots ,\alpha _k)^{h^B}=(\alpha _1,\dots ,\alpha _k)^x\). Since \(h\in (G^{(k),\Omega })_B\), there exists \(y\in G\) such that \((\alpha _1,\dots ,\alpha _k)^{h}=(\alpha _1,\dots ,\alpha _k)^y\). Further, since h fixes B setwise it follows that \(y\in G_B\). Then taking \(x:=y^B\) we have the required equality for (c).

Finally, suppose that there exist \(B_1,\ldots ,B_k\in \mathcal {B}\) such that \(G_{B_1,\ldots ,B_k}=1\). Let h lie in the kernel of the action of \(G^{(k),\Omega }\) on \(\mathcal {B}\), and choose \(\alpha \in \Omega \). Since G acts transitively on \(\Omega \) there exists \(g\in G\) such that \(\alpha ^{g^{-1}}\in B_1\). For each \(i=2,3,\ldots ,k\) choose \(\beta _i\in B_i^g\). Since \(h\in G^{(k),\Omega }\), there exists \(x\in G\) such that \((\alpha ,\beta _2,\ldots ,\beta _k)^h=(\alpha ,\beta _2,\ldots ,\beta _k)^x\). By its definition, h fixes each \(B_i^g\) setwise, and so \(\beta _i^x\in B_i^g\) for all i and \(\alpha ^x\in B_1^g\). Thus \(x\in G_{B_1^g,B_2^g,\ldots ,B_k^g}=(G_{B_1,\ldots ,B_k})^g=1\). Hence \(\alpha ^h=\alpha ^x=\alpha \). Since \(\alpha \) was arbitrary and \(G^{(k),\Omega }\leqslant \textrm{Sym}(\Omega )\), it follows that \(h=1\) and so \(G^{(k),\Omega }\) acts faithfully on \(\mathcal {B}\) as stated in part (d). \(\square \)

3 Links between bases and closures

For a finite group G acting on a set \(\Omega \), let \(b(G^\Omega )\) denote the minimum size of a base for the permutation group \(G^\Omega \leqslant \textrm{Sym}(\Omega )\) induced by G on \(\Omega \); \(b(G^\Omega )\) is called the base size of \(G^\Omega \). When G is transitive, a maximal block system for G on \(\Omega \) is a G-invariant partition \(\Sigma \) of \(\Omega \) such that \(|\Sigma |>1\) and the induced action \(G^\Sigma \) is primitive. In other words, the only G-invariant partition strictly coarser than \(\Sigma \) (that is, with each part a union of more than one part of \(\Sigma \)) is the universal partition \(\{\Omega \}\) with just one part. Note that \(\Sigma \) is allowed to be the partition into singletons.

Lemma 3.1

Let G be a finite nonabelian simple group, and suppose that G has a faithful permutation representation on a set \(\Omega \). Let \(\Omega _0\) be a G-orbit on \(\Omega \) with \(|\Omega _0|>1\), and let \(\Sigma \) be a maximal block system for G on \(\Omega _0\). Then the base size \(b(G^\Omega )\) satisfies \(b(G^\Omega )\leqslant b(G^\Sigma )\).

Proof

Since \(|\Sigma |>1\) and G is simple it follows that G acts faithfully on \(\Sigma \). Let \(b=b(G^\Sigma )\). Then G has a base \(\{\sigma _1,\dots , \sigma _b\}\) in \(\Sigma \), that is \(G_{\sigma _1,\dots , \sigma _b}=1\). Let \(\alpha _i\in \sigma _i\) for each i. Then \(G_{\alpha _1,\dots , \alpha _b}\) fixes each of the \(\sigma _i\) setwise, and hence \(G_{\alpha _1,\dots , \alpha _b}\leqslant G_{\sigma _1,\dots , \sigma _b}=1\). It follows that \(b(G^\Omega )\leqslant b\). \(\square \)

We may restate Wielandt’s result [30, Theorem 5.12] as follows.

Lemma 3.2

If G is a finite group, and G has a permutation representation on a set \(\Omega \), such that \(b(G^\Omega )=b\), then \(G^{(b+1),\Omega }=G\).

This has the following useful consequence.

Corollary 3.3

Let G be a finite nonabelian simple group, and let \(b_{maxprim}(G)\) denote the maximum of \(b(G^\Omega )\) over each faithful primitive permutation representation of G on a set \(\Omega \). Then the closure number k(G) is at most \(b_{maxprim}(G)+1\).

Proof

Let G be faithfully represented on a set \(\Omega \). Then, by Lemma 3.1, \(b(G^\Omega )\) is at most the base size of some primitive representation of G, and hence \(b(G^\Omega )\leqslant b_{maxprim}(G)=b\), say. It follows from Lemma 3.2 that \(G^{(b+1),\Omega }=G\). Hence, by definition of the closure number, \(k(G)\leqslant b+1\). \(\square \)

In the light of Corollary 3.3, we next summarise some recent results on base sizes of finite primitive permutation groups that are nonabelian simple groups. For several families of such actions the base size is much larger than all others; these are called standard actions and are defined as follows.

Definition 1

Let G be a finite nonabelian simple group. A primitive action of G on a set \(\Omega \) is standard if, up to equivalence of actions, one of (a) or (b) holds for \(G^\Omega \), and the action is non-standard otherwise.

  1. (a)

    \(G = A_n\) for some \(n\geqslant 5\), and \(\Omega \) is an orbit of subsets or partitions of \(\{1, \dots , n\}\); or

  2. (b)

    G is a classical simple group with natural module V over a field of characteristic p, so \(G=\textrm{PSL}(V), \textrm{PSp}(V), \textrm{PSU}(V)\) or \(\textrm{P}\Omega ^\varepsilon (V)\), and G has a subspace action on \(\Omega \), that is to say, one of the following holds:

    1. (i)

      for some positive integer \(k \leqslant \dim (V)/2\), \(\Omega \) is a G-orbit of k-dimensional totally singular, nondegenerate, or (if G is orthogonal, \(p = 2\) and \(k = 1\)) nonsingular subspaces of V; or

    2. (ii)

      \(G = \textrm{PSp}(V)\), \(p=2\), and \(\Omega \) is the set of nondegenerate quadratic forms on V of fixed type \(+\) or − that polarise to the symplectic form preserved by G.

If \(G = \textrm{PSL}(V)\) in case (b)(i) above, then \(\Omega \) consists of all k-dimensional subspaces of V. Note also that, as the group G is simple, the definition above is slightly simpler than that for a standard action of an almost simple group as given by Cameron and KantorFootnote 1in [12], where they conjectured that for each non-standard primitive action of an almost simple group G on a finite set \(\Omega \), the base size \(b(G^\Omega )\) is bounded above by an absolute constant c. In [24, Theorem 1.3], Liebeck and Shalev proved this conjecture, but did not specify the constant c. In the case of a non-standard primitive group G with socle \(A_n\), a sketch of a proof was given in [12, Proposition 2.3] that \(c=2\) should usually suffice. A detailed proof in [9, Theorem 1.1 & Corollary 1.2] showed that this is indeed true for all \(n\geqslant 13\), and all possibilities for non-standard actions of \(A_n\) and \(S_n\) with base-size at least 3 were given in [9, Table 1] (although the group \(A_6\) appears in this table, the corresponding action is standard). Burness and others [6, 10, 11] determined the best possible value for c such that \(b(G^\Omega )\leqslant c\) for all other non-standard primitive actions of almost simple groups G. In particular, upper bounds for \(b(G^\Omega )\) were specified in [6, Theorem 1.1 & Proposition 4.1] in the case where \(G^\Omega \) is a non-standard action of an almost simple classical group. Thus we obtain the following.

Lemma 3.4

Let G be a finite nonabelian simple group, and suppose that G has a non-standard primitive action on a finite set \(\Omega \) with \(|\Omega |>1\). Then the following statements hold.

  1. (i)

    \(b(G^\Omega )\leqslant 7\), with equality if and only if \(G=M_{24}\) in its 5-transitive action of degree 24.

  2. (ii)

    If \(G=A_n\) for some \(n\geqslant 5\), then either \(b(G^\Omega )< n-2\) or \((n, |\Omega |, b(G^\Omega ))=(5,6,3)\).

  3. (iii)

    If G is a classical group, then \(b(G^\Omega ) \leqslant 4\).

  4. (iv)

    If \(b(G^\Omega ) = 4\) and G is equal to \(\textrm{PSL}(n,q)\), \(\textrm{PSU}(n,q)\) or \(\textrm{PSp}(n,q)\) with \(n \leqslant 5\), then \(G = \textrm{PSU}(4,3)\).

We are now in a position to prove part of Theorem 1.2.

Proof of Theorem 1.2(b)

Let G be an exceptional group of Lie type. Then all primitive actions of G are non-standard and so by Lemma 3.4, \(b_{maxprim}(G)\leqslant 6\). Hence Corollary 3.3 implies that \(k(G)\leqslant 7\), as required. \(\square \)

3.1 Bases for alternating groups

For the actions of \(A_n\) or \(S_n\) on k-subsets, the best general bounds have been obtained by Halasi in [17]. To extract the information we need from his work it is simplest to use his monotonicity result from [17, Sect. 2].

Lemma 3.5

Let \(G=A_n\) or \(S_n\), where \(n\geqslant 5\) and \(n=3\,m+\delta \) with \(\delta \in \{0,1,2\}\). Consider the action of G on the set \(\Omega \) of k-subsets of \(X=\{1,2,\dots , n\}\), where \(2\leqslant k\leqslant n/2\). Then \(b(G^\Omega )\leqslant 2m\) if \(\delta \in \{0,1\}\) and \(b(G^\Omega )\leqslant 2\,m+1\) if \(\delta =2\); so in particular \(b(G^\Omega )\leqslant 2n/3\). Moreover, \(b(G^\Omega )\leqslant n-2\), with equality if and only if \((n,k, G)=(6,2, S_6)\) or \((5,2, S_5)\).

Proof

Since \(b(A_n^\Omega )\leqslant b(S_n^\Omega )\), we may assume that \(G=S_n\) for proving the main assertion. As in [17], let f(nk) denote the (minimum) base size of the action of \(S_n\) on the set of k-subsets of X. By [17, Corollary 2.2], \(f(n,k)\leqslant f(n,2)\), so we may assume further that \(k=2\). Write \(n=3m+\delta \) as in the statement, and consider the following subset of \(\Omega \):

$$\begin{aligned} B=\{\{ 3j+1, 3j+2\}, \{ 3j+2, 3j+3\} \mid 0\leqslant j\leqslant m-1\}. \end{aligned}$$

Let H be the pointwise stabiliser of B in G. Then H fixes each point i of X such that \(1\leqslant i\leqslant 3m\). Suppose first that \(\delta =0\) or 1 (so that \(n \geqslant 6\)). Then \(H=1\) and hence \(b(G^\Omega )\leqslant |B|=2m\leqslant 2n/3\leqslant n-2\). Further if \(b(G^\Omega )= n-2\) then \(2n/3 = n-2\), and hence \(n=6\) and \(b(G^\Omega )= 4\). By [17, Theorem 3.1], \(f(6,3)=\lceil \log _2 6\rceil = 3\), which is less than \(b(G^\Omega )= 4\), and hence \(k=2\). Also, by [17, Theorem 3.2], \(f(6,2)\geqslant \lceil (2n-2)/(k+1)\rceil = 4\), and it follows that \(b(S_6^\Omega )= 4\). However \(b(A_6^\Omega )= 3\) since the pairs \(\{1,2\}, \{1,3\}, \{1,4\}\) form a base. Thus the result holds if \(\delta \leqslant 1\).

Suppose now that \(\delta =2\). Then \(H = \langle (n-1, n)\rangle \), and the pointwise stabiliser of \(B\cup \{ \{1, n\}\}\) is trivial. Hence \(b(G^\Omega )\leqslant |B|+1=2m+1\) and we have both \(2m+1<2n/3\) and \(2m+1\leqslant 3m = n-2\). In this case, if \(b(G^\Omega )= n-2\), then \(2m+1 = 3m\), so \(n=5\). The proof is complete on noting that \(b(S_5^\Omega )= 3\) while \(b(A_5^\Omega )= 2\). \(\square \)

Similarly for the base sizes of the actions of \(A_n\) or \(S_n\) on partitions, precise results have been obtained by Burness, Garonzi and Lucchini in [8] for small part sizes, building on the work of Benbenishty, Cohen and Niemeyer [4], and by Morris and Spiga in [26] for large part sizes. Again, the information we require is most easily obtained for most cases from earlier work, this time [8] and an argument in Liebeck’s 1984 paper [21].

Lemma 3.6

Let \(G=A_n\) or \(S_n\), where \(n\geqslant 5\) and \(n=ab\) with \(a\geqslant 2, b\geqslant 2\). Consider the action of G on the set \(\Omega \) of partitions of \(X=\{1,2,\dots , n\}\) with b parts of size a. Then \(b(G^\Omega )\leqslant n-2\) with equality if and only if \((a,b, G)=(2,3, S_6)\) or \((a,b,G)=(3,2,S_6)\).

Proof

Since \(b(A_n^\Omega )\leqslant b(S_n^\Omega )\), we may assume that \(G=S_n\) for proving the main assertion. If \(a\geqslant 3\) then in [21, Case (i) on pp. 13–14], Liebeck proved that \(b(G^\Omega )\leqslant (a-1)(b-1)+2=ab-a-b+3\leqslant n-2\), where the second equality is strict unless \((a,b)=(3,2)\). In this case we have that [26, Theorems 1.1 & 1.2] \(b(S_n^\Omega )=4=n-2\) while \(b(A_n^\Omega )=3<n-2\). Thus we may assume that \(a=2\), so \(b\geqslant 3\). In this case, by [8, Theorem 2], the base size satisfies \(b(S_n^\Omega )=3< n-2\) if \(b\geqslant 4\), and \(b(S_n^\Omega )=4= n-2\) if \(b=3\). Further, \(b(A_n^\Omega )=3< n-2\) if \(b=3\), see [8, Remark 2.8]. \(\square \)

Finally, we prove the following lemma, which completes the proof of Theorem 1.1.

Lemma 3.7

Let \(G=A_n\) where \(n\geqslant 5\), and suppose that G has a faithful primitive permutation representation on a finite set \(\Omega \). Then \(b(G^\Omega )\leqslant n-2\), with equality if and only if either \(|\Omega |=n\) or \((n, |\Omega |, b(G^\Omega ))=(5,6,3)\). Moreover the assertions of Theorem 1.1 are valid.

Proof

If \(|\Omega |=n\) then the G-action on \(\Omega \) is permutationally isomorphic to its natural action and \(b(G^\Omega )=n-2\). Suppose this is not the case. If the primitive permutation group \(G^\Omega \) is standard, as in Definition 1(a), then by Lemmas 3.5 and 3.6, \(b(G^\Omega )<n-2\). On the other hand if \(G^\Omega \) is non-standard, then by Lemma 3.4, either \(b(G^\Omega )< n-2\) or \((n, |\Omega |, b(G^\Omega ))=(5,6,3)\). This proves the first assertion.

Now fix \(n\geqslant 5\), and let \(\Omega \) be such that \(G=A_n\) has a faithful primitive action on \(\Omega \) and \(b(G^\Omega )\) is maximal over all such actions, that is, \(b(G^\Omega ) = b_{maxprim}(G)\). By Corollary 3.3, the closure number k(G) satisfies \(k(G)\leqslant b(G^\Omega )+1\), and we have just proved that \(b(G^\Omega )\leqslant n-2\). Thus \(k(G)\leqslant n-1\). On the other hand, as we noted in the introduction, for the natural action of G on \(\Omega =\{1,\dots ,n\}\), G and \(S_n\) have the same orbits on \(\Omega ^{n-2}\), and hence \(G^{(n-2),\Omega }=S_n\ne G\). Thus \(k(G)>n-2\), and hence \(k(G)=n-1\). This proves Theorem 1.1. \(\square \)

4 Bases for simple classical groups

In this subsection, we consider in detail base sizes of finite simple classical groups G that are not isomorphic to alternating groups. We are concerned with

$$\begin{aligned} b_{maxprim}(G):=\max \{b(G^\Omega ) \mid G \text { acts faithfully and primitively on the set } \Omega \}. \end{aligned}$$
(2)

In particular, we prove the following theorem, which gives lower and upper bounds for \(b_{maxprim}(G)\). Together with Corollary 3.3, these bounds allow us to prove Theorem 1.2(c). The bounds depend on certain quantities which we define as follows.

$$\begin{aligned} \mathcal {O}\!:=\!\{ \textrm{P}\Omega ^\varepsilon (n,q)\mid n\,\, \text {even}, \varepsilon \!=\!\pm , q \!\leqslant \! 3, \text {and if}\,\,q \!=\! 3, \text {then}\,\,n\equiv 1\!-\!\varepsilon .1\pmod {4}\},\quad \end{aligned}$$
(3)
(4)
Table 2 Lower and upper bounds for \(b_{maxprim}(G)\), for finite simple classical groups G not isomorphic to alternating groups

Theorem 4.1

Let G be a finite simple classical group that is not isomorphic to an alternating group. Then Table 2 gives a lower and upper bound for \(b_{maxprim}(G)\).

In order to prove this theorem, it will be necessary to consider primitive subspace actions \(G^\Omega \), as in Definition 1. The next result, Lemma 4.2, summarises results from [18]. Note that if \(\Omega \) is a G-orbit of k-dimensional subspaces of the natural module V for G, with \(n:=\dim (V)\), then (up to equivalence of actions) \(1 \leqslant k \leqslant n/2\). In fact, if \(G = \textrm{P}\Omega ^-(n,q)\), then \(n/2-1\) is the maximum dimension of a totally singular subspace of V. On the other hand, if \(G \ne \textrm{P}\Omega ^-(n,q)\) or if \(4 \not \mid n\), then G does not act primitively on any set of n/2-dimensional nondegenerate subspaces of V (see [5, Table 2.2]).

Lemma 4.2

Let G be a finite simple classical group with natural module V of dimension n. Additionally, let \(\Omega \) be a set of k-dimensional nondegenerate or totally singular subspaces of V, such that \(G^\Omega \) is a primitive subspace action. Then \(b(G^\Omega ) \leqslant n/k+11\). Furthermore, \(b(G^\Omega ) \leqslant n/k+10\) if \(G = \textrm{PSp}(n,q)\) and k is odd; if \(G = \textrm{P}\Omega ^\pm (n,q)\), k is odd and q is even; or if \(k = n/2\) and either \(G \ne \textrm{P}\Omega ^-(n,q)\) or \(4 \not \mid n\).

Proof

Theorem 3.3 of [18] shows that \(b(G^\Omega ) \leqslant n/k+11\). If \(k = n/2\) and either \(G \ne \textrm{P}\Omega ^-(n,q)\) or \(4 \not \mid n\), then \(\Omega \) is a set of totally singular subspaces of V (since \(G^\Omega \) is primitive). This is also true if k is odd and either \(G = \textrm{PSp}(n,q)\), or \(G = \textrm{P}\Omega ^\pm (n,q)\) with q even, as here no nondegenerate subspace of V has odd dimension. Thus in any of these special cases, \(b(G^\Omega ) \leqslant n/k+10\) by [18, p. 28]. \(\square \)

It would be possible to use Lemmas 3.4 and 4.2 directly (along with some additional arguments) to obtain upper bounds for \(b_{maxprim}(G)\), at least in the case where G has no primitive action on (degenerate) nonsingular 1-dimensional subspaces of V. However, in most cases, we can deduce significantly improved bounds by considering in detail the known bounds on base sizes of primitive actions of G on subspaces of dimension at most 2.

By case (b) of Definition 1, any subspace action of G is defined with respect to the natural module for G. Thus G may have standard actions that are not subspace actions of G, but are equivalent to subspace actions of isomorphic classical groups. Therefore, in the lemma below, it is not sufficient to consider G only up to isomorphism. For example, the orthogonal groups \(G = \textrm{P}\Omega ^\varepsilon (n,q)\) (with \(\varepsilon = -\) if \(n = 4\) so that G is simple) occur as classical simple groups for any integer \(n\geqslant 3\), even though these groups are often considered as linear, unitary or symplectic groups of lower dimension when \(n \leqslant 6\).

Lemma 4.3

Let G be a finite simple classical group that is not isomorphic to an alternating group, and let V be the natural module for G. Additionally, let \(\Omega \) be a set of k-dimensional subspaces of V, such that \(k \leqslant 2\) and \(G^\Omega \) is a primitive subspace action. Then Table 3 gives an upper bound for \(b(G^\Omega )\).

Table 3 Upper bounds for the base sizes of primitive subspace actions in Lemma 4.3

Proof

Let b(Gk) be the value in the final column of Table 3 corresponding to the pair (Gk). We first consider the case where \(G = \textrm{PSL}(n,q)\) and \(k = 1\). Let \(\{e_1,\ldots ,e_n\}\) be a basis for V. It is well known (and easy to verify) that \(B:=\{\langle e_1 \rangle , \ldots , \langle e_n \rangle \}\) is a base of minimal size for \(G^\Omega \) if \(q = 2\), and that \(B \cup \{\langle e_1+\cdots +e_n \rangle \}\) is a base of minimal size if \(q > 2\). Hence \(b(G^\Omega ) = b(G,k)\).

Next, if \(G \in \{\textrm{PSU}(2,q),\textrm{PSp}(2,q)\}\) with \(k=1\), then the subspaces in \(\Omega \) are 1-dimensional and totally singular. Every 1-dimensional subspace of \(\textrm{PSp}(2,q)\) is totally singular, and so the action in this case is precisely the unique primitive subspace action of \(\textrm{PSL}(2,q)\) (which is equal to \(\textrm{PSp}(2,q)\)). This action is in fact also equivalent to the unique subspace action of \(\textrm{PSU}(2,q)\) (see [5, Tables 2.3 & 8.1]). As G is simple, \(q \ne 2\) for these groups, and we deduce that \(b(G^\Omega ) = 3 = b(G,k)\).

Thus we may assume that \(n \geqslant 3\), and that if \(k = 1\) then G is not linear. Additionally, if G is orthogonal, we assume here that \(n \geqslant 4\), that \(k = 1\) if \(n \leqslant 6\), and that \(\Omega \) is a set of nondegenerate or nonsingular subspaces if \(n = 4\) (we deal with the other orthogonal cases in the next paragraph). The results in [25, Sect. 2.1-\(-\)2.2] show that b(Gk) is an upper bound for the base size of the action on \(\Omega \) of the projective isometry group \(\textrm{PGL}(n,q),\textrm{PGU}(n,q),\textrm{PSp}(n,q)\) or \(\textrm{PGO}^\varepsilon (n,q)\) associated with and containing G. As \(b(G^\Omega ) \leqslant b(R^\Omega )\) for any overgroup R of G, it follows that \(b(G^\Omega ) \leqslant b(G,k)\).

Finally we deal with the actions of small dimensional orthogonal groups not covered by the previous paragraph, namely the groups \(G= \textrm{P}\Omega ^\varepsilon (n,q) \cong H\), where \((n,\varepsilon ,H)\) is one of the following:

$$\begin{aligned}{} & {} (3,\circ ,\textrm{PSL}(2,q)),(4,-,\textrm{PSL}(2,q^2)),(5,\circ ,\textrm{PSp}(4,q)),\\{} & {} (6,+,\textrm{PSL}(4,q)),(6,-,\textrm{PSU}(4,q)). \end{aligned}$$

We can use the tables in [5, Sect. 2.2 & 8.2] to determine the maximal subgroup type of a point stabiliser of \(G^\Omega \), considered as a subgroup of H. Using this information, we deduce from [6, Tables 2–3] that if \(G^\Omega \) is not equivalent to a subspace action of H, then \(b(G^\Omega ) \leqslant 3\), and in each case this is at most the relevant bound b(Gk). We also see that if \(G^\Omega \) is equivalent to a subspace action of H in the case where \(n = 6\) and \(k = 2\), then \(\varepsilon = -\) and the action of H is on totally singular 1-dimensional subspaces. In each case, the bounds on base sizes of subspace actions of H from above show that \(b(G^\Omega ) \leqslant b(G,k)\). \(\square \)

We can now prove Theorem 4.1. Since \(b_{maxprim}(G)\) is a property of G as an abstract group, we may now consider G up to isomorphism.

Proof of Theorem 4.1 Let V be the natural module for G and \(n:=\dim (V)\). We begin by determining a lower bound for \(b_{maxprim}(G)\). For each positive integer \(k \leqslant n/2\), [15, Theorem 3.1.9] gives a lower bound for the base size of a primitive action of G on a set of k-dimensional subspaces of V (depending only on G and k). Additionally, the tables in [20, Sect. 3.5] and [5, Sect. 8.2] show that G has a primitive action on a set of 1-dimensional subspaces of V. In each case, the lower bound for \(b_{maxprim}(G)\) given in Table 2 is the lower bound from [15] in the case \(k = 1\) (when G is linear, this bound is well known; see the proof of Lemma 4.3).

Now, let \(b_{maxsub}(G)\) and \(b_{maxstand}(G)\) denote the maximum of the base sizes of all primitive subspace actions of G, and of all primitive standard actions of G, respectively. The latter accounts for all actions of G that are equivalent to subspace actions of isomorphic classical groups. Additionally, let \(b_{upper}(G)\) be the upper bound for \(b_{maxprim}(G)\) from Table 2. We will divide the remainder of the proof into several cases, corresponding to the type of G. In each case, we utilise the upper bounds from Lemma 4.3 for base sizes of primitive subspace actions of G on subspaces of V of dimension at most two.

Case (a): \(G \cong \textrm{PSL}(n,q)\). Any primitive action of G on k-dimensional subspaces of V, with \(3 \leqslant k \leqslant n/2\), has base size at most \(n/k+3\) if \(k \mid n\), or at most \(n/k+5\) otherwise [18, p. 24]. As \(\lceil n/2 \rceil +2\) and \(\lfloor n/3+5 \rfloor \) are less than or equal to n for all \(n \geqslant 7\), it follows easily from Lemma 4.3 (noting that \(G \not \cong \textrm{PSL}(4,2) \cong A_8\)) that \(b_{maxsub}(G) \leqslant n+1-\delta _{2q} = b_{upper}(G)\).

We now consider standard actions of G that are not subspace actions. The relevant isomorphisms here are \(\textrm{PSL}(2,7) \cong \textrm{PSL}(3,2)\), \(\textrm{PSL}(2,q) = \textrm{PSp}(2,q) \cong \textrm{PSU}(2,q) \cong \Omega (3,q)\), \(\textrm{PSL}(2,q) \cong \textrm{P}\Omega ^-(4,q^{1/2})\) (when q is a square) and \(\textrm{PSL}(4,q) \cong \textrm{P}\Omega ^+(6,q)\). The tables in [5, Sect. 2.2 & 8.2] show that the action of \(\textrm{P}\Omega ^+(6,q)\) on 3-dimensional totally singular subspaces is equivalent to a subspace action of \(\textrm{PSL}(4,q)\). It follows from Lemma 4.3 that each subspace action of \(\textrm{P}\Omega ^+(6,q)\) or of \(\textrm{PSL}(4,q)\) has a base size of at most 5, while the remaining actions on subspaces here all have a base size of at most 3. When q is even, the action of \(\textrm{PSp}(2,q)\) corresponding to case (b)(ii) of Definition 1 also has base size at most 3 [18, p. 30]. Thus \(b_{maxstand}(G) \leqslant b_{upper}(G)\) in each case.

It remains to consider the cases where our upper bound \(b_{upper}(G)\) for \(b_{maxstand}(G)\) may be strictly less than \(b_{maxprim}(G)\). By Lemma 3.4, this can occur only if \(n+1-\delta _{2q} < 4\), i.e., only if \(n = 2\) or \(G = \textrm{PSL}(3,2)\). This same lemma now shows that the base size of a non-standard action of G is at most \(3 = n+1-\delta _{2q}\). Therefore, in each linear case, \(b_{maxprim}(G) \leqslant b_{upper}(G)\).

Case (b): \(G \cong \textrm{PSU}(n,q)\), with \(n \geqslant 3\). As \(\lfloor n/3+11 \rfloor \leqslant n\) for all \(n \geqslant 16\), we see from Lemmas 4.2 and 4.3 that \(b_{maxsub}(G) \leqslant b_{upper}(G)\).

The remaining standard actions correspond to the isomorphisms \(\textrm{PSU}(4,q) \cong \textrm{P}\Omega ^-(6,q)\) and \(\textrm{PSU}(4,2) \cong \textrm{PSp}(4,3) \cong \textrm{P}\Omega (5,3)\). Lemma 4.3 shows that \(b_{maxstand}(G) \leqslant b_{upper}(G)\) (since \(4 \not \mid 6\), the group \(\textrm{P}\Omega ^-(6,q)\) has no primitive subspace action on three-dimensional subspaces of its natural module). Moreover, if \(b_{upper}(G) < 4\), then \(b_{maxsub}(G) \leqslant 3 = n\). Hence Lemma 3.4 yields \(b_{maxprim}(G) \leqslant b_{upper}(G)\) in each case.

Case (c): \(G \cong \textrm{PSp}(n,q)\), with \(n \geqslant 4\). When q is even, the subspace action of G from case (b)(ii) of Definition 1 has base size n if \(n \geqslant 6\) [25, Proposition 1], or base size at most 5 if \(n = 4\) [18, p. 30]. Since \(\lfloor n/3+10 \rfloor \) and \(\lfloor n/4+11 \rfloor \) are less than or equal to n for all \(n \geqslant 14\), we obtain \(b_{maxsub}(G) \leqslant b_{upper}(G)\) from Lemmas 4.2 and 4.3.

We now consider the isomorphisms \(\textrm{PSp}(4,3) \cong \textrm{PSU}(4,2) \cong \textrm{P}\Omega ^-(6,2)\) and \(\textrm{PSp}(4,q) \cong \textrm{P}\Omega (5,q)\). In each case, Lemma 4.3 yields \(b_{maxstand}(G) \leqslant b_{upper}(G)\). As no symplectic G satisfies \(b_{upper}(G) < 4\), it follows from Lemma 3.4 that \(b_{maxprim}(G) \leqslant b_{upper}(G)\).

Case (d): \(G \cong \textrm{P}\Omega ^\varepsilon (n,q)\), with \(n \geqslant 7\). Noting that \(\lfloor n/3+11 \rfloor \leqslant n-1\) for all \(n \geqslant 17\) and \(\lfloor n/4+11 \rfloor \leqslant \lfloor n/3+10 \rfloor \) for all \(n \geqslant 9\), we see from Lemmas 4.2 and 4.3 that \(b_{maxsub}(G) \leqslant b_{upper}(G)\).

As \(n \geqslant 7\), there are no nontrivial isomorphisms of classical groups to consider. Moreover, \(b_{upper}(G) > 3\), and so \(b_{maxprim}(G) \leqslant b_{upper}(G)\) by Lemma 3.4. \(\square \)

We can now prove Theorem 1.2 for classical groups.

Proof of Theorem 1.2(c). Let G be a classical group that is not isomorphic to an alternating group. Then Theorem 1.2(c) follows from Corollary 3.3 and the upper bound for \(b_{maxprim}(G)\) given in Theorem 4.1. \(\square \)

Of course, if G is a low-dimensional unitary, symplectic or orthogonal group that is not explicitly mentioned in Theorem 1.2(c), then the general upper bound of \(\lfloor n/3+12 \rfloor \) for k(G) from this theorem is often higher than the actual upper bound that follows from Theorem 4.1. Notice also that the lower and upper bounds for \(b_{maxprim}(G)\) from Table 2 often coincide. The following corollary of Theorem 4.1 summarises when this occurs. Here, \(\delta \) again denotes the Kronecker delta, and \(\mathcal {O}\) is as in (3).

Corollary 4.4

Let G be a finite simple classical group that is not isomorphic to an alternating group.

  1. (i)

    If \(G = \textrm{PSL}(n,q)\), then \(b_{maxprim}(G) = n+1-\delta _{2,q}\).

  2. (ii)

    If \(G = \textrm{PSU}(n,q)\) with \((q+1) \not \mid n\), and with \(n \in \{3,5\}\) or \(n \geqslant 16\), then \(b_{maxprim}(G) = n\).

  3. (iii)

    If \(G = \textrm{PSp}(n,q)\) with \(n \geqslant 14\), then \(b_{maxprim}(G) = n\).

  4. (iv)

    If \(G = \textrm{P}\Omega ^\varepsilon (n,q)\) with \(n \geqslant 16\), with q even if \(n = 16\), and with \(G \notin \mathcal {O}\), then \(b_{maxprim}(G) = n-1\).

Remark 4.5

Observe that if G is one of the groups in Corollary 4.4 for which \(b_{maxprim}(G)\) is known exactly, then the upper bound for the closure number k(G) from Theorem 1.2 is equal to \(b_{maxprim}(G)+1\). Hence this theorem gives the best upper bound for k(G) that can be obtained solely using Corollary 3.3. On the other hand, for many low-dimensional unitary, symplectic and orthogonal groups G, improved upper bounds on the base sizes of primitive actions of G on subspaces of dimension greater than two would likely result in improved upper bounds for \(b_{maxprim}(G)\) and hence for k(G).

5 k-closures of sporadic groups

We start with the following lemma on certain simple groups.

Lemma 5.1

Let G be a finite nonabelian simple group acting primitively on a set \(\Omega \) such that \({{\,\textrm{Out}\,}}(G)=1\) and G is maximal in \(\textrm{Alt}(\Omega )\). If G is k-transitive on \(\Omega \) but not \((k+1)\)-transitive for some integer k such that \(1\leqslant k < |\Omega |-2\), then \(G^{(k+1),\Omega }=G\) but \(G^{(k),\Omega }\ne G\).

Proof

Note that since G is k-transitive we have that \(G^{(k),\Omega }=\textrm{Sym}(\Omega )\ne G\). Since G is maximal in \(\textrm{Alt}(\Omega )\) it follows that \(G^{(k+1),\Omega }\cap \textrm{Alt}(\Omega )=G\) or \(\textrm{Alt}(\Omega )\). However, \(\textrm{Alt}(\Omega )\) is \((k+1)\)-transitive on \(\Omega \) while G is not, so \(\textrm{Alt}(\Omega )\not \leqslant G^{(k+1),\Omega }\). Hence \(G^{(k+1),\Omega }\cap \textrm{Alt}(\Omega )=G\). Thus \(|G^{(k+1),\Omega }:G|\leqslant 2\). Since G acts primitively on \(\Omega \) (and |G| is not prime) we have that \(C_{\textrm{Sym}(\Omega )}(G)=1\). Then as \({{\,\textrm{Out}\,}}(G)=1\) it follows that \(G^{(k+1),\Omega }=G\). \(\square \)

Corollary 5.2

  1. (a)

    Let \(G=M_{23}\leqslant \textrm{Sym}(\Omega )\) with \(|\Omega |=23\). Then \(G^{(5),\Omega }=G\).

  2. (b)

    Let \(G=Co_3\leqslant \textrm{Sym}(\Omega )\) with \(|\Omega |=276\). Then \(G^{(3),\Omega }=G\).

  3. (c)

    Let \(G=Co_2\leqslant \textrm{Sym}(\Omega )\) with \(|\Omega |=2300\). Then \(G^{(2),\Omega }=G\).

Proof

By [22], G is maximal in \(\textrm{Alt}(\Omega )\). Additionally, \( {{\,\textrm{Out}\,}}(G)=1\), and so the result follows from Lemma 5.1. \(\square \)

When \(G=M_{23}\), \(Co_2\) or \(Co_3\) we have from [11, Table 1] that \(b_{maxprim}(G)=6\) so our next result is a slight improvement on what can be deduced from Corollary 3.3. We suspect that it is not best possible.

Lemma 5.3

Let \(G=M_{23}\), \(Co_2\) or \(Co_3\). Then \(k(G)\leqslant 6\).

Proof

Let \(n=23\) if \(G=M_{23}\), \(n=276\) if \(G=Co_3\) and \(n=2300\) if \(G=Co_2\), and let G act faithfully on a set \(\Omega \). We will prove that \(G^{(6),\Omega }=G^\Omega \cong G\), and from this it will follow that \(k(G)\leqslant 6\). If G has an orbit with a maximal block system \(\mathcal {B}\) such that \(|\mathcal {B}|\ne n\), then \(b(G^{\mathcal {B}})\leqslant 5\), by [11, Theorem 1], and so \(b(G^\Omega )\leqslant 5\), by Lemma 3.1. Hence by Lemma 3.2, \(G^{(6),\Omega }=G^\Omega \cong G\). Thus it remains to consider the case where on each orbit \(\Delta \), each maximal block system has size n. Let \(\Delta \) be an orbit of G and let \(\mathcal {B}\) be a maximal block system on \(\Delta \). By Lemma 2.5 and Corollary 5.2 we have that \(G^{(6),\Delta }\) preserves \(\mathcal {B}\) and \((G^{(6),\Delta })^{\mathcal {B}}=G^{(6),\mathcal {B}}=G^\mathcal {B}\cong G\). Moreover, by [11, Theorem 1] there exist \(B_1,\ldots ,B_6\in \mathcal {B}\) such that \(G_{B_1,\ldots ,B_6}=1\) (since G is faithful on \(\mathcal {B}\)), and it follows from Lemma 2.5 that \(G^{(6),\Delta }\) acts faithfully on \(\mathcal {B}\). Thus \(G^{(6),\Delta }\cong G\) so \(G^{(6),\Delta }=G^\Delta \).

If \(\Omega =\Delta \) then we are done so we may assume that G has more than one orbit. Recall that, on each G-orbit, each maximal block system has size n. Let \(\Delta _1\) be a G-orbit and let \(\mathcal {B}_1\) be a maximal block system for G on \(\Delta _1\). Let \(\beta \in B\in \mathcal {B}_1\). Then \(G_\beta \leqslant G_B\). When \(G=M_{23}\) we have that \(G_B=M_{22}\), when \(G=Co_3\) we have \(G_B=McL.2\) and when \(G=Co_3\) we have \(G_B=\textrm{PSU}_6(2).2\). In all cases \(|G_B|\) is not divisible by 23 but n is, and so \(G_B\) does not have a transitive action of degree n. Thus \(G_B\), and hence also \(G_{\beta }\) is not transitive on any orbit of G. Since \(G^{(6),\Delta }=G^\Delta \) for each G-orbit \(\Delta \), it follows from Lemma 2.3 that \(G^{(6),\Omega }=G^\Omega \cong G\) and the result follows. \(\square \)

We use Lemma 5.3 for \(M_{23}\), as well as the fundamental Lemma 2.1, to determine the closure number of \(M_{24}.\)

Lemma 5.4

\( k(M_{24})=6\).

Proof

Let \(G=M_{24}\) act faithfully on a set \(\Omega \). We divide the orbits \(\Delta \) of G into three types:

  1. (a)

    \(|\Delta |=24\).

  2. (b)

    G has a maximal block system on \(\Delta \) of size other than 24.

  3. (c)

    Each maximal block system of G on \(\Delta \) has size 24 and at least one of these maximal block systems is not the partition into singletons.

Recall that we allow a maximal block system to be the partition into singletons.

Let \(\Delta \) be an orbit of G and suppose first that \(\Delta \) is of type (a). Then G acts 5-transitively but not 6-transitively on \(\Delta \). By [22] G is a maximal subgroup of \(A_{24}\) and so by Lemma 5.1, \(k(G) \geqslant 6\) and \(G^{(6),\Delta }=G^\Delta \cong G\).

Next suppose that \(\Delta \) has type (b) and let \(\mathcal {B}\) be a maximal block system on \(\Delta \) with \(|\mathcal {B}|\ne 24\). Then by [11, Theorem 1] the base size \(b(G^{\mathcal {B}})\leqslant 4\). Hence by Lemma 3.1, also \(b(G^\Delta )\leqslant 4\) and so by Lemma 3.2, \(G^{(5),\Delta }=G^\Delta \cong G\). This implies that \(G^{(6),\Delta }=G^\Delta \cong G\).

Finally suppose that \(\Delta \) is of type (c) and let \(\mathcal {B}\) be a maximal block system of size 24 that is not the partition into singletons. Let \(B \in \mathcal {B}\). By Lemma 2.5 and our result on orbits of type (a), we have \(M_{24}\cong G^{\mathcal {B}}\leqslant (G^{(6),\Delta })^{\mathcal {B}}\leqslant G^{(6),\mathcal {B}}\cong M_{24}\), and so \((G^{(6),\Delta })^{\mathcal {B}}\cong G\). Moreover, by Lemma 2.5, \(M_{23}\cong (G^\Delta )_B^B\leqslant (G^{(6),\Delta })_B^B\leqslant (G_B)^{(6),B}=M_{23}^{(6),B}\), which is isomorphic to \( M_{23}\) by Lemma 5.3. Thus \(G^{(6),\Delta }\leqslant M_{23}\wr M_{24}\) acting imprimitively. Furthermore, since the minimal degree of a transitive representation of \(M_{23}\) is 23 (see for example [14, p71]) we have \(|B|\geqslant 23\). Let K be the kernel of the action of \(G^{(6),\Delta }\) on \(\mathcal {B}\) and suppose that \(K\ne 1\). Then by a lemma of Scott ([29, p. 328] or see [28, Theorem 4.16]) the group K is a direct product of \(\ell \) diagonal subgroups \(H_j\cong M_{23}\) for \(j=1,\ldots , \ell \) (where \(\ell \geqslant 1\)), with pairwise disjoint supports, that is to say, there is a partition \(\{\mathcal {O}_1,\ldots ,\mathcal {O}_\ell \}\) of \(\mathcal {B}\) such that, for each j, \((H_j)^B\cong M_{23}\) for all \(B\in \mathcal {O}_j\) and \((H_j)^B=1\) for all \(B\notin \mathcal {O}_j\). Suppose that \(\ell \geqslant 2\). Then there exist distinct \(B_j,B_k\in \mathcal {B}\) such that for all \(\alpha _1,\ldots ,\alpha _5\in B_j\) and all \(\beta _1,\beta _2\in B_k\), there exists \(h\in K\) such that \((\alpha _1,\ldots ,\alpha _5)^h=(\alpha _1,\ldots ,\alpha _5)\) and \(\beta _1^h=\beta _2\). Since \(K\leqslant G^{(6),\Delta }\), there exists \(g\in G\) such that \((\alpha _1,\ldots ,\alpha _5,\beta _1)^g=(\alpha _1,\ldots ,\alpha _5, \beta _1)^h=(\alpha _1,\ldots ,\alpha _5,\beta _2)\). As \(\beta _1, \beta _2\) were arbitrary points of \(B_k\), it follows that \(G_{\alpha _1,\ldots ,\alpha _5}\) acts transitively on \(B_k\). Note that \(G_{B_j}=M_{23}\), so \(G_{B_j}\) acts faithfully on \(B_j\). Now \(G_{\alpha _1,\ldots ,\alpha _5}= (G_{B_j})_{\alpha _1,\ldots ,\alpha _5}= (M_{23})_{\alpha _1,\ldots ,\alpha _5}\) while \(|B_k|\geqslant 23\). Suppose first that \(G_{B_j}\) has a maximal block system of size greater than 23 on \(B_j\). Then by [11, Theorem 1] and Lemma 3.1, we have \(b(G_{B_j}^{B_j})\leqslant 4\). Hence we could have chosen \(\alpha _1,\ldots ,\alpha _5\) such that \((M_{23})_{\alpha _1,\ldots ,\alpha _5}=1\), contradicting \(G_{\alpha _1,\ldots ,\alpha _5}\) acting transitively on \(B_k\). Hence all maximal block systems \(\Sigma \) of \(G_{B_j}\) on \(B_j\) have size 23. However, in this case we can choose \(\alpha _1,\ldots , \alpha _5\in B_j\) from pairwise distinct blocks of \(\Sigma \) such that \(|(M_{23})_{\alpha _1,\ldots ,\alpha _5}|<23\), again contradicting \(G_{\alpha _1,\ldots ,\alpha _5}\) acting transitively on \(B_k\).

Thus \(\ell =1\) so \(K\cong M_{23}\), and \(G^{(6),\Delta }=M_{23}\times M_{24}\). Hence \(G^{(6),\Delta }\) has an intransitive normal subgroup N isomorphic to \(M_{24}\) such that the normal subgroup \(L\cong M_{23}\) acts transitively on the set \(\Sigma \) of N-orbits. Now \(\Sigma \) is a G-invariant partition of \(\Delta \), and a maximal block system for the G-action on \(\Sigma \) corresponds to a maximal block system, say \(\mathcal {B}'\), for \(G^\Delta \). By assumption all maximal block systems for G on \(\Delta \) have size 24 and so \(|\mathcal {B}'|=24\). Hence the G-action on \(\Sigma \) has a block system of size 24. Thus the subgroup L of G is transitive on \(\Sigma \) preserving a block system of size 24. This implies that \(L\cong M_{23}\) has a subgroup of index 24, which is a contradiction (see for example [14, p71]). Therefore \(K=1\) and \(G^{(6),\Delta }=M_{24}\).

Thus we have shown that \(G^{(6),\Delta }=G^\Delta \cong G\) for each orbit \(\Delta \). In particular, if G is transitive on \(\Omega \) then we have shown that \(G^{(6),\Omega }=G^{\Omega }\cong G\). Suppose now that G is intransitive on \(\Omega \). If there is some orbit \(\Delta \) of type (b) then we saw above that \(b(G^\Delta )\leqslant 4\). Since G acts faithfully on \(\Delta \) we have \(b(G^{\Omega })\leqslant 4\) and so by Lemma 3.2, \(G^{(5),\Omega }=G^\Omega \cong G\). If all orbits are of type (a) then all actions of G on orbits are pairwise equivalent and so by Corollary 2.4 we have \(G^{(6),\Omega }=G^\Omega \cong G\).

It remains to consider the case where there is at least one orbit \(\Delta \) of type (c) and all the remaining orbits are of type (a) or (c). Recall that this implies that \(G^\Delta \) preserves a block system \(\mathcal {B}\) of size 24 on \(\Delta \), and moreover in each G-orbit \(\Delta '\), each maximal block system for G on \(\Delta '\) has size 24. Let \(\beta \in B\in \mathcal {B}\). Then \(G_\beta \leqslant G_B= M_{23}\). Since \(M_{23}\) does not have any transitive actions of degree 24, it follows that \(G_B\), and hence also \(G_{\beta }\), is not transitive on any orbit of G. Thus by Lemma 2.3, and the fact that \(G^{(6),\Delta '}=M_{24}\) for all G-orbits \(\Delta '\), we deduce that \(G^{(6),\Omega } \cong G\). Thus we have shown that \(k(G)\leqslant 6\). Moreover, equality holds since if \(|\Omega |=24\) then G is 5-transitive on \(\Omega \) and hence \(G^{(5),\Omega }=\textrm{Sym}(\Omega )\). \(\square \)

We can now complete the proof of Theorem 1.2.

Proof of Theorem 1.2(a). Let G be a sporadic simple group. If \(G\cong M_{24}\) then by Lemma 5.4 we have \(k(G)=6\), so suppose further that \(G\not \cong M_{24}\). By Corollary 3.3, the closure number k(G) satisfies \(k(G)\leqslant b_{maxprim}(G)+1\), where \(b_{maxprim}(G)\) is as in (2). Let \(\Omega \) be a finite set on which G acts faithfully and primitively such that \(b(G^\Omega ) = b_{maxprim}(G)\), so \(k(G)\leqslant b(G^\Omega )+1\). By [11, Theorem 1] we in fact have that either \(b(G^{\Omega })\leqslant 5\) and so \(k(G)\leqslant 6\), or \(G=M_{23}\), \(Co_2\) or \(Co_3\). Thus Theorem 1.2(a) follows from Lemma 5.3. \(\square \)