1 Introduction

Social choice theory is the study of how to aggregate individual preferences and opinions of agents on a set of alternatives in order to reach a collective decision. In many practical situations, the relationship between the alternatives is represented by a dominance relation, which specifies the relative strength of the alternatives in any pairwise comparison. For example, in sports competitions the dominance relation signifies the match outcome when two players or teams play each other, while in elections the relation represents the pairwise majority comparisons among the candidates. The structure consisting of the alternatives and their dominance relation is called a tournament, and the analysis of tournament winner selection methods—also known as tournament solutions—has received significant attention from researchers in the past few decades [4, 27].

Among the vast array of tournament solutions proposed in the literature, two of the earliest and best-known ones are the top cycle [17, 34, 42] and the uncovered set [14, 35]. An alternative belongs to the top cycle if it can reach every other alternative via a directed path in the tournament. Note that if the tournament contains n alternatives, any such path has length \(n-1\) or less (the length of a path refers to the number of edges in the path).Footnote 1 Similarly, the uncovered set—also known as the set of kings [32]—consists of the alternatives that can reach every other alternative via a path of length at most two. It is clear from the definitions that the uncovered set is always a subset of the top cycle. Moreover, both tournament solutions can be viewed as special cases of a generalized notion of kings called k-kings, which correspond to the alternatives that can reach every other alternative via a path of length at most k. Indeed, the uncovered set is the set of 2-kings, while the top cycle contains precisely the \((n-1)\)-kings. Examples of tournaments and k-kings can be found in Section 2.

Given that tournament solutions are meant to distinguish the best alternatives from the rest, it is natural to ask how selective each tournament solution is. Moon and Moser [37] and Fey [13] addressed this question and showed that all alternatives are likely to be 2-kings, and hence also \((n-1)\)-kings, when the tournament is large. In particular, their results hold under the uniform random model, wherein each edge is oriented in one direction or the other with equal probability independently of other edges. Saile and Suksompong [41] extended these results to the generalized random model, in which the orientation of each edge is determined by probabilities within the range \([p, 1-p]\) for some parameter \(p \le 1/2\), and these probabilities may vary across edges. The generalized random model allowed these authors to demonstrate a difference between the two tournament solutions—while all alternatives are likely to be \((n-1)\)-kings as long as \(p\in \omega (1/n)\), the same is true for 2-kings only when \(p\in \Omega (\sqrt{\log n/n})\), so the two thresholds differ by roughly \(\Theta (\sqrt{n})\). This raises the following question: How does the probability threshold change as we transition from \(k=2\) to \(k=n-1\)? Does it already decrease at around \(k = \sqrt{n}\), or does it remain the same until, say, \(k \approx n/2\)?

Table 1 Summary of the bounds on the probability p at which the respective tournament solutions select all alternatives with high probability under the corresponding random model

In this paper, we show that, perhaps surprisingly, all of the changes in the probability threshold occur when k is constant. In fact, when \(k=6\), all alternatives are already likely to be k-kings provided that \(p\in \omega (1/n)\), the same threshold as \(k = n-1\)—this significantly strengthens the result of Saile and Suksompong [41] on \((n-1)\)-kings. For \(k=3\) and 4, we establish an asymptotically tight bound of \(p\in \Omega (\log n/n)\), while for \(k=5\) we leave the only (small) gap between \(\Omega (\log \log n/n)\) and \(\omega (1/n)\). Besides the generalized random model, we consider a more specific model which has nevertheless been studied in several papers called the Condorcet random model [16, 22, 28, 49]. In this model, there is a parameter p and a linear order of alternatives from strongest to weakest, and the probability that a stronger alternative dominates a weaker one is \(1-p\), independently of other pairs of alternatives.Footnote 2 For the Condorcet random model, we show that the threshold for \(k=5\) is \(\omega (1/n)\), whereas the thresholds for other values of k remain tight. Our results are summarized in Table 1 and presented in Section 3. Taken together, they reveal the intriguing facts that (i) 2-kings are distinctly more selective than k-kings for \(k\ge 3\); (ii) 3-kings and 4-kings are slightly more selective than higher-order kings; and (iii) there is virtually no difference in discriminative power from \(k=5\) all the way to \(k=n-1\).

In addition to k-kings, we also consider the set of single-elimination winners, which are alternatives that can win a (balanced) single-elimination tournament under some bracket, where the match outcomes in the single-elimination tournament are determined according to the dominance relation in the original tournament.Footnote 3 Kim et al. [22] showed that all alternatives are likely to be single-elimination winners in the Condorcet random model as long as \(p\in \Omega (\log n/n)\), and this bound is tight.Footnote 4 For the generalized random model, they established an analogous statement in the range \(p\in \Omega (\sqrt{\log n/n})\). We close this gap by proving that even for the generalized random model, \(p\in \Omega (\log n/n)\) already suffices for all alternatives to be single-elimination winners with high probability; moreover, a winning bracket for each alternative can be computed in polynomial time. Our result, which can be found in Section 4, further lends credence to the observation that real-world tournaments can be easily manipulated [31].

In Sect. 5, we move beyond the question of when all alternatives are likely to be selected by a tournament solution, and instead ask when this is the case for a large or small number of alternatives. Even though \(p\in \Omega (\sqrt{\log n/n})\) is required in order for all alternatives to be 2-kings with high probability [41], we show that most of them are already likely to be 2-kings as long as \(p\in \Omega (\log n/n)\). This threshold is exactly where the transition occurs: if \(p\in O(\log n/n)\), we prove that it is almost surely the case that only a small fraction of the alternatives are 2-kings. Furthermore, for any \(k\ge 3\), we establish that a large fraction of alternatives are likely to be k-kings provided that \(p\in \omega (1/n)\). These results illustrate the probability range under which each tournament solution is discriminative, and again exhibit a clear difference between 2-kings and higher-order kings. Finally, in Sect. 6, we complement our theoretical findings with results from computer experiments.

Before proceeding further, let us provide some discussion and intuition on our models and results. The uniform random model, while being the most basic choice to start probabilistic investigations of tournaments with, is clearly unrealistic since alternatives in most applications have different levels of strength. The Condorcet random model addresses this shortcoming by allowing some alternatives to be stronger than others. However, as Saile and Suksompong [41] pointed out, the Condorcet random model still suffers from limitations such as using the same probability for all pairs of alternatives (regardless of the extent to which one alternative is stronger than the other) or not allowing for “bogey teams” (i.e., weak teams that often beat certain stronger teams). The generalized random model only assumes that each match is sufficiently random, and therefore does not have these limitations. The fact that there are usually a large number of k-kings when the tournaments are drawn from the uniform random model, as well as from the Condorcet and generalized random models for sufficiently large p, can be explained intuitively by noting that, for large tournaments, there are a high number of potential paths through with one alternative can reach another. As we will see later in our experimental results (Section 6), when the tournament size is roughly 100, the fraction of k-kings is already very high for most parameters and models.

1.1 Related work

Tournament solutions have been extensively studied for the past several decades from the axiomatic [6, 7, 21, 25, 26], computational [1, 5, 10, 12, 36], and probabilistic perspectives [13, 43].Footnote 5 This strong interest is due to the fact that tournaments can be used to model a variety of scenarios including voting, webpage ranking, and biological interactions—indeed, the well-known PageRank algorithm can be viewed as a form of tournament solution [7], and the pecking order in a flock of chickens can be represented by a tournament [32]. In an early work on tournament solutions, Maurer [32] characterized the pairs (nr) for which there exists a tournament of size n such that the number of 2-kings is exactly r, and proved that all alternatives are 2-kings with high probability in the uniform random model.Footnote 6 There are several containment relations among common tournament solutions. For example, the Copeland set, Slater set, Markov set, and Banks set are all contained in the set of 2-kings (i.e., the uncovered set), which is in turn contained in the set of \((n-1)\)-kings (i.e., the top cycle)—this provides a range of options in terms of discriminative power and other properties. Even though k-kings admit a simple and elegant definition generalizing both 2-kings and \((n-1)\)-kings, and have attracted interest from graph theorists [9, 39, 48], as far as we know, they have not been studied in the social choice context until recently. Kim and Vassilevska Williams [23] and Kim et al. [22] identified conditions under which a 3-king can win a single-elimination tournament. Brill et al. [10] showed that computing the “margin of victory” of k-kings can be done efficiently for \(k\le 3\) but becomes NP-hard for \(k\ge 4\). They also illustrated through experiments that the margin of victory of 3-kings behaves much more similarly to that of \((n-1)\)-kings than to the corresponding notion of 2-kings; our results therefore complement theirs by exhibiting that analogous behavior can be observed with respect to discriminative power.

As we mentioned earlier, the study of tournament solutions under the probabilistic lens was initiated by Moon and Moser [37], who showed that all alternatives are likely to be \((n-1)\)-kings when the tournament is generated by the uniform random model. Fey [13] and Scott and Fey [43] established the same property for two other tournament solutions, the Banks set and the minimal covering set, while Fisher and Ryan [15] showed that the bipartisan set includes half of the alternatives on average.Footnote 7 Brandt et al. [6] showed that any tournament solution that satisfies an attractive property called stability, including the set of \((n-1)\)-kings, the minimal covering set, and the bipartisan set, must choose at least half of the alternatives on average. The Condorcet random model has been analyzed, among others, by Frank [16], Łuczak et al. [28], Vassilevska Williams [49], and Kim et al. [22], with the last paper also proposing the generalized random model.

Finally, single-elimination tournaments have constituted a popular topic of study in the past decade; see the survey by Vassilevska Williams [50] and a more recent one by Suksompong [47]. In particular, even though the problem of determining whether an alternative can win a single-elimination tournament is known to be NP-hard [2], a wide range of algorithmic and complexity results have been developed by this active line of work [11, 18,19,20, 22,23,24, 40, 44, 45, 49, 51]. For instance, Kim and Vassilevska Williams [23] gave an algorithm running in time \(O(2^n\text {poly}(n))\) that not only decides whether a given alternative can win a single-elimination tournament but also counts the number of brackets for which this alternative is the winner. Kim et al. [22] presented several conditions under which an alternative is guaranteed to be a single-elimination winner and its winning bracket is computable in polynomial time. Ramanujan and Szeider [40] and Gupta et al. [18, 20] studied this problem from the perspective of parameterized complexity. Issues of bribery and scheduling manipulation have also been extensively studied with respect to single-elimination as well as other tournament formats such as round-robin [3, 19, 20, 23, 24, 30, 46].

2 Preliminaries

A tournament T consists of a set \(V=\{x_1,\dots ,x_n\}\) of vertices, also called alternatives, and a set E of directed edges. For any two alternatives \(x_i,x_j\in V\), there exists either an edge from \(x_i\) to \(x_j\) or an edge from \(x_j\) to \(x_i\), but not both. The edges represent a dominance relation between the alternatives: an edge from \(x_i\) to \(x_j\) means that \(x_i\) dominates \(x_j\), a relation which we denote by \(x_i\succ x_j\). The outdegree (resp., indegree) of an alternative \(x_i\) is the number of alternatives that \(x_i\) dominates (resp., that dominate \(x_i\)). We extend the dominance relation to sets of alternatives: for \(V_1,V_2\subseteq V\), we write \(V_1\succ V_2\) to mean that \(x\succ x'\) for all \(x\in V_1\) and \(x'\in V_2\), and \(V_1\succ x'\) to mean that \(x\succ x'\) for all \(x\in V_1\). A set \(V'\subseteq V\) is called a dominating set if for every \(x\in V\setminus V'\), there exists \(x'\in V'\) such that \(x'\succ x\).

Fig. 1
figure 1

A tournament with five alternatives. The outdegrees of abcde are 3, 3, 2, 2, 0, respectively. The alternatives a, b, and d are 2-kings, c is a 3-king but not a 2-king, while e is not a k-king for any k

We can now define the key notions of this paper.

  • For any integer \(k\ge 2\), an alternative is said to be a k-king if it can reach every other alternative via a directed path of length at most k. An example is shown in Fig. 1.

  • Suppose that \(n=2^r\) for some nonnegative integer r. An alternative is said to be a single-elimination winner if it wins a (balanced) single-elimination tournament under some bracket, where the outcome of each match is determined according to the dominance relation. Formally, a single-elimination tournament is represented by a balanced binary tree with n leaves corresponding to the alternatives; the assignment of the alternatives to the leaves is called a bracket. The winner of the tournament is determined recursively: the winner of a leaf is the alternative at the leaf, and the winner of a subtree rooted at node u is the winner of the match between the winners of the subtrees rooted at the two children of u. An example is shown in Fig. 2.

Fig. 2
figure 2

A single-elimination tournament with eight alternatives, where we assume that \(b\succ a\), \(c\succ d\), \(f\succ e\), \(h\succ g\), \(c\succ b\), \(h\succ f\), and \(c\succ h\); all other dominance relations are arbitrary. Under this bracket, alternative c is the single-elimination winner

We will consider two random models for generating tournaments. In the Condorcet random model, there is a parameter \(0\le p\le 1/2\). For \(i<j\), alternative \(x_j\) dominates \(x_i\) with probability p (so \(x_i\) dominates \(x_j\) with probability \(1-p\)), independently of other pairs of alternatives. In the generalized random model, there is a parameter \(p_{i,j}\) for each pair \(i\ne j\), where \(p_{i,j} + p_{j,i} = 1\). For any pair ij, alternative \(x_i\) dominates \(x_j\) with probability \(p_{i,j}\), independently of other pairs. We will generally allow each probability \(p_{i,j}\) to be chosen from the range \([p,1-p]\) for a given parameter \(0\le p\le 1/2\). See Fig. 3 for an illustration. Before a tournament is generated from the generalized random model, the expected outdegree of \(x_i\) is defined as \(\sum _{j\ne i}p_{i,j}\). Following standard terminology in probability theory, we say that an event whose probability depends on n occurs “with high probability” if the probability that it occurs approaches 1 as \(n\rightarrow \infty\).

Fig. 3
figure 3

Examples of probabilities for generating tournaments according to the Condorcet random model (left) and the generalized random model (right) with \(p = 0.2\). For each pair \(i < j\), the corresponding number in the figure indicates the probability \(p_{i,j}\) with which alternative \(x_i\) dominates \(x_j\); the reverse domination occurs with probability \(1 - p_{i,j}\)

We now list two well-known probabilistic statements that will be used multiple times in this paper. The first statement provides an upper bound on the probability that a sum of independent random variables is far from its expectation.

Lemma 2.1

(Chernoff bound). Let \(X_1, \dots , X_k\) be independent random variables taking values in [0, 1], and let \(X := X_1 + \cdots + X_k\). Then, for any \(\delta \in [0, 1]\),

$$\begin{aligned} \Pr [X \ge (1 + \delta )\mathbb {E}[X]] \le \exp \left( \frac{-\delta ^2 \mathbb {E}[X]}{3}\right) \end{aligned}$$

and

$$\begin{aligned} \Pr [X \le (1 - \delta )\mathbb {E}[X]] \le \exp \left( \frac{-\delta ^2 \mathbb {E}[X]}{2}\right) . \end{aligned}$$

The second statement is a simple upper bound on the expression \(1+x\).

Lemma 2.2

For every real number x, we have \(1+x \le e^x\).

We end this section with a lemma on the degree of alternatives in a tournament.

Lemma 2.3

Let \(1\le r\le n\). For any tournament T, the average outdegree and the average indegree of the alternatives in any subset of size r are at least \((r-1)/2\). Similarly, before a tournament is generated from the generalized random model, the average expected outdegree of the alternatives in any subset of size r is at least \((r-1)/2\).

Proof

Given a tournament T, in a subset of alternatives of size r, there are a total of \(r(r-1)/2\) edges. Hence, the sum of the outdegrees of the r alternatives is at least \(r(r-1)/2\), implying that their average is at least \((r-1)/2\). Likewise, the sum of the indegrees of the r alternatives is at least \(r(r-1)/2\), so their average is also at least \((r-1)/2\).

Now, consider a tournament with probabilities \(p_{i,j}\) as in the generalized random model, and a subset B of r alternatives. Each edge between two alternatives in B adds 1 to the sum of expected outdegree of these two alternatives. Since there are \(r(r-1)/2\) edges between alternatives in B, the sum of expected outdegree of the alternatives in B is at least \(r(r-1)/2\), and the average expected outdegree is therefore at least \((r-1)/2\). \(\square\)

Unless a base is explicitly specified, \(\log\) refers to the natural logarithm.

3 Generalized kings

Recall the result of Saile and Suksompong [41] that in the generalized random model, all alternatives are 2-kings with high probability only if \(p\in \Omega (\sqrt{\log n/n})\). For our first result, we show that 3-kings are not as selective: even when \(p\in \Omega (\log n/n)\), it is already likely that none of the alternatives is excluded by this set.

Theorem 3.1

Assume that a tournament T is generated according to the generalized random model, and that \(p_{i,j} \in \left[ 30\log n/n, 1 - 30\log n/n\right]\) for all \(i\ne j\). Then with high probability, all alternatives in T are 3-kings.

To prove this theorem, we establish a rather general lemma on the probability of one set dominating another, which will also be useful in our analysis of 5-kings later.

Lemma 3.2

Let \(T_0\) be a tournament with \(n_0 \ge 10\) alternatives \(V_0 := \{x_1, \dots , x_{n_0}\}\), and let \(q_{1,1}, q_{1,2}, \dots , q_{n_0,1}, q_{n_0,2} \in \left[ \frac{10 \lambda }{n_0},1\right]\) for some \(1 \le \lambda \le \frac{n_0}{10}\). Suppose that we randomly create a set \(S_1\) by including each alternative \(x_i\) independently with probability \(q_{i,1}\), and a set \(S_2\) by including each alternative \(x_i\) independently with probability \(q_{i,2}\). Then, \(\Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2] \le e^{-\lambda }.\)

Proof

It suffices to prove the lemma when \(q_{1,1} = q_{1,2} = \dots = q_{n_0,1} = q_{n_0,2} = \frac{10 \lambda }{n_0}\) since increasing these probabilities does not decrease \(\Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2]\). Observe that if \(S_1\succ S_2\), then every alternative in \(S_1\) must have a strictly higher outdegree than every alternative in \(S_2\) when we restrict the tournament \(T_0\) to \(S_1\cup S_2\).

Consider any set S of alternatives; let \(z_1, \dots , z_t\) be its elements in nondecreasing order of outdegree in the restriction of \(T_0\) to S. Notice that, when we condition on \(S_1 \cap S_2 = \emptyset\) and \(S_1 \cup S_2 = S\), since all of our probabilities \(q_{i,j}\) are equal, the set \(S_1\) has a uniform distribution over all subsets of S. Furthermore, \(S_1 \not \succ S_2\) unless \(S_1 = \{z_1, \dots , z_i\}\) for some \(i \in \{0, \dots , t\}\). Hence, we have

$$\begin{aligned} \Pr [S_1 \succ S_2 \mid S_1 \cup S_2 = S \text { and } S_1 \cap S_2 = \emptyset ] \le \frac{t + 1}{2^t}. \end{aligned}$$

This implies that

$$\begin{aligned} \Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2 \mid S_1 \cup S_2 = S] \le \frac{t + 1}{2^t}. \end{aligned}$$

Using the law of total expectation, we can bound the desired probability as follows:

$$\begin{aligned}&\Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2] \\&\quad = \sum _{S \subseteq V_0} \Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2 \mid S_1 \cup S_2 = S] \cdot \Pr [S_1 \cup S_2 = S]\\&\quad \le \sum _{S \subseteq V_0} \frac{|S| + 1}{2^{|S|}} \cdot \Pr [S_1 \cup S_2 = S]. \end{aligned}$$

Note that each alternative is included in \(S_1 \cup S_2\) with probability exactly \(\gamma := 1 - (1 - \frac{10\lambda }{n_0})^2\). Hence, we have \(\Pr [S_1 \cup S_2 = S] = \gamma ^{|S|} (1 - \gamma )^{n_0 - |S|}\). Substituting this in the above inequality, we get

$$\begin{aligned}&\Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2] \\&\quad \le \sum _{S \subseteq V_0} \frac{|S| + 1}{2^{|S|}} \cdot \gamma ^{|S|} (1 - \gamma )^{n_0 - |S|}\\&\quad = \sum _{t=0}^{n_0} \left( {\begin{array}{c}n_0\\ t\end{array}}\right) \cdot (t + 1) \cdot (0.5\gamma )^t (1 - \gamma )^{n_0 - t} \\&\quad = \sum _{t=0}^{n_0} \left( {\begin{array}{c}n_0\\ t\end{array}}\right) \cdot t \cdot (0.5\gamma )^t (1 - \gamma )^{n_0 - t} + \sum _{t=0}^{n_0} \left( {\begin{array}{c}n_0\\ t\end{array}}\right) \cdot (0.5\gamma )^t (1 - \gamma )^{n_0 - t} \\&\quad = n_0 \sum _{t=1}^{n_0} \left( {\begin{array}{c}n_0 - 1\\ t - 1\end{array}}\right) \cdot (0.5\gamma )^t (1 - \gamma )^{n_0 - t} + \sum _{t=0}^{n_0} \left( {\begin{array}{c}n_0\\ t\end{array}}\right) \cdot (0.5\gamma )^t (1 - \gamma )^{n_0 - t} \\&\quad = n_0 (0.5\gamma ) (1 - 0.5\gamma )^{n_0 - 1} + (1 - 0.5\gamma )^{n_0}, \end{aligned}$$

where we use the binomial expansion of \((0.5\gamma +(1-\gamma ))^{n_0-1}\) and \((0.5\gamma +(1-\gamma ))^{n_0}\) for the last equality.

Finally, one can check that \(\gamma \in [10\lambda / n_0, 20\lambda /n_0]\). Therefore, we have

$$\begin{aligned} \Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2]&\le n_0 (0.5\gamma ) \left( 1 - \frac{5 \lambda }{n_0}\right) ^{n_0 - 1} + \left( 1 - \frac{5 \lambda }{n_0}\right) ^{n_0} \\&\le n_0 \left( \frac{10\lambda }{n_0}\right) \left( 1 - \frac{5 \lambda }{n_0}\right) ^{n_0 - 1} + \left( 1 - \frac{5 \lambda }{n_0}\right) ^{n_0} \\&\le 10 \lambda \cdot e^{-4\lambda } + e^{-5\lambda } \\&\le e^{-\lambda }, \end{aligned}$$

where in the third inequality we use Lemma 2.2 and the fact that \(n_0 \ge 10\), which follows from our assumption that \(1\le \lambda \le n_0/10\), and in the last inequality we use the facts that \(\lambda \ge 1\), the function \(f(y) := 10y\cdot e^{-3y} + e^{-4y}\) is decreasing for \(y\in [1,\infty )\), and \(f(1) < 1\). \(\square\)

Lemma 3.2 allows for a short proof of Theorem 3.1.

Proof of Theorem 3.1

Fix a pair of distinct alternatives \(x_i,x_j\). We first bound the probability that \(x_j\) cannot reach \(x_i\) via a directed path of length at most three.

Consider the tournament \(T^0\) defined by restricting T to \(V^0 := V \setminus \{x_i, x_j\}\). Let \(S_1\) denote the set of alternatives in \(V^0\) that dominate \(x_i\) with respect to T, and let \(S_2\) denote the set of alternatives in \(V^0\) that are dominated by \(x_j\) with respect to T. Notice that if \(S_1 \cap S_2 \ne \emptyset\) or \(S_1 \not \succ S_2\), then there is a path of length at most three from \(x_j\) to \(x_i\). Furthermore, from the assumption of the theorem, each alternative belongs to each of \(S_1\) and \(S_2\) independently with probability at least \(\frac{30 \log n}{n}\), which is at least \(\frac{25 \log n}{|V^0|}\) for any \(n\ge 12\). As a result, we may apply Lemma 3.2 with \(\lambda = 2.5 \log n\) (note that the size of \(T^0\) is \(n-2\ge 10\)), which gives

$$\begin{aligned}&\quad \Pr [\text {there is } \text {no path of length at most three from } x_j \text { to } x_i] \\&\le \Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2] \\&\le e^{-\lambda } \\&= 1 / n^{2.5}. \end{aligned}$$

Finally, applying the union bound over all (ordered) pairs of alternatives \(x_i\ne x_j\), the probability that some alternative cannot reach some other alternative via a directed path of length at most three is no more than \(1/n^{0.5}\), which converges to 0 as n goes to infinity. \(\square\)

Next, we show that in the Condorcet random model, if \(p\in \Theta (\log n/n)\) and the associated constant is low enough, there is likely to be an alternative that is not a 4-king. Combined with Theorem 3.1, this implies that the bound \(\Theta (\log n/n)\) is asymptotically tight for both 3- and 4-kings in both random models that we consider.

Theorem 3.3

Assume that a tournament T is generated according to the Condorcet random model, and that \(p \le 0.1\log n/n.\) Then with high probability, there exists an alternative in T that is not a 4-king.

Proof

Let \(r = \left\lceil n^{0.45} \right\rceil\). We assume that \(n \ge 7\); in this range, it holds that \(n > 2r\). Let us partition the set \(V = \{x_1, \dots , x_n\}\) into three parts: \(V_{\text {top}} := \{x_1, \dots , x_r\}\), \(V_{\text {mid}} := \{x_{r + 1}, \dots , x_{n - r}\}\), and \(V_{\text {bottom}} := \{x_{n - r + 1}, \dots , x_n\}\). Furthermore, we define the following four events:

  • \(E_1\): There exists \(x_{i^*} \in V_{\text {top}}\) such that \(x_{i^*} \succ V_{\text {mid}}\).

  • \(E_2\): There exists \(x_{j^*} \in V_{\text {bottom}}\) such that \(V_{\text {mid}} \succ x_{j^*}\).

  • \(E_3\): \(V_{\text {top}} \succ V_{\text {bottom}}\).

  • \(E_4\): For every \(x_\ell \in V_{\text {mid}}\), either \(V_{\text {top}}\succ x_\ell\) or \(x_\ell \succ V_{\text {bottom}}\).

Before we proceed, let us note that if all four events occur, then \(x_{j^*}\) is not a 4-king. To see that this is the case, assume for the sake of contradiction that \(x_{j^*}\) is a 4-king; in particular, there exists a directed path of length at most four from \(x_{j^*}\) to \(x_{i^*}\). From \(E_2\) and \(E_3\), the alternatives dominated by \(x_{j^*}\) form a subset of \(V_{\text {bottom}}\). From \(E_1\) and \(E_3\), the alternatives dominating \(x_{i^*}\) form a subset of \(V_{\text {top}}\). Since there is a path of length at most four from \(x_{j^*}\) to \(x_{i^*}\), there must exist \(x_j \in V_{\text {bottom}}\) and \(x_i \in V_{\text {top}}\) such that \(x_j\) can reach \(x_i\) via a path of length at most two; however, this is impossible by \(E_3\) and \(E_4\).

Below, we will argue that each of \(E_1, E_2, E_3\), and \(E_4\) occurs with high probability. Once this is the case, the union bound together with the argument in the previous paragraph completes the proof of Theorem 3.3.

We start by showing that \(E_1\) occurs with high probability. We may write the probability of the complement event \(\lnot E_1\) as

$$\begin{aligned} \Pr [\lnot E_1]&= \Pr [\forall x_i \in V_{\text {top}}, \exists x_j \in V_{\text {mid}}, x_j \succ x_i] \\&= \prod _{i=1}^r \Pr [\exists x_j \in V_{\text {mid}}, x_j \succ x_i] \\&= \prod _{i=1}^r \left( 1 - \Pr [\forall x_j \in V_{\text {mid}}, x_i \succ x_j]\right) \\&= \prod _{i=1}^r \left( 1 - \prod _{j=r+1}^{n - r} \Pr [x_i \succ x_j]\right) \\&= \prod _{i=1}^r \left( 1 - (1 - p)^{n-2r}\right) \\&= \left( 1 - (1 - p)^{n-2r}\right) ^r \\&\le \left( 1 - (1 - p)^n\right) ^r \\&\le \left( 1 - \frac{1}{(1 + 2p)^n}\right) ^r \\&\le \left( 1 - \frac{1}{e^{2pn}}\right) ^r \\&\le \left( 1 - \frac{1}{n^{0.2}}\right) ^r \le e^{-r / n^{0.2}} \le e^{-n^{0.1}}. \end{aligned}$$

The second inequality holds since \((1-p)(1+2p)\ge 1\) for all \(p\in [0,1/2]\), while the third and fifth inequalities follow from Lemma 2.2. Thus, \(E_1\) occurs with high probability. A symmetric argument implies that \(E_2\) also occurs with high probability.

Next, consider \(E_3\). We may use the union bound to bound \(\Pr [\lnot E_3]\) as follows:

$$\begin{aligned} \Pr [\lnot E_3]&= \Pr [\exists x_i \in V_{\text {top}}, \exists x_j \in V_{\text {bottom}}, x_j \succ x_i] \\&\le \sum _{i=1}^r \sum _{j=n-r+1}^n \Pr [x_j \succ x_i] \\&= \sum _{i=1}^r \sum _{j=n-r+1}^n p \\&= r^2 p \\&\in O\left( \frac{\log n}{n^{0.1}}\right) , \end{aligned}$$

which converges to zero as \(n \rightarrow \infty\). As a result, \(E_3\) also occurs with high probability.

Finally, for \(E_4\), the union bound implies that

$$\begin{aligned} \Pr [\lnot E_4]&= \Pr [\exists x_\ell \in V_{\text {mid}}, \exists x_i \in V_{\text {top}}, \exists x_j \in V_{\text {bottom}}, x_j\succ x_\ell \text { and } x_\ell \succ x_i] \\&\le \sum _{\ell = r+1}^{n-r}\Pr [\exists x_i \in V_{\text {top}}, \exists x_j \in V_{\text {bottom}}, x_j\succ x_\ell \text { and } x_\ell \succ x_i]. \end{aligned}$$

For each \(x_\ell \in V_{\text {mid}}\), the latter probability is

$$\begin{aligned}&\Pr [\exists x_i \in V_{\text {top}}, \exists x_j \in V_{\text {bottom}}, x_j\succ x_\ell \text { and } x_\ell \succ x_i] \\&\quad = \Pr [\exists x_i \in V_{\text {top}}, x_\ell \succ x_i]\cdot \Pr [\exists x_j \in V_{\text {bottom}}, x_j\succ x_\ell ] \\&\quad \le \sum _{i=1}^r \Pr [x_\ell \succ x_i]\cdot \sum _{i=n-r+1}^n \Pr [x_j\succ x_\ell ] \\&\quad = (pr)^2, \end{aligned}$$

where we use independence for the first equality and the union bound for the inequality. Hence,

$$\begin{aligned} \Pr [\lnot E_4] \le n(pr)^2\in O(\log ^2n/n^{0.1}), \end{aligned}$$

which vanishes for large n. This means that \(E_4\) occurs with high probability as well, concluding the proof. \(\square\)

Our results so far demonstrate that k-kings for any \(k\ge 3\) are much closer to the \((n-1)\)-kings than to 2-kings in terms of discriminative power. In the remainder of this section, we show that there is virtually no difference in selectiveness between k-kings for \(k\ge 6\) and \((n-1)\)-kings. We begin by showing that in the Condorcet random model, all alternatives are likely to be 5-kings provided that \(p\in \omega (1/n)\)—this gives a complete characterization of the probability threshold for the Condorcet random model.

Theorem 3.4

Assume that a tournament T is generated according to the Condorcet random model, and that \(p \in \omega (1/n).\) Then with high probability, all alternatives in T are 5-kings.

Proof

If \(p\in \omega (\log n/n)\), the result already follows from Theorem 3.1, so assume that \(p\in O(\log n/n)\). In particular, \(p \in o(1/\sqrt{n})\). Let \(n\ge 5\), \(V_{\text {top}} := \{x_1,x_2\}\), and \(V_{\text {bottom}} := \{x_{n-1},x_n\}\). We define the following three events:

  • \(E_1\): For every \(x_m \not \in V_{\text {top}}\), there exists \(x_i\in V_{\text {top}}\) such that \(x_i\succ x_m\).

  • \(E_2\): For every \(x_\ell \not \in V_{\text {bottom}}\), there exists \(x_j\in V_{\text {bottom}}\) such that \(x_\ell \succ x_j\).

  • \(E_3\): For every \(x_i\in V_{\text {top}}\) and \(x_j\in V_{\text {bottom}}\), there exists a directed path of length at most three from \(x_j\) to \(x_i\).

Before we proceed, let us note that if all three events occur, then all alternatives in T are 5-kings. Indeed, consider any pair of distinct alternatives \(x_\ell\) and \(x_m\). If \(x_\ell \not \in V_{\text {bottom}}\), let \(x_j\in V_{\text {bottom}}\) be such that \(x_\ell \succ x_j\); the existence of such j is guaranteed by \(E_2\). Else, let \(j = \ell\). Analogously, if \(x_m\not \in V_{\text {top}}\), let \(x_i\in V_{\text {top}}\) be such that \(x_i\succ x_m\); the existence of such i is guaranteed by \(E_1\). Else, let \(i=m\). From \(E_3\), \(x_j\) can reach \(x_i\) via a directed path of length at most three. This yields a path of length at most five from \(x_\ell\) to \(x_m\).

By the union bound, it suffices to show that each of \(E_1, E_2\), and \(E_3\) occurs with high probability. For \(E_1\), the probability that its complement \(\lnot E_1\) occurs is

$$\begin{aligned} \Pr [\lnot E_1]&= \Pr [\exists x_m\not \in V_{\text {top}}, \forall x_i\in V_{\text {top}}, x_m\succ x_i] \\&\le \sum _{x_m\not \in V_{\text {top}}} \Pr [\forall x_i\in V_{\text {top}}, x_m\succ x_i] \\&= \sum _{x_m\not \in V_{\text {top}}} \Pr [x_m\succ x_1]\cdot \Pr [x_m\succ x_2] \\&= \sum _{x_m\not \in V_{\text {top}}} p^2 \\&\le p^2n \\&\in o(1), \end{aligned}$$

where the first inequality follows from the union bound and the second equality from independence. An analogous argument shows that \(E_2\) also occurs with high probability.

Finally, consider \(E_3\). Since both \(V_{\text {top}}\) and \(V_{\text {bottom}}\) are of constant size, by the union bound, it suffices to show that for each fixed pair \(x_i\in V_{\text {top}}\) and \(x_j\in V_{\text {bottom}}\), a path of length at most three from \(x_j\) to \(x_i\) exists with high probability. We show this for \(x_1\) and \(x_n\); the proofs for other pairs are similar. Define \(V_{\text {top-half}} = \{x_1,\dots ,x_{\lfloor n/2\rfloor }\}\) and \(V_{\text {bottom-half}} = \{x_{\lceil n/2\rceil + 1},\dots ,x_n\}\). The probability that \(V_{\text {top-half}} \succ x_n\) is

$$\begin{aligned} (1-p)^{\lfloor n/2\rfloor }&\le (1-p)^{n/3} \le e^{-pn/3}, \end{aligned}$$

which converges to 0 as \(n\rightarrow \infty\) since \(p\in \omega (1/n)\); here, the second inequality holds by Lemma 2.2. Hence, with high probability, there exists \(x_\ell \in V_{\text {top-half}}\) such that \(x_n\succ x_\ell\). Likewise, with high probability, there exists \(x_m\in V_{\text {bottom-half}}\) such that \(x_m\succ x_1\). Since \(\ell < m\), the probability that \(x_\ell \succ x_m\) is \(1-p\), which approaches 1 for large n. Hence, the union bound again implies that with high probability, \(x_n\) can reach \(x_1\) via the path \(x_n\succ x_\ell \succ x_m\succ x_1\), as desired. \(\square\)

For the generalized random model, we show that as long as \(p\in \omega (1/n)\), all alternatives are likely to be 6-kings.

Theorem 3.5

Assume that a tournament T is generated according to the generalized random model, and that \(p_{i,j} \in \omega (1/n)\) for all \(i\ne j\). Then with high probability, all alternatives in T are 6-kings.

To prove Theorem 3.5, we first establish the following auxiliary lemma.

Lemma 3.6

For any tournament T, there exists an alternative with both indegree and outdegree larger than \(n/4 - 1\).

Proof

Let us order the alternatives of T as \(x_{\sigma (1)},\dots ,x_{\sigma (n)}\) in nondecreasing order of outdegree, and let \(x^* = x_{\sigma (\lfloor n/2\rfloor )}\). By applying Lemma 2.3 on \(x_{\sigma (1)}, \dots , x_{\sigma (\lfloor n/2\rfloor )}\), we can conclude that the outdegree of \(x^*\) is at least \((\lfloor n/2\rfloor - 1)/2 > n/4 - 1\). Similarly, applying Lemma 2.3 on \(x_{\sigma (\lfloor n/2\rfloor )}, \dots , x_{\sigma (n)}\) implies that the indegree of \(x^*\) is at least \(((n - \lfloor n/2\rfloor + 1) - 1) / 2 > n/4 - 1\). \(\square\)

The high-level idea of the proof of Theorem 3.5 is to identify an alternative \(x^*\) as a “hub” that can both reach and be reached by every other alternative via a path of length at most three. The proof below formalizes this idea.

Proof of Theorem 3.5

Define a tournament \(T'\) on alternatives \(x_1,\dots ,x_n\) so that for each pair \(i\ne j\), we have \(x_i\succ x_j\) if \(p_{i,j}> 1/2\). Then, our tournament T is generated by reversing the edges of \(T'\) so that the edge between \(x_i\) and \(x_j\) is reversed with probability \(q_{i,j} := \min \{p_{i,j},1-p_{i,j}\}\le 1/2\), independently of other edges. Note that \(q_{i,j}\in \omega (1/n)\).

From Lemma 3.6, there exists an alternative \(x^*\) whose indegree and outdegree are both larger than \(n/4 - 1\), which is at least 0.24n for any \(n\ge 100\).

We claim that with high probability, in T, every alternative has a path of length at most three to \(x^*\), and \(x^*\) has a path of length at most three to every alternative; this is sufficient to obtain the desired conclusion. By symmetry, we only need to show the latter claim.

Let \(r=\lceil 0.24n\rceil\), and let \(y_1,\dots ,y_r\) be alternatives dominated by \(x^*\) in \(T'\). For \(1\le i\le r\), let \(Y_i\) be an indicator random variable that indicates whether the edge from \(x^*\) to \(y_i\) is reversed when generating T from \(T'\)—in particular, \(Y_i\) takes on the value 1 if the edge is reversed, and 0 otherwise. Note that \(\mathbb {E}[Y_i] \le 1/2\) for each i, and define \(Y := \sum _{i=1}^r Y_i\). Moreover, define \(Y_i' := Y_i + 1/2 - \mathbb {E}[Y_i]\) and \(Y' := \sum _{i=1}^r Y_i'\). We have \(\mathbb {E}[Y_i'] = 1/2\) and \(\mathbb {E}[Y'] = r/2\). By Lemma 2.1, it follows that

$$\begin{aligned} \Pr [Y \ge 7r/12]&\le \Pr [Y' \ge 7r/12] \\&= \Pr \left[ Y' \ge \frac{7}{6}\cdot \mathbb {E}[Y'] \right] \\&\le \exp \left( -\left( \frac{1}{6}\right) ^2\cdot \frac{\mathbb {E}[Y']}{3}\right) = \exp \left( -\frac{r}{216}\right) , \end{aligned}$$

which converges to 0 for large n. This means that with high probability, we have \(Y < 7r/12\); when this happens, the outdegree of \(x^*\) in T is at least \(5r/12 \ge 0.1n\). From now on, we condition on the event that \(x^*\) has outdegree at least 0.1n in T. Note that this conditioning only involves edges adjacent to \(x^*\). In particular, for any \(i \ne j\) such that \(x^*\not \in \{x_i, x_j\}\), we still have \(x_i \succ x_j\) independently with probability \(p_{i, j}\).

Let S be the set of alternatives dominated by \(x^*\) in T; by our assumption, \(|S| \ge 0.1 n\). Since \(x^*\) can reach each alternative in S via a single edge, it suffices to show that, with high probability, \(x^*\) can reach every alternative in \({\bar{S}} := \{x_1, \dots , x_n\} \setminus (S \cup \{x^*\})\) via a path of length at most three.

Consider the tournament \(T_{\text {res}}\), which is the restriction of T on the alternatives in \({\bar{S}}\). Let \(z_1, \dots , z_{|{\bar{S}}|}\) be the alternatives in \({\bar{S}}\) in nondecreasing order of indegree. For each \(\ell\), let \(D_\ell \subseteq {\bar{S}}\) denote the set of alternatives that dominate \(z_\ell\) in \(T_{\text {res}}\), and let \(\widehat{D}_\ell = D_\ell \cup \{z_\ell \}\). From Lemma 2.3, we have \(|D_\ell |\ge (\ell -1)/2\), and so \(|\widehat{D}_\ell | \ge (\ell + 1) / 2\).

Notice that if some \(s\in S\) dominates some \(d\in \widehat{D}_\ell\), then \(x^*\) can reach \(z_\ell\) via a path of length at most three, i.e., the path \(x^*\succ s\succ z_\ell\) if \(d = z_\ell\), or the path \(x^*\succ s\succ d\succ z_\ell\) if \(d\in D_\ell\). Hence, if there is no path from \(x^*\) to \(z_\ell\) of length at most three, we must have \(\widehat{D}_\ell \succ S\) in T. This happens with probability

$$\begin{aligned} \prod _{x_i \in \widehat{D}_\ell } \prod _{x_j \in S} p_{i, j}&\le (1 - p_{\min })^{|\widehat{D}_\ell | \cdot |S|} \le (1 - p_{\min })^{0.05 (\ell + 1) n}. \end{aligned}$$

where \(p_{\min } := \min _{i \ne j} p_{i, j} \in \omega (1/n)\).

Hence, by union bound, the probability that there exists an alternative in \({\bar{S}}\) which cannot be reached from \(x^*\) via a path of length at most three is bounded above by

$$\begin{aligned} \sum _{\ell =1}^{|{\bar{S}}|} (1 - p_{\min })^{0.05 (\ell + 1) n}&= \sum _{\ell =1}^{|{\bar{S}}|} \left( (1 - p_{\min })^{0.05n}\right) ^{(\ell + 1)} \le \sum _{\ell =1}^{\infty } \left( e^{-0.05p_{\min } n}\right) ^{(\ell + 1)}, \end{aligned}$$

where the inequality holds by Lemma 2.2. Let \(\zeta := e^{-0.05p_{\min } n}\). Since \(p_{\min } \in \omega (1/n)\), we have \(\zeta \in o(1)\). This means that the right-hand side expression above is equal to \(\frac{\zeta ^2}{1 - \zeta } \in o(1)\), which concludes our proof. \(\square\)

In light of Theorem 3.5, the only remaining gap in our probability threshold characterization is for 5-kings in the generalized random model. We conjecture that the true threshold is \(\omega (1/n)\), but our proof of Theorem 3.4 relies on the ordering of the alternatives in the Condorcet random model and cannot be easily extended. Instead, we present a slightly weaker bound of \(\Omega (\log \log n/n)\)—this shows that 5-kings are closer to 6-kings than to 4-kings with respect to selective power. To establish this bound, we need a lemma on generalized dominating sets.

Definition 3.7

Given a positive integer r, a set of alternatives D is said to be an r-dominating set of a tournament T if every alternative \(x \notin D\) is dominated by at least r alternatives in D.

Lemma 3.8

For any tournament T and any positive integer r, there exists an r-dominating set of T of size at most \(r \lceil \log _2 n \rceil\).

Proof

Let us start with \(D = \emptyset\) and repeat the following procedure r times: find a minimum dominating set S of the restriction of T on \(V \setminus D\), and update D to \(D \cup S\). It is clear that the final set D is an r-dominating set of T. Furthermore, it is well-known [33, Fact 2.5] that any tournament has a dominating set of size at most \(\lceil \log _2 n\rceil\), which implies that the final set D is of size at most \(r \lceil \log _2 n \rceil\). \(\square\)

We are now ready to establish our result on 5-kings.

Theorem 3.9

Assume that a tournament T is generated according to the generalized random model, and that \(p_{i,j} \in \left[ 50 (\log \log n)/n, 1 - 50 (\log \log n)/n\right]\) for all \(i\ne j\). Then with high probability, all alternatives in T are 5-kings.

Proof

Define a tournament \(T'\) on alternatives \(x_1,\dots ,x_n\) so that for each pair \(i\ne j\), we have \(x_i\succ x_j\) if \(p_{i,j}> 1/2\). Then, our tournament T is generated by reversing the edges of \(T'\) so that the edge between \(x_i\) and \(x_j\) is reversed with probability \(q_{i,j} := \min \{p_{i,j},1-p_{i,j}\}\le 1/2\), independently of other edges. For each alternative x, let I(x) denote the set of alternatives that dominate x in \(T'\).

Let \(r = \lceil 1.1\log _2 n\rceil\), and let D and \(D_{\text {inv}}\) be a minimum r-dominating set of the tournament \(T'\) and its “inverse” constructed by reversing all edges in \(T'\), respectively. From Lemma 3.8, we have \(|D|, |D_{\text {inv}}| \le r \lceil \log _2 n \rceil \in O((\log n)^2)\).

We define the following three events in T:

  • \(E_1\): For every \(x_m \not \in D\), there exists \(x_i\in D\) such that \(x_i\succ x_m\).

  • \(E_2\): For every \(x_\ell \not \in D_{\text {inv}}\), there exists \(x_j\in D_{\text {inv}}\) such that \(x_\ell \succ x_j\).

  • \(E_3\): For every \(x_i\in D\) and \(x_j\in D_{\text {inv}}\), there exists a directed path of length at most three from \(x_j\) to \(x_i\).

Similarly to the proof of Theorem 3.4, when \(E_1\), \(E_2\), and \(E_3\) all occur, every alternative is a 5-king. As a result, it suffices to show that each of the three events occurs with high probability.

For \(E_1\), we have

$$\begin{aligned} \Pr [\lnot E_1]&= \Pr [\exists x_m\not \in D, \forall x_i\in D, x_m\succ x_i] \\&\le \sum _{x_m\not \in D} \Pr [\forall x_i\in D, x_m\succ x_i] \\&\le \sum _{x_m\not \in D} \Pr [\forall x_i\in D \cap I(x_m), x_m\succ x_i] \\&= \sum _{x_m\not \in D} \prod _{x_i\in D \cap I(x_m)} \Pr [x_m\succ x_i] \\&= \sum _{x_m\not \in D} \prod _{x_i\in D \cap I(x_m)} q_{i, m} \\&\le \sum _{x_m\not \in D} (1/2)^{|D \cap I(x_m)|} \\&\le \sum _{x_m\not \in D} (1/2)^r \le \sum _{x_m\not \in D} 1/n^{1.1} \in o(1), \end{aligned}$$

where the first inequality follows the union bound and the fourth inequality from the fact that D is an r-dominating set in \(T'\). An analogous argument shows that \(E_2\) also occurs with high probability.

Finally, consider \(E_3\). Since both D and \(D_{\text {inv}}\) are of size \(O((\log n)^2)\), by the union bound, it suffices to show that for each fixed pair \(x_i\in D\) and \(x_j\in D_{\text {inv}}\), a path of length at most three from \(x_j\) to \(x_i\) exists with probability \(1 - o(1 / (\log n)^{4})\).

To prove this, consider the tournament \(T^0\) defined by restricting T to \(V^0 := V \setminus \left( D \cup D_{\text {inv}}\right)\). Let \(S_1\) denote the set of alternatives in \(V^0\) that dominate \(x_i\) with respect to T, and let \(S_2\) denote the set of alternatives in \(V^0\) that are dominated by \(x_j\) with respect to T. Notice that if \(S_1 \cap S_2 \ne \emptyset\) or \(S_1 \not \succ S_2\), then there is a path of length at most three from \(x_j\) to \(x_i\). Furthermore, from the assumption of the theorem, each alternative belongs to each of \(S_1\) and \(S_2\) independently with probability at least \(\frac{50 \log \log n}{n}\), which is at least \(\frac{45\log \log n}{|V^0|}\) for any n such that \(|D|,|D_{\text {inv}}|\le n/20\); this holds when \(n\ge 4000\). As a result, we may apply Lemma 3.2 with \(\lambda = 4.5 \log \log n\), which gives

$$\begin{aligned}&\Pr [\text {there is } \text {no path of length at most three from } x_j \text { to } x_i] \\&\quad \le \Pr [S_1 \cap S_2 = \emptyset \text { and } S_1 \succ S_2] \\&\quad \le e^{-\lambda } \\&\quad = 1 / (\log n)^{4.5} \\&\quad \in o(1/ (\log n)^4), \end{aligned}$$

which concludes our proof. \(\square\)

4 Single-elimination winners

In this section, we turn our attention to single-elimination winners and derive a tight bound of \(\Omega (\log n/n)\) for the generalized random model, thereby strengthening the bound \(\Omega (\sqrt{\log n/n})\) of Kim et al. [22] and matching their bound for the Condorcet random model. As in previous work on this subject, we assume for simplicity that \(n=2^r\) for some positive integer r, so \(r = \log _2 n\). In order to construct a winning bracket, a useful notion is that of a “superking”, introduced by Vassilevska Williams [49].

Definition 4.1

Given a tournament T, an alternative x is said to be a superking if for every alternative \(x'\) such that \(x'\succ x\), there exist at least \(\log _2 n\) alternatives \(x''\) such that \(x\succ x''\) and \(x''\succ x'\).

Lemma 4.2

([49]). In any tournament, every superking is a single-elimination winner, and its winning bracket can be computed in polynomial time.

Before we proceed to our result, let us briefly recap the proofs of the two aforementioned results by Kim et al. [22], and explain why they cannot be used to establish our desired strengthening. In order to show that all alternatives are single-elimination winners with high probability when \(p\in \Omega (\sqrt{\log n/n})\), Kim et al. showed that all alternatives are likely to be superkings in this range; it is not difficult to verify that this condition no longer holds when \(p\in o(\sqrt{\log n/n})\).Footnote 8 For the \(\Omega (\log n/n)\) bound in the Condorcet random model, they argued that the weakest alternative x is likely to dominate one of the top n/6 alternatives, and constructed a winning bracket for this latter alternative y among the top half of the alternatives, so that x can play y in the final round and win the single-elimination tournament. Since there is no clear notion of strength in the generalized random model (indeed, all alternatives may be roughly equally strong, with no linear order), this approach also does not work for our purpose.

At a high level, our proof proceeds by choosing \(r = \log _2n\) alternatives that our desired winning alternative x dominates. In order to ensure that x can play against these alternatives in the r rounds, we partition the r alternatives along with the remaining alternatives into subsets of size \(1,2,\dots ,2^{r-1}\), so that the r alternatives are superkings in the respective subtournament and can therefore win a single-elimination tournament with respect to their subset by Lemma 4.2.

Theorem 4.3

Assume that a tournament T is generated according to the generalized random model, and that \(p_{i,j} \in \left[ 100\log n/n, 1 - 100\log n/n\right]\) for all \(i\ne j\). Then with high probability, all alternatives in T are single-elimination winners, and a winning bracket of each alternative can be computed in polynomial time.

Proof

By union bound, it suffices to show that for each alternative x, with probability \(1-o(1/n)\), x is a single-elimination winner and its winning bracket can be computed in polynomial time.

Fix an alternative x. First, note that by Lemma 2.3, any subset of alternatives of size at least \(0.94n-1\) has average expected outdegree at least \((0.94n-2)/2\), which is greater than \(0.45n + 1\) for \(n > 100\) (since n is assumed to be a power of two in this section, we must have \(n\ge 128\)). Hence, there exist at least 0.06n alternatives besides x with expected outdegree at least \(0.45n + 1\). Call this set of alternatives H. Let \(h := |H| \ge 0.06n\), and let \(z_1,\dots ,z_h\) be the alternatives in H.

For \(1\le i\le h\), let \(Z_i\) be an indicator random variable that indicates whether x dominates \(z_i\) with respect to T—in particular, \(Z_i\) takes on the value 1 if \(x\succ z_i\), and 0 otherwise. Note that \(\mathbb {E}[Z_i] \ge 80\log n/n\) for each i. Define \(Z := \sum _{i=1}^h Z_i\), so \(\mathbb {E}[Z] \ge (100\log n/n) \cdot |H| \ge 6 \log n \ge 3.7r\). By Lemma 2.1, we have

$$\begin{aligned} \Pr [Z \le r]&\le \Pr \left[ Z \le \frac{1}{3.7}\cdot \mathbb {E}[Z] \right] \\&\le \exp \left( -\left( \frac{2.7}{3.7}\right) ^2\cdot \frac{\mathbb {E}[Z]}{2}\right) \\&\le \exp \left( -\left( \frac{2.7}{3.7}\right) ^2\cdot 3 \log n\right) \\&\le n^{-1.5} \\&\in o(1/n). \end{aligned}$$

From now on, we assume that \(Z > r\), meaning that x dominates at least r alternatives in H with respect to T. Let \(y_1,\dots ,y_r\) be r such alternatives. We will show that with probability \(1-o(1/n)\), there exists a partition of \(V\setminus \{x\}\) into r sets \(V_1,\dots ,V_r\) such that for each \(1\le i\le r\), the set \(V_i\) has size \(2^{i-1}\) and contains \(y_i\), and moreover \(y_i\) is a single-elimination winner in the tournament T restricted to \(V_i\). This is sufficient because we can then choose a bracket so that x beats \(y_1\) in the first round, \(y_2\) in the second round, and so on, until beating \(y_r\) in the final round to win the single-elimination tournament.

Consider an alternative \(y\in \{y_1,\dots ,y_r\}\), and observe that by definition of H, for \(n > 100\), the expected outdegree of y with respect to alternatives other than x is at least 0.45n. Applying Lemma 2.1 in a similar manner as above, the probability that the outdegree of y in T upon excluding the edge between y and x is less than 0.4n is at most

$$\begin{aligned} \exp \left( -\left( \frac{1}{9}\right) ^2\cdot \frac{0.45n}{2}\right) \le e^{-0.002n} \in o(1/n^2). \end{aligned}$$

By union bound, we have that with probability at least \(1-o(1/n)\), every alternative \(y_i\) has outdegree at least 0.4n in T. Assume from now on that this is the case.

We now construct our desired partition according to the following steps:

  1. 1.

    For each \(1\le i\le r\), add \(y_i\) to \(V_i\).

  2. 2.

    For each \(1\le i\le r - 3\), add \(2^{i-1}-1\) remaining alternatives that \(y_i\) dominates in T to \(V_i\).

  3. 3.

    For each \(r-2\le i\le r\), add \(\lfloor 0.091n \rfloor -1\) remaining alternatives that \(y_i\) dominates in T to \(V_i\).

  4. 4.

    For each \(r-2\le i\le r\), add remaining alternatives to \(V_i\) so that \(|V_i| = 2^{i-1}\).

To see that this construction procedure is well-defined, observe that in the first three steps, we add a total of

$$\begin{aligned} r + \sum _{i=1}^{r-3}(2^{i-1}-1) + \sum _{i=r-2}^r(\lfloor 0.091n \rfloor -1)&= \sum _{i=1}^{r-3} 2^{i-1} + \sum _{i=r-2}^r \lfloor 0.091n \rfloor \\&\le (2^{r-3}-1) + 3\cdot 0.091n \\&= 0.398n - 1 \end{aligned}$$

alternatives. Including x, the total number of unavailable alternatives at any point during these three steps is therefore at most \(0.398n < 0.4n\). Since every \(y_i\) has outdegree at least 0.4n in T, there always exist available alternatives.

It remains to show that each \(y_i\) is a superking in \(V_i\) with sufficiently high probability. For \(1\le i\le r-3\), this holds with probability 1 because \(y_i\) dominates all other alternatives in \(V_i\). For \(r-2\le i\le r\), by construction, for \(n > 100\), \(y_i\) dominates at least \(0.091n - 2 \ge 0.071n\) alternatives in \(V_i\) with respect to T; call this set of alternatives W. Let u be any alternative in \(V_i\) that dominates \(y_i\). The expected number of alternatives in W that dominate u is at least \((100\log n/n)\cdot (0.071n) = 7.1\log n > 4.9r\). Applying Lemma 2.1 again, the probability that the number of alternatives in W dominating u is less than r is bounded above by

$$\begin{aligned} \exp \left( -\left( \frac{3.9}{4.9}\right) ^2\cdot \frac{7.1\log n}{2}\right) \le n^{-2.2}. \end{aligned}$$

Taking the union bound over all u, we find that \(y_i\) is a superking with probability \(1-o(1/n)\). A union bound over all \(r-2\le i \le r\) implies that with this probability, \(y_i\) is a superking in \(V_i\) for all i, and therefore x is a single-elimination winner.

Finally, from Lemma 4.2, a winning bracket for \(y_i\) in the subtournament with alternatives \(V_i\) can be found in polynomial time. Moreover, our partitioning procedure can be implemented efficiently. Hence, we can compute a winning bracket for x in polynomial time. \(\square\)

5 Number of kings

So far, we have addressed the question of when each tournament solution is likely to select all alternatives, i.e., the case where the solution is decidedly not useful. In this section, we move beyond this question (which is also the focus of several previous works) and ask when a small or large number of alternatives are chosen. Our first result shows that even though \(p\in \Omega (\sqrt{\log n/n})\) is required for all alternatives to be 2-kings with high probability [41], the smaller threshold of \(p\in \Omega (\log n/n)\) suffices in order for most of the alternatives to be included.

Theorem 5.1

Assume that a tournament T is generated according to the generalized random model, and that \(p_{i,j} \in \left[ 50\log n/n, 1 - 50\log n/n\right]\) for all \(i\ne j\). Then with high probability, at least 0.9n alternatives in T are 2-kings.

Proof

Assume without loss of generality that the alternatives before the tournament T is generated are \(x_1,\dots ,x_n\) in nonincreasing order of expected outdegree. Let \(r = \lceil 0.9n \rceil\), and consider any \(1\le i\le r\). We will show that the probability that \(x_i\) is a 2-king in T is at least \(1 - o(1/n)\). The union bound then implies that with high probability, at least 0.9n alternatives in T are 2-kings.

To show that \(\Pr [x_i \text { is a } 2\text {-king in } T]\) is at least \(1 - o(1/n)\), the union bound again implies that it suffices to prove that the probability that there is no path of length at most two from \(x_i\) to \(x_j\) is at most \(o(1/n^2)\) for each alternative \(x_j \ne x_i\).

We henceforth fix \(1\le i\le r\) and \(x_j \ne x_i\). By Lemma 2.3 on \(x_i,x_{i+1}, \dots , x_n\), the expected outdegree of \(x_i\) is at least \((n - i) / 2\), which is at least 0.045n for any \(n\ge 100\). As a result, for this range of n, we have

$$\begin{aligned}&\Pr [\text {there is no path of length at most two from } x_i \text { to } x_j] \\&\quad = \Pr [x_j \succ x_i]\cdot \Pr [\forall k \notin \{i,j\}, x_k \succ x_i \text { or } x_j \succ x_k] \\&\quad = (1 - p_{i,j})\cdot \prod _{k \notin \{i,j\}} \Pr [x_k \succ x_i \text { or } x_j \succ x_k] \\&\quad = (1 - p_{i,j})\cdot \prod _{k \notin \{i,j\}} \left( 1 - p_{i,k} p_{k,j}\right) \\&\quad \le \exp (-p_{i,j})\cdot \prod _{k \notin \{i,j\}} \exp (- p_{i,k} p_{k,j}) \\&\quad \le \exp \left( -\frac{50\log n}{n} \cdot p_{i,j}\right) \cdot \prod _{k \notin \{i,j\}} \exp \left( -\frac{50\log n}{n} p_{i,k}\right) \\&\quad = \exp \left( -\frac{50\log n}{n} \left( \sum _{k \ne i} p_{i,k}\right) \right) \\&\quad \le \exp \left( -\frac{50\log n}{n} \cdot 0.045 n\right) \\&\quad = 1/n^{2.25} \\&\quad \in o(1/n^2). \end{aligned}$$

We use Lemma 2.2 for the first inequality, and the assumption that the expected outdegree of \(x_i\) is at least 0.045n for the last inequality. This concludes our proof. \(\square\)

Our next result establishes the asymptotic tightness of the threshold in Theorem 5.1, and implies that the transition from a small to a large number of 2-kings occurs at \(p\in \Theta (\log n/n)\) for both random models.

Theorem 5.2

Assume that a tournament T is generated according to the Condorcet random model, and that \(p \le 0.1\log n/n.\) Then with high probability, at most \(\sqrt{n} \log n\) alternatives in T are 2-kings.

Proof

Let \(n\ge 4\) and \(r = \left\lceil \sqrt{n} \right\rceil\). Let us partition the set \(V = \{x_1, \dots , x_n\}\) into two parts: \(V_{\text {top}} := \{x_1, \dots , x_r\}\) and \(V_{\text {bottom}} := \{x_{r + 1}, \dots , x_n\}\). Furthermore, let S denote the set of all alternatives in \(V_{\text {bottom}}\) that dominate at least one alternative in \(V_{\text {top}}\). We define the following two events:

  • \(E_1\): There exists \(x_{i^*} \in V_{\text {top}}\) such that \(x_{i^*} \succ V_{\text {bottom}}\).

  • \(E_2\): \(|S| \le 0.2 r \log n\).

Before we proceed, let us note that if both events occur, there are at most \(\sqrt{n} \log n\) 2-kings. To see this, observe that when \(E_1\) occurs, an alternative \(x \in V_{\text {bottom}}\) can be a 2-king only if \(x \in S\). As a result, when both events occur, the number of 2-kings is at most \(r + |S|\), which is at most \(\sqrt{n} \log n\) for any sufficiently large n. We now argue that each of \(E_1, E_2\) occurs with high probability; the union bound then completes the proof.

We start by showing that \(E_1\) occurs with high probability. We write the probability of the complement event \(\lnot E_1\) as

$$\begin{aligned} \Pr [\lnot E_1]&= \Pr [\forall x_i \in V_{\text {top}}, \exists x_j \in V_{\text {bottom}}, x_j \succ x_i] \\&= \prod _{i=1}^r \Pr [\exists x_j \in V_{\text {bottom}}, x_j \succ x_i] \\&= \prod _{i=1}^r \left( 1 - \Pr [\forall x_j \in V_{\text {bottom}}, x_i \succ x_j]\right) \\&= \prod _{i=1}^r \left( 1 - \prod _{j=r+1}^n \Pr [x_i \succ x_j]\right) \\&= \prod _{i=1}^r \left( 1 - (1 - p)^{n-r}\right) \\&= \left( 1 - (1 - p)^{n-r}\right) ^r \\&\le \left( 1 - (1 - p)^n\right) ^r \\&\le \left( 1 - \frac{1}{(1 + 2p)^n}\right) ^r \\&\le \left( 1 - \frac{1}{e^{2pn}}\right) ^r \\&\le \left( 1 - \frac{1}{n^{0.2}}\right) ^r \le e^{-r / n^{0.2}} \le e^{-n^{0.3}}. \end{aligned}$$

The second inequality holds since \((1-p)(1+2p)\ge 1\) for all \(p\in [0,1/2]\), while the third and fifth inequalities follow from Lemma 2.2. Thus, \(E_1\) occurs with high probability.

Next, we consider \(E_2\). Let \(P := \{(x_i, x_j) \in V_{\text {top}} \times V_{\text {bottom}} \mid x_j \succ x_i\}\). Notice that each \((x_i, x_j) \in V_{\text {top}} \times V_{\text {bottom}}\) belongs to P independently with probability \(p \le 0.1 \log n / n\). We will bound the probability that \(|P| \ge 0.2 r \log n\). This probability does not increase when p decreases, so it suffices to consider \(p = 0.1\log n / n\). For this p, we have \(\mathbb {E}[|P|] = p r (n - r) \le 0.1r \log n\), and moreover \(pr(n - r) \ge 0.01r \log n\) for any \(n \ge 4\). As a result, by Lemma 2.1, we have

$$\begin{aligned} \Pr \left[ |P| \ge 0.2 r \log n\right]&\le \Pr \left[ |P| \ge 2\mathbb {E}[|P|]\right] \\&\le \exp \left( -\frac{\mathbb {E}[P]}{3}\right) \\&\le \exp \left( -\frac{0.01r \log n}{3}\right) \in o(1). \end{aligned}$$

Finally, notice that \(|S| \le |P|\), which implies that \(E_2\) happens with high probability, as desired. \(\square\)

Finally, we show that as long as \(p\in \omega (1/n)\), a large number of alternatives are already likely to be 3-kings (and therefore k-kings for every \(k\ge 3\)). This bound is again tight: when \(p\in O(1/n)\) and the tournament is generated from the Condorcet random model, there is at least a constant probability that the strongest alternative dominates all remaining alternatives (in which case it is the only k-king for each \(k\ge 2\)). In particular, for each \(k\ge 3\), the transition from a small to a large number of k-kings occurs at \(p\in \Theta (1/n)\) for both random models.

Theorem 5.3

Assume that a tournament T is generated according to the generalized random model, and that \(p_{i,j} \in \omega (1/n)\) for all \(i\ne j\). Then with high probability, at least 0.9n alternatives in T are 3-kings.

Proof

Assume without loss of generality that the alternatives before the tournament T is generated are \(x_1,\dots ,x_n\) in nonincreasing order of expected outdegree. Let \(p = \min _{i \ne j} p_{i, j}\); by our assumption, \(p \in \omega (1/n)\). Furthermore, let \(r = \lceil 0.95n \rceil\), and consider any \(1\le i\le r\).

We will show that the probability that \(x_i\) is not a 3-king in T is at most o(1). Let \(q_i\) denote this probability, and let \(q = \max _{i=1}^r q_i\). By linearity of expectation, for any \(n\ge 100\), we have

$$\begin{aligned} \mathbb {E}[|\{i \mid 1 \le i \le r \text { and } x_i \text { is not a 3-king}\}|] \le 0.96nq. \end{aligned}$$

As a result, by Markov’s inequality, we can derive

$$\begin{aligned}&\Pr [\text {there are less than } 0.9n \,3\text {-kings}] \\&\le \Pr [|\{i \mid 1 \le i \le r \text { and } x_i \text { is not a}\, 3\text {-king}\}| \ge 0.05 n] \\&\quad \le \frac{0.96nq}{0.05n} = 96q/5 \in o(1), \end{aligned}$$

as desired.

It remains to show that for each \(1\le i\le r\), the probability that \(x_i\) is not a 3-king is o(1). By Lemma 2.3 on \(x_i,x_{i+1}, \dots , x_n\), the expected outdegree of \(x_i\) is at least \((n - i) / 2\), which is at least 0.02n for any \(n\ge 100\). Let U denote the set of alternatives dominated by \(x_i\) in T. Note that |U| is a sum of \(n - 1\) independent Bernoulli random variables, and \(\mathbb {E}[|U|] \ge 0.02 n\). Applying Lemma 2.1, we have

$$\begin{aligned} \Pr [|U| < 0.01n] \le \exp \left( -\frac{1}{4} \cdot \frac{0.02n}{2}\right) \in o(1). \end{aligned}$$

Thus, we may henceforth assume that \(|U| \ge 0.01n\).

Now, let \(T_i\) denote the tournament T restricted to \(V \setminus (U \cup \{x_i\})\). Let us order the alternatives in \(T_i\) as \(x_{\psi (1)}, \dots , x_{\psi (L)}\) in nondecreasing order of indegree in \(T_i\), where \(L = n - |U| - 1\). For each \(j = 1, \dots , L\), let \(D_j\) denote the set of alternatives in \(T_i\) that dominate \(x_{\psi (j)}\). We have

$$\begin{aligned}&\Pr [\text {there is no path of length at most three from } x_i \text { to } x_{\psi (j)}] \\&\quad \le \Pr [D_j \cup \{x_j\} \succ U] \\&\quad = \prod _{x_k \in D_j \cup \{x_j\}} \prod _{x_\ell \in U}\Pr [x_k \succ x_\ell ] \\&\quad = \prod _{x_k \in D_j \cup \{x_j\}} \prod _{x_\ell \in U} (1 - p_{\ell k}) \\&\quad \le \prod _{x_k \in D_j \cup \{x_j\}} \prod _{x_\ell \in U} (1 - p) \\&\quad = (1 - p)^{(|D_j| + 1)|U|} \le e^{-0.02np(|D_j| + 1)} \le e^{-0.01np(j + 1)}, \end{aligned}$$

where in the last two inequalities we use Lemma 2.2 and the fact that \(|D_j| \ge (j - 1)/2\), which follows from Lemma 2.3, respectively.

Hence, by the union bound, the probability that some alternative in \(T_i\) is unreachable from \(x_i\) in at most three steps is no more than

$$\begin{aligned} \sum _{j=1}^L e^{-0.01np(j + 1)} \le \frac{e^{-0.02np}}{1 - e^{0.01np}}, \end{aligned}$$

which is o(1) due to our assumption that \(np = \omega (1)\). As a result, the probability that \(x_i\) is not a 3-king is o(1), as claimed. \(\square\)

6 Experiments

In this section, we present experimental results on the behavior of k-kings in random tournaments. Besides the Condorcet random model, we also consider an arguably more realistic model introduced by Saile and Suksompong [41] called the “gap model”.

6.1 Condorcet random model

We start with the Condorcet random model. For each \(n\in \{10,20,40,60,80,100\}\), each \(p\in \{0, 0.02, 0.04, 0.06, \dots , 0.5\}\), and each \(k\in \{2,3,4,n-1\}\), we generated 10000 tournaments according to the Condorcet random model with parameters n and p, and took the average of the number of k-kings in the generated tournaments. In order to determine the number of k-kings, a simple iterative algorithm running in time \(O(n^2)\) suffices for the case \(k = n-1\) [4, p. 71], while an efficient algorithm based on matrix multiplication can be used for the cases \(k=2,3,4\) [4, p. 68].Footnote 9 The results are shown in Fig. 4; for ease of comparison, we illustrate the percentage (as opposed to the number) of k-kings among all alternatives in the tournament.Footnote 10

Fig. 4
figure 4

Percentage of alternatives that are k-kings in tournaments generated by the Condorcet random model with probability p, for \(k\in \{2,3,4,n-1\}\) and different sizes n of the tournament. The horizontal and vertical axes correspond to the probability p and the percentage, respectively. Averages are taken over 10000 generated tournaments

From the resulting graphs, it is clear that 3-, 4-, and \((n-1)\)-kings behave much more similarly to one another than to 2-kings, as our theoretical findings predict. Note that \(p=0\) always gives rise to a transitive tournament, which has only one k-king for every k, so the percentage of k-kings is always \((100/n)\%\) in this case. As p increases, the tournament becomes less skewed, and the number of k-kings also rises as a result. This rise is significantly faster for larger n, since there are more intermediate alternatives through which an alternative can reach another alternative. Indeed, for \(n=100\), even when p is only 0.1, already \(80\%\) of the alternatives are 2-kings and \(100\%\) are 3, 4, and \((n-1)\)-kings. A perhaps surprising observation is that 4-kings behave less similarly to 3-kings than to \((n-1)\)-king, even though Table 1 appears to predict the opposite outcome. This behavior may be due to the facts that the difference between \(\Theta (\log n/n)\) and \(\Theta (\log \log n/n)\) is small, so the difference in the constant factors plays a more noticeable role.

Our results in Sect. 5 show that the transition from a small number to a large number of 2-kings occurs in the range \(p\in \Theta (\log n/n)\), with the threshold \(p \ge 50\log n/n\) guaranteeing that at least \(90\%\) of the alternatives are 2-kings. For \(n=100\), this percentage of 2-kings is reached when \(p = 0.14\approx 3\log 100/100\), which means that there is likely to be room for improvement in Theorem 5.1 with respect to the constant factor. A similar improvement is also plausible for Theorem 5.2, as \(0.1\log 100/100 \approx 0.005\), whereas our experiments exhibit that even when \(p = 0.02\), only \(10\%\) of the alternatives are 2-kings.

For completeness, we also provide experimental results on the fraction of tournaments in which all alternatives are k-kings in Appendix 1.

6.2 Gap model

Next, we consider another random model for generating tournaments called the gap model [41]. Like in the Condorcet random model, in the gap model there is a parameter \(p\in [0,1/2]\) and an ordering \(x_1,\dots ,x_n\) of the alternatives. However, instead of the probability of \(x_j\) dominating \(x_i\) being p for all \(i < j\), this probability is now \(0.5 - \frac{(0.5-p)(j-i)}{n-1}\). As a result, although the domination probability remains p when \((i,j) = (1,n)\), the probability is much closer to 0.5 when i and j are close to each other. In particular, even when \(p=0\), the tournament can be far from transitive. We conducted the same experiments for the gap model as we did for the Condorcet random model, with the exception being that we chose lower values of n, i.e., \(n\in \{5,10,20,30,40,60\}\). The reason behind this choice is that when \(n > 60\), very close to \(100\%\) of the alternatives are k-kings for every k. The results of our experiments are shown in Fig. 5. Note that for \(n=5\), even though 4-kings and \((n-1)\)-kings coincide, their corresponding curves slightly differ because the tournaments are generated anew for each k.

Fig. 5
figure 5

Percentage of alternatives that are k-kings in tournaments generated by the gap model with probability p, for \(k\in \{2,3,4,n-1\}\) and different sizes n of the tournament. The horizontal and vertical axes correspond to the probability p and the percentage, respectively. Averages are taken over 10000 generated tournaments

Similarly to the Condorcet random model, our results for the gap model reveal a clear separation between 2-kings and k-kings for \(k\ge 3\). Moreover, the average number of k-kings is much higher in the gap model (especially for low values of p), which indicates that the proportion of k-kings in real-life tournaments is likely to be higher than in tournaments generated according to the Condorcet random model. We remark that even when \(p = 0\), the experimental findings suggest that the percentage of k-kings converges to \(100\%\) for every k as n grows. This result has not been established theoretically by our work (or any prior work), because the probability that alternative \(x_n\) dominates alternative \(x_1\) is 0 when \(p = 0\), so the results on the generalized random model do not apply. Establishing this result and studying the gap model theoretically is an interesting direction for future work.

As with the Condorcet random model, we present results on the fraction of tournaments in which all alternatives are k-kings in Appendix 1.

7 Concluding remarks

In this paper, we have extensively investigated the behavior of generalized kings and single-elimination winners in random tournaments in view of their discriminative power. Our results reveal surprisingly clear distinctions between 2-kings and k-kings for \(k\ge 3\), and illustrate why manipulating a single-elimination tournament is often possible in practice despite the problem being NP-hard. All of the bounds that we obtained are asymptotically tight except for the bound for 5-kings in the generalized random model (Theorem 3.9); one could try to close this gap.

An exciting future direction in our view is to study k-kings and single-elimination winners with respect to axiomatic and computational properties, as is commonly done for other tournament solutions [4, 27]. For example, the set of 2-kings (i.e., the uncovered set) is known to be the finest tournament solution satisfying Condorcet consistency, neutrality, and expansion [38]. Which axioms does the set of 3-kings satisfy, and can we characterize it by a collection of axioms? One could also study the relationship between these tournament solutions and traditional ones—this was partially done by Kim et al. [22], who showed for instance that any alternative in the Copeland set or the Slater set can always win a single-elimination tournament. Another possible avenue is to extend our results to other stochastic models for tournaments—several interesting models have been studied experimentally by Brandt and Seedig [8] and Brill et al. [10]. Such questions illustrate the richness of tournaments and probabilistic models, which we expect to give rise to further fruitful research.