Nur wer mit geringen Mitteln Großes tut, hat es glücklich getroffen.Footnote 1

1 Introduction

This paper aims to connect two seemingly distant fields of academic research: scientometrics and sports economics. However, the approach is not unique, Kóczy and Strobel (2010) and Lehmann and Wohlrabe (2017) actually do the reverse by applying sports ranking methods to journals.

Following the pioneering work of Garfield (1979), a plethora of measures have been suggested, tested, and debated to quantify the influence of a scholar (Ravallion and Wagstaff 2011; Glänzel and Moed 2013; Wildgaard et al. 2014). Most academics have probably met with the h-index during the evaluation of grant proposals. Nonetheless, there is no consensus on what is the best bibliometric index, mainly due to the need to balance quantity (represented by the number of papers written) and quality (represented by the number of citations to those papers). This dilemma has motivated several articles on the axiomatisation of scientometric measures which can greatly help our understanding of their characteristics (Marchant 2009; Bouyssou and Marchant 2010; Kóczy and Nichifor 2013; Bouyssou and Marchant 2014; de la Vega and Volij 2018).

The economic analysis of team sports is often centred around the issue of competitive balance since the celebrated article of Rottenberg (1956). Any sporting contest establishes a hierarchy among the competitors by differentiating between winners and losers. Therefore, uncertainty of outcome has at least three different dimensions (Buzzacchi et al. 2003): (a) short-term (whether the ex ante winning probabilities of the playing teams are equal in a match); (b) medium-term (whether all teams have the same chance to be the champion in a given season); (c) long-term (how volatile is the relative standing of different clubs across some seasons). Obviously, the measurement of competitive balance is much more than a simple academic problem. For example, several antitrust decisions are based on the claim that fan interest decreases when the uncertainty of outcome declines, thus, public interest demands to maintain competitive balance (Szymanski 2003).

While there are numerous attempts to quantify the level of competition in sports leagues (Zimbalist 2002; Manasis et al. 2013; Manasis and Ntzoufras 2014), there exists much less literature on how to measure it in knockout tournaments (del Corral 2009; Considine and Gallagher 2018). However, several popular sports championships—the tennis Grand Slam tournaments, the final stages of the FIFA World Cups and UEFA European Championships in (association) football, the playoffs in the NBA, NFL, NHL—are designed in this format. A natural expectation of both the fans and the regulators can be to avoid the dominance of the same players or teams across several seasons.

The current paper aims to quantify long-term uncertainty of outcome in such tournaments. The main challenge is similar to the central problem of bibliometrics: What is the trade-off between quantity and quality? Does a team obtain a greater market share when it has won the title once or when it has never won but qualified for the semifinals four times? A further difficulty of the measurement can be that the teams participating in a given year may not participate the year after (Koning 2009).

We offer a solution to these challenges by using bibliometric indices. In particular, the Euclidean index (Perry and Reny 2016) and the rectangle index (Fenner et al. 2018; Levene et al. 2019) will be considered with various weights assigned for the achievements of the teams, analogous to the number of times a paper has been referred to. An axiomatic comparison will be provided to highlight the differences between the two measures. The Euclidean index rewards top achievements and is always increased when the performance improves. On the other hand, the rectangle index focuses on core performance. For example, if a team usually qualifies for the semifinals, it does not count whether it is eliminated in the Round of 16 or the quarterfinals in the next year. Since both concepts can be justified, we do not promote any of them over the other.

Several papers quantify uncertainty of outcome between consecutive seasons by using sophisticated statistical techniques. While our approach is much simpler, that can help its spread among the stakeholders as the popularity of scientometric indices shows: it is usually more attractive—although might be potentially misleading—to express a complex phenomenon in sports (or any other field) with a single number. The suggested framework can be understood with limited methodological knowledge, and one can easily “open the black box” behind the results. Anyway, measuring trends over time in a novel manner compared to the prior literature is able to reinforce or deny well-established findings which is an important issue in academic research, even though the conclusions of the analysis do not fundamentally differ from the findings of the “standard” literature.

The theoretical overview is followed by a real-world application. The proposed methodology will be used first to quantify the performance of clubs and national associations in the recent seasons of the UEFA Champions League, the most prestigious European football tournament. In particular, a rolling five-season window is chosen because both the UEFA club and country coefficients—that measure the results of individual clubs (for seeding in the Champions League) and the member associations (for determining the access list of the Champions League), respectively—take into account the matches played over the past five seasons of European competitions. Milanovic (2005) also examines 5-year periods as reasonable over which concentration can be observed and calculated.

After that, three types of competitive balance will be analysed at the level of clubs, between the countries, and within the countries. It seems that previous studies have not discussed competition at the level of national associations in the UEFA Champions League. Therefore, they have failed to identify if different clubs from the same country have dominated the tournament. We can grab even this aspect of reduced competition.

The rest of the paper is organised as follows. Section 2 presents the two bibliometric indices and their properties. The UEFA Champions League data and the details of our application are discussed in Sect. 3, followed by a concise overview of previous literature on competitive balance in European football and the analysis of the results in Sect. 4. Finally, Sect. 5 concludes.

2 Methodology: bibliometric indices

This section presents the axiomatic comparison of two bibliometric indices that will be used to quantify competitive balance. While it does not contain any new results, it might be useful for readers who are not familiar with the principles behind these measures.

Consider a championship consisting of n races, where each position in each race earns some non-negative points for the competitors. The points scored by a contestant are collected into the score vector \({\textbf{x}} = \left[ x_1, x_2, \dots , x_n \right] \) that is sorted in descending order, i.e. \(x_i \ge x_j\) for all \(1 \le i < j \le n\). The achievement of a contestant in the whole championship is measured by an aggregation function f that maps score vectors to the set of non-negative real numbers.

The score vector is equivalent to a citation vector of a scholar, consequently, a bibliometric index can be used for its evaluation. In the case of competitive balance, most papers use the simple aggregation function \(f({\textbf{x}}) = \sum _{i=1}^n x_i\), essentially counting the number of points for each competitor. However, this is not necessarily an appropriate approach in a knockout tournament. We consider two other bibliometric indices, in particular, the Euclidean index (Perry and Reny 2016) and the rectangle index (Fenner et al. 2018; Levene et al. 2019).

The Euclidean index \(E({\textbf{x}})\) of a score vector \({\textbf{x}} = \left[ x_1, x_2, \dots , x_n \right] \) is the Euclidean norm of the score vector:

$$\begin{aligned} E \left( {\textbf{x}} \right) = \sqrt{\sum _{i=1}^n x_i^2}. \end{aligned}$$

The rectangle index \(R({\textbf{x}})\) of a score vector \({\textbf{x}} = \left[ x_1, x_2, \dots , x_n \right] \) is the area of the largest rectangle that can fit under the score vector:

$$\begin{aligned} R \left( {\textbf{x}} \right) = \max _{1 \le i \le n} i x_i. \end{aligned}$$

If the largest rectangle is a square, the rectangle index is equivalent to the well-known h-index (Hirsch 2005), that is, the maximal number h of races where the contestant scored at least h points in each of them.

In order to better understand these measures, it is worth using an axiomatic approach. Some reasonable requirements for aggregation functions are the following:

  • Monotonicity: if score vector \({\textbf{x}} = \left[ x_1, x_2, \dots , x_n \right] \) is dominated by score vector \({\textbf{y}} = \left[ y_1, y_2, \dots , y_n \right] \), that is, \(x_i \le y_i\) holds for all \(1 \le i \le n\), then \(f({\textbf{x}}) \le f({\textbf{y}})\).

  • Independence: if \(f({\textbf{x}}) \le f({\textbf{y}})\) and a new, \(n+1\)th race is added to the championship where both competitors score the same number of points, then the ordinal ranking of their achievements does not change.

  • Depth relevance: if score vector \({\textbf{x}}\) is modified such that a new, \(n+1\)th race is added and the number of points \(x_i\) in the ith race is split into two parts between the ith and \(n+1\)th races, then the overall achievement should decrease.

  • Scale invariance: \(f({\textbf{x}}) \le f({\textbf{y}}) \iff f(c{\textbf{x}}) \le f(c{\textbf{y}})\) for any \(c > 0\).

  • Directional consistency: if \(f({\textbf{x}}) = f({\textbf{y}})\) holds for the score vectors \({\textbf{x}}\) and \({\textbf{y}}\), and both of them are shifted by the same score vector \({\textbf{d}}\) such that \(f({\textbf{x}} + {\textbf{d}}) = f({\textbf{y}} + {\textbf{d}})\), then \(f({\textbf{x}} + \lambda {\textbf{d}}) = f({\textbf{y}} + \lambda {\textbf{d}})\) for any \(\lambda > 1\).

  • Uniform citation: if the score vector \({\textbf{x}}\) is uniform, that is, \(x_1 = x_2 = \cdots = x_n\), then \(f({\textbf{x}}) = \sum _{i=1}^n x_i\).

  • Uniform equivalence: for any score vector \({\textbf{x}}\), there exists a uniform score vector \({\textbf{u}}\) dominated by \({\textbf{x}}\) such that \(f({\textbf{x}}) = f({\textbf{u}})\).

Monotonicity is a natural requirement, the achievement of a contestant with at least the same performance cannot be lower. Independence allows the addition of identical records for two competitors without changing their ranking. Depth relevance excludes the overall achievement to be maximised by spreading the number of points collected thinly across the races. In other words, this property rewards quality over quantity. Note that summing up the number of points does not satisfy depth relevance. According to scale invariance, multiplying the number of points by a positive scaling factor does not affect the order of two contestants. The h-index does not satisfy scale invariance (Perry and Reny 2016). Directional consistency means that if two tied competitors remain tied when their number of points are increased by the same amounts in decreasing order, they continue to remain equally ranked if this growth vector is multiplied by any positive constant. Uniform citation requires the overall achievement to be equal to the number of points when they are level across all races. Finally, according to uniform equivalence, the same number of points should be counted for each race in the overall achievement of a contestant.

An aggregation function f satisfies monotonicity, independence, depth relevance, scale invariance, and directional consistency if and only if it is equivalent to the Euclidean index (Perry and Reny 2016, Theorem 1). An aggregation function f meets monotonicity, uniform citation, and uniform equivalence if and only if it is the rectangle index (Levene et al. 2019, Theorem 4.1).

A stronger version of scale invariance is satisfied by both measures.

Proposition 1

The share of a competitor’s achievement in the total achievement under the Euclidean and the rectangle indices does not change if the scores are multiplied by the same positive number \(c > 0\).

Proof

The invariance is implied by \(E(c {\textbf{x}}) = c E({\textbf{x}})\) and \(R(c {\textbf{x}}) = c R({\textbf{x}})\) for any score vector \({\textbf{x}}\). \(\square \)

According to Proposition 1, the two approaches are influenced only by the ratio of the points to be scored in each race, but they do not depend on the absolute size of the points.

Consider a binary championship where there are only two types of positions, for example, a contestant either qualifies or does not qualify.

Proposition 2

The rankings of the competitors under the Euclidean and rectangle indices coincide in a binary championship.

Proof

Due to Proposition 1, it can be assumed without loss of generality that the points to be scored are 0 and 1. Consequently, the score vector of any competitor consists of k pieces of one and \(n-k\) pieces of zero. Then, \(E({\textbf{x}}) \le E({\textbf{y}})\) if and only if score vector \({\textbf{x}}\) does not contain more ones than score vector \({\textbf{y}}\). The same argument can be applied for the rectangle index. \(\square \)

The differences between the Euclidean and rectangle indices are highlighted by the fact that the latter measure fails to satisfy three properties used to characterise the former as the following examples show. However, the rectangle index remains scale invariant (Proposition 1) which is the main reason why we have chosen it instead of the h-index.

Fig. 1
figure 1

The rectangle index does not satisfy independence

Example 1

The rectangle index does not satisfy independence

Take the score vectors \({\textbf{x}} = \left[ 5, 1 \right] \) and \({\textbf{y}} = \left[ 3, 3 \right] \) of Players 1 and 2, plotted in Fig. 1.a. Since \(E({\textbf{x}}) = \sqrt{26} \approx 5.1\) and \(E({\textbf{y}}) = \sqrt{18} \approx 4.24\), the Euclidean index favours Player 1. Contrarily, \(R({\textbf{x}}) = 1 \times 5 = 5\) and \(R({\textbf{y}}) = 2 \times 3 = 6\), thus, the rectangle index prefers Player 2 to Player 1.

Consider a new, third race where both players score five points, that is, \({\textbf{x}}' = \left[ 5, 5, 1 \right] \) and \({\textbf{y}}' = \left[ 5, 3, 3 \right] \), as depicted in Fig. 1.b. Now,

$$\begin{aligned} E \left( {\textbf{x}}' \right) = \sqrt{51} \approx 7.14> E \left( {\textbf{y}}' \right) = \sqrt{43} \approx 6.56 \text {, and} \\ R \left( {\textbf{x}}' \right) = 2 \times 5 = 10 > R \left( {\textbf{y}}' \right) = 3 \times 3 = 9, \end{aligned}$$

hence, both measures rank Player 1 above Player 2. To summarise, the two players have the same performance in the additional race but their relative ranking is exchanged by the rectangle index. When Player 1 has two outstanding results of five points, this seems to be the rule rather than an exception as in the first case.

Fig. 2
figure 2

The rectangle index does not satisfy depth relevance

Example 2

The rectangle index does not satisfy depth relevance

Take the score vectors \({\textbf{x}} = \left[ 5, 2 \right] \) and \({\textbf{y}} = \left[ 3, 2, 2 \right] \) of Players 1 and 2, plotted in Fig. 2. Since \(R({\textbf{x}}) = 1 \times 5 = 5\) and \(R({\textbf{y}}) = 3 \times 2 = 6\) but both players have scored the same number of points in total (7), the rectangle index violates depth relevance. The rectangle index prefers better core performance after a race is split into two. On the other hand, \(E({\textbf{x}}) = \sqrt{29} \approx 5.39\) and \(E({\textbf{y}}) = \sqrt{17} \approx 4.12\), thus, the Euclidean index favours player 1.

Fig. 3
figure 3

The rectangle index does not satisfy directional consistency

Example 3

The rectangle index does not satisfy directional consistency

Take the score vectors \({\textbf{x}} = \left[ 3, 3, 0 \right] \) and \({\textbf{y}} = \left[ 2, 2, 2 \right] \) of Players 1 and 2, plotted in Figs. 3a, b, respectively. Add the score vector \({\textbf{d}} = \left[ 1, 0, 0 \right] \) to them. Since \(R({\textbf{x}}) = 2 \times 3 = 3 \times 2 = R({\textbf{y}})\) and \(R({\textbf{x}} + {\textbf{d}}) = 2 \times 3 = 3 \times 2 = R({\textbf{y}} + {\textbf{d}})\), directional consistency requires that \(R({\textbf{x}} + 4{\textbf{d}})\) equals \(R({\textbf{y}} + 4{\textbf{d}})\). But \(R({\textbf{x}} + 4{\textbf{d}}) = 1 \times 7 = 7\) and \(R({\textbf{y}} + 4{\textbf{d}}) = 1 \times 6 = 6\). The rectangle index disregards further results if the performance in one race becomes outstanding. On the other hand, the Euclidean index always prefers Player 1: \(E({\textbf{x}}) = \sqrt{18} > E({\textbf{y}}) = \sqrt{12}\), \(E({\textbf{x}} + {\textbf{d}}) = \sqrt{25} > E({\textbf{y}} + {\textbf{d}}) = \sqrt{17}\), and \(E({\textbf{x}} + 4{\textbf{d}}) = \sqrt{58} > E({\textbf{y}} + 4{\textbf{d}}) = \sqrt{44}\).

Table 1 Axiomatic comparison of two bibliometric indices

Table 1 summarises the axiomatic comparison of the two indices. They grab a different aspect of the contestant’s performance. The Euclidean index takes into account all points scored but rewards quality over quantity by summing up their squares. Contrarily, the rectangle index supports balanced achievements by ignoring unexpected peaks which are often due to mere luck or unusually favourable circumstances in sports.

The quantified achievements of the contestants can be used for at least two purposes: to measure their performance and to determine competitive balance in the championship. The first is revealed by the share of the total reward earned by a competitor. Competitive balance can be determined by evaluating the inequality in the achievements of the contestants, for example, according to the Herfindahl–Hirschman index or the Gini coefficient.

We do not say that any of the two approaches dominates the other. The choice should depend on the preferences of the decision-maker. However, if both measures indicate the same tendencies, this would be powerful evidence for a change in the corresponding direction. On the other hand, if the implications from the two underlying indices seem to contradict each other, one cannot be sure what happens in the competition.

3 Data and implementation

The UEFA Champions League, or simply the Champions League, is the most prestigious annual club football competition in Europe. The tournament format is fixed since the 2003/04 season (but will change from 2024/25 (UEFA 2022)): a round-robin group stage with eight groups of four—altogether 32—teams each such that the top two teams from each group qualify to the Round of 16 in the knockout phase.

The level of competition at the group stage is not very interesting because the Champions League slots are allocated in a deterministic way based on the ranking of UEFA member associations. For example, it is guaranteed that four clubs from the top four leagues of England, Germany, Italy, and Spain participate in the 2022/23 season. Csató (2022) provides more details on the Champions League qualification.

Tables 6 and 7 in the Appendix show all participants in the knockout stage of the Champions League over the 20 seasons played between 2003 and 2023. In each year, there is exactly one winner (W), while one team is eliminated in the final (F), two are eliminated in the semifinals (SF), four in the quarterfinals (QF), and eight in the Round of 16 (R16).

As all teams have a national affiliation and the qualification depends on the ranking of UEFA member countries, at least three types of competitive balance can be distinguished:

  • the overall competitive balance at the level of clubs;

  • the between-country competitive balance when only the national association of the teams is taken into consideration; and

  • the within-country competitive balance, that is, how stable is the set of teams coming from a given national association.

Analogously, the performances of clubs and countries are worth measuring separately.

Table 2 The weights assigned to the positions

They all can be quantified by the methodology proposed in Sect. 2. In particular, any season is regarded as a race, and weights are associated with each position (W, F, SF, QF, R16) in each season according to Table 2. Weighting method W1 strongly favours the top positions, but note that there are 16 teams in the Round of 16 compared to the unique winner. Weighting method W4 is extremal by counting only whether a team has qualified for the knockout stage. Weightings W2 and W3 represent a middle course between the other two. Milanovic (2005) uses a really sharp assignment of points to calculate Gini coefficients in the Champions League: 10 points for the winner, 6 points for the finalist, 2 points for the two semi-finalists, and 1 point for the four quarterfinalists.

Admittedly, our choice for the weights seems to be somewhat arbitrary. However, studying different sets of values and two bibliometric indices can indicate if the qualitative results are sensitive to this setting.

Both performance and competitive balance are assessed across subsequent seasons, hence an appropriate time window should be considered. As has been mentioned in the Introduction, a rolling 5-year window can be justified by the seeding policy of the UEFA.

The positions of team i (or country i) in a given period are collected into the score vector \({\textbf{x}}^{(i)}\) that is transformed into a real number \(f \left( {\textbf{x}}^{(i)} \right) \) by the Euclidean or rectangle index. After that, the “market share” of club i (\(1 \le i \le n\)) is calculated to directly obtain its performance:

$$\begin{aligned} s_i = \frac{f \left( {\textbf{x}}^{(i)} \right) }{\sum _{i=1}^n f \left( {\textbf{x}}^{(i)} \right) }, \end{aligned}$$
(1)

For competitive balance, the Herfindahl–Hirschman index (\( HHI \)) is computed as follows:

$$\begin{aligned} HHI = \sum _{i=1}^n s_i^2 = \sum _{i=1}^n \left( \frac{f \left( {\textbf{x}}^{(i)} \right) }{\sum _{i=1}^n f \left( {\textbf{x}}^{(i)} \right) } \right) ^2. \end{aligned}$$
(2)

A higher value of the \( HHI \) indicates a lower level of competitive balance.

Note that the minimum concentration is not zero in this hierarchical system since, e.g. there should be 16 teams in the Round of 16 in every year. However, this is not a problem because we will focus on relative changes.

Example 4

There are three Portuguese teams in our sample, Benfica, Porto, and Sporting CP. Let us concentrate on the five seasons from 2014/15 to 2018/19 and take the weighting scheme W2. The score vector of Benfica is \(\left[ 2,1,0,0,0 \right] \), the score vector of Porto is \(\left[ 2,2,1,1,0 \right] \), and the score vector of Sporting CP is \(\left[ 0,0,0,0,0 \right] \). Consequently, the Euclidean index of Benfica is \(E_B = \sqrt{5} \approx 2.24\), the rectangle index of Benfica is \(R_B = 1 \times 2 = 2\), the Euclidean index of Porto is \(E_P = \sqrt{10} \approx 3.16\), the rectangle index of Porto is \(R_B = 2 \times 2 = 4\), while both measures are zero for Sporting CP.

The within-country competitive balance is

$$\begin{aligned} \left( \frac{E_B}{E_B + E_P} \right) ^2 + \left( \frac{E_P}{E_B + E_P} \right) ^2 = \frac{5 + 10}{5 + 2 \sqrt{5} \sqrt{10} + 10} \approx 0.515 \end{aligned}$$

under the Euclidean index, and it is

$$\begin{aligned} \left( \frac{R_B}{R_B + R_P} \right) ^2 + \left( \frac{R_P}{R_B + R_P} \right) ^2 = \frac{4 + 16}{36} \approx 0.556 \end{aligned}$$

under the rectangle index.

The number of seasons and clubs considered determines the maximal possible number of “publications” to account for. Therefore, in the case of the rectangle index, it is advised to use weighting methods W2, W3, and W4 when there are only a few opportunities to score points, thus, there is no need to provide many points for the best positions. However, if the number of qualifications to the knockout stage can be high, the weighting methods W3 and W4 usually result in the same ranking of the clubs (countries) by simply counting the number of times the Round of 16 is reached.

Table 3 The total achievement of Porto over the 16 seasons

Example 5

Table 3 summarises the positions of Porto in the 20 Champions League seasons played between 2003 and 2023. Its achievement remains the same under weightings W2, W3, and W4 because the rectangle index is based on the 13 occasions when the club qualified to the Round of 16: the “natural” position of Porto is in the Round of 16. On the other hand, if weighting W1 is applied, the five participations in the quarterfinals will be counted.

Similarly, it would be misleading to use weighting method W1 with the Euclidean index because this implies that almost everything is determined by the top positions due to the squares in its mathematical formula.

Example 6

Consider Table 3. The Euclidean index of Porto under weighting W1 is \(\sqrt{16^2 + 4 \times 4^2 + 8 \times 1^2}\), that is, the influence of its single winning is 256 times larger than a qualification to the Round of 16. This seems to be unjustified.

4 Results

In the following, our empirical findings will be discussed. Section 4.1 overviews some papers that are connected to measuring competitive balance in European football, which is followed by the study of the club and country performances in Sect. 4.2 and the analysis of competitive balance in Sect. 4.3.

4.1 Related literature

Several papers have examined competitive balance in European football leagues (Koning 2000; Szymanski 2001; Goossens 2006). Pawlowski et al. (2010) investigate how the competitive situation in a domestic league is influenced by participation in the UEFA Champions League. Haan et al. (2012) study in a theoretical model how the growing importance of the Champions League and the increased international trade in talent affect competitive balance within national competitions and the quality differences between them. Their results reveal that the Champions League in itself reduces inequality, both nationally and internationally, except for the very small countries.

There exists more limited research on competitive balance in European competitions. Regarding the UEFA Champions League, Milanovic (2005) finds gradual deconcentration between 1963 and 1987, followed by a sharp reversal of this trend between 1988 and 2002. According to Koning (2009), success in the Champions League has become more persistent between the time windows 1980–1998 and 1999–2007, mainly due to allowing multiple teams per country to play. The author argues that competitive balance in European tournaments is better to analyse at the level of national associations because most fans will identify with a domestic club from their home country, even if it is not their favourite.

Plumley and Flint (2015) and Triguero-Ruiz and Avila-Cano (2023) study competitive balance in the group stage of the Champions League. Schokkaert and Swinnen (2016) confirm the result of Milanovic (2005): the same teams are more likely to qualify for the knockout stage of the Champions League compared to its predecessor European Cup prior to the 1992/93 season. However, the uncertainty of which club wins beyond this stage has increased in the competition. Bullough (2018) analyses representation, performance, and revenue distribution in the Champions League from the 2003/04 to the 2016/17 seasons and finds that the current structure significantly benefits a small proportion of clubs.

Ramchandani et al. (2023) examine the competitive balance of the UEFA Champions League before and after the implementation of the UEFA Financial Fair Play (FFP) regulations in 2011. Regarding the knockout stage, they simply calculate the number of unique clubs that have progressed to the knockout stage, as well as the number of occasions that clubs from a particular country played in the semifinals and the final, or won the tournament. Their analysis shows a decline in competitive balance in the post-FFP era. Even though it is also stated that the knockout stage has been dominated by clubs from the top five European leagues since 1992/93, this is not quantified by a particular metric.

Competitive balance may depend on many factors, including the score system (Haugen 2008), the scoring rate (Scarf et al. 2019), major rule changes (Kent et al. 2013), participation in other competitions (Moffat 2020), the tournament format (Csató 2020a, 2021; Lasek and Gagolewski 2018; McGarry and Schutz 1997; Scarf et al. 2009), the seeding procedure (Corona et al. 2019; Dagaev and Rudyak 2019), the number of competitors (Cairns 1987), as well as promotion and relegation (Noll 2002; Buzzacchi et al. 2003). The latter two, to some extent, are analogous to the changing set of Champions League participants between two subsequent seasons. Although these issues will not be discussed here, the consideration of the causes that govern the dynamics of competitive balance in the UEFA Champions League can be a promising direction for future research.

4.2 The performances of clubs and national associations

Table 4 Club performances between 2003 and 2023: the top five

First, we focus on the achievements of teams and countries. Table 4 shows the five highest-ranked clubs in the whole period from 2003/04 to 2022/23 if their performances are quantified by the two indices and the three weighting methods suggested. The top of the ranking is remarkably robust: the first three positions are almost always occupied by the Spanish clubs Barcelona and Real Madrid together with the German Bayern Munich. Real Madrid turns out to be the best team except for the rectangle index and weighting W3. They are followed by three English clubs: Arsenal, Chelsea, and Liverpool. While Arsenal often qualified for the Round of 16, its performance there remained modest, thus, it is favoured by a less sharp weighting. For instance, under the weighting scheme W2, Arsenal is only the 13th best team by the Euclidean index. Its achievement stands in stark contrast to the record of Liverpool, which was missing from the knockout stage of the Champions League in several seasons but achieved good results when qualifying for this phase.

Fig. 4
figure 4

Club performances in 5-year periods between 2003 and 2023. The year on the x-axis indicates the finishing year of the last season in the period, e.g. 2018 corresponds to the seasons from 2013/14 to 2017/18

Figure 4 compares the achievements of these six teams over the last five seasons in the corresponding year. According to all measures, the performance of Arsenal has gradually declined. Chelsea shows a similar decrease except for the last 5-year periods, while Real Madrid has been at its peak in 2018 since it has won in three subsequent seasons. However, note that formula (1) also depends on the achievements of the other teams. Barcelona (Bayern Munich) has been the best towards the beginning (middle) of our time window, and Liverpool was absent from the knockout stage of the UEFA Champions League between the seasons 2009/10 and 2016/17. Nonetheless, in this set of six clubs, it cannot be said that any team unambiguously dominated another one during the whole period considered.

Table 5 Country performances between 2003 and 2023: the top five

Figure 4 can be somewhat misleading as the worsening performance of Arsenal and Chelsea mainly reflects the high level of competition in the English Premier League: they have failed to qualify for the knockout stage of the Champions League because they could not have played in the group stage. Therefore, Table 5 turns to the comparison of national associations. The clubs of the five top leagues (England, France, Germany, Italy, Spain) clearly prevail in the Champions League. According to our measures, even their ranking is obvious except for the first place disputed by England and Spain, and the third position disputed by Germany and Italy.

Fig. 5
figure 5

Country performances in 5-year periods between 2003 and 2023. The year on the x-axis indicates the finishing year of the last season in the period, e.g. 2018 corresponds to the seasons from 2013/14 to 2017/18

However, Fig. 5 shows that clear trends can be observed in the achievements of the clubs from these leagues. In particular, the performance of English and Italian teams have decreased in the first half of the period, while German and Spanish clubs have become more competitive until the middle of the time window considered. However, the former dominance of Spanish clubs is threatened in recent years. In contrast to Fig. 4, the relative order of certain national associations seems to be robust: Spain is consistently better than Germany, England is consistently better than Italy, and France has the weakest league among the top five. Spain has probably outperformed Italy, too.

To conclude, the comparison of countries is easier than the comparison of individual clubs. While this is almost a straightforward implication since the fluctuations in the performances of teams are smoothed out at the level of national associations, it yields at least two important lessons. First, UEFA follows a reasonable policy by founding the Champions League qualification on the ranking of associations rather than the ranking of individual teams (Csató 2022). Second, the current use of UEFA club coefficients for seeding in the UEFA Champions League (Csató 2020b) and the second-tier competition UEFA Europa League is perhaps flawed; it would be better to draw the groups on the basis of country characteristics. For instance, Guyon (2015) recommends labelling all clubs by their finishing positions in their domestic leagues and quantifying their performances in this way instead of focusing separately on each club. This approach could be fairer for a team emerging from nil: even though it was one of the greatest sporting stories of all time when Leicester City became the champion in the English Premier League despite overwhelming odds (BBC 2016), the team had the second smallest UEFA club coefficient in the next season of the Champions League.

4.3 Competitive balance

Fig. 6
figure 6

Competition among the clubs in 5-year periods between 2003 and 2023. The year on the x-axis indicates the finishing year of the last season in the period, e.g. 2018 corresponds to the seasons from 2013/14 to 2017/18

Figure 6 presents the dynamics of the Herfindahl–Hirschman index over 5-year periods between 2003 and 2023 at the level of clubs. All six methods (three weighting schemes with two bibliometric indices) fluctuate without a clear trend.

Fig. 7
figure 7

Competition among the countries in 5-year periods between 2003 and 2023. The year on the x-axis indicates the finishing year of the last season in the period, e.g. 2018 corresponds to the seasons from 2013/14 to 2017/18

Figure 7 follows another approach by focusing on the competition between the countries. Except for the rectangle index under weighting W1, the other five measures uncover a deterioration of competitive balance at the level of national associations in the second half of the period.

Fig. 8
figure 8

Competition in 5-year periods between 2003 and 2023: the top five associations versus other countries. The year on the x-axis indicates the finishing year of the last season in the period, e.g. 2018 corresponds to the seasons from 2013/14 to 2017/18

Competitive balance can also be studied in the relation of big and small leagues. For instance, the 2019/20 season was the single one when only teams from the top five national associations qualified for the Round of 16. Indeed, Fig. 8 shows some evidence that the uncertainty of outcome has recently declined from this perspective.

Fig. 9
figure 9

Competition within the countries in 5-year periods between 2003 and 2023. The year on the x-axis indicates the finishing year of the last season in the period, e.g. 2018 corresponds to the seasons from 2013/14 to 2017/18

Finally, Fig. 9 analyses the third variant of competitive balance by focusing on the top five leagues, England (with 7 clubs in our dataset), France (6), Germany (10), Italy (8), and Spain (10). Here, there are some clear tendencies in the uncertainty of outcome. England and, recently, Italy have managed to improve competitive balance. The highest \( HHI \)s in France and Italy can be attributed to the dominance of one club, Juventus (the Italian champion between the seasons 2011/12 and 2019/20) and Paris Saint-Germain (the French champion since 2012/13 except for two seasons), respectively. While these observations concern the competitiveness of domestic leagues, it is highlighted from an unusual perspective by focusing on the international performance of the leading clubs.

5 Concluding remarks

This paper has aimed to demonstrate how bibliometric indices can be used to measure the level of competition in a knockout tournament. Although our approach involves the choice of two crucial variables, the scientometric index and the weighting method, most results for the UEFA Champions League seem to be remarkably robust concerning these parameters. Basically, neither the performance of clubs and countries nor the three types of competitive balance depend on them to a great extent.

Nevertheless, our results can be sensitive to the luck of the draw in the knockout stage: as the weighting system awards more points to the clubs as they progress in the tournament, uncertainty of outcome can decrease merely because strong clubs are drawn early against each other. This, together with the low scoring nature of the sport, might make the final result of a season relatively noisy. On the other hand, the public views only the outcome of the matches and is probably not satisfied if computer simulations do not show a deterioration in the level of competition but the same clubs almost always win in the real world.

The main findings of the work can be summarised as follows:

  • The performance of national associations varies less than the performance of individual clubs;

  • In the middle of the period considered, Spanish teams have gained prominence at the expense of English and Italian teams, but this trend has reversed in the last years;

  • Even though competitive balance has not changed at the level of clubs since 2003/04, there is some evidence for a recent decline at the level of countries, and, especially, in the relation of top five versus other leagues.

The relative stability of competitive balance in the last two decades supports the results of Schokkaert and Swinnen (2016) and can be explained by the qualification rules of the UEFA Champions League that allow multiple teams from the strongest leagues to compete. This movement partially contradicts the expected effects of free labour markets in European football, which should have increased the inequality of results among clubs (Milanovic 2005).

The current study may have important policy lessons for sports administrators. First, since the performances of national associations are more stable than the results of individual clubs, it would be better to build the seeding in the UEFA Champions League group stage upon association coefficients rather than club coefficients, accounting for the positions that the teams have achieved in their domestic leagues. This change would favour leagues where the competition within the country is stronger, and the set of teams qualifying for the Champions League is more volatile across the seasons. The methodology proposed here can even be used to determine the ranking of national associations for the UEFA access list.

Second, there is no clear relationship between the competitive balance in a national league and the competitiveness of its clubs in the UEFA Champions League: the aggregated performances of English and Italian teams have both declined in the middle of the period considered (Fig. 5), however, the competition within the countries has changed in the opposite way (Fig. 9). A possible explanation for the case of Italy can be that even though domination in the domestic league helps a club to buy better players, the decreased incentives to be efficient at the home front may be detrimental to its international performance.