Abstract
We argue for the application of bibliometric indices to quantify the long-term uncertainty of outcome in sports. The Euclidean index is proposed to reward quality over quantity, while the rectangle index can be an appropriate measure of core performance. Their differences are highlighted through an axiomatic analysis and several examples. Our approach also requires a weighting scheme to compare different achievements. The methodology is illustrated by studying the knockout stage of the UEFA Champions League in the 20 seasons played between 2003 and 2023: club and country performances as well as three types of competitive balance are considered. Measuring competition at the level of national associations is a novelty. All results are remarkably robust concerning the bibliometric index and the assigned weights. Since the performances of national associations are more stable than the results of individual clubs, it would be better to build the seeding in the UEFA Champions League group stage upon association coefficients adjusted for league finishing positions rather than club coefficients.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
“Nur wer mit geringen Mitteln Großes tut, hat es glücklich getroffen.Footnote 1”
1 Introduction
This paper aims to connect two seemingly distant fields of academic research: scientometrics and sports economics. However, the approach is not unique, Kóczy and Strobel (2010) and Lehmann and Wohlrabe (2017) actually do the reverse by applying sports ranking methods to journals.
Following the pioneering work of Garfield (1979), a plethora of measures have been suggested, tested, and debated to quantify the influence of a scholar (Ravallion and Wagstaff 2011; Glänzel and Moed 2013; Wildgaard et al. 2014). Most academics have probably met with the h-index during the evaluation of grant proposals. Nonetheless, there is no consensus on what is the best bibliometric index, mainly due to the need to balance quantity (represented by the number of papers written) and quality (represented by the number of citations to those papers). This dilemma has motivated several articles on the axiomatisation of scientometric measures which can greatly help our understanding of their characteristics (Marchant 2009; Bouyssou and Marchant 2010; Kóczy and Nichifor 2013; Bouyssou and Marchant 2014; de la Vega and Volij 2018).
The economic analysis of team sports is often centred around the issue of competitive balance since the celebrated article of Rottenberg (1956). Any sporting contest establishes a hierarchy among the competitors by differentiating between winners and losers. Therefore, uncertainty of outcome has at least three different dimensions (Buzzacchi et al. 2003): (a) short-term (whether the ex ante winning probabilities of the playing teams are equal in a match); (b) medium-term (whether all teams have the same chance to be the champion in a given season); (c) long-term (how volatile is the relative standing of different clubs across some seasons). Obviously, the measurement of competitive balance is much more than a simple academic problem. For example, several antitrust decisions are based on the claim that fan interest decreases when the uncertainty of outcome declines, thus, public interest demands to maintain competitive balance (Szymanski 2003).
While there are numerous attempts to quantify the level of competition in sports leagues (Zimbalist 2002; Manasis et al. 2013; Manasis and Ntzoufras 2014), there exists much less literature on how to measure it in knockout tournaments (del Corral 2009; Considine and Gallagher 2018). However, several popular sports championships—the tennis Grand Slam tournaments, the final stages of the FIFA World Cups and UEFA European Championships in (association) football, the playoffs in the NBA, NFL, NHL—are designed in this format. A natural expectation of both the fans and the regulators can be to avoid the dominance of the same players or teams across several seasons.
The current paper aims to quantify long-term uncertainty of outcome in such tournaments. The main challenge is similar to the central problem of bibliometrics: What is the trade-off between quantity and quality? Does a team obtain a greater market share when it has won the title once or when it has never won but qualified for the semifinals four times? A further difficulty of the measurement can be that the teams participating in a given year may not participate the year after (Koning 2009).
We offer a solution to these challenges by using bibliometric indices. In particular, the Euclidean index (Perry and Reny 2016) and the rectangle index (Fenner et al. 2018; Levene et al. 2019) will be considered with various weights assigned for the achievements of the teams, analogous to the number of times a paper has been referred to. An axiomatic comparison will be provided to highlight the differences between the two measures. The Euclidean index rewards top achievements and is always increased when the performance improves. On the other hand, the rectangle index focuses on core performance. For example, if a team usually qualifies for the semifinals, it does not count whether it is eliminated in the Round of 16 or the quarterfinals in the next year. Since both concepts can be justified, we do not promote any of them over the other.
Several papers quantify uncertainty of outcome between consecutive seasons by using sophisticated statistical techniques. While our approach is much simpler, that can help its spread among the stakeholders as the popularity of scientometric indices shows: it is usually more attractive—although might be potentially misleading—to express a complex phenomenon in sports (or any other field) with a single number. The suggested framework can be understood with limited methodological knowledge, and one can easily “open the black box” behind the results. Anyway, measuring trends over time in a novel manner compared to the prior literature is able to reinforce or deny well-established findings which is an important issue in academic research, even though the conclusions of the analysis do not fundamentally differ from the findings of the “standard” literature.
The theoretical overview is followed by a real-world application. The proposed methodology will be used first to quantify the performance of clubs and national associations in the recent seasons of the UEFA Champions League, the most prestigious European football tournament. In particular, a rolling five-season window is chosen because both the UEFA club and country coefficients—that measure the results of individual clubs (for seeding in the Champions League) and the member associations (for determining the access list of the Champions League), respectively—take into account the matches played over the past five seasons of European competitions. Milanovic (2005) also examines 5-year periods as reasonable over which concentration can be observed and calculated.
After that, three types of competitive balance will be analysed at the level of clubs, between the countries, and within the countries. It seems that previous studies have not discussed competition at the level of national associations in the UEFA Champions League. Therefore, they have failed to identify if different clubs from the same country have dominated the tournament. We can grab even this aspect of reduced competition.
The rest of the paper is organised as follows. Section 2 presents the two bibliometric indices and their properties. The UEFA Champions League data and the details of our application are discussed in Sect. 3, followed by a concise overview of previous literature on competitive balance in European football and the analysis of the results in Sect. 4. Finally, Sect. 5 concludes.
2 Methodology: bibliometric indices
This section presents the axiomatic comparison of two bibliometric indices that will be used to quantify competitive balance. While it does not contain any new results, it might be useful for readers who are not familiar with the principles behind these measures.
Consider a championship consisting of n races, where each position in each race earns some non-negative points for the competitors. The points scored by a contestant are collected into the score vector \({\textbf{x}} = \left[ x_1, x_2, \dots , x_n \right] \) that is sorted in descending order, i.e. \(x_i \ge x_j\) for all \(1 \le i < j \le n\). The achievement of a contestant in the whole championship is measured by an aggregation function f that maps score vectors to the set of non-negative real numbers.
The score vector is equivalent to a citation vector of a scholar, consequently, a bibliometric index can be used for its evaluation. In the case of competitive balance, most papers use the simple aggregation function \(f({\textbf{x}}) = \sum _{i=1}^n x_i\), essentially counting the number of points for each competitor. However, this is not necessarily an appropriate approach in a knockout tournament. We consider two other bibliometric indices, in particular, the Euclidean index (Perry and Reny 2016) and the rectangle index (Fenner et al. 2018; Levene et al. 2019).
The Euclidean index \(E({\textbf{x}})\) of a score vector \({\textbf{x}} = \left[ x_1, x_2, \dots , x_n \right] \) is the Euclidean norm of the score vector:
The rectangle index \(R({\textbf{x}})\) of a score vector \({\textbf{x}} = \left[ x_1, x_2, \dots , x_n \right] \) is the area of the largest rectangle that can fit under the score vector:
If the largest rectangle is a square, the rectangle index is equivalent to the well-known h-index (Hirsch 2005), that is, the maximal number h of races where the contestant scored at least h points in each of them.
In order to better understand these measures, it is worth using an axiomatic approach. Some reasonable requirements for aggregation functions are the following:
-
Monotonicity: if score vector \({\textbf{x}} = \left[ x_1, x_2, \dots , x_n \right] \) is dominated by score vector \({\textbf{y}} = \left[ y_1, y_2, \dots , y_n \right] \), that is, \(x_i \le y_i\) holds for all \(1 \le i \le n\), then \(f({\textbf{x}}) \le f({\textbf{y}})\).
-
Independence: if \(f({\textbf{x}}) \le f({\textbf{y}})\) and a new, \(n+1\)th race is added to the championship where both competitors score the same number of points, then the ordinal ranking of their achievements does not change.
-
Depth relevance: if score vector \({\textbf{x}}\) is modified such that a new, \(n+1\)th race is added and the number of points \(x_i\) in the ith race is split into two parts between the ith and \(n+1\)th races, then the overall achievement should decrease.
-
Scale invariance: \(f({\textbf{x}}) \le f({\textbf{y}}) \iff f(c{\textbf{x}}) \le f(c{\textbf{y}})\) for any \(c > 0\).
-
Directional consistency: if \(f({\textbf{x}}) = f({\textbf{y}})\) holds for the score vectors \({\textbf{x}}\) and \({\textbf{y}}\), and both of them are shifted by the same score vector \({\textbf{d}}\) such that \(f({\textbf{x}} + {\textbf{d}}) = f({\textbf{y}} + {\textbf{d}})\), then \(f({\textbf{x}} + \lambda {\textbf{d}}) = f({\textbf{y}} + \lambda {\textbf{d}})\) for any \(\lambda > 1\).
-
Uniform citation: if the score vector \({\textbf{x}}\) is uniform, that is, \(x_1 = x_2 = \cdots = x_n\), then \(f({\textbf{x}}) = \sum _{i=1}^n x_i\).
-
Uniform equivalence: for any score vector \({\textbf{x}}\), there exists a uniform score vector \({\textbf{u}}\) dominated by \({\textbf{x}}\) such that \(f({\textbf{x}}) = f({\textbf{u}})\).
Monotonicity is a natural requirement, the achievement of a contestant with at least the same performance cannot be lower. Independence allows the addition of identical records for two competitors without changing their ranking. Depth relevance excludes the overall achievement to be maximised by spreading the number of points collected thinly across the races. In other words, this property rewards quality over quantity. Note that summing up the number of points does not satisfy depth relevance. According to scale invariance, multiplying the number of points by a positive scaling factor does not affect the order of two contestants. The h-index does not satisfy scale invariance (Perry and Reny 2016). Directional consistency means that if two tied competitors remain tied when their number of points are increased by the same amounts in decreasing order, they continue to remain equally ranked if this growth vector is multiplied by any positive constant. Uniform citation requires the overall achievement to be equal to the number of points when they are level across all races. Finally, according to uniform equivalence, the same number of points should be counted for each race in the overall achievement of a contestant.
An aggregation function f satisfies monotonicity, independence, depth relevance, scale invariance, and directional consistency if and only if it is equivalent to the Euclidean index (Perry and Reny 2016, Theorem 1). An aggregation function f meets monotonicity, uniform citation, and uniform equivalence if and only if it is the rectangle index (Levene et al. 2019, Theorem 4.1).
A stronger version of scale invariance is satisfied by both measures.
Proposition 1
The share of a competitor’s achievement in the total achievement under the Euclidean and the rectangle indices does not change if the scores are multiplied by the same positive number \(c > 0\).
Proof
The invariance is implied by \(E(c {\textbf{x}}) = c E({\textbf{x}})\) and \(R(c {\textbf{x}}) = c R({\textbf{x}})\) for any score vector \({\textbf{x}}\). \(\square \)
According to Proposition 1, the two approaches are influenced only by the ratio of the points to be scored in each race, but they do not depend on the absolute size of the points.
Consider a binary championship where there are only two types of positions, for example, a contestant either qualifies or does not qualify.
Proposition 2
The rankings of the competitors under the Euclidean and rectangle indices coincide in a binary championship.
Proof
Due to Proposition 1, it can be assumed without loss of generality that the points to be scored are 0 and 1. Consequently, the score vector of any competitor consists of k pieces of one and \(n-k\) pieces of zero. Then, \(E({\textbf{x}}) \le E({\textbf{y}})\) if and only if score vector \({\textbf{x}}\) does not contain more ones than score vector \({\textbf{y}}\). The same argument can be applied for the rectangle index. \(\square \)
The differences between the Euclidean and rectangle indices are highlighted by the fact that the latter measure fails to satisfy three properties used to characterise the former as the following examples show. However, the rectangle index remains scale invariant (Proposition 1) which is the main reason why we have chosen it instead of the h-index.
Example 1
The rectangle index does not satisfy independence
Take the score vectors \({\textbf{x}} = \left[ 5, 1 \right] \) and \({\textbf{y}} = \left[ 3, 3 \right] \) of Players 1 and 2, plotted in Fig. 1.a. Since \(E({\textbf{x}}) = \sqrt{26} \approx 5.1\) and \(E({\textbf{y}}) = \sqrt{18} \approx 4.24\), the Euclidean index favours Player 1. Contrarily, \(R({\textbf{x}}) = 1 \times 5 = 5\) and \(R({\textbf{y}}) = 2 \times 3 = 6\), thus, the rectangle index prefers Player 2 to Player 1.
Consider a new, third race where both players score five points, that is, \({\textbf{x}}' = \left[ 5, 5, 1 \right] \) and \({\textbf{y}}' = \left[ 5, 3, 3 \right] \), as depicted in Fig. 1.b. Now,
hence, both measures rank Player 1 above Player 2. To summarise, the two players have the same performance in the additional race but their relative ranking is exchanged by the rectangle index. When Player 1 has two outstanding results of five points, this seems to be the rule rather than an exception as in the first case.
Example 2
The rectangle index does not satisfy depth relevance
Take the score vectors \({\textbf{x}} = \left[ 5, 2 \right] \) and \({\textbf{y}} = \left[ 3, 2, 2 \right] \) of Players 1 and 2, plotted in Fig. 2. Since \(R({\textbf{x}}) = 1 \times 5 = 5\) and \(R({\textbf{y}}) = 3 \times 2 = 6\) but both players have scored the same number of points in total (7), the rectangle index violates depth relevance. The rectangle index prefers better core performance after a race is split into two. On the other hand, \(E({\textbf{x}}) = \sqrt{29} \approx 5.39\) and \(E({\textbf{y}}) = \sqrt{17} \approx 4.12\), thus, the Euclidean index favours player 1.
Example 3
The rectangle index does not satisfy directional consistency
Take the score vectors \({\textbf{x}} = \left[ 3, 3, 0 \right] \) and \({\textbf{y}} = \left[ 2, 2, 2 \right] \) of Players 1 and 2, plotted in Figs. 3a, b, respectively. Add the score vector \({\textbf{d}} = \left[ 1, 0, 0 \right] \) to them. Since \(R({\textbf{x}}) = 2 \times 3 = 3 \times 2 = R({\textbf{y}})\) and \(R({\textbf{x}} + {\textbf{d}}) = 2 \times 3 = 3 \times 2 = R({\textbf{y}} + {\textbf{d}})\), directional consistency requires that \(R({\textbf{x}} + 4{\textbf{d}})\) equals \(R({\textbf{y}} + 4{\textbf{d}})\). But \(R({\textbf{x}} + 4{\textbf{d}}) = 1 \times 7 = 7\) and \(R({\textbf{y}} + 4{\textbf{d}}) = 1 \times 6 = 6\). The rectangle index disregards further results if the performance in one race becomes outstanding. On the other hand, the Euclidean index always prefers Player 1: \(E({\textbf{x}}) = \sqrt{18} > E({\textbf{y}}) = \sqrt{12}\), \(E({\textbf{x}} + {\textbf{d}}) = \sqrt{25} > E({\textbf{y}} + {\textbf{d}}) = \sqrt{17}\), and \(E({\textbf{x}} + 4{\textbf{d}}) = \sqrt{58} > E({\textbf{y}} + 4{\textbf{d}}) = \sqrt{44}\).
Table 1 summarises the axiomatic comparison of the two indices. They grab a different aspect of the contestant’s performance. The Euclidean index takes into account all points scored but rewards quality over quantity by summing up their squares. Contrarily, the rectangle index supports balanced achievements by ignoring unexpected peaks which are often due to mere luck or unusually favourable circumstances in sports.
The quantified achievements of the contestants can be used for at least two purposes: to measure their performance and to determine competitive balance in the championship. The first is revealed by the share of the total reward earned by a competitor. Competitive balance can be determined by evaluating the inequality in the achievements of the contestants, for example, according to the Herfindahl–Hirschman index or the Gini coefficient.
We do not say that any of the two approaches dominates the other. The choice should depend on the preferences of the decision-maker. However, if both measures indicate the same tendencies, this would be powerful evidence for a change in the corresponding direction. On the other hand, if the implications from the two underlying indices seem to contradict each other, one cannot be sure what happens in the competition.
3 Data and implementation
The UEFA Champions League, or simply the Champions League, is the most prestigious annual club football competition in Europe. The tournament format is fixed since the 2003/04 season (but will change from 2024/25 (UEFA 2022)): a round-robin group stage with eight groups of four—altogether 32—teams each such that the top two teams from each group qualify to the Round of 16 in the knockout phase.
The level of competition at the group stage is not very interesting because the Champions League slots are allocated in a deterministic way based on the ranking of UEFA member associations. For example, it is guaranteed that four clubs from the top four leagues of England, Germany, Italy, and Spain participate in the 2022/23 season. Csató (2022) provides more details on the Champions League qualification.
Tables 6 and 7 in the Appendix show all participants in the knockout stage of the Champions League over the 20 seasons played between 2003 and 2023. In each year, there is exactly one winner (W), while one team is eliminated in the final (F), two are eliminated in the semifinals (SF), four in the quarterfinals (QF), and eight in the Round of 16 (R16).
As all teams have a national affiliation and the qualification depends on the ranking of UEFA member countries, at least three types of competitive balance can be distinguished:
-
the overall competitive balance at the level of clubs;
-
the between-country competitive balance when only the national association of the teams is taken into consideration; and
-
the within-country competitive balance, that is, how stable is the set of teams coming from a given national association.
Analogously, the performances of clubs and countries are worth measuring separately.
They all can be quantified by the methodology proposed in Sect. 2. In particular, any season is regarded as a race, and weights are associated with each position (W, F, SF, QF, R16) in each season according to Table 2. Weighting method W1 strongly favours the top positions, but note that there are 16 teams in the Round of 16 compared to the unique winner. Weighting method W4 is extremal by counting only whether a team has qualified for the knockout stage. Weightings W2 and W3 represent a middle course between the other two. Milanovic (2005) uses a really sharp assignment of points to calculate Gini coefficients in the Champions League: 10 points for the winner, 6 points for the finalist, 2 points for the two semi-finalists, and 1 point for the four quarterfinalists.
Admittedly, our choice for the weights seems to be somewhat arbitrary. However, studying different sets of values and two bibliometric indices can indicate if the qualitative results are sensitive to this setting.
Both performance and competitive balance are assessed across subsequent seasons, hence an appropriate time window should be considered. As has been mentioned in the Introduction, a rolling 5-year window can be justified by the seeding policy of the UEFA.
The positions of team i (or country i) in a given period are collected into the score vector \({\textbf{x}}^{(i)}\) that is transformed into a real number \(f \left( {\textbf{x}}^{(i)} \right) \) by the Euclidean or rectangle index. After that, the “market share” of club i (\(1 \le i \le n\)) is calculated to directly obtain its performance:
For competitive balance, the Herfindahl–Hirschman index (\( HHI \)) is computed as follows:
A higher value of the \( HHI \) indicates a lower level of competitive balance.
Note that the minimum concentration is not zero in this hierarchical system since, e.g. there should be 16 teams in the Round of 16 in every year. However, this is not a problem because we will focus on relative changes.
Example 4
There are three Portuguese teams in our sample, Benfica, Porto, and Sporting CP. Let us concentrate on the five seasons from 2014/15 to 2018/19 and take the weighting scheme W2. The score vector of Benfica is \(\left[ 2,1,0,0,0 \right] \), the score vector of Porto is \(\left[ 2,2,1,1,0 \right] \), and the score vector of Sporting CP is \(\left[ 0,0,0,0,0 \right] \). Consequently, the Euclidean index of Benfica is \(E_B = \sqrt{5} \approx 2.24\), the rectangle index of Benfica is \(R_B = 1 \times 2 = 2\), the Euclidean index of Porto is \(E_P = \sqrt{10} \approx 3.16\), the rectangle index of Porto is \(R_B = 2 \times 2 = 4\), while both measures are zero for Sporting CP.
The within-country competitive balance is
under the Euclidean index, and it is
under the rectangle index.
The number of seasons and clubs considered determines the maximal possible number of “publications” to account for. Therefore, in the case of the rectangle index, it is advised to use weighting methods W2, W3, and W4 when there are only a few opportunities to score points, thus, there is no need to provide many points for the best positions. However, if the number of qualifications to the knockout stage can be high, the weighting methods W3 and W4 usually result in the same ranking of the clubs (countries) by simply counting the number of times the Round of 16 is reached.
Example 5
Table 3 summarises the positions of Porto in the 20 Champions League seasons played between 2003 and 2023. Its achievement remains the same under weightings W2, W3, and W4 because the rectangle index is based on the 13 occasions when the club qualified to the Round of 16: the “natural” position of Porto is in the Round of 16. On the other hand, if weighting W1 is applied, the five participations in the quarterfinals will be counted.
Similarly, it would be misleading to use weighting method W1 with the Euclidean index because this implies that almost everything is determined by the top positions due to the squares in its mathematical formula.
Example 6
Consider Table 3. The Euclidean index of Porto under weighting W1 is \(\sqrt{16^2 + 4 \times 4^2 + 8 \times 1^2}\), that is, the influence of its single winning is 256 times larger than a qualification to the Round of 16. This seems to be unjustified.
4 Results
In the following, our empirical findings will be discussed. Section 4.1 overviews some papers that are connected to measuring competitive balance in European football, which is followed by the study of the club and country performances in Sect. 4.2 and the analysis of competitive balance in Sect. 4.3.
4.1 Related literature
Several papers have examined competitive balance in European football leagues (Koning 2000; Szymanski 2001; Goossens 2006). Pawlowski et al. (2010) investigate how the competitive situation in a domestic league is influenced by participation in the UEFA Champions League. Haan et al. (2012) study in a theoretical model how the growing importance of the Champions League and the increased international trade in talent affect competitive balance within national competitions and the quality differences between them. Their results reveal that the Champions League in itself reduces inequality, both nationally and internationally, except for the very small countries.
There exists more limited research on competitive balance in European competitions. Regarding the UEFA Champions League, Milanovic (2005) finds gradual deconcentration between 1963 and 1987, followed by a sharp reversal of this trend between 1988 and 2002. According to Koning (2009), success in the Champions League has become more persistent between the time windows 1980–1998 and 1999–2007, mainly due to allowing multiple teams per country to play. The author argues that competitive balance in European tournaments is better to analyse at the level of national associations because most fans will identify with a domestic club from their home country, even if it is not their favourite.
Plumley and Flint (2015) and Triguero-Ruiz and Avila-Cano (2023) study competitive balance in the group stage of the Champions League. Schokkaert and Swinnen (2016) confirm the result of Milanovic (2005): the same teams are more likely to qualify for the knockout stage of the Champions League compared to its predecessor European Cup prior to the 1992/93 season. However, the uncertainty of which club wins beyond this stage has increased in the competition. Bullough (2018) analyses representation, performance, and revenue distribution in the Champions League from the 2003/04 to the 2016/17 seasons and finds that the current structure significantly benefits a small proportion of clubs.
Ramchandani et al. (2023) examine the competitive balance of the UEFA Champions League before and after the implementation of the UEFA Financial Fair Play (FFP) regulations in 2011. Regarding the knockout stage, they simply calculate the number of unique clubs that have progressed to the knockout stage, as well as the number of occasions that clubs from a particular country played in the semifinals and the final, or won the tournament. Their analysis shows a decline in competitive balance in the post-FFP era. Even though it is also stated that the knockout stage has been dominated by clubs from the top five European leagues since 1992/93, this is not quantified by a particular metric.
Competitive balance may depend on many factors, including the score system (Haugen 2008), the scoring rate (Scarf et al. 2019), major rule changes (Kent et al. 2013), participation in other competitions (Moffat 2020), the tournament format (Csató 2020a, 2021; Lasek and Gagolewski 2018; McGarry and Schutz 1997; Scarf et al. 2009), the seeding procedure (Corona et al. 2019; Dagaev and Rudyak 2019), the number of competitors (Cairns 1987), as well as promotion and relegation (Noll 2002; Buzzacchi et al. 2003). The latter two, to some extent, are analogous to the changing set of Champions League participants between two subsequent seasons. Although these issues will not be discussed here, the consideration of the causes that govern the dynamics of competitive balance in the UEFA Champions League can be a promising direction for future research.
4.2 The performances of clubs and national associations
First, we focus on the achievements of teams and countries. Table 4 shows the five highest-ranked clubs in the whole period from 2003/04 to 2022/23 if their performances are quantified by the two indices and the three weighting methods suggested. The top of the ranking is remarkably robust: the first three positions are almost always occupied by the Spanish clubs Barcelona and Real Madrid together with the German Bayern Munich. Real Madrid turns out to be the best team except for the rectangle index and weighting W3. They are followed by three English clubs: Arsenal, Chelsea, and Liverpool. While Arsenal often qualified for the Round of 16, its performance there remained modest, thus, it is favoured by a less sharp weighting. For instance, under the weighting scheme W2, Arsenal is only the 13th best team by the Euclidean index. Its achievement stands in stark contrast to the record of Liverpool, which was missing from the knockout stage of the Champions League in several seasons but achieved good results when qualifying for this phase.
Figure 4 compares the achievements of these six teams over the last five seasons in the corresponding year. According to all measures, the performance of Arsenal has gradually declined. Chelsea shows a similar decrease except for the last 5-year periods, while Real Madrid has been at its peak in 2018 since it has won in three subsequent seasons. However, note that formula (1) also depends on the achievements of the other teams. Barcelona (Bayern Munich) has been the best towards the beginning (middle) of our time window, and Liverpool was absent from the knockout stage of the UEFA Champions League between the seasons 2009/10 and 2016/17. Nonetheless, in this set of six clubs, it cannot be said that any team unambiguously dominated another one during the whole period considered.
Figure 4 can be somewhat misleading as the worsening performance of Arsenal and Chelsea mainly reflects the high level of competition in the English Premier League: they have failed to qualify for the knockout stage of the Champions League because they could not have played in the group stage. Therefore, Table 5 turns to the comparison of national associations. The clubs of the five top leagues (England, France, Germany, Italy, Spain) clearly prevail in the Champions League. According to our measures, even their ranking is obvious except for the first place disputed by England and Spain, and the third position disputed by Germany and Italy.
However, Fig. 5 shows that clear trends can be observed in the achievements of the clubs from these leagues. In particular, the performance of English and Italian teams have decreased in the first half of the period, while German and Spanish clubs have become more competitive until the middle of the time window considered. However, the former dominance of Spanish clubs is threatened in recent years. In contrast to Fig. 4, the relative order of certain national associations seems to be robust: Spain is consistently better than Germany, England is consistently better than Italy, and France has the weakest league among the top five. Spain has probably outperformed Italy, too.
To conclude, the comparison of countries is easier than the comparison of individual clubs. While this is almost a straightforward implication since the fluctuations in the performances of teams are smoothed out at the level of national associations, it yields at least two important lessons. First, UEFA follows a reasonable policy by founding the Champions League qualification on the ranking of associations rather than the ranking of individual teams (Csató 2022). Second, the current use of UEFA club coefficients for seeding in the UEFA Champions League (Csató 2020b) and the second-tier competition UEFA Europa League is perhaps flawed; it would be better to draw the groups on the basis of country characteristics. For instance, Guyon (2015) recommends labelling all clubs by their finishing positions in their domestic leagues and quantifying their performances in this way instead of focusing separately on each club. This approach could be fairer for a team emerging from nil: even though it was one of the greatest sporting stories of all time when Leicester City became the champion in the English Premier League despite overwhelming odds (BBC 2016), the team had the second smallest UEFA club coefficient in the next season of the Champions League.
4.3 Competitive balance
Figure 6 presents the dynamics of the Herfindahl–Hirschman index over 5-year periods between 2003 and 2023 at the level of clubs. All six methods (three weighting schemes with two bibliometric indices) fluctuate without a clear trend.
Figure 7 follows another approach by focusing on the competition between the countries. Except for the rectangle index under weighting W1, the other five measures uncover a deterioration of competitive balance at the level of national associations in the second half of the period.
Competitive balance can also be studied in the relation of big and small leagues. For instance, the 2019/20 season was the single one when only teams from the top five national associations qualified for the Round of 16. Indeed, Fig. 8 shows some evidence that the uncertainty of outcome has recently declined from this perspective.
Finally, Fig. 9 analyses the third variant of competitive balance by focusing on the top five leagues, England (with 7 clubs in our dataset), France (6), Germany (10), Italy (8), and Spain (10). Here, there are some clear tendencies in the uncertainty of outcome. England and, recently, Italy have managed to improve competitive balance. The highest \( HHI \)s in France and Italy can be attributed to the dominance of one club, Juventus (the Italian champion between the seasons 2011/12 and 2019/20) and Paris Saint-Germain (the French champion since 2012/13 except for two seasons), respectively. While these observations concern the competitiveness of domestic leagues, it is highlighted from an unusual perspective by focusing on the international performance of the leading clubs.
5 Concluding remarks
This paper has aimed to demonstrate how bibliometric indices can be used to measure the level of competition in a knockout tournament. Although our approach involves the choice of two crucial variables, the scientometric index and the weighting method, most results for the UEFA Champions League seem to be remarkably robust concerning these parameters. Basically, neither the performance of clubs and countries nor the three types of competitive balance depend on them to a great extent.
Nevertheless, our results can be sensitive to the luck of the draw in the knockout stage: as the weighting system awards more points to the clubs as they progress in the tournament, uncertainty of outcome can decrease merely because strong clubs are drawn early against each other. This, together with the low scoring nature of the sport, might make the final result of a season relatively noisy. On the other hand, the public views only the outcome of the matches and is probably not satisfied if computer simulations do not show a deterioration in the level of competition but the same clubs almost always win in the real world.
The main findings of the work can be summarised as follows:
-
The performance of national associations varies less than the performance of individual clubs;
-
In the middle of the period considered, Spanish teams have gained prominence at the expense of English and Italian teams, but this trend has reversed in the last years;
-
Even though competitive balance has not changed at the level of clubs since 2003/04, there is some evidence for a recent decline at the level of countries, and, especially, in the relation of top five versus other leagues.
The relative stability of competitive balance in the last two decades supports the results of Schokkaert and Swinnen (2016) and can be explained by the qualification rules of the UEFA Champions League that allow multiple teams from the strongest leagues to compete. This movement partially contradicts the expected effects of free labour markets in European football, which should have increased the inequality of results among clubs (Milanovic 2005).
The current study may have important policy lessons for sports administrators. First, since the performances of national associations are more stable than the results of individual clubs, it would be better to build the seeding in the UEFA Champions League group stage upon association coefficients rather than club coefficients, accounting for the positions that the teams have achieved in their domestic leagues. This change would favour leagues where the competition within the country is stronger, and the set of teams qualifying for the Champions League is more volatile across the seasons. The methodology proposed here can even be used to determine the ranking of national associations for the UEFA access list.
Second, there is no clear relationship between the competitive balance in a national league and the competitiveness of its clubs in the UEFA Champions League: the aggregated performances of English and Italian teams have both declined in the middle of the period considered (Fig. 5), however, the competition within the countries has changed in the opposite way (Fig. 9). A possible explanation for the case of Italy can be that even though domination in the domestic league helps a club to buy better players, the decreased incentives to be efficient at the home front may be detrimental to its international performance.
Notes
“Only he that does great things with small means has made a successful hit.” (Source: Carl von Clausewitz: On War, Book 7, Chapter 22—On the Culminating Point of Victory, translated by Colonel James John Graham, London, N. Trübner, 1873. http://clausewitz.com/readings/OnWar1873/TOC.htm).
References
BBC (2016) Leicester City win Premier League title after Tottenham draw at Chelsea, 2 May. https://www.bbc.com/sport/football/35988673
Bouyssou D, Marchant T (2010) Consistent bibliometric rankings of authors and of journals. J Informet 4(3):365–378
Bouyssou D, Marchant T (2014) An axiomatic approach to bibliometric rankings and indices. J Informet 8(3):449–477
Bullough S (2018) UEFA champions league revenues, performance and participation 2003–2004 to 2016–2017. Manag Sport Leis 23(1–2):139–156
Buzzacchi L, Szymanski S, Valletti TM (2003) Equality of opportunity and equality of outcome: open leagues, closed leagues and competitive balance. J Ind Compet Trade 3(3):167–186
Cairns JA (1987) Evaluating changes in league structure: the reorganization of the Scottish Football League. Appl Econ 19(2):259–275
Considine J, Gallagher L (2018) Competitive balance in a quasi-double knockout tournament. Appl Econ 50(18):2048–2055
Corona F, Forrest D, Tena JD et al (2019) Bayesian forecasting of UEFA Champions League under alternative seeding regimes. Int J Forecast 35(2):722–732
del Corral J (2009) Competitive balance and match uncertainty in Grand-Slam tennis: effects of seeding system, gender, and court surface. J Sports Econ 10(6):563–581
Csató L (2020) Optimal tournament design: Lessons from the men’s handball Champions League. J Sports Econ 21(8):848–868
Csató L (2020) The UEFA Champions League seeding is not strategy-proof since the 2015/16 season. Ann Oper Res 292(1):161–169
Csató L (2021) A simulation comparison of tournament designs for the World Men’s Handball Championships. Int Trans Oper Res 28(5):2377–2401
Csató L (2022) UEFA against the champions? An evaluation of the recent reform of the Champions League qualification. J Sports Econ 23(8):991–1016
Dagaev D, Rudyak V (2019) Seeding the UEFA Champions League participants: evaluation of the reform. J Quant Anal Sports 15(2):129–140
Fenner T, Harris M, Levene M et al (2018) A novel bibliometric index with a simple geometric interpretation. Plos ONE 13(7):e0200098
Garfield E (1979) Is citation analysis a legitimate evaluation tool? Scientometrics 1(4):359–375
Glänzel W, Moed HF (2013) Opinion paper: thoughts and facts on bibliometric indicators. Scientometrics 96(1):381–394
Goossens K (2006) Competitive balance in European football: comparison by adapting measures: national measure of seasonal imbalance and Top 3. Rivista di Diritto ed Economia dello Sport 2(2):77–122
Guyon J (2015) Champions League: How to solve the seeding problem. The New York Times 21 January. https://www.nytimes.com/2015/01/22/upshot/champions-league-how-to-solve-the-seeding-problem.html
Haan M, Koning R, van Witteloostuijn A (2012) The effects of institutional change in European soccer. Jahrbücher für Nationalökonomie und Statistik 232(3):318–335
Haugen KK (2008) Point score systems and competitive imbalance in professional soccer. J Sports Econ 9(2):191–210
Hirsch JE (2005) An index to quantify an individual’s scientific research output. Proc Natl Acad Sci 102(46):16569–16572
Kent RA, Caudill SB, Mixon FG Jr (2013) Rules changes and competitive balance in European professional soccer: evidence from an event study approach. Appl Econ Lett 20(11):1109–1112
Kóczy LA, Nichifor A (2013) The intellectual influence of economic journals: quality versus quantity. Econ Theor 52(3):863–884
Kóczy LÁ, Strobel M (2010) The world cup of economics journals: a ranking by a tournament method. IEHAS Discussion Papers 1018, Institute of Economics, Hungarian Academy of Sciences
Koning RH (2000) Balance in competition in Dutch soccer. J R Stat Soc Ser D (The Stat) 49(3):419–431
Koning RH (2009) Sport and measurement of competition. De Econ 157(2):229–249
Lasek J, Gagolewski M (2018) The efficacy of league formats in ranking teams. Stat Model 18(5–6):411–435
Lehmann R, Wohlrabe K (2017) Who is the ‘Journal Grand Master’? A new ranking based on the Elo rating system. J Informet 11(3):800–809
Levene M, Fenner T, Bar-Ilan J (2019) Characterisation of the \(\chi \)-index and the \(rec\)-index. Scientometrics 120(2):885–896
Manasis V, Ntzoufras I (2014) Between-seasons competitive balance in European football: review of existing and development of specially designed indices. J Quant Anal Sports 10(2):139–152
Manasis V, Avgerinou V, Ntzoufras I et al (2013) Quantification of competitive balance in European football: development of specially designed indices. IMA J Manag Math 24(3):363–375
Marchant T (2009) An axiomatic characterization of the ranking based on the h-index and some other bibliometric rankings of authors. Scientometrics 80(2):325–342
McGarry T, Schutz RW (1997) Efficacy of traditional sport tournament structures. J Oper Res Soc 48(1):65–74
Milanovic B (2005) Globalization and goals: Does soccer show the way? Review of International Political Economy 12(5):829–850
Moffat J (2020) The impact of participation in pan-European competition on domestic performance in association football. Eur Sport Manag Q 20(4):440–457
Noll RG (2002) The economics of promotion and relegation in sports leagues: the case of English football. J Sports Econ 3(2):169–203
Pawlowski T, Breuer C, Hovemann A (2010) Top clubs’ performance and the competitive situation in European domestic football competitions. J Sports Econ 11(2):186–202
Perry M, Reny PJ (2016) How to count citations if you must. Am Econ Rev 106(9):2722–2741
Plumley D, Flint SW (2015) The UEFA Champions League: Maintaining the status quo? Team Perform Manage 21(5/6):247–258
Ramchandani G, Plumley D, Mondal S et al (2023) ‘You can look, but don’t touch’: competitive balance and dominance in the UEFA Champions League. Soccer Soc 24(4):479–491
Ravallion M, Wagstaff A (2011) On measuring scholarly influence by citations. Scientometrics 88(1):321
Rottenberg S (1956) The baseball players’ labor market. J Polit Econ 64(3):242–258
Scarf P, Yusof MM, Bilbao M (2009) A numerical study of designs for sporting contests. Eur J Oper Res 198(1):190–198
Scarf P, Parma R, McHale I (2019) On outcome uncertainty and scoring rates in sport: the case of international rugby union. Eur J Oper Res 273(2):721–730
Schokkaert J, Swinnen J (2016) Uncertainty of outcome is higher in the Champions League than in the European Cup. J Sports Econ 17(2):115–147
Szymanski S (2001) Income inequality, competitive balance and the attractiveness of team sports: some evidence and a natural experiment from English soccer. Econ J 111(469):F69–F84
Szymanski S (2003) The economic design of sporting contests. J Econ Lit 41(4):1137–1187
Triguero-Ruiz F, Avila-Cano A (2023) On competitive balance in the group stage of the UEFA Champions League. Scot J Polit Econ 70(3):231–248
UEFA (2022) UEFA approves final format and access list for its club competitions as of the 2024/25 season, 10 May. https://www.uefa.com/returntoplay/news/0275-151c779310c3-b92bbf0d24f9-1000--uefa-approves-final-format-and-access-list-for-its-club-competi/
de la Vega CL, Volij O (2018) Ranking scholars: a measure representation. J. Informet. 12(2):510–517
Wildgaard L, Schneider JW, Larsen B (2014) A review of the characteristics of 108 author-level bibliometric indicators. Scientometrics 101(1):125–158
Zimbalist AS (2002) Competitive balance in sports leagues: An introduction. J Sports Econ 3(2):111–121
Acknowledgements
László Csató, the father of the first author has helped in carrying out the calculations. Six anonymous reviewers provided valuable comments and suggestions on an earlier draft. We are grateful to the Wikipedia community for collecting and structuring invaluable information on the sports tournaments discussed.
Funding
Open access funding provided by Corvinus University of Budapest. The research was supported by the National Research, Development and Innovation Office under Grants FK 145838 and PD 146055.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare no conflicts of interest.
Human participants or animals
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Csató, L., Petróczy, D.G. Bibliometric indices as a measure of performance and competitive balance in the knockout stage of the UEFA Champions League. Cent Eur J Oper Res 32, 961–988 (2024). https://doi.org/10.1007/s10100-023-00896-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10100-023-00896-8