1 Introduction

Soccer is undoubtedly one of the most popular sports competitions. In several regional contexts, it presents not only entertainment features but also socio-economic implications (see, e.g. Galariotis et al., 2018 for joint treatment of sport, business and financial performance of soccer teams, or the older contribution Neale, 1964). Such a relevance justifies the increasing data scientists attention and efforts in analysing and creating data sets related to soccer. A few examples are here discussed. In Hughes et al. (2012), the authors identify data-driven performance indicators for the different roles taken by the soccer players. An interesting review of the promising contributions of data science in the field is reported in Rein and Memmert (2016), where the complexity of the data management phases is also acknowledged. In the conclusive remarks of such a paper, one reads that “future soccer research will have to embrace a stronger multi-disciplinary approach”. The suggestion surely includes the idea of collecting and analysing data about players (e.g., in-game performances). In this respect, Filetti et al. (2017)’s authors monitor the technical–tactical and physical efficiency of Italian “Serie A” season 2013–2014’s players through a semi-automatic video analysis system. In Memmert et al. (2017), one can find an overview of the developments in the analysis of players’ positional data in 11 matches between Bayern Munich and FC Barcelona.

Interestingly, the recent systematic literature review presented in Goes et al. (2021) outlines challenges and successes of big data potentialities to support tactical performance. In this respect, some contributions propose a data science-based analysis of the constellation of the position of the ball and the players during a match for guessing the probability of scoring a goal (see, e.g. Link et al., 2016; Ric et al., 2017). An analogous perspective is presented in Gonçalves et al. (2017), where one can find the complex networks-based analysis of the interactions among the players during a match. In a very different context, Frick et al. (2010) analyse the probability of the so-called “premature termination of a contract” of teams’ head coach by employing a logit approach. In the context of the methodological tools used for dealing with data complexity, we mention Hassan et al. (2020), where the authors use neural networks to predict matches’ outcomes.

This paper enters the debate on data science for the scientific exploration of football by providing a detailed analysis of the Italian Football Championships, which is undoubtedly one of the most relevant football contexts in the EU under different perspectives. For example, regarding the economic relevance, we quote Kennedy and Kennedy (2012), where the authors wrote, “The first decade of the twenty-first century has witnessed market growth from €8 billion per annum in revenue to almost €16 billion per annum, powered by the so-called ‘big five’ leagues in England, Germany, Spain, Italy and France”. Furthermore, by looking at the attendance data in Worldfootball.net (2021b), one notice that Italy had 9,590,166 spectators in 2018/2019, scoring a place in the top ten’s leagues in the world.

In line with Goossens et al. (2012)—where the authors analyse and compare different regulations for the Belgian league championships – we here compare the official historical rankings of the Italian championships between 1930 and 2020 (by removing the possible presence of penalties, as we will see below) with the cases of two-points score and three-points score for the winning team in a match. In this respect, we enter the debate raised by Csató (2020), where the author deals with the rules of the UEFA Champion League by advancing doubts on their fairness. We also mention Cea et al. (2020), where the authors discuss the procedure employed by FIFA for ranking the national team. In this context, they present proper modifications of such a procedure for overcoming some inconsistencies of the ranking rules.

In entering the debate by means of a rank-size analysis, we are close to some relevant contributions in the literature. For example, Ausloos (2014a)’s author assesses the Union of European Football Associations (UEFA) affiliated teams performance between 2009 and 2014 with—among others—the Discrete Generalised Beta Distribution, a three parameters ranks-size law (see Mansilla et al., 2007). Specifically, Ausloos studies the yearly ranking of the teams as a function of their UEFA coefficient. Again on the UEFA data, we also mention Ausloos et al. (2014b) in which the authors discuss the possibility of finding dissipative structures, as in open systems acquiring (and losing) energy. In Ausloos et al. (2014a), the authors present the rank-size relationships for the International Federation of Association Football (FIFA) and UEFA rankings to assess ranking differences in terms of the FIFA’s and UEFA’s coefficients. More recently, Yoon and Sedaghat (2020)’s authors use the rank-size law to fit the ranked attendance data for the games in Major League Baseball (MLB); National Baseball Association (NBA), National Football League (NFL), and National Hockey League (NHL). Differently, in Malacarne and Mendes (2000), the authors state that “the goal distribution by goal-players is connected with an anomalous decay related to the Zipf-Mandelbrot law” in the main football league from Italy, England, Spain and Brazil and for the specific case of a couple of championships. The informative content of the rank-size laws is well illustrated in Rimmer and Johnston (1967), where a visual inspection of the rank-size relationship between cities’ population and their ranks in Australia is used to demonstrate the Victorian Football League’s influence beyond the boundaries of the state (see Fig. 2 in the quoted paper).

This paper investigates the teams and the championships in the Italian league by assessing the relationship between teams’ final ranks and scores. To this aim, we analyse all the Italian Football “Serie A” championships between 1930-2020. We use the final rankings and the teams’ scores to assess the rank-size regimes at the yearly championships level. Moreover, we implement an individual teams-based analysis by ranking their performance over the years and using the rank-size law’s parameters to compare their performances across the tournaments. So, in a nutshell, we analyse the presence of comparable features by using rank-size law’s parameters which have different interpretations when comparisons are made between championships or teams (from now on, in this paper, we refer to the analysis done per championship with the locution “by year”, and to the one done per team by means of “by team”). In this respect, the rank-size analysis allows creating a unified system based on the disaggregated data, hence pointing to the global features of the relationship between the ranked data in the light of the related scores. We also propose a cluster analysis of the rank-size laws based on a k-means algorithm to provide additional insights and capture similarities and deviations among championships and teams. Finally, we follow the approach used in Ficcadenti and Cerqueti (2017) to propose a measure of competitiveness within championships and by the team.

In line with the mentioned literature, we tested different laws; first of all, the Zipf-Mandelbrot one (ZML hereafter) presented in Mandelbrot (1953, 1961), as a generalization of the Zipf’s law Zipf (1949, 1935); then—and more satisfactorily, under a statistical perspective—we tested the Discrete Generalised Beta Distribution (DGBD hereafter). The range of applications where these laws play a crucial role is wide. Among the others, we mention Ficcadenti and Cerqueti (2017) where the ZML has been used to estimate the economic cost of earthquakes, Ficcadenti et al. (2019, 2020) for their investigation of rank-size relationships in corpora, Cerqueti and Ausloos (2015) where the Italian cities tax income distribution analysis is run through a rank-size approach, Dimitrova and Ausloos (2015), where the Bulgarian Urban system is studied across years using rank-size laws and Rotundo (2014); Ausloos (2013, 2014b) for their applications in the field of scientometrics. Referring to sport, it is worth mentioning Ausloos (2020) where the rank-size law of the official gains for bicycle teams in Tour de France is studied.

In our case, the Discrete Generalised Beta Distribution presents outstanding best fit performances. The analysis points to connections of the estimated curves’ parameters with the years, the rules in place for assigning points and teams’ presence in the “Serie A”. Besides, the clusterization of these parameters via the famous k-means algorithm (see Jain et al., 1999, for a detailed description of this method in the framework of the clustering procedures) highlights regularities and deviations in the characteristics of championships and teams—on the basis of the interpretation of the calibrated parameters. Such findings point the attention also to stylized facts related to soccer and its surrounding socio-economic environments, such as the relationship between competitiveness and economic capacity of the teams involved in the “Serie A” championships. In this light, we introduce and also discuss a competitiveness indicator of the teams at the individual championship level by considering the relative relevance of the sizes at high and low ranks.

The rest of the paper is organized as follows. Section 2 contains a description of the considered datasets. Section 3 contains a description of the employed methodology, with the statement of the rank-size laws; moreover, the implementation of the cluster analysis is also presented, along with the aggregation leading to the measurement of the competitiveness at low and high ranks. Section 4 presents the findings of the study critically, with a related discussion. Section 5 offers some conclusive remarks. We have relegated long tables on the explored datasets in the “Appendix”.

2 The datasets

This study puts together information regarding Italian football results collected from Worldfootball.net (2021a). Specifically, we download the final scores of the top Italian football leagues, whose winner is awarded the “Scudetto”. Namely, we have downloaded the final points obtained by each team competing in the so-called “Serie A”.

The dataset covers 88 championships, namely all those played between 1930 and 2020 (we indicate the year in which the challenge ends). Table 4 (in the appendix) contains a statistical summary of the final points scored per each championship. It is interesting to notice that the number of admitted teams to the “Serie A” has changed over the years. The season-ending in 1930 was the first to adopt the single group formula, with home and away matches. This regulation has not undergone any change except for the number of teams, mainly held with 16, 18, 20 or 21 teams. The number of teams presents in the league changed :

  • From 1930 till 1934, 18 teams

  • From 1935 till 1943, 16 teams

  • 1947, 20 teams

  • 1948, 21 teams

  • From 1949 till 1952, 20 teams

  • From 1953 till 1967, 18 teams

  • From 1968 till 1988, 16 teams

  • From 1989 till 2004, 18 teams

  • From 2005 till now 20 teams

In Table 5 (in the appendix) one finds a statistical summary of the scores obtained by each team at the end of each championship in which games have been played.Footnote 1

A few interventions on the raw data are needed to homogenise information across the years. In fact, on the website, the final results are reported in tables that contain the ranking of the teams at the end of the tournament, the teams’ names, the score (number of points) obtained by each team, the number of played, won, drawn and lost matches, scored and conceded goals along with their difference. Sometimes, score reductions are imposed by law; they penalise teams for specific illegal behaviours of managers and players. We create an additional column to incorporate penalisations and deduct the penalisation points from the reported scores. Some cases have been treated carefully in that penalisations might be responsible for relevant biases on the final ranking of the championships. For example, in 2006, Juventus sits at the last position, even if it has the highest score in the championship; such a severe penalisation is a consequence of a legal decision after the scandal Calciopoli, Commissione d’Appello Federale—Federazione Italiana Giuoco Calcio (2006). In this circumstance, we set the final score to zero but incorporate such information in the penalties’ column, where the official score is saved and considered as the magnitude of the sanction. So, for the case of Juventus in 2006, we discount a penalty of -91, which was the whole scored level without penalisation.

A methodological note is now needed. For each championship, each team’s final score is given by the sum of the number of the drawn matches and h times the number of the won matches. The value of h was 2 before 1994; after this year, due to a change of regulation, h was set to 3—as it is currently.

The analysis described in the next section is applied to three instances here identified with Pt, \(Pt_2\) and \(Pt_3\):

  • Pt is the case where the points scored by the teams in a year (Y) incorporate penalties and/or relegation deriving from legal disputes outcomes (as per the description above).

  • \(Pt_2\) regards the analysis made on the ranking that the teams would have reached if the points assigned for winning a match were still 2, as it was before 1994. Namely, the points are assigned using the old rule for all the tournaments after the one finished in 1994.

  • \(Pt_3\) regards the analysis on the ranking that the teams would have reached if the points assigned for winning a match had been 3, as it currently is. So, all the points obtained before 1994 are assigned using the current rule.

For an overview of the three considered instances at a championship and team level, see Tables 4 and 5.

3 Methodology

We run rank-size investigations on the Italian football datasets Pt, \(Pt_2\) and \(Pt_3\) described in the previous section. First, we provide the analysis at the championship level, namely an analysis of the tournaments disputed between 1930 and 2020. Per championship, the final points obtained by the teams are ranked in descending order and, the highest outcome is associated with the rank one, while the lowest one has the largest rank, corresponding to the number of teams competing in that specific championship. The case of the tournament ending in 1937 (instance Pt) is reported in Fig. 1a as an illustrative example. At this stage one should focus on the black dots only; in that year Bologna has won the “Scudetto” and Alessandria was the last in the final ranking. On the x and y-axis, the final ranks and the total points scored are respectively reported. Second, we analyse the datasets at the team level by taking the teams that have competed in the “Serie A” for a large enough number of the considered championships. To this aim, each team’s score obtained in each tournament is ranked in descending order. The highest result obtained among the championships disputed by the considered team has a rank equal to one. In contrast, the team’s lowest score is associated with the highest rank, corresponding to the number of tournaments disputed between 1930 and 2020. Specific teams have participated in a few championships over the years; therefore, they do not have enough observations to make a robust rank-size fit. Thus, as announced above, we set a threshold. Namely, the teams that have competed less than 16 times in the “Serie A” are dropped from the dataset, so the rank-size analysis is not performed on them. Such a threshold allows a meaningful implementation of the rank-size law’s best fits. The case of the SPALFootnote 2 (instance Pt) is presented in Fig. 1b as another illustrative example. SPAL has had its best performance in 2019 scoring 42 points and its worse in 2020 with 20 points. In the following, we refer to r as the team’s rank whose score is the size z.

In line with the aforementioned literature, a first best-fit tentative is done with the ZML rank-size curve (Mandelbrot, 1953, 1961) here reported for reference:

$$\begin{aligned} z = \frac{\phi }{(\theta + r)^\xi }, \end{aligned}$$
(1)

where \(\phi \), \(\theta \) and \(\xi \) are the non-negative parameters to be calibrated. However, the best-fit exercise through Eq. (1) does not provide statistically sounding results, that are then not shownFootnote 3

For all the considered datasets, scores and related ranks are much better represented by the DGBD (see, e.g., Naumis and Cocho, 2008; Martínez-Mekler et al., 2009), which exhibits outstanding capacity of fitting the championships outcomes, with high values of the goodness-of-fit parameters. The formulation of such a law is

$$\begin{aligned} z = \frac{\alpha (R + 1 - r)^\beta }{r^\gamma }, \end{aligned}$$
(2)

where \(\alpha , \beta \) and \(\gamma \) are the non-negative parameters to be calibrated, and \(R = \max (r)\) over the considered data sample.

The application of Eq. (2) and the mentioned data pre-processing lead to a dataset made of 28 teamsFootnote 4 (see Table 5 in the appendix).

The DGBD-based analysis at championship and team levels generate two sets of triplets representing the estimated \(\alpha , \beta \) and \(\gamma \), i.e. \({\hat{\alpha }}, {\hat{\beta }}\) and \({\hat{\gamma }}\), via the Trust Region Reflective algorithm (Branch et al., 1999) applied with boundaries conditions on the parameters, that are forced to be positive. Furthermore, a “brute-force” procedure is deployed to avoid local minima in estimating. A broad grid of starting values feeds the trust Region Reflective algorithm so that the \({\hat{\alpha }}, {\hat{\beta }}\) and \({\hat{\gamma }}\) minimizing the Root Mean Square Error (RMSE) are taken as starting points for the final estimation run (see Ficcadenti and Cerqueti, 2017, where a similar process is used).

The two sets of \({\hat{\alpha }}, {\hat{\beta }}\) and \({\hat{\gamma }}\) are stored in two tables, one for the estimations when year by year results are considered and another for the team by team analysis. Each of these reports the three instances Pt, \(Pt_2\) and \(Pt_3\).

As a further investigation and for all the considered instances, we cluster championships and teams through the obtained triplets. In particular, we employ a k-means clustering approach (see, e.g. Baker et al., 2020), which is undoubtedly one of the most popular clustering methods (for clustering, the calibrated DGBD parameters are suitably standardized). Through it, we want to identify two regimes in scored points distributions along with each championship and each team’s history. Therefore, we are interested in having \(k = 2\) to capture best and worst-performing teams in championships (e.g., first and last four teams for each championship), and teams’ glorious and undistinguished moments characterised by the best and worse years in terms of scored points. Signs of two regimes are found in the distributions of the estimated parameters, as we point out in Sect. 4. To ensure that \(k=2\) is a suitable choice, we make more formal consideration calculating the common Silhouette index (D’Urso and Maharaj, 2012; Kaufman and Rousseeuw, 2009) on each clustering by varying k, and we report the values in Table 1. The results allow to conclude in favour of the choice \(k=2\) apart for the case \(Pt_3\), when the analysis team by team is run. Only in that instance, \(k=3\) is favoured, but the difference is not material and the Silhouette value is still in an acceptable range for \(k=2\). Therefore, the employed clustering algorithm selects the two clusters’ centroids

$$\begin{aligned} \mu _1=(\alpha ^{(1)}, \beta ^{(1)}, \gamma ^{(1)}), \qquad \mu _2=(\alpha ^{(2)}, \beta ^{(2)}, \gamma ^{(2)}) \end{aligned}$$

that minimise the within-clusters sum-of-squares criterion:

$$\begin{aligned} \sum _{i=1}^{n}\min _{\mu _j \in {\mathbb {R}}^3:j = \{1,2\}}(||x_i - \mu _j||^2) \end{aligned}$$
(3)

where \(x_i=({\hat{\alpha }}_i,{\hat{\beta }}_i,{\hat{\gamma }}_i,)\) is the triple of the estimated parameters \(\alpha , \beta , \gamma \) in Eq. (2) for the \(i^{th}\) element of the considered sample, n is the cardinality of the sample and \(||x-\mu ||\) is the Euclidean distance between the three-dimensional vectors x and \(\mu \). More specifically, we use “k-means++” from the Python’s Scikit-learn (Pedregosa et al., 2011) for its capacity in minimizing the chances of getting into local minima through the optimization procedure, see Arthur and Vassilvitskii (2006). We here report a summary of the algorithm that can be summarised in three steps. The first one is to choose the initial \(\mu _1\) and \(\mu _2\) (this is the point where the starting centroid selection process has been changed with the “k-means++” variant, see Ostrovsky et al., 2013). After that, the algorithm consists of a loop between the next two steps. The second step assigns each team to a cluster using the criteria of the nearest centroid, while in the third one, new centroids are calculated averaging \({\hat{\alpha }}_i,{\hat{\beta }}_i,{\hat{\gamma }}_i\) belonging to the same clusters. Then, the difference between the old and the new centroids are computed, and the algorithm repeats these last two steps until the new centroids and the old ones do not vary significantly. A summary of the process is reported in the Algorithm 1.

Additionally, we use the rank-size relationship reported in Eq. (2) to transform the points into an indication of competitiveness within championships and per team. For all the analysed instances, we denote with A the area underlying the curve in Eq. (2) and bounded from below by the abscissae, i.e.:

$$\begin{aligned} A = \int _{\min (r)}^{R} {\hat{\alpha }} \frac{(R + 1 - r)^{{\hat{\beta }}}}{r^{{\hat{\gamma }}}} dr \end{aligned}$$
(4)

Moreover, we denote by \(\Gamma _L\) and \(\Gamma _H\) the proportion of the area of A given by the four lowest ranks and highest ranks, respectively, i.e.

$$\begin{aligned} \Gamma _{L} = \dfrac{\int _{1}^4 {\hat{\alpha }} \frac{(R + 1 - r)^{{\hat{\beta }}}}{r^{{\hat{\gamma }}}} dr}{A}, \qquad \Gamma _{H} = \dfrac{\int _{R-4}^R {\hat{\alpha }} \frac{(R + 1 - r)^{{\hat{\beta }}}}{r^{{\hat{\gamma }}}} dr}{A}. \end{aligned}$$
(5)

The values of \(\Gamma _{L}\) and \(\Gamma _{H}\) are calculated for Pt, \(Pt_2\) and \(Pt_3\) in both the cases “by year” and “by team”, providing a view of the competitiveness over the years and providing a relative measure of the capacity per each team. An exemplifying view of these indicators is reported in Fig. 1. For both the type of analysis, the areas are highlighted when the instance Pt is considered. Figure 1 shows the cases of the championship ended in 1937, and Fig. 1b gives a visual idea of the situation for the SPAL. One can inspect the colours of the areas to better understand the idea behind eqs. (4) and (5).

Fig. 1
figure 1

In the figure are reported the rank-size’s best fits run with Eq. (2). Furthermore, we report the elements necessary for the calculation of the competitive indexes \(\Gamma _{H}\), \(\Gamma _{L}\) reported in eqs. (4) and (5)

Table 1 The Silhouette index is calculated for each k-means cluster analysis run. The parameter k goes from 2 to 5, we report it for the analysis “by year” and “by team” in each instance Pt, \(Pt_2\) and \(Pt_3\)
Table 2 Summary of the best fit results through Eq. (2) run per year

4 Results and discussion

The rank-size analysis results from the Eq. (2) best fits lead to the identification of the calibrated parameters \({\hat{\alpha }}\), \({\hat{\beta }}\) and \({\hat{\gamma }}\), along with some goodness-of-fit quantities—\(R^2\) and RMSE in our context. The calibrated parameters, the goodness-of-fit measures, and the considered sample’s cardinality form two sets of distributions—one associated with the championships and the other with the teams. The main descriptive statistics of such distributions are reported in Tables 2 and 3, and Fig. 2. The locutions “by year” and “by team” reported in that figure are respectively associated with the analysis run at year (championship) and team level.

The parameter \(\alpha \) may be viewed as a proxy of the size at rank one so that a large (small) value of \(\alpha \) is associated with a large (small) value of the maximum score of the considered sample.

The parameter \(\beta \) describes how the curve decreases as the rank grows. If \(\beta \) is small, then the curve tends to capture the so-called “queen and harem effect” turning the concavity of the curve at the highest ranks (Ausloos, 2013, 2014b).

Table 3 Summary of the best fit results through Eq. (2) run per team
Fig. 2
figure 2

The figure reports the DGBD estimated parameters’ probability density (smoothed by a kernel density estimator) when one performs the best fits “by year” and “by team”

Fig. 3
figure 3

The clusters regimes are represented in terms of Eq. (2). Namely, the \({\hat{\alpha }},{\hat{\beta }}\) and \({\hat{\gamma }}\) belonging to the resulting clusters {0,1} have been averaged within the subdivisions to report two representative curves per each instance Pt, \(Pt_2\) and \(Pt_3\). The averaged parameters are indicated with \(\bar{{\hat{\alpha }}}\), \(\bar{{\hat{\beta }}}\) and \(\bar{{\hat{\gamma }}}\)

Fig. 4
figure 4

The parameters resulting from the best fit of Eq. (2) are reported, and the colours distinguish them considering the clusters

Fig. 5
figure 5

This figure reports information about variability of the official ranking of each team analysed, namely those having played at least 16 “Serie A” championships. The coloured areas represent the observations distributed around the media (black vertical line), namely those comprised between the 25th (Q1) and 75th percentiles (Q3), the interquartile range (\(IQR=Q3-Q1\)). The edges of the continuous lines respectively represent the data between \(Q1-1.5*IQR\) and \(Q3+1.5*IQR\). The rhombus represent the outliers positions

Fig. 6
figure 6

Empirical distribution of the DGBD’s estimated parameters divided by clusters when the analysis “by year” is run

Fig. 7
figure 7

Empirical distributions of the years belonging to the two clusters. Each bin covers nine years, starting from 1930 and ending in 2020

Fig. 8
figure 8

Time series of the areas subtended by the curve in Eq. (2) and captured with Eq. (5) using the parameters estimated in the analysis “by year”. The tournaments were not disputed between 1944 and 1946

Fig. 9
figure 9

This figure reports the results from Eq. (5) when the analysis “by team” is performed. The levels of the bar plots represent the areas subtended by the Eq. (2) when the first four (red) and the last four (blue) ranks are considered

Fig. 10
figure 10

This figure reports information about the scores’ variability for each analysed team, namely those having played at least 16 “Serie A” championships. The coloured areas represent the observations distributed around the mean (black vertical line), namely those comprised between the 25th (Q1) and 75th percentiles (Q3), the interquartile range (\(IQR=Q3-Q1\)). The edges of the black continuous lines respectively represent the data between \(Q1-1.5* IQR\) and \(Q3 + 1.5* IQR\). The rhombus are the outliers positions

Fig. 11
figure 11

This figure reports information about the years in which the considered teams have competed. It gives an indication of the points obtained via the dots’ size. Furthermore, the colour indicates the cluster to which it belongs

Fig. 12
figure 12

Empirical distribution of the DGBD’s estimated parameters divided by clusters when the analysis “by team” is run

The DGBD’s \(\gamma \) captures the deviation of the size of two consecutive ranks. Specifically, the difference between the sizes at rank r and \(r+1\) decreases as the value of \(\gamma \) grows. One can better visualize what is described here observing Fig. 3 and jointly comparing the curves with the parameters (reported in caption) that generated them, for this exercise, one can disregard the clusters. Besides, the relationship between parameters can be visually inspected in Fig. 4.

At a championship level, one can notice that the distribution of \({\hat{\alpha }}\) is skewed on higher values in the cases Pt and \(Pt_3\). This outcome highlights similarities between actual final rankings (Pt) and those resulting from the assignment of 3 points to won matches, with an asymmetry of the distribution of the maximum scores in the championships to its left tail. The positive skewness is associated with substantial values in the final rankings of the championships. In this respect, please refer to \({\hat{\alpha }}\) maxima, which is around 90 for Pt and \(Pt_3\) and around 60 in \(Pt_2\). Of course, such large values are registered for teams winning many competitions and in the cases where the points assigned to the won matches is 3, namely championships disputed from 1994-95; for this reason, we do not observe the same result for \(Pt_2\). Moreover, the skewness for Pt is much larger than that of \(Pt_3\). This finding agrees with the evidence that the dataset \(Pt_3\) contains generally larger values so that its highest realizations are closer to the mean. This is confirmed by the resulting means of Pt and \(Pt_3\)—around 32 and 41, respectively—and by the medians—about 30 and 38, respectively. The interpretation of \({\hat{\alpha }}\) is quite similar in the rank-size analysis at a team level, and the same arguments proposed above apply. We only notice that the discrepancies among Pt, \(Pt_2\) and \(Pt_3\) are less evident for the team case than for the championship one. This outcome explains the less evident 3 points regulation effects on the scores of a given team rather than on the rankings of the championships. In other words, some teams have spent most of the years in “Serie A” before 1995, for example, Padova, therefore the differences between Pt, \(Pt_2\) and \(Pt_3\) are mitigated.

The meaningfulness of the estimated parameter \({\hat{\gamma }}\) is evident if one compares the values obtained for the analysis “by team” in the instance Pt with those for \(Pt_2\) and \(Pt_3\) (see tables 3, 5 and figs. 2, 4). In particular, the mean value of \({\hat{\gamma }}\) is higher for Pt than for \(Pt_2\) and \(Pt_3\)—the means in the three cases are around 0.14, 0.09 and 0.1, respectively. This result describes a situation with homogeneous sizes with quite steep rank-size curves. The case “by year” behaviour remains stable across the instances Pt, \(Pt_2\) and \(Pt_3\) with a slightly higher mean than those of the team cases, manifesting the presence in the championship analysis of a few more points on the tails of the rank-size law. Furthermore, most of the considered teams have played under both the rules of 2 and 3 points per won game. Therefore, the championships played during the latter periods led to higher scores than the former ones, forcing \({\hat{\gamma }}\)’s to be more prominent in the Pt and smaller in \(Pt_2\) and \(Pt_3\)—where the points decay is smoother along the ranks.

The values of \({\hat{\beta }}\) present similar behaviours for the cases Pt and \(Pt_3\) when the analysis “by year” is performed. In contrast, when it is performed “by team”, a stronger similarity can be observed between the cases \(Pt_2\) and \(Pt_3\) confirming the teams’ performances regardless the rules in force. In both of cases, all the distributions of the \({\hat{\beta }}\) for Pt, \(Pt_2\) and \(Pt_3\) have positive skewness and present signs of bi-modality. So, if the points associated with low ranks are shallow and not that distanced, \({\hat{\beta }}\) tends to be smaller. For the cases of the analysis “by team”, \({\hat{\beta }}\) captures the capacity of the teams in the “Serie A”—specifically those having played at least in 16 championships—to perform relatively well (with respect to them-self) for several years and rather bad in the majority of the competitions. Indeed, Fig. 5 shows that most of the analysed teams have had their “glory moments”, being in the first five official positions of the competitions at least once and most of the teams spent their carriers in the central-low part of the rankings.

The analysis of the DGBD best fits’ parameters proves the presence of two regimes across the different instances. Its strongest evidence consists of bi-modality in parameters’ distributions which also strengthens the decision to use the k-means calibrated with two clusters, \(k=2\), further supported by the Silhouette index reported in Table 1. In figs. 6 and 7 the results of the cluster analysis “by year” are reported. The different distributions of the observations across clusters and along the years manifest interesting aspects. Apart from the first nine years (first bin in Fig. 7) and the last years, the clusters capture the two regimes mentioned above. The years belonged to different groups until, more recently, they split more equally between the clusters; this is particularly evident in Pt. Potential justification for that can be found in Özaydin and Donduran (2019) where the authors describe how the teams’ competitiveness grows together with the economic power. Similar arguments are detailed in Michie and Oughton (2004), where the authors state that “in Italy, there has been a marked deterioration in competitive balance since 1992 so that at the end of the period (2004) Italy had the highest degree of imbalance of the top 5 leagues” and connect this fact with the increase in revenues by the firms owning the soccer teams. Finally, in Nicoliello and Zampatti (2016) the impact of the Financial Fair Play regulation starting from 2014 is studied for the case of Italy. The need for a Financial Fair Play regulation constitute relevant proof of the competitive implications of economics or financial imbalances, see Masters (2014). In the instance Pt, the split years between clusters can be additionally affected by the changes in rules. For example, \({\hat{\alpha }}\) shows two picks (see Fig. 2), providing indications of the distinct behaviour of the winning teams, which got higher points for the more recent championships.

Another way for understanding the results from the cluster analysis done “by year” comes from a visual inspection of Fig. 3a. It reports the curves plotted by plugging in Eq. (2) the estimated parameters’ averages per cluster and instance. One notices that the differences are mostly present at high and low ranks, where the clusters are characterized. On the other hand, Fig. 4a, instance Pt, shows the signs of the change in rules with a less concentrated distribution of the dots with respect to the cases \(Pt_2\) and \(Pt_3\).

Concluding, in Figs. 8 and 9 we report the results from the analysis of the areas subtended by Eq. (2), namely the figures generated from Eq. (5). It is interesting to notice the presence of different regimes. There has been a drastic drop in the first years before 1950 for both \(\Gamma _{H}\) and \(\Gamma _{L}\) and then a recovery predominately obtained by the teams sitting at the first four positions of the championships until 1975-1980, see Fig. 8. The areas below the curve related to the last four positions in each championship did not recover enough to go back to the level occurred before the Second World War. In recent years, we can appreciate a stabilization for \(\Gamma _{L}\), but, regarding \(\Gamma _{H}\) the situation has deteriorated, further strengthening the idea of having weaker and weaker teams at the bottom of the rankings.

The results from the analysis “by teams” are driven by similar factors and lead to conclusions regarding teams performances. In the analysis run on the official rankings Pt, the teams that had less success and have played more recently belong to the same cluster (see figs. 51011). For example, one can notice that Brescia, Cagliari, Catania, Chievo and Lecce have a recent history in “Serie A” (Fig. 11) not that successful in terms of positions in the final ranks (Fig. 5) but still good for the points obtained (Fig. 10). On the other hand, teams like Fiorentina, Inter, Juventus, Napoli, Roma and Torino have a more successful history. Namely, they have got higher ranks (Fig. 5) and more points (Fig. 10). They have played more often in the “Serie A” (Fig. 11). The analysis of \(Pt_2\) and \(Pt_3\) are less affected by the time factor magnified by the official penalties applied and, more importantly, by the changes in rules. This is confirmed by the fact that changes in clusters are not that frequent when comparing the colours of the boxplots reported in Fig. 10, \(Pt_2\) and \(Pt_3\). Furthermore, these teams with a more successful history in “Serie A” tend to have lower levels of \(\Gamma _{L}\) and \(\Gamma _{H}\), as reported in Fig. 9. The presence of teams moving from one cluster to another when looking at Pt, \(Pt_2\) and \(Pt_3\) can be explained through three factors: (i) the change in rule for assigning points to won matches (from 2 points to 3 points for each victory), so that teams having played and won most of the matches recently (so getting 3 points per won match), are likely to stay in the same cluster for the instances Pt and \(Pt_3\), see for example the case of Livorno. (ii) The proportion of draws had during the different championships, so in different periods. In other words, the teams having scored draws often have the rank-size curve more characterised by the few times they performed well. In this context, the points assigned to the won games—2 or 3—play a relevant role in identifying the clusters. This can be visualised by observing the different behaviours of the curves and clusters at low ranks in Fig. 3b, e.g., for the instances Pt and \(Pt_3\). In this case, the distance between the two clusters at low ranks is more significant for \(Pt_3\) than Pt; therefore, there is a remarkable difference in the probabilities of falling in one cluster or another. (iii) In Pt, one has the penalties included in the final results, hence leading to more extreme values (figs 10 and 12).

5 Conclusions

The present paper deals with a rank-size analysis of the Italian football final ranking obtained by teams in the so-called “Serie A” in the championships disputed between 1930 and 2020. The parameters’ calibration procedure run with the DGBD, Eq. (2), presents an outstanding performance with a \(R^2\) ranging between 0.80 and 0.99. The DGBD’s parameters \(\alpha \), \(\beta \), and \(\gamma \) effectively capture relevant features related to the teams’ performance. The highest points scored by the competitors in the championships, in the case “by year”, and in the teams’ history for the case “by team”, are captured by \({\hat{\alpha }}\). The performance of the teams in the play-off area is captured by \({\hat{\beta }}\) and the decay as well as the concentration at highest rank is captured by \({\hat{\gamma }}\). The triplets estimated in the instances Pt, \(Pt_2\) and \(Pt_3\) successfully represent the conditions of teams and championships had over the years. The rank-size regimes and their meaning are evident once used to feed the k-means algorithm with \(k=2\). The results prove the relevance of historical phases of Italian football, suggesting a solid characterisation of them by the economic condition of the teams and the rules in place. More specifically, we consider the results connected with the literature regarding the teams’ economic power and competitiveness (see Michie and Oughton, 2004; Nicoliello and Zampatti, 2016; Özaydin and Donduran, 2019), and the change in rules that occurred in 1994-95. In details, the connection with the deterioration in competitive balance since the ‘90s can be associated with the change of the European Champion Clubs’ Cup tournament formula, occurred in 1992. The cup was initially a straight knockout tournament open only to the champions of Europe’s domestic leagues. The competition got its current name “UEFA Champions League” during the season ended in 1993, after that other changes occurred. For example, it was added a round-robin group stage, and later it was allowed multiple entrants from other countries than the whole EU group itself. Given that, the TV rights value and the business associated to soccer (e.g., appetite for sponsors) increased, mostly driven by the increasing number of matches and the spread of the competition across more countries. Such a phenomenon transformed the soccer dynamic in Italy as well as in other leagues. The teams at the top of the “Serie A” ranking got increased visibility and changes of increasing their earnings through the new business opportunities. With specific reference to the TV rights, the biggest change occurred during the season ending in 1995, namely when the contracts between UEFA and European Broadcasting Union ended. One can notice from Fig. 8, that around that period, the two \(\Gamma \)s change trend’s direction. Namely, the teams at low ranks have been increasing / holding their competitiveness (see \(\Gamma _L\)), while the others had the opposite (decreasing level of \(\Gamma _H\)).

With the analysis “by team” one captures the phenomenon for which many teams have had “glory years” entering in the “Serie A” and being able to have a “decent” performance for a few times, spending the rest of the permanence in floating in the middle of the ranking. The analysis of the areas performed via Eq. (5) further confirms that in recent years, the teams at the bottom of the rankings have performed worse and worse, underfeeding a general trend followed by the teams present at the top.