Introduction

The Human Development Index (HDI) is an international index used to provide additional information beyond the economic information provided by the Gross Domestic Product [37]. The HDI is defined as a general measure of human development in a given population, determined by socioeconomic status, life expectancy (i.e., health), per capita income (i.e., income), and formal education access (i.e., education) [37]. Similarly, HDI, developed by the United Nations Development Programme (UNEP), has been cited as one of the most powerful variables that play a relevant role, along with cultural factors, in promoting a favourable environment for sporting development [10].

From an ecological perspective, the interplay of subject-environment has been highlighted as an essential key to human development through several domains (e.g., cognitive, behavioural, social, motor) [8, 35]. Since athlete development is a non-linear process [19], different variables act together for its expressions, such as personal characteristics, motivational aspects, social/economic facilities, and cultural factors. So, support from different levels is required [19], meaning that the role of individual factors (i.e., anthropometric, physiological, technical-tactical, psychological) [20, 39], economic characteristics and training structure [13], and context-specific characteristics are relevant to sports participation and performance [3, 11, 28, 32].

Studying Brazilian swimming athletes, Gomes-Sentone et al. [16] reported that HDI, income, and education level were important social indicators for sports performance, and similar results were reported by Costa et al. [10] among soccer players. On the other hand, Santos et al. [30], studying junior, elite professionals, and masters athletes presented in the Athletics World Rankings (i.e., 100 m and 10,000 m running distances), found that countries with very high HDI were the most represented in the 100 m ranking, while countries with moderate/low HDI were the most represented in the 10,000 m ranking. Previous studies highlighted that environmental characteristics are also important predictors of sports participation [5].

In summary, the published studies highlighted those environmental characteristics were relevant in both the development and maintenance of athletic performance [11, 18, 25]. Thus, considering the context-specific characteristics between countries, it is possible to postulate that these differences can be associated with between-countries performance differences, which can lead one country to be more competitive than the others, achieving higher international performance and recognition [6, 33]. Furthermore, this scenario can lead these countries to receive more financial sports investments through their government and/or stakeholders/sponsors, allowing them to increase their visibility and success at the international level, due to better conditions for sports development and support for elite athletes.

This fact may illustrate the concept of the “Matthew effect” [1], which highlights that initial advantage tends to beget further advantage and disadvantage further disadvantage. Over time, these differences tend to create widening gaps between those who have more and those who have less [1]. The “Matthew effect” has been studied in a wide variety of contexts and institutional settings, such as sociology, education, biology, and economics [14, 22, 27]. In the context of sport, it is possible to observe this occurrence in the Brazilian setting, where differences between its regions tend to favour those with more favourable socioeconomic indicators [10, 16, 29]. These regions, which receive higher amounts of economic investments [26], tend to invest more in sports and talent development programs in a higher number of sports clubs, and competition events, which contributes to these regions, usually concentrating the highest number of high-elite athletes at a national level [34]. Furthermore, athletes identified as talented tend to move to these regions or are recruited by development programs and sports clubs from these regions, with the purpose to have success in the practice/modality [2].

For example, in endurance running, it was reported that socioeconomic variables (i.e., sports investment and gross domestic product) and competition venue were associated with countries’ likelihood of having athletes in the top 10 rankings, in the European continent [33]. Furthermore, in the Brazilian context, a relationship between population size and the State’s Gross Domestic Product with the number of athletes in the national ranking was observed [33]. Moreover, at an international level, information about the success of the African endurance runners has been extensively studied [17], but information regarding the relationship between countries’ HDI and the success in this modality is still not conclusive.

Thus, the purpose of this study was to investigate the frequency of countries represented in the TOP20 long-distance elite runner rankings from 1997 to 2020 whilst taking into account countries' HDI. We also aimed to verify if the “Matthew effect” can be observed regarding countries' representativeness in the ranking over 23 years, i.e., if having a high number of athletes in the ranking in the first years is associated with a high number of athletes in the following years. We hypothesized that the number of athletes in the first decade would be associated with the number of athletes in the last decade and that countries' HDI would be negatively associated with the number of athletes each country has in the ranking of the modality.

Methods

Study design and data source

The study used a cross-sectional design. All data were collected in November 2020 from the official results section of the Tilastopaja website (www.tilastopaja.eu/). All available results from the world's best half-marathon and marathon marks in official outdoor events, between 1997 and 2020, were compiled for both male and female sexes. Available information included: the athlete’s name, date of birth, sex, race time, citizenship, date of the competition, and venue. Athletes’ age was computed using the date of birth and the date of the competition.

For the present study, the sample included 1852 professional runners (883 female and 969 male), aged 18–41 years, who were ranked in the Senior World TOP20 half-marathon (403 female and 487 male) and marathon races (480 female and 482 male), for the years 1997–2020. To categorize athletes according to their performance, they were split, based on their ranking position, into three groups (“1st–3rd”; “4th–10th”; “ > 10th”), based on previous studies [24].

HDI data source

Further, information regarding the countries HDI was collected from the HDI Global 2014 (https://www.br.undp.org/content/brazil/pt/home/idh0/rankings/idh-global.html). Based on the United Nations Development Program (36), HDI was categorized as “low” (< 0.550), “medium” (≥ 0.550 and ≤ 0.699), “high” (≥ 0.700 and ≤ 0.799), and “very high” (≥ 0.800).

Determination of the “Matthew effect”

To identify the existence of the “Matthew effect”, information about the number of athletes ranked in the TOP20 per country was considered. Previous studies have shown a decrease in participation and presence in the ranking for European athletes, in comparison to African athletes in the last years [24]. However, taking into account the temporal interval available to be used in the present study and the number of athletes by country, we decided to cluster it into three different time intervals (“1997–2000”; “2001–2010”; 2011–2020”).

Statistical analysis

Descriptive statistics are presented as mean (standard deviation), and frequencies (%). The normality was tested by the Kolmogorov–Smirnov test. Chi-square test, followed by the Tukey test for multiple pairwise comparisons was performed in WinPepi to verify the association between athletes ranking position (1st–3rd; 4th–10th; > 10th) and countries’ HDI classification (low HDI; medium HDI; high HDI; very-high HDI), considering race distance for both sexes. Spearman correlation (r) was used to estimate the relationship between countries’ HDI and the total number of athletes per country across the year intervals and also consider the full range of years. The magnitude of the correlation was determined by the scale proposed by Batterham and Hopkins [4] as follows: r < 0.1, trivial; r = 0.1– < 0.3, small; r = 0.3– < 0.5, moderate; r = 0.5– < 0.7, strong; r = 0.7– < 0.9, very strong; r = 0.9 to < 1.0 almost perfect; and r = 1.0, perfect. Bootstrap results were performed based on 1000 bootstrap samples. Statistical Package for Social Sciences (SPSS), version 26®, was used, adopting a significance level of 95%.

Results

The athletes’ mean age was 27.5 (4.5) and 25.6 (3.6) years for women and men half-marathoners, and 28.4 (4.1) and 28.0 (4.0) years for marathoners of both sexes (women and men, respectively). The sample distribution, according to HDI, was: 50.1% (n = 927) from medium HDI countries, 24.8% (n = 459) from low HDI countries, 22.2% (n = 412) from very-high HDI, and 2.9% (n = 54) from high HDI countries. Figure 1 presents the athletes’ distribution per race distance, according to countries HDI. The majority of athletes were from countries with medium HDI, except female marathoners, where most of them were from very-high HDI (36%) and low HDI (30.4%).

Fig. 1
figure 1

Frequency of HDI countries represented in the TOP20 long-distance elite runners ranking

Table 1 presents the chi-square results. For the “1st–3rd” category, in both distances and for both sexes, the highest frequency was observed for countries classified as medium HDI, and this frequency was significantly higher compared to other countries classification. In general, countries with medium HDI were higher represented in the ranking, except for groups “4th–10th” and “ > 10th” among female marathoners, where the highest representativeness was observed for countries with a very high HDI.

Table 1 Chi-square test results for ranking position and HDI association

Table 2 and Fig. 2 show the Spearman correlation results for the association between countries’ HDI and the total number of athletes in the ranking across the years (intervals). Non-significant associations were observed between HDI and the total number of athletes, regardless of the year interval. A positive, moderate and significant association was found between the number of athletes in 1997–2000 and the number of athletes in 2001–2010 (r = 0.600; P < 0.001), as well as a trivial correlation between the number of athletes in 2001–2010 and the number of athletes in 2011–2020 (r = 0.29; P < 0.001).

Table 2 Spearman correlation results for the relationship between the variables
Fig. 2
figure 2

Correlation matrix results for the relationship between variables

Discussion

The purpose of this study was to investigate the frequency of countries represented in the TOP20 long-distance elite runners ranking during 1997–2020, taking into account countries' HDI, and to verify if the “Matthew effect” can be observed regarding countries’ representativeness in the ranking alongside the years. The main findings reveal that most of the endurance athletes were from countries with medium and/or low HDI in the last 20 years; (ii) most female marathoners were from countries with a very high HDI, followed by a low HDI, and (iii) a noncorrelation between HDI and number of athletes were showed, but a positive and significant association was verified for the number of athletes in different years.

Although most of the female athletes came from Kenya and Ethiopia, countries such as Japan, Romania, China, Russia, Germany, Great Britain, and Italy represent, together, approximately 28% of the ranking, which can explain the results found. It seems to have a consensus, in the available literature, regarding the role of economic and social factors in sports participation and high-level sports performance [7, 25]. At an international level, a previous study highlighted the role of the States in the Olympic success, of which 50% of this success can be associated with countries' HDI, population size, and political regime [6]. Similar results were observed by Thuany et al. [33], studying Brazilian elite endurance athletes, where the authors reported that population size and GDP were related to states' representativeness (determined by the number of athletes) in the national ranking of the modality. At the European level, economic proxy (i.e., sports investment) and place of competition (i.e., housing running events) seem to increase the chances of a runner being ranked among the 10 best athletes in endurance running [33].

A previous study, conducted by Santos et al. [30], identified a positive correlation between countries' HDI and the number of athletes in the IAAF (World Athletic) ranking from 2006 to 2016; however, in the present study, no significant association between HDI and the number of athletes in the ranking was observed. Disagreement between these results can be related to differences in sample characteristics (i.e., sex and competitive level) and years considered (2006–2016 vs. 1997–2020 in the present study). In addition, results from the present study can be related to the fact that most of the athletes in the TOP20 ranking, during the last 20 years, are from African countries, especially Kenya (47%, if considering the whole temporal range, and 50.8% if considering the last 10 years) and Ethiopia (22.4%, if considering the whole temporal range, and 38.2% when considering the last 10 years). Both countries do not have a large population size, nor even a high HDI, since they are ranked in the 143rd and 173rd positions, respectively (http://hdr.undp.org/).

However, available data indicated that in 2015, about 2.98% of the Kenyan adult population were unemployed, and about 37.1% of the population lived below the poverty line (https://data.worldbank.org/country/kenya). However, a different economic scenario was described by Onywera et al. [23], whose results showed that about 40% of the Kenyan population were unemployed, and at least 50% lived below the poverty line. These socioeconomic data do not reflect the country’s representativeness at an international level in endurance running sports, since most of the best athletes come from these countries classified as having low HDI, such as Kenya and Ethiopia [21]. Given the significant poverty conditions experienced by Kenyans, sports participation could be motivated by the possibility of economic empowerment [23]. However, another potential fact is that running is a relevant aspect of Kenyan life, being part of the country’s sporting culture [15].

These results reinforce the idea that sports performance is “country-specific” [12], and although Kenya is not a wealthy country, it is still one of the nations with the best endurance-running athletes. The hegemony of a country in a given sport is not a recent phenomenon (e.g., Sprinters—Jamaica; Soccer—Brazil; Basketball—EUA; Hockey—Canada), and the country’s success is associated with cultural and environmental characteristics (e.g., athletes development programs, number of competitions, number of clubs, sports investment), or even with the athletes perspective of social ascension trough the sport [23].

The “Matthew effect” was indirectly tested, and the study hypothesis was supported, showing a direct relationship between the numbers of athletes, across the year’s intervals. The results demonstrated that a high number of athletes in the ranking in the first decade was positively associated with the number of athletes in the second and third decades. Since 1968 (Mexico City Olympic Games), Kenya and Ethiopia have dominated the long-distance running events in Track and Field [21, 38]. The hegemony of African athletes among the best runners worldwide was associated with a plethora of factors, especially the genetic characteristics, environmental factors (i.e., altitude), morphological and physiological indicators [9, 23, 31], in addition to the motivational characteristics. Since participation in sport can potentially lead to better living conditions for athletes (in part due to possible economic empowerment, better access to facilities and care), this fact can contribute to a higher number of youth becoming interested in track and field as a potential career in these countries [15].

Increasing the number of potential athletes to achieve the elite status and, as a consequence, keeping these countries in the highest position in the ranking of the modality, i.e., the higher the number of athletes internationally ranked, the higher the sport practice increases in the country, and the higher the chances of the country keep its position in the ranking. It is interesting to note that, in this case, the “Matthew effect” is observed not because of higher economic conditions in a country, but instead having athletes well-ranked can be a positive stimulus for youth to take part in the modality, keeping and/or maintain the country position in the international athletic success.

There were some limitations in the current study. First, we considered only the TOP20 athletes around the world, which indicated that differences could be identified if other athlete’s categories were considered, or even through the use of a different ranking classification and/or distance (i.e., middle-running) and temporal range (i.e., before 1997). Second, to identify the “Matthew effect”, we only considered the last 20 years, and could be of relevance to investigate a longer period. Thirdly, the lack of time between the HDI (2014) and the sum of athletes in different years could be associated with an absence of significant association.

Conclusion

Half of the elite endurance athletes ranked in the TOP20 between 1997 and 2020, are from countries with a medium HDI, followed by a low HDI and a very high HDI. Instead of any significant association being observed between countries' HDI and athletes ranking positions, an illustration of the Matthew effect was observed, since a positive and significant relationship in the number of ranked athletes over the years was observed, where those countries with the highest number of athletes in the first decade were most represented in the subsequent decade. However, these results do not allow the generalization, and future studies should consider investigating a longer time interval, comprising years before 1997, and the relationship with other socioeconomic factors.