Pre-injury performance is most important for predicting the level of match participation after Achilles tendon ruptures in elite soccer players: a study using a machine learning classifier

Diniz, Pedro; Abreu, Mariana; Lacerda, Diogo; Martins, António; Pereira, Hélder; Ferreira, Frederico Castelo; Kerkhoffs, Gino MMJ; Fred, Ana

doi:10.1007/s00167-022-07082-4

Pre-injury performance is most important for predicting the level of match participation after Achilles tendon ruptures in elite soccer players: a study using a machine learning classifier

ANKLE
Published: 09 August 2022

Volume 30, pages 4225–4237, (2022)
Cite this article

Download PDF

Knee Surgery, Sports Traumatology, Arthroscopy Aims and scope

Pre-injury performance is most important for predicting the level of match participation after Achilles tendon ruptures in elite soccer players: a study using a machine learning classifier

Download PDF

Pedro Diniz ORCID: orcid.org/0000-0001-9234-7041^1,2,3,4,
Mariana Abreu^2,5,
Diogo Lacerda¹,
António Martins^1,4,
Hélder Pereira^6,7,8,
Frederico Castelo Ferreira^2,3,
Gino MMJ Kerkhoffs^9,10,11 &
…
Ana Fred^2,5

1896 Accesses
4 Citations
Explore all metrics

Abstract

Purpose

Achilles tendon ruptures (ATR) are career-threatening injuries in elite soccer players due to the decreased sports performance they commonly inflict. This study presents an exploratory data analysis of match participation before and after ATRs and an evaluation of the performance of a machine learning (ML) model based on pre-injury features to predict whether a player will return to a previous level of match participation.

Methods

The website transfermarkt.com was mined, between January and March of 2021, for relevant entries regarding soccer players who suffered an ATR while playing in first or second leagues. The difference between average minutes played per match (MPM) 1 year before injury and between 1 and 2 years after the injury was used to identify patterns in match participation after injury. Clustering analysis was performed using k-means clustering. Predictions of post-injury match participation were made using the XGBoost classification algorithm. The performance of this model was evaluated using the area under the receiver operating characteristic curve (AUROC) and Brier score loss (BSL).

Results

Two hundred and nine players were included in the study. Data from 32,853 matches was analysed. Exploratory data analysis revealed that forwards, midfielders and defenders increased match participation during the first year after injury, with goalkeepers still improving at 2 years. Players were grouped into four clusters regarding the difference between MPMs 1 year before injury and between 1 and 2 years after the injury. These groups ranged between a severe decrease (n = 34; − 59 ± 13 MPM), moderate decrease (n = 75; − 25 ± 8 MPM), maintenance (n = 70; 0 ± 8 MPM), or increase (n = 30; 32 ± 13 MPM). Regarding the predictive model, the average AUROC after cross-validation was 0.81 ± 0.10, and the BSL was 0.12, with the most important features relating to pre-injury match participation.

Conclusion

Most players take 1 year to reach peak match participation after an ATR. Good performance was attained using a ML classifier to predict the level of match participation following an ATR, with features related to pre-injury match participation displaying the highest importance.

Level of evidence

I.

The Mental Health of Elite Athletes: A Narrative Systematic Review

Article Open access 20 February 2016

The Biomechanics of the Track and Field Sprint Start: A Narrative Review

Article Open access 17 June 2019

The Importance of Muscular Strength in Athletic Performance

Article 02 February 2016

Introduction

Achilles tendon ruptures (ATR) are career-threatening injuries in elite soccer players. Unfortunately, despite a relatively high return to play (RTP) rate, 96%, according to Grassi et al. [11], 18% of players will not return to the same level of competition within two seasons following injury [34]. Furthermore, previous research has also shown that soccer players suffering from these injuries have their careers shortened, on average, by two seasons compared to matched controls [30].

Several studies reporting outcomes of ATRs in elite athletes are based on publicly available information [11, 13, 15, 24, 26, 30, 33, 34]. In soccer, one notable source is transfermarkt.com [10, 11, 16, 21, 34, 36], which has been considered accurate, regarding injury denomination and location, in 89% of cases [8, 16]. Although primarily aimed at aggregating player market values and transfer fees, it includes other valuable data for sports analytics, such as match results, player performance indicators (namely goals, assists, and fouls), and injury history. This database is publicly available and maintained by transfermarkt.com and its user community [32].

Artificial intelligence is a field that studies artificial agents that can mimic or surpass human-level intelligent tasks and has become increasingly popular in the past decade [7]. Machine Learning (ML) is a subset of artificial intelligence related to “advanced statistical techniques that use computer algorithms to model complex relationships between variables”, with these computer algorithms learning automatically from experience, i.e. data, without direct human intervention [20]. These algorithms rely on data analysis models to uncover hidden patterns and other meaningful insights from large datasets [28]. Among these algorithms, one can find both unsupervised and supervised learning methods [1]. Unsupervised learning is used when “labels” are unavailable [1], i.e. individual instances in the dataset are not categorized. These algorithms can organize individual instances according to naturally emerging patterns in the dataset, detect anomalous patterns and perform dimensionality reduction [1, 7]. Supervised learning is used when data are “labeled”, i.e. the algorithm is fed training data where individual instances—observations—and corresponding output values, obtained with human intervention, are known [1]. Regression and classification problems are the two main categories into which supervised learning can be divided [1, 7].

Despite recent advances in the characterization of consequences of ATRs for elite soccer players [10, 11, 30, 34], both an evaluation of how match participation evolves after injury and a set of prognostic tools to gauge the likelihood of return to the same level of play are still missing in the literature. In addition, previous studies of elite soccer players treated for ATRs have also been limited by their reduced number of cases under consideration [10, 30, 34], by being restricted to a single league [10, 30], or by missing performance measures besides the return to play at the same competitive level [11].

This study has a double objective. Firstly, an exploratory data analysis aims to inform athletes and staff how match participation evolves after ATRs. Secondly, it evaluates the performance of an ML model based on pre-injury features to predict whether a player would return to a similar level of match participation, together with a study of the most relevant features for this task.

Materials and methods

Player screening and selection

The website transfermarkt.com (Transfermarkt, Hamburg, Germany) was mined, between January and March of 2021, for relevant entries regarding soccer players who suffered an ATR while playing in first or second leagues.

A customized web scraper was developed using Scrapy [22]. Player screening and selection were carried out using the following scheme: fir. Firstly, a list of all first and second leagues across the world was manually compiled; secondly, team rosters for each team in each league, since season 2007/2008, were extracted to a list; finally, the injury data of each player in the list were retrieved. The resulting injury data were filtered for entries containing the string “Achilles tendon rupture” or “Achilles” combined with more than 90 days of absence. Another group of players with absence times of more than 90 days was built from the following strings: “calf”, “leg”, and “ankle tendon”.

Each entry was then evaluated independently by two researchers. Only players with club reports, press releases, or interviews mentioning a complete ATR were eligible for inclusion. A minimum follow-up of 24 months was also required. Due to the COVID-19 pandemic and ensuing match calendar rearranging, only injuries occurring before 31st March^t of 2018 were included. Players that suffered partial or focal tears of the Achilles, and players that suffered an ATR while playing for teams not in first or second leagues, or were unaffiliated with any team at the moment of injury, were excluded. Disagreements were settled by discussion with a third researcher on a case-by-case basis.

Data extraction and dataset handling

The transfermarkt.com website was also scraped for the following items: date of birth, height, preferred foot, playing position, club transfers (including projected market values and transfer fees), whether the player had played for the national team (at any time during the player’s career), date of clearance for unrestricted practice, and match participation data (as minutes on the playing field; for the season of injury, the preceding season, and the two seasons following injury). Specific match participation data included: minutes played, whether the player was in the starting team, whether the player did not play but sat on the bench, and the reason for not playing (medical injuries, coach choice, or other). Data were anonymized, pooled into a database, inspected, and formatted for consistency. In cases where players sustained bilateral ruptures, the first rupture was considered the index event.

Dealing with missing data

Missing data regarding minutes played per match were imputed using spline interpolation. In addition, missing values regarding categorical features related to match participation (reason for player absence from the playing field and whether the player was in the starting eleven) were imputed using backfilling. Of note, less than 0.01% of matches had missing information.

Feature engineering

The following features were computed from the available data: age at rupture, relative market value (obtained from the division of the player’s market value by the squad total market value), whether a re-rupture or a contra-lateral rupture happened, whether there were other preceding or following Achilles Tendon (AT) problems, date of the first official match participation following rupture, whether the player retired, changed clubs or was left without club within the 2 years following injury, minutes and matches played in the 24- (Year − 2) and 12-months preceding (Year − 1), and 12- (Year 0) and 24-months (Year 1) after injury. In addition, to account for discrepancies in playtime available, players’ data related to match participation was averaged by the number of matches played by the team in 30-, 90-, 120-, 180- and 360-day intervals.

Additional feature engineering was then performed, leading to the creation of the following features: the player’s market value multiplied by the average minutes played per match in Year − 1, the market value of the team multiplied by the player’s average minutes played per match in Year − 1, the difference in minutes played per match in Year − 1 and Year − 2, how many days had elapsed since the player joined the team when the injury happened and the number of months elapsed since the beginning of the season when the injury occurred.

Machine learning model development and calibration

Unsupervised and supervised machine learning models were trained and evaluated using the Python SciKit-Learn library on the Google Colab platform [2, 25].

The difference between average minutes played per match during Year 1 and Year − 1 (ΔMPM) was used to survey patterns in match participation after injury. Clustering analysis was performed using k-means clustering [1]. The optimal number of clusters was determined using the silhouette score [27], which varies between − 1 and + 1, and evaluates how similar data points are to their clusters compared to other clusters. A value of 0 represents overlapping clusters, and negative values signify that data points have been assigned to the wrong cluster. The silhouette score is frequently used to assess clustering quality, in the absence of a standard method in the research community [27]. Cluster stability was evaluated by repeatedly randomly dividing the main dataset into training and test datasets (number of repeats: 100; train/test split: 50/50) and measuring the similarity of the resulting clustering with the Adjusted Rand Index and Fowlkes–Mallows scores, using the main dataset cluster labels as ground truth.

The post-injury match participation level was predicted using the XGBoost classification algorithm [6, 12]. Continuous variables were scaled with standardization. Feature selection was performed using forward selection, in which the model is started with no features, and features are added sequentially and kept if results are improved. Model outputs were subjected to cross-validation using a ten k-fold strategy [19]. In a stratified ten k-fold cross-validation, 90% of the dataset is used to train, and 10% is used to evaluate the model. The procedure is repeated ten times, each with a different train/test split until the entire dataset has been used as the test set. The model was evaluated using the area under the receiver operating characteristic curve (AUROC) and Brier score loss. A representation of the machine learning processing pipeline can be found in Fig. 1.

Statistical analysis

Statistical analysis was performed using Python libraries Statsmodels and SciPy. Except otherwise specified, values are presented as means and standard deviation. Groups were compared using Student’s t-test, Kruskal–Wallis, or one-way ANOVA (depending on the number of groups and whether data followed a normal distribution). The assumption of normality was tested using the Shapiro–Wilk test. The Pearson’s correlation coefficient was used to explore potential correlations between variables. Statistical significance was set at p < 0.05. Sample size calculation was not performed for this study.

Results

The scraping process retrieved 748 entries. After applying exclusion criteria, 209 players were selected for analysis. Detailed information regarding the screening and selection process, with exclusion criteria, can be found in Fig. 2.

Player demographics and baseline characteristics

Data related to player demographics and baseline characteristics can be found in Table 1. The mean age at rupture was 28.2 ± 4.0 years (range 20–40).

Table 1 Player demographics and baseline characteristics

Full size table

Return to competition and career changes

Players were cleared for unrestricted practice after a mean of 223 ± 129 days (range 92–1553). The first post-injury match was played after a mean of 287 ± 136 days (range 106–825).

Fourteen players (6.7%) did not play any match after the AT injury and subsequently retired, with five more players retiring within 2 years after injury, for a total of 19 (9.1%). Three other players (1.4%) had their contracts expire and were left without a club sometime in the 2 years after injury. Thirteen players (6.2%) changed clubs within the 2 years following injury, with nine changing to teams playing below second league (4.3%).

Re-ruptures and other Achilles tendon issues

Ten players sustained re-ruptures (4.8%). These re-ruptures occurred after a mean of 621 ± 532 days after the index injury (153–1634). Six players (2.9%) sustained contra-lateral ruptures at some point in their careers. Eight players (3.8%) had a recording of previous AT problems, and 16 players (7.7%) had another time-loss injury (other than re-rupture or contra-lateral ATR) related to AT problems after the index injury.

Exploratory analysis of match participation data

Data from 32,853 matches were analysed. The average minutes played per match was 48 ± 25 in Year − 2, 46 ± 24 in Year − 1, 11 ± 13 in Year 0, and 32 ± 25 in Year 1. Players were in the squad in 64.1 ± 26.2% of games in Year − 2, 62.6 ± 25.0% in Year − 1, 17.9 ± 18.1% in Year 0, and 47.0 ± 29.4% in Year 1. Players were in the starting eleven in 53.5 ± 28.0% of games in Year − 2, 51.6 ± 27.0% in Year − 1, 12.0 ± 15.1% in Year 0, and 35.3 ± 28.7% in Year 1. These differences were statistically significant (p < 0.001) for all comparisons except between Year − 2 and Year − 1. A plot of average minutes played per match throughout the study time frame, computed in 30-day intervals for each playing position, can be seen in Fig. 3.

The Pearson’s correlation coefficient showed a small inverse correlation between days until clearance for unrestricted practice and the ΔMPM (r = − 0.2; 95% confidence interval − 0.33 to − 0.07; p < 0.01). A very small positive correlation was found between days elapsed since injury until first match played and the ΔMPM (r = 0.13; 95% confidence interval − 0.01 to 0.26; n.s.). After removal of outliers (those with values above 500 days; n = 18), this correlation was 0.2 (95% confidence interval 0.06–0.33; p < 0.01). Finally, a small positive correlation was also found between the number of days from clearance for unrestricted practice to first match played and the ΔMPM (r = 0.24; 95% confidence interval 0.11–0.36; p < 0.001).

Clustering analysis

The optimal number of clusters was four. The silhouette score was 0.55. The Adjusted Rand Index and Fowlkes–Mallows scores were 0.84 and 0.88, respectively. A plot of average minutes played per match for each cluster, computed in 30-day intervals, can be found in Fig. 4. The main characteristics of clusters and respective statistical comparisons can be found in Table 2.

Table 2 Main characteristics of clusters and statistical comparisons

Full size table

Prediction of post-injury match participation

Players were divided into two groups based on whether they suffered a decrease in match participation while comparing average minutes per match in Year 1 and Year − 1. Players were assigned to Group 1 if they showed a decrease larger than 15 min played per match, and this difference was more than 20% of the value in Year − 1 (decreased match participation, n = 103). Otherwise, they were assigned to Group 2 (maintenance or improvement of match participation, n = 106). These designations were used as classification labels to train a ML classification algorithm. A list of included features and relative feature importance can be found in Table 3. After cross-validation, the average model AUROC was 0.81 ± 0.10, and the Brier score loss was 0.12.

Table 3 Features included in the predictive model and their importance

Full size table

Discussion

The most important findings of this study were: most players gradually increased match participation during the first year after injury, with goalkeepers still improving after 2 years; and the ML classifier displayed good performance predicting whether a player would return to a similar, or even improved, level of match participation, with the most important features being related with pre-injury performance.

Plateauing of post-injury match participation occurred approximately 1 year after injury for forwards, midfielders, and defenders. Goalkeepers kept increasing playing time throughout the 2 years following injury, albeit at a slower rate. Of note, previous research has shown that outcomes after ATRs improve for at least 1 year after injury [3, 4], possibly due to a need to adapt to biomechanical changes in the lower limb resulting from tendon elongation [9]. Another critical aspect to consider is that psychological factors may be involved [29, 35], in which players need to regain confidence in their abilities and overcome the fear of re-injury.

Differences in match participation between Year − 1 and Year 1 were the subject of clustering analysis. A silhouette score of 0.55 was found for the optimum number of clusters, which denotes moderate cluster separability. In addition, good clustering stability was found through the Adjusted Rand Index and Fowlkes–Mallows scores, meaning that these clusters were relatively consistent, even when only subsections of the dataset were randomly evaluated.

Younger age has been previously recognized as a favourable prognostic factor after ATRs in soccer players [10]. However, this point is controversial since other studies have not found statistically significant differences regarding age in players with favourable versus unfavourable outcomes in soccer [34], American football [24], basketball [15], and baseball [26]. In this study, the average age was lower in clusters C and D (maintenance or improvement of match participation) than clusters A and B (decreased match participation).

The number of days the player has been with the team at the time of injury is a previously unrecognized prognostic factor in ATRs. In this study, it was found that players in Cluster A were with the team for a significantly longer time (1060.1 ± 1287.6 days) compared with the remaining cohort (p < 0.01). The longer time with the team (or since the last market transfer) may signal a different career context for these players. For example, their contracts may be near expiration, and prospects of joining another team are dim. Coincidentally, players in this cluster also retired within 2 years in a statistically significant higher proportion than the remaining cohort (26.5% versus 5.7%; p < 0.01).

Players in cluster D took a significantly longer time before playing their first official match compared with the remaining cohort (315 days ± 119 versus 264 days ± 159; p < 0.05), despite similar time intervals from injury to unrestricted practice (207 days ± 55 versus 222 days ± 140; p = 0.72). Therefore, it can be speculated that by allowing these players more time to recover, they made their comeback at a higher performance level—closer to the full recovery potential—which would be perceived as a superior recovery from injury, encouraging increased match participation. In addition, players in Cluster A (those with the most significant decrease in average minutes played per match in Year 1 compared to Year − 1) showed the shortest time until first match played. However, statistical correlations between days until unrestricted practice or first match played and the ΔMPM were small (albeit statistically significant). Further research is required to determine how a delayed return to competition may relate to improved outcomes after ATRs.

A ML classifier was trained, with an AUROC of 0.81 ± 0.10 after cross-validation, through careful feature engineering and selection, which translates as good discriminating performance [23]. The model’s performance was also evaluated regarding output probabilities using the Brier score loss, as it was deemed helpful for players and staff to gauge these against their individual beliefs and experiences. It should be noted that only pre-injury features were used to train the model, and no data regarding treatments was available. Of note, since features related to pre-injury match participation showed the highest feature importance, it can be inferred that the future level of match participation is related to the sporting context at the time of injury, directly or indirectly (e.g. a tendency for early RTP in high-performing players which may reflect negatively in match participation afterward).

The use of ML algorithms to predict sports injuries is a current trend in research [14, 17, 31], but practitioners should remain cautious regarding their use despite recent advances. There are ethical implications to consider [5], such as inadvertently hindering a player’s career through a wrongfully attributed worse prognosis. Model results may also be overly optimistic, either due to overfitting (when the model is fitted too close to a particular set of data and becomes unable to make good predictions in a generalized environment) or accidental data leakage (when information contained in the test set is wrongfully fed to the model during training). Nevertheless, the increasing accessibility and ease of use of ML tools and development frameworks offer an excellent opportunity to improve the care of musculoskeletal injuries, though researchers and clinicians should stay vigilant about its shortcomings.

The main limitation of this study is the inability to confirm the diagnosis. However, all included cases were manually double-checked using other sources by two researchers independently to avoid the inclusion of misclassified injuries. Other limitations are the unknown measurement accuracy of match participation data found on transfermarkt.com, the unavailability of treatment data, and the lack of a strictu sensu measure of player performance.

This study can guide the objectives and expectations of athletes and staff regarding how match participation evolves after an ATR, noting that it takes approximately 1 year to reach its peak (except for goalkeepers, who may keep improving for at least 2 years). In addition, the cluster of players with improved match performance showed a statistically significant increase in the number of days until first match played compared to the remaining cohort. Also, a small but statistically significant positive correlation was found between time until first match played and the ΔMPM. Finally, recent research has shown improved outcomes in patients undergoing slowed-down rehabilitation programs [18]. Thus, it may make sense to prioritize recovery of lower limb strength and sport-specific skills over an early return to competition.

Conclusion

Exploratory data analysis revealed that forwards, midfielders and defenders increased match participation during the first year after injury, with goalkeepers still improving at 2 years. Good performance was attained using a ML classifier to predict the level of match participation following an ATR, with features related to pre-injury match participation displaying the highest importance.

Abbreviations

AT:: Achilles tendon
ATR:: Achilles tendon rupture
AUROC:: Area under the receiver operating characteristic curve
ML:: Machine learning
RTP:: Return to play
ΔMPM:: Difference between average minutes played per match during Year 1 and Year − 1

References

Badillo S, Banfai B, Birzele F, Davydov II, Hutchinson L, Kam-Thong T, Siebourg-Polster J, Steiert B, Zhang JD (2020) An introduction to machine learning. Clin Pharmacol Ther 107:871–885
Article PubMed PubMed Central Google Scholar
Bisong E (2019) Building machine learning and deep learning models on google cloud platform: a comprehensive guide for beginners. Apress, Ottawa
Book Google Scholar
Carmont MR, Knutsson SB, Brorsson A, Karlsson J, Nilsson-Helander K (2022) The release of adhesions improves outcome following minimally invasive repair of Achilles tendon rupture. Knee Surg Sports Traumatol Arthrosc 30:1109–1117
Article PubMed Google Scholar
Carmont MR, Silbernagel KG, Edge A, Mei-Dan O, Karlsson J, Maffulli N (2013) Functional outcome of percutaneous achilles repair: improvements in achilles tendon total rupture score during the first year. Orthop J Sports Med. https://doi.org/10.1177/2325967113494584
Article PubMed PubMed Central Google Scholar
Char DS, Abràmoff MD, Feudtner C (2020) Identifying ethical considerations for machine learning healthcare applications. Am J Bioeth 20:7–17
Article PubMed PubMed Central Google Scholar
Chen T, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Krishnapuram B, Shah M (eds) KDD '16: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, August 2016. Association for Computing Machinery, New York, pp 785–794
Choi RY, Coyner AS, Kalpathy-Cramer J, Chiang MF, Campbell JP (2020) Introduction to machine learning, neural networks, and deep learning. Transl Vis Sci Technol. https://doi.org/10.1167/tvst.9.2.14
Article PubMed PubMed Central Google Scholar
Della Villa F, Hägglund M, Della Villa S, Ekstrand J, Waldén M (2021) High rate of second ACL injury following ACL reconstruction in male professional footballers: an updated longitudinal analysis from 118 players in the UEFA Elite Club Injury Study. Br J Sports Med 55:1350–1357
Article PubMed Google Scholar
Diniz P, Pacheco J, Guerra-Pinto F, Pereira H, Ferreira FC, Kerkhoffs G (2020) Achilles tendon elongation after acute rupture: is it a problem? A systematic review. Knee Surg Sports Traumatol Arthrosc 28:4011–4030
Article PubMed Google Scholar
Grassi A, Macchiarola L, Filippini M, Lucidi GA, Della Villa F, Zaffagnini S (2020) Epidemiology of anterior cruciate ligament injury in italian first division soccer players. Sports Health 12:279–288
Article PubMed Google Scholar
Grassi A, Rossi G, D’Hooghe P, Aujla R, Mosca M, Samuelsson K, Zaffagnini S (2020) Eighty-two per cent of male professional football (soccer) players return to play at the previous level two seasons after Achilles tendon rupture treated with surgical repair. Br J Sports Med 54:480–486
Article PubMed Google Scholar
Hinterwimmer F, Lazic I, Langer S, Suren C, Charitou F, Hirschmann MT, Matziolis G, Seidl F, Pohlig F, Rueckert D, Burgkart R, von Eisenhart-Rothe R (2022) Prediction of complications and surgery duration in primary TKA with high accuracy using machine learning with arthroplasty-specific data. Knee Surg Sports Traumatol Arthrosc. https://doi.org/10.1007/s00167-022-06957-w
Article PubMed PubMed Central Google Scholar
Jack RA, Sochacki KR, Gardner SS, McCulloch PC, Lintner DM, Cosculluela PE, Varner KE, Harris JD (2017) Performance and return to sport after Achilles tendon repair in national football league players. Foot Ankle Int 38:1092–1099
Article PubMed Google Scholar
Karnuta JM, Luu BC, Haeberle HS, Saluan PM, Frangiamore SJ, Stearns KL, Farrow LD, Nwachukwu BU, Verma NN, Makhni EC, Schickendantz MS, Ramkumar PN (2020) Machine learning outperforms regression analysis to predict next-season major league baseball player injuries: epidemiology and validation of 13,982 player-years from performance and injury profile trends, 2000–2017. Orthop J Sports Med. https://doi.org/10.1177/2325967120963046
Article PubMed PubMed Central Google Scholar
Lemme NJ, Li NY, Kleiner JE, Tan S, DeFroda SF, Owens BD (2019) Epidemiology and video analysis of Achilles tendon ruptures in the National Basketball Association. Am J Sports Med 47:2360–2366
Article PubMed Google Scholar
Leventer L, Eek F, Hofstetter S, Lames M (2016) Injury patterns among elite football players: a media-based analysis over 6 seasons with emphasis on playing position. Int J Sports Med 37:898–908
Article CAS PubMed Google Scholar
Lövdal SS, Den Hartigh RJR, Azzopardi G (2021) Injury prediction in competitive runners with machine learning. Int J Sports Physiol Perform. https://doi.org/10.1123/ijspp.2020-0518
Article PubMed Google Scholar
Maffulli N, Gougoulias N, Maffulli GD, Oliva F, Migliorini F (2022) Slowed-down rehabilitation following percutaneous repair of Achilles tendon rupture. Foot Ankle Int 43:244–252
Article PubMed Google Scholar
Marcot BG, Hanea AM (2021) What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis? Comput Stat 36:2009–2031
Article Google Scholar
Martin RK, Ley C, Pareek A, Groll A, Tischer T, Seil R (2022) Artificial intelligence and machine learning: an introduction for orthopaedic surgeons. Knee Surg Sports Traumatol Arthrosc 30:361–364
Article PubMed Google Scholar
Mazza D, Viglietta E, Monaco E, Iorio R, Marzilli F, Princi G, Massafra C, Ferretti A (2022) Impact of anterior cruciate ligament injury on European professional soccer players. Orthop J Sports Med. https://doi.org/10.1177/23259671221076865
Article PubMed PubMed Central Google Scholar
Nigam H, Biswas P (2021) From web scraping to web crawling. In: Choudhary A, Agrawal AP, Logeswaran R, Unhelkar B (eds) Applications of artificial intelligence and machine learning. Lecture notes in electrical engineering, vol 778. Springer, Singapore, pp 97–112
Olsson S, Akbarian E, Lind A, Razavian AS, Gordon M (2021) Automating classification of osteoarthritis according to Kellgren–Lawrence in the knee using deep learning in an unfiltered adult population. BMC Musculoskelet Disord. https://doi.org/10.1186/s12891-021-04722-7
Article PubMed PubMed Central Google Scholar
Parekh SG, Wray WH, Brimmo O, Sennett BJ, Wapner KL (2009) Epidemiology and outcomes of Achilles tendon ruptures in the National Football League. Foot Ankle Spec 2:283–286
Article PubMed Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay É (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830
Google Scholar
Saltzman BM, Tetreault MW, Bohl DD, Tetreault D, Lee S, Bach BR (2017) Analysis of player statistics in major league baseball players before and after Achilles tendon repair. HSS J 13:108–118
Article PubMed PubMed Central Google Scholar
Shahapure KR, Nicholas C (2020) Cluster quality analysis using Silhouette Score. In: Geoffrey IW, Zhongfei Z, Vincent TS, Graham W, Michalis V, Longbing C (eds) IEEE DSAA’2020: The 7th IEEE international conference on data science and advanced analytics, Sydney, October 2020. Institute of Electrical and Electronics Engineers (IEEE), New York, pp 747–748
Google Scholar
Shamshirband S, Fathi M, Dehzangi A, Chronopoulos AT, Alinejad-Rokny H (2020) A review on deep learning approaches in healthcare systems: taxonomies, challenges, and open issues. J Biomed Inform. https://doi.org/10.1016/j.jbi.2020.103627
Article PubMed Google Scholar
Slagers AJ, van Veen E, Zwerver J, Geertzen JHB, Reininga IHF, van den Akker-Scheek I (2021) Psychological factors during rehabilitation of patients with Achilles or patellar tendinopathy: a cross-sectional study. Phys Ther Sport 50:145–152
Article PubMed Google Scholar
Sochacki KR, Jack RA II, Hirase T, McCulloch PC, Lintner DM, Varner KE, Cosculluela PE, Harris JD (2019) There is a high return to sport rate but with reduced career lengths after Achilles tendon repair in Major League Soccer players. J ISAKOS 4:15–20
Article Google Scholar
Taborri J, Molinaro L, Santospagnuolo A, Vetrano M, Vulpiani MC, Rossi S (2021) A machine-learning approach to measure the anterior cruciate ligament injury risk in female basketball players. Sensors (Basel). https://doi.org/10.3390/s21093141
Article Google Scholar
Transfermarkt.com (2021) Transfermarkt.com: Football Transfers, Rumours, Market Values, News and Statistics. http://www.transfermarkt.com. Accessed 31 Mar 2021
Trofa DP, Miller JC, Jang ES, Woode DR, Greisberg JK, Vosseller JT (2017) Professional athletes’ return to play and performance after operative repair of an Achilles tendon rupture. Am J Sports Med 45:2864–2871
Article PubMed Google Scholar
Trofa DP, Noback PC, Caldwell J-ME, Miller JC, Greisberg JK, Ahmad CS, Vosseller JT (2018) Professional soccer players’ return to play and performance after operative repair of Achilles tendon rupture. Orthop J Sports Med. https://doi.org/10.1177/2325967118810772
Article PubMed PubMed Central Google Scholar
Turner J, Malliaras P, Goulis J, Mc Auliffe S (2020) “It’s disappointing and it’s pretty frustrating, because it feels like it’s something that will never go away.” A qualitative study exploring individuals’ beliefs and experiences of Achilles tendinopathy. PLoS One. https://doi.org/10.1371/journal.pone.0233459
Article PubMed PubMed Central Google Scholar
Wilke J, Tenberg S, Groneberg D (2022) Prognostic factors of muscle injury in elite football players: a media-based, retrospective 5-year analysis. Phys Ther Sport 55:305–308
Article PubMed Google Scholar

Download references

Acknowledgements

This work is financed by national funds from FCT—Fundação para a Ciência e a Tecnologia, I.P., in the scope of the Project UIDB/04565/2020 and UIDP/04565/2020 of the Research Unit Institute for Bioengineering and Biosciences—iBB and the project LA/P/0140/2020 of the Associate Laboratory Institute for Health and Bioeconomy—i4HB.

Funding

None.

Author information

Authors and Affiliations

Department of Orthopaedic Surgery, Hospital de Sant’Ana, Rua de Benguela, 501, 2775-028, Parede, Portugal
Pedro Diniz, Diogo Lacerda & António Martins
Department of Bioengineering and iBB, Institute for Bioengineering and Biosciences, Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal
Pedro Diniz, Mariana Abreu, Frederico Castelo Ferreira & Ana Fred
Associate Laboratory i4HB, Institute for Health and Bioeconomy, Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal
Pedro Diniz & Frederico Castelo Ferreira
Fisiogaspar, Lisbon, Portugal
Pedro Diniz & António Martins
Instituto de Telecomunicações, Lisbon, Portugal
Mariana Abreu & Ana Fred
Orthopaedic Department, Centro Hospitalar Póvoa de Varzim, Vila do Conde, Portugal
Hélder Pereira
Ripoll y De Prado Sports Clinic: FIFA Medical Centre of Excellence, Murcia-Madrid, Spain
Hélder Pereira
University of Minho ICVS/3B’s-PT Government Associate Laboratory, Braga/Guimarães, Portugal
Hélder Pereira
Department of Orthopaedic Surgery, Amsterdam Movement Sciences, Amsterdam University Medical Centers, Amsterdam, The Netherlands
Gino MMJ Kerkhoffs
Academic Center for Evidence Based Sports Medicine (ACES), Amsterdam, The Netherlands
Gino MMJ Kerkhoffs
Amsterdam Collaboration for Health and Safety in Sports (ACHSS), Amsterdam, The Netherlands
Gino MMJ Kerkhoffs

Authors

Pedro Diniz
View author publications
You can also search for this author in PubMed Google Scholar
Mariana Abreu
View author publications
You can also search for this author in PubMed Google Scholar
Diogo Lacerda
View author publications
You can also search for this author in PubMed Google Scholar
António Martins
View author publications
You can also search for this author in PubMed Google Scholar
Hélder Pereira
View author publications
You can also search for this author in PubMed Google Scholar
Frederico Castelo Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Gino MMJ Kerkhoffs
View author publications
You can also search for this author in PubMed Google Scholar
Ana Fred
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

PD, FCF, GK, and AF designed the study; PD wrote the computer code to perform data extraction from a publicly available source; PD and DL screened and selected athletes for inclusion in the study; PD prepared the data, performed exploratory data analysis, and developed the predictive model with support from MA and AF; PD drafted the manuscript with input from MA, AM, and HP; FCF, GK and AF revised the final manuscript.

Corresponding author

Correspondence to Pedro Diniz.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

Not applicable.

Informed consent

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Diniz, P., Abreu, M., Lacerda, D. et al. Pre-injury performance is most important for predicting the level of match participation after Achilles tendon ruptures in elite soccer players: a study using a machine learning classifier. Knee Surg Sports Traumatol Arthrosc 30, 4225–4237 (2022). https://doi.org/10.1007/s00167-022-07082-4

Download citation

Received: 11 April 2022
Accepted: 18 July 2022
Published: 09 August 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s00167-022-07082-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Pre-injury performance is most important for predicting the level of match participation after Achilles tendon ruptures in elite soccer players: a study using a machine learning classifier

Abstract

Purpose

Methods

Results

Conclusion

Level of evidence

Similar content being viewed by others

The Mental Health of Elite Athletes: A Narrative Systematic Review

The Biomechanics of the Track and Field Sprint Start: A Narrative Review

The Importance of Muscular Strength in Athletic Performance

Introduction

Materials and methods

Player screening and selection

Data extraction and dataset handling

Dealing with missing data

Feature engineering

Machine learning model development and calibration

Statistical analysis

Results

Player demographics and baseline characteristics

Return to competition and career changes

Re-ruptures and other Achilles tendon issues

Exploratory analysis of match participation data

Clustering analysis

Prediction of post-injury match participation

Discussion

Conclusion

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation