Predicting anterior cruciate ligament failure load with T2* relaxometry and machine learning as a prospective imaging biomarker for revision surgery

Flannery, Sean W.; Beveridge, Jillian E.; Proffen, Benedikt L.; Walsh, Edward G.; Kramer, Dennis E.; Murray, Martha M.; Kiapour, Ata M.; Fleming, Braden C.

doi:10.1038/s41598-023-30637-5

Predicting anterior cruciate ligament failure load with T₂* relaxometry and machine learning as a prospective imaging biomarker for revision surgery

Article
Open access
Published: 02 March 2023

Volume 13, article number 3524, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Predicting anterior cruciate ligament failure load with T₂* relaxometry and machine learning as a prospective imaging biomarker for revision surgery

Download PDF

Sean W. Flannery¹,
Jillian E. Beveridge¹,
Benedikt L. Proffen² &
Edward G. Walsh³ on behalf of
BEAR Trial Team,
Dennis E. Kramer²,
Martha M. Murray²,
Ata M. Kiapour² &
…
Braden C. Fleming¹

1092 Accesses
2 Citations
4 Altmetric
Explore all metrics

Abstract

Non-invasive methods to document healing anterior cruciate ligament (ACL) structural properties could potentially identify patients at risk for revision surgery. The objective was to evaluate machine learning models to predict ACL failure load from magnetic resonance images (MRI) and to determine if those predictions were related to revision surgery incidence. It was hypothesized that the optimal model would demonstrate a lower mean absolute error (MAE) than the benchmark linear regression model, and that patients with a lower estimated failure load would have higher revision incidence 2 years post-surgery. Support vector machine, random forest, AdaBoost, XGBoost, and linear regression models were trained using MRI T₂* relaxometry and ACL tensile testing data from minipigs (n = 65). The lowest MAE model was used to estimate ACL failure load for surgical patients at 9 months post-surgery (n = 46) and dichotomized into low and high score groups via Youden’s J statistic to compare revision incidence. Significance was set at alpha = 0.05. The random forest model decreased the failure load MAE by 55% (Wilcoxon signed-rank test: p = 0.01) versus the benchmark. The low score group had a higher revision incidence (21% vs. 5%; Chi-square test: p = 0.09). ACL structural property estimates via MRI may provide a biomarker for clinical decision making.

Application of machine learning-based multi-sequence MRI radiomics in diagnosing anterior cruciate ligament tears

Article Open access 31 January 2024

Risk factors for secondary meniscus tears can be accurately predicted through machine learning, creating a resource for patient education and intervention

Article 16 August 2022

Machine learning algorithm to predict anterior cruciate ligament revision demonstrates external validity

Article Open access 01 January 2022

Introduction

Anterior cruciate ligament (ACL) injuries affect over 120,000 people in the United States each year and are especially common in adolescents¹. Improved treatment options are needed as approximately one-third of adolescent patients will experience a graft rupture or contralateral ACL injury within 2 years post-surgery^2,3,4,5,6. Furthermore, ACL reconstruction (the current gold standard) does not significantly reduce the risk of posttraumatic osteoarthritis (PTOA) relative to non-operative management^7,8,9,10. At 20 years post-surgery, approximately 52% of ACL injured patients show radiographic signs of PTOA despite surgical treatment⁸. Successive surgeries, including revision, exacerbate PTOA risk¹¹.

The causal factors for these poor outcomes are a subject of debate. Risk for revision surgery has been associated with graft choice¹², age¹³, and incompletely restored joint kinematics^14,15,16, and neuromuscular control^14,15,16. A noninvasive method to evaluate the structural properties of the healing ACL or ACL graft, such as quantitative magnetic resonance imaging (qMRI), could provide insight into the mechanisms mediating reinjury and PTOA risk, guide clinical decision making, and aid in the development of improved surgical interventions (e.g., ACL restoration surgery) and rehabilitation strategies¹⁷.

Previously, measures obtained using qMRI (e.g., T₂* relaxation time, signal intensity, volume, volume of organized/disorganized collagen, etc.) have been shown to relate directly to the structural properties of the healing ACL or ACL graft^18,19,20, have been used to detect early soft tissue degeneration^21,22, and have been shown to correlate with contemporaneous clinical, functional, and patient-reported outcomes^{23,24,25,26,27,28}. Most recently, these qMRI measures were also shown to be associated with prospective functional outcomes following ACL restoration surgery²⁹. The catastrophic clinical consequences of ACL or graft failure have driven the development of methods to predict structural properties of the healing tissue from qMRI. While linear modeling methods represent the conventional approach^18,20,30,31, preliminary evidence suggests that machine learning techniques could provide substantial improvement to predictive performance relative to existing models as these methods have similarly shown promise in other magnetic resonance-based applications^32,33.

The goals of this study were: (1) to train candidate machine learning models to predict post-mortem ACL failure loads based on in vivo T₂* MR images from porcine subjects following ACL repair for which ground truth tensile failure load data were available and evaluate them relative to a benchmark linear model; and (2) to apply the best performing machine learning model to 9 month MRIs of a cohort of patients who were 2 years out from bridge-enhanced ACL restoration (BEAR) surgery and demonstrate the clinical relevance of the failure load estimates by comparing them to the incidence of revision surgery in this cohort. It was hypothesized that: (1) at least one of the machine learning models would demonstrate lower mean absolute error (MAE) on the porcine failure load data than the benchmark linear model (Aim 1); and (2) human patients with a lower estimated failure load using the machine learning model with the lowest MAE would have a higher incidence of revision surgery within 2 years post-surgery (Aim 2).

Results

Model development

The top 3 features for the machine learning models were limb type (surgical or contralateral intact), first quartile of T₂* relaxation time, and sub-volume proportion 4 (Fig. 1). These features were determined by RFE-CV and included in the subsequent machine learning models.

The benchmark linear regression model, which included limb type, CSA, and sub-volume proportions 1 and 4 as features, demonstrated the highest mean absolute error of all the models evaluated (Table 1). The lowest error was found using the random forest regression model. On the test set, the random forest model demonstrated a 55% (95% CI 44–78%) decrease in mean absolute error relative to the baseline linear regression model (p = 0.01; Fig. 2).

Table 1 Test set mean absolute error (MAE) and 95% confidence interval (CI) of the benchmark linear model (LM) and selected machine learning models (SVM support vector machine, RF random forest).

Full size table

Clinical pilot assessment of model performance in predicting reinjury risk

The random forest model was then applied to the initial BEAR III patient cohort data. Based on the Youden’s J statistic, the optimal score threshold for separating the low and high failure load groups was 0.81 (Fig. 3a). At this threshold, the false positive rate was 0.48, and the true positive rate was 0.83 (Fig. 3b). For patients in the high failure load group, the incidence of revision surgery was 4.5%. For the patients in the low failure load group, the incidence was 20.8%; a 362% increase in risk for a revision surgery (likelihood ratio Chi-square test p = 0.09).

In contrast, the baseline linear model had an optimal dichotomization threshold of 0.33 as determined by Youden’s J statistic. At this threshold, the false positive rate was 0.45, and the true positive rate was 1.0 (Supplementary Table 1). For the patients in the high failure load group, the incidence of revision was 0%, and for patients in the low failure load group, the incidence was 25% (likelihood ratio Chi-square test p = 0.01).

Discussion

All the machine learning models evaluated in this study showed decreased mean absolute errors relative to the benchmark linear regression model by 55% for predicting the tensile ACL failure load from T₂* relaxometry data when applied to the porcine test data set (Aim 1). Of the machine learning models, the random forest regression model showed the lowest mean absolute error and was therefore used in the clinical assessment (Aim 2). When applying the random forest model to the 9-month clinical imaging data, the predicted failure loads obtained showed potential for determining which patients subsequently required revision surgery by two years as the failure rate for the high failure load group was less than 5% and that of the low failure load group was 21%.

When selecting features to include in the machine learning models, limb type (surgical or contralateral intact) was the most important feature, followed by Q1 T₂* and sub-volume proportion 4. It was not surprising that limb type scored highly on feature importance given that feature importance tends to inflate the importance of dichotomous variables³⁴. Nevertheless, limb type was considered an important variable to include in the analysis, independent of the feature importance results. Q1 T₂* and sub-volume proportion 4 may be considered important because they relate to the most and least organized tissue within the healing ligament, respectively³⁰. It has previously been shown that the quantities of the most and least organized tissue within the ACL are drivers of its structural properties³⁰. Two of the features used in the baseline linear regression model were also found to be in the top 3 for feature importance (limb type and sub-volume proportion 4). Another variable used in the linear regression model, CSA, was found to be the fourth most important feature.

The best machine learning model was a random forest regression model that used limb type, Q1 T₂*, and sub-volume proportion 4 as features. The estimated failure load predictions using the random forest approach showed a large decrease in mean absolute error (55%) in the porcine test set data compared to the benchmark linear regression that was statistically significant (p = 0.01). It is likely that this decrease is clinically meaningful, as the random forest model reduced the estimated failure load error from 31% (for the linear regression model) to 10% of the mean true failure load. Furthermore, when the model predictions were plotted versus the true failure loads, it was observed that the benchmark linear regression model overestimated low failure loads on the porcine test set, while it predicted similar values at high failure loads. Overestimating low failure loads could potentially be a limiting factor for the clinical translation of the linear model because patients with low failure loads would theoretically be the most vulnerable to reinjury. In contrast, the random forest model was more consistent across the failure load distribution.

When making estimated failure load predictions on the BEAR III patients using T₂* relaxometry (Aim 2), a large increase in revision rate was observed for patients in the low failure load score group compared to patients in the high failure load score group. In the high failure load score group, 1 out of 22 patients required revision surgery. In the low failure load score group, 5 out of 24 patients required revision surgery. Due to the small sample size of the clinical cohort, this difference was not found to be statistically significant, though suggests a trend (p < 0.09). These findings are similar to that of the linear model, in which 0 out of 22 high failure load score patients and 6 out of 24 low failure load score patients required revision. Given that this difference was dependent on the classification of a single patient, and that the overall incidence of revision was low, more data are needed to further assess the clinical utility of the machine learning model.

Previously, we developed models predicting porcine ACL structural properties, specifically the maximum failure load, yield load, and stiffness of the ACL^18,20,30,31. However, translating these structural property prediction models to human patients and establishing their clinical relevance was necessary. In one of our studies, we showed that estimated failure load from Constructive Interference in Steady State scans (using a different prediction model) was significantly associated with functional outcomes for patients after the ACL restoration procedure²⁹. To our knowledge, the present analysis is the first to show that the T₂* relaxometry-based structural property predictions are associated with the incidence of revision surgery and is a critical step towards translating the large body of preclinical work that we have pursued. To this end, these findings provide proof-of-concept evidence that estimating ACL failure load post-surgery using a random forest machine learning model to analyze T₂* relaxometry images may serve as a clinically relevant biomarker to prospectively evaluate ACL surgery revision risk. This biomarker could potentially be used in the clinic to inform return-to-sport decisions and to tailor rehabilitation strategies. Future studies could be designed to evaluate the use of these failure load estimates to alter the course of healing in patients with low predicted failure loads. In research settings, noninvasively measuring the structural properties of the ACL using T₂* relaxometry would also be a valuable tool for many research applications, for example, assessing new surgical interventions against the gold standard treatments.

There are a few study limitations to consider. First, model performance has yet to be assessed on ACL reconstructed patients. ACL reconstruction is a more common procedure, and application of this imaging approach to healing ACL grafts will be pursued in future work. As previously noted, feature importance is known to inflate the importance of dichotomous variables, such as limb type. However, for the purposes of feature selection, the magnitudes of relative feature importance values were less important than the ranking. Furthermore, limb type would be considered an important feature to include regardless of feature importance value due to differences between the healthy and repaired ligament. In addition, model specific feature importance may vary to some extent from model to model. For this reason, the main purpose of the RFE-CV procedure was to serve as a dimension reduction technique to reduce the risk of overfitting the machine learning models. For obvious ethical reasons, in vivo ground truth ACL failure load data were not available for human patients. As a result, a direct validation of the model failure load predictions could not be performed on patients, and thus were derived from porcine data, which were then normalized and scaled for human use. Therefore, the clinical impact of the estimated failure load scores was assessed by relating model predictions of failure load at 9 months (the time at which patients were permitted to return-to-sport^35,36) to revision surgery rates by 2 years. Furthermore, by converting the failure load prediction to a 0–1 score, the metric was normalized to the range of possible estimated failure load values, reducing the importance of the actual absolute value. This approach also has the benefit of providing a more easily interpretable metric for clinical use. Another limitation was that the porcine data used for the initial model training were acquired with a 6-channel coil, rather than the 15-channel coil used for the human data. Because the 6-channel coil has a lower signal-to-noise ratio than the 15-channel coil, its use would potentially make the machine learning prediction model more robust by acting as a form of noise augmentation^37,38. Finally, the overall incidence of revision in the BEAR III cohort study was low (6 out of 46), which may explain the relatively high false positive rate. The incidence of revision surgery was chosen as a benchmark for comparison in the human study because it is a discrete outcome and clinically highly relevant. However, there are other clinically relevant outcomes that the model should be compared to, such as incidence of PTOA. This could be assessed as long-term follow up data become available from the clinical trial analyzed in the present study. The sample sizes for the animal and clinical studies were based on samples of convenience as the current analysis leveraged existing data sets. Nonetheless, the sample size was large enough to show that the predicted failure load score from T₂* images may be related to risk of reinjury and provides proof-of-concept evidence that it could serve as a biomarker to predict ACL reinjury. Future studies involving a larger independent test cohort are needed to fully validate the method.

In conclusion, the best machine learning model (random forest regression) predicted the ACL failure load in porcine subjects with lower error and lower bias than the benchmark linear regression model. The application of the random forest regression model to human T₂* data at 9 months post-surgery shows a potential relationship with the incidence of revision surgery out to 2 years, where patients with a higher estimated failure load score showed a decreased incidence of revision surgery, though more data are needed to support these clinical pilot results. Estimating structural properties of the ACL post-surgery may therefore be a relevant biomarker for both clinical and research applications.

Methods

Model development

Porcine data

Data were acquired from two previous studies of ACL transection followed by surgical repair with the BEAR implant in Yucatan minipigs (Fig. 4A)^30,31. These studies were performed to meet the ARRIVE guidelines and were IACUC approved. In the first study, paired MRI and tensile failure load data of the ACL repaired limb and contralateral uninjured limb were available at 12 (n = 13) and 24 weeks post-surgery (n = 16)^30,31. In the second study, the same data were available at 24 weeks post-surgery (n = 36)^31,39. In total, paired MRI and tensile failure load data were available for 65 minipigs.

For the pig studies, MRI T₂* relaxometry scans were acquired using a 3 T magnet (PRISMA, Siemens, Erlangen, Germany) with a 6-channel flexible coil (Flexcoil; Siemens), and a gradient echo 4-echo sequence (FA = 12°; TR = 29 ms; TE₁ = 2.8 ms, TE₂ = 7.9 ms, TE₃ = 13 ms, TE₄ = 18 ms; FOV = 160 mm; 512 × 512 acquisition matrix, voxel size of 0.3125 mm × 0.3125 mm × 0.8 mm)^30,40. Both the surgical and contralateral knees were imaged. Following in vivo imaging of both knees, the animals were euthanized, and the hind limbs were harvested and frozen at − 20 °C until mechanical testing^30,31. The porcine data were randomly split and stratified by subject into an 80% training (n = 52) set and a 20% test (n = 13) set. Given the small sample size, a cross validation within the training set was performed as it is more robust than an independent validation set³⁸.

MR image processing

The ACL was manually segmented from the image volume by one segmenter with 4 years of experience (SWF) using commercial imaging software (Mimics Research 19.0; Materialise, Leuven, Belgium). Voxel-wise T₂* relaxation times were calculated by fitting an exponential decay curve to the per echo signal intensities⁴¹. Based on our previous studies, the following MR variables were extracted for potential inclusion in the prediction models: median T₂*, mean T₂*, standard deviation T₂*, skew T₂*, first quartile (Q1) T₂*, third quartile (Q3) T₂*, average cross-sectional area (CSA), and sub-volume proportions (Prop 1–4)^18,30. Average CSA was estimated as ligament volume divided by ligament length. Four sub-volume proportions were identified by binning voxels by relaxation time (0–12.5 ms, 12.5–25 ms, 25 ms, 25–37.5 ms, 37.5–50 ms) and dividing by the total ligament volume. All these variables were selected because they are either ligament size-invariant or ligament size-normalized. It was expected that these variables would scale more readily from pigs to humans. All variables were standardized by z-scoring prior to use by the machine learning models.

Biomechanical testing

The porcine hind limbs were thawed to room temperature prior to biomechanical testing. The joints were dissected down to the femur-ACL-tibia complex, then the femur and tibia were potted in PVC pipes and secured with a urethane resin (SmoothCast; Smooth-On, Macungie, PA). Each sample was mounted for mechanical testing in a servo-hydraulic material testing system (MTS 810, Eden Prairie, MN) such that the long axis of the ACL was aligned with the tensile load vector. Tensile loading was then applied at a rate of 20 mm/min until failure³¹. Failure load, yield load, and stiffness were calculated for each specimen as previously described³⁰. Given the high correlation between these three variables^18,20,30, only failure load was evaluated in the present study (Fig. 5).

Machine learning model development

Multiple machine learning regression models were developed and compared to predict the ACL failure load: support vector machine (SVM), random forest (RF), AdaBoost, and XGBoost (Table 2)³⁸. A multivariate linear regression model (LM) was trained as the performance benchmark. Model selection was influenced, in part, by the sample size available for training and evaluation³⁸. The SVM regressor was trained by fitting a hyperplane to the data that maximized instances between the margins and minimized violations outside of the margins. Random forest, AdaBoost, and XGBoost were variations of decision tree ensembles. The random forest model was a parallel ensemble of decision trees trained on random samples of data selected with replacement (i.e., bagging)³⁸. AdaBoost and XGBoost both combined multiple weak models sequentially to create a single strong learner (i.e., boosting)³⁸. During this process, more weight was placed on difficult instances so that subsequent learners improved predictions on those instances. XGBoost also added regularization (L1, L2, pruning) to reduce the risk of overfitting³⁸.

Table 2 Initial hyperparameter search space (SVM support vector machine, RF random forest, RBF radial basis function, MSE mean squared error, DART dropouts meet multiple additive regression trees).

Full size table

Feature selection

In addition to the MR imaging variables extracted, limb type (surgical or contralateral intact) was also included as a potential variable. The benchmark linear model used sub-volume proportions 1 and 4, CSA, and limb type as features. This linear model was an evolution of a previously published linear model relating T₂* relaxation times to the structural properties³⁰ but was based on size-normalized features in the current study. Features for the machine learning models were selected via recursive feature elimination with cross-validation (RFE-CV)^42,43. In the RFE-CV procedure, an estimator was initially fitted on the training set using all possible features and fivefold cross-validation. Cross-validation ensured that the feature importance estimation was robust to the data split used, by iteratively withholding a different subset of the data for validation³⁸. After cross-validation, the least important feature was eliminated, and the process was repeated with the minimum number of features at the finish set to 3. This threshold was selected because of the rapid decay in feature importance levels beyond 3 features.

Model optimization and selection

Each machine learning model was optimized by first performing a random search fivefold cross-validation (RS-CV) to narrow the possible hyperparameter space (Table 2). Next, a grid search cross-validation (GS-CV) was performed in the reduced hyperparameter space. This approach was more computationally efficient than performing GS-CV on the full hyperparameter space³⁸. After model optimization, final performance of each machine learning model was assessed by making predictions on the withheld test set. The best (lowest error) machine learning model was compared to the benchmark linear model using the mean absolute error values from these test set predictions. The statistical significance of the difference in mean absolute error was assessed with a Wilcoxon signed-rank test.

Clinical pilot assessment of model performance in predicting reinjury risk

Data set

MRI T₂* data were acquired from patients enrolled in the ongoing, IRB-approved BEAR III clinical trial (NCT03348995) and who were at least 2 years post-surgery (Fig. 4B)⁴⁴. The BEAR III trial was approved by the Institutional Review Boards of Boston Children’s Hospital and Rhode Island Hospital, and all subjects granted their informed consent. Eligible patients received ACL restoration surgery within 50 days of ACL injury using the BEAR implant (Boston Children’s Hospital, Boston MA), a scaffold composed of bovine-derived extracellular matrix proteins^45,46. The inclusion criteria were male and female patients, 12–80 years of age, a first-time complete mid-substance ACL tear or partial tear accompanied by a positive pivot shift. Patients were included if they had at least 5% of ACL length remaining on the tibia, as determined via MRI. Subjects who tore their contralateral ACL, re-tore their ipsilateral ACL, or had missing data at any timepoint were excluded from the present analysis. Two-year data were available for 49 subjects (Fig. 4B; 25 female, 24 male; mean age 21 [range 13–47] years)⁴⁴. Three patients missed their 9-month MRI exams, and no patients were lost to follow up, making 46 subjects available for the present analysis. During the 2-year post-surgery period, a total of 6 ACL injuries requiring revision surgery occurred.

MR imaging was performed 9 months post-surgery using either a 3 T Prisma (Siemens, Erlangen, Germany) or a 3 T Tim Trio (Siemens) scanner with a 15-channel transmit/receive coil (Siemens). On the Prisma (Brown University; n = 10), T₂* relaxometry scans were acquired with FA = 12°; TR = 29 ms; TE₁ = 2.5 ms; TE₂ = 6.9 ms; TE₃ = 11.2 ms; TE₄ = 15.6 ms; TE₅ = 20 ms; TE₆ = 24.4 ms; FOV = 160 mm; 384 × 384 acquisition matrix with voxel size 0.4167 mm × 0.4167 mm × 0.8 mm, acquisition time 11 min 57 s. On the Tim Trio (Boston Children’s Hospital Waltham, n = 36), T₂* relaxometry scans were acquired with FA = 12°; TR = 29 ms; TE₁ = 3.4 ms; TE₂ = 6.9 ms; TE₃ = 11.2 ms; TE₄ = 15.6 ms; TE₅ = 20 ms; TE₆ = 24.4 ms; FOV = 160 mm; 384 × 384 acquisition matrix with voxel size 0.4167 mm × 0.4167 mm × 0.8 mm, acquisition time 11 min 59 s. Due to the acquisition parameter differences between scanners imposed by the differing gradient hardware performance limits, all acquisitions were harmonized prior to analysis by z-scoring⁴⁷. Details regarding the MRI harmonization procedure have been previously described⁴⁷.

Clinical application of the optimal machine learning model

The machine learning model with the lowest MAE as determined in Aim 1 was used to estimate the failure load of the healing ACL at 9 months post-surgery for the patients treated with the BEAR implant. Prior to making predictions, each continuous feature in the MR data that was included in the model was standardized by z-scoring with the mean and standard deviation of the same feature from pigs used as the scaling reference. This standardization step was necessary to ensure the patient imaging data were on the same scale as the porcine data that the model was trained on.

Failure load predictions were converted to a score from 0–1 (Eq. 1; F_failure = estimated failure load of subject, F_min = lowest estimated failure load in dataset, F_max = highest estimated failure load dataset). A score threshold was objectively determined using the Youden’s J statistic (Eq. 2; TP = true positive, TN = true negative, FN = false negative, FP = false positive)⁴⁸. Patients with a predicted failure load score less than or equal to a score threshold were assigned to the low failure load group (i.e., at high risk for reinjury), and those that were greater than the score threshold assigned to the high failure load group (i.e., at lower risk for reinjury). For a dichotomous variable, the optimal threshold was located where the J statistic was maximized⁴⁸. The difference in revision rates within 2 years of surgery between the MRI predicted low and high failure load groups was then assessed with a likelihood ratio Chi-square test. The level of significance was set at alpha = 0.05 for all statistical tests.

$$Estimated \; Failure \; Load \; Score= \frac{{F}_{failure}-{F}_{min}}{{F}_{max}-{F}_{min}},$$

(1)

$$J= \frac{TP}{TP+FN}+\frac{TN}{TN+FP}-1.$$

(2)

Data availability

Please contact the corresponding author for data access.

References

Gornitzky, A. L. et al. Sport-specific yearly risk and incidence of anterior cruciate ligament tears in high school athletes: A systematic review and meta-analysis. Am. J. Sports Med. 44, 2716–2723. https://doi.org/10.1177/0363546515617742 (2016).
Article PubMed Google Scholar
Morgan, M. D., Salmon, L. J., Waller, A., Roe, J. P. & Pinczewski, L. A. Fifteen-year survival of endoscopic anterior cruciate ligament reconstruction in patients aged 18 years and younger. Am. J. Sports Med. 44, 384–392. https://doi.org/10.1177/0363546515623032 (2016).
Article PubMed Google Scholar
Salmon, L. J. et al. 20-year outcomes of anterior cruciate ligament reconstruction with hamstring tendon autograft: The catastrophic effect of age and posterior tibial slope. Am. J. Sports Med. 46, 531–543. https://doi.org/10.1177/0363546517741497 (2018).
Article PubMed Google Scholar
Webster, K. E. & Feller, J. A. Exploring the high reinjury rate in younger patients undergoing anterior cruciate ligament reconstruction. Am. J. Sports Med. 44, 2827–2832. https://doi.org/10.1177/0363546516651845 (2016).
Article PubMed Google Scholar
Paterno, M. V., Rauh, M. J., Schmitt, L. C., Ford, K. R. & Hewett, T. E. Incidence of second ACL injuries 2 years after primary ACL reconstruction and return to sport. Am. J. Sports Med. 42, 1567–1573. https://doi.org/10.1177/0363546514530088 (2014).
Article PubMed PubMed Central Google Scholar
Webster, K. E., Feller, J. A., Leigh, W. B. & Richmond, A. K. Younger patients are at increased risk for graft rupture and contralateral injury after anterior cruciate ligament reconstruction. Am. J. Sports Med. 42, 641–647. https://doi.org/10.1177/0363546513517540 (2014).
Article PubMed Google Scholar
Wellsandt, E., Failla, M. J., Axe, M. J. & Snyder-Mackler, L. Does anterior cruciate ligament reconstruction improve functional and radiographic outcomes over nonoperative management 5 years after injury?. Am. J. Sports Med. 46, 2103–2112. https://doi.org/10.1177/0363546518782698 (2018).
Article PubMed PubMed Central Google Scholar
Cinque, M. E., Dornan, G. J., Chahla, J., Moatshe, G. & LaPrade, R. F. High rates of osteoarthritis develop after anterior cruciate ligament surgery: An analysis of 4108 patients. Am. J. Sports Med. 46, 2011–2019. https://doi.org/10.1177/0363546517730072 (2018).
Article PubMed Google Scholar
Ajuied, A. et al. Anterior cruciate ligament injury and radiologic progression of knee osteoarthritis: A systematic review and meta-analysis. Am. J. Sports Med. 42, 2242–2252. https://doi.org/10.1177/0363546513508376 (2014).
Article PubMed Google Scholar
Delince, P. & Ghafil, D. Anterior cruciate ligament tears: Conservative or surgical treatment? A critical review of the literature. Knee Surg. Sports Traumatol. Arthrosc. 20, 48–61. https://doi.org/10.1007/s00167-011-1614-x (2011).
Article PubMed Google Scholar
Jones, M. H. & Spindler, K. P. Risk factors for radiographic joint space narrowing and patient reported outcomes of post-traumatic osteoarthritis after ACL reconstruction: Data from the MOON cohort. J. Orthop. Res. 35, 1366–1374. https://doi.org/10.1002/jor.23557 (2017).
Article PubMed PubMed Central Google Scholar
Group, M. & Group, M. Effect of graft choice on the outcome of revision anterior cruciate ligament reconstruction in the Multicenter ACL Revision Study (MARS) Cohort. Am. J. Sports Med. 42, 2301–2310. https://doi.org/10.1177/0363546514549005 (2014).
Article Google Scholar
Wiggins, A. J. et al. Risk of secondary injury in younger athletes after anterior cruciate ligament reconstruction: A systematic review and meta-analysis. Am. J. Sports Med. 44, 1861–1876. https://doi.org/10.1177/0363546515621554 (2016).
Article PubMed PubMed Central Google Scholar
Trojani, C. et al. Causes for failure of ACL reconstruction and influence of meniscectomies after revision. Knee Surg. Sports Traumatol. Arthrosc. 19, 196–201. https://doi.org/10.1007/s00167-010-1201-6 (2011).
Article PubMed Google Scholar
Magnussen, R. A. et al. Effect of high-grade preoperative knee laxity on 6-year anterior cruciate ligament reconstruction outcomes. Am. J. Sports Med. 46, 2865–2872. https://doi.org/10.1177/0363546518793881 (2018).
Article PubMed PubMed Central Google Scholar
Paterno, M. V. et al. Biomechanical measures during landing and postural stability predict second anterior cruciate ligament injury after anterior cruciate ligament reconstruction and return to sport. Am. J. Sports Med. 38, 1968–1978. https://doi.org/10.1177/0363546510376053 (2010).
Article PubMed PubMed Central Google Scholar
DeFroda, S. F., Fadale, P. D., Owens, B. D. & Fleming, B. C. The role of magnetic resonance imaging in evaluating post-operative ACL reconstruction healing and graft mechanical properties: A new criterion for return to play?. Phys. Sportsmed. 49, 123–129. https://doi.org/10.1080/00913847.2020.1820846 (2021).
Article PubMed Google Scholar
Biercevicz, A. M. et al. T2* MR relaxometry and ligament volume are associated with the structural properties of the healing ACL. J. Orthop. Res. 32, 492–499. https://doi.org/10.1002/jor.22563 (2014).
Article PubMed Google Scholar
Biercevicz, A. M., Proffen, B. L., Murray, M. M., Walsh, E. G. & Fleming, B. C. T2* relaxometry and volume predict semi-quantitative histological scoring of an ACL bridge-enhanced primary repair in a porcine model. J. Orthop. Res. 33, 1180–1187. https://doi.org/10.1002/jor.22874 (2015).
Article PubMed PubMed Central Google Scholar
Biercevicz, A. M., Miranda, D. L., Machan, J. T., Murray, M. M. & Fleming, B. C. In situ, noninvasive, T2*-weighted MRI-derived parameters predict ex vivo structural properties of an anterior cruciate ligament reconstruction or bioenhanced primary repair in a porcine model. Am. J. Sports Med. 41, 560–566. https://doi.org/10.1177/0363546512472978 (2013).
Article PubMed PubMed Central Google Scholar
Kajabi, A. W. et al. Multiparametric MR imaging reveals early cartilage degeneration at 2 and 8 weeks after ACL transection in a rabbit model. J. Orthop. Res. 38, 1974–1986. https://doi.org/10.1002/jor.24644 (2020).
Article CAS PubMed Google Scholar
Chu, C. R. et al. Visualizing pre-osteoarthritis: Integrating MRI UTE-T2* with mechanics and biology to combat osteoarthritis-The 2019 Elizabeth Winston Lanier Kappa Delta Award. J. Orthop. Res. 39, 1585–1595. https://doi.org/10.1002/jor.25045 (2021).
Article PubMed Google Scholar
Titchenal, M. R. et al. Cartilage subsurface changes to magnetic resonance imaging UTE-T2* 2 years after anterior cruciate ligament reconstruction correlate with walking mechanics associated with knee osteoarthritis. Am. J. Sports Med. 46, 565–572. https://doi.org/10.1177/0363546517743969 (2018).
Article PubMed PubMed Central Google Scholar
Williams, A. A., Titchenal, M. R., Andriacchi, T. P. & Chu, C. R. MRI UTE-T2* profile characteristics correlate to walking mechanics and patient reported outcomes 2 years after ACL reconstruction. Osteoarthr. Cartil. 26, 569–579. https://doi.org/10.1016/j.joca.2018.01.012 (2018).
Article CAS Google Scholar
Biercevicz, A. M. et al. MRI volume and signal intensity of ACL graft predict clinical, functional, and patient-oriented outcome measures after ACL reconstruction. Am. J. Sports Med. 43, 693–699. https://doi.org/10.1177/0363546514561435 (2015).
Article PubMed Google Scholar
Pfeiffer, S. J. et al. Association of jump-landing biomechanics with tibiofemoral articular cartilage composition 12 months after ACL reconstruction. Orthop. J. Sports Med. 9, 23259671211016424. https://doi.org/10.1177/23259671211016424 (2021).
Article PubMed PubMed Central Google Scholar
Lansdown, D. A. et al. Quantitative imaging of anterior cruciate ligament (ACL) graft demonstrates longitudinal compositional changes and relationships with clinical outcomes at 2 years after ACL reconstruction. J. Orthop. Res. 38, 1289–1295. https://doi.org/10.1002/jor.24572 (2020).
Article PubMed PubMed Central Google Scholar
Kiapour, A. M. et al. Anatomical features of the tibial plateau predict outcomes of ACL reconstruction within 7 years after surgery. Am. J. Sports Med. 47, 303–311. https://doi.org/10.1177/0363546518823556 (2019).
Article PubMed PubMed Central Google Scholar
Flannery, S. W. et al. Early MRI-based quantitative outcomes are associated with a positive functional performance trajectory from 6- to 24-months post-ACL surgery. Knee Surg. Sports Traumatol. Arthrosc. https://doi.org/10.1007/s00167-022-07000-8 (2022).
Article PubMed Google Scholar
Beveridge, J. E. et al. Magnetic resonance measurements of tissue quantity and quality using T₂* relaxometry predict temporal changes in the biomechanical properties of the healing ACL. J. Orthop. Res. 36, 1701–1709. https://doi.org/10.1002/jor.23830 (2018).
Article CAS PubMed Google Scholar
Beveridge, J. E. et al. Cartilage damage is related to ACL stiffness in a porcine model of ACL repair. J. Orthop. Res. 37, 2249–2257. https://doi.org/10.1002/jor.24381 (2019).
Article PubMed PubMed Central Google Scholar
Peng, W. K. Clustering Nuclear Magnetic Resonance: Machine learning assistive rapid two-dimensional relaxometry mapping. Eng. Rep. 3, e12383. https://doi.org/10.1002/eng2.12383 (2021).
Article Google Scholar
Peng, W. K., Ng, T. T. & Loh, T. P. Machine learning assistive rapid, label-free molecular phenotyping of blood with two-dimensional NMR correlational spectroscopy. Commun. Biol. 3, 1–10. https://doi.org/10.1038/s42003-020-01262-z (2022).
Article CAS Google Scholar
Strobl, C., Boulesteix, A. L., Zeileis, A. & Hothorn, T. Bias in random forest variable importance measures: Illustrations, sources and a solution. BMC Bioinform. 8, 25. https://doi.org/10.1186/1471-2105-8-25 (2007).
Article CAS Google Scholar
Barnett, S. C. et al. Earlier resolution of symptoms and return of function after bridge-enhanced anterior cruciate ligament repair as compared with anterior cruciate ligament reconstruction. Orthop. J. Sports Med. 9, 23259671211052530. https://doi.org/10.1177/23259671211052530 (2021).
Article PubMed PubMed Central Google Scholar
Sanborn, R. M. et al. Psychological readiness to return to sport at 6 months is higher after bridge-enhanced ACL restoration than autograft ACL reconstruction: Results of a prospective randomized clinical trial. Orthop. J. Sports Med. 10, 23259671211070544. https://doi.org/10.1177/23259671211070542 (2022).
Article PubMed PubMed Central Google Scholar
McRobbie, D. W., Moore, E. A., Graves, M. J. & Prince, M. R. MRI from Picture to Proton. 2nd ed (Cambridge University Press, 2007).
Geron, A. Hands-On Machine Learning with Scikit-Learn, Keras and TensorFlow. 819 (O'Reilly Media, Inc., 2019).
Flannery, S. W. et al. Assessing meniscus integrity post-ACL repair with MRI T2* relaxometry. Trans. Orthop. Res. Soc. 64 (2018).
Beveridge, J. E., Walsh, E. G., Murray, M. M. & Fleming, B. C. Sensitivity of ACL volume and T2* relaxation time to magnetic resonance imaging scan conditions. J. Biomech. 56, 117–121. https://doi.org/10.1016/j.jbiomech.2017.03.010 (2017).
Article PubMed PubMed Central Google Scholar
Brown, R. W., Cheng, Y.-C. N., Haacke, E. M., Thompson, M. R. & Venkatesan, R. Magnetic Resonance Imaging: Physical Principles and Sequence Design. 2nd ed. 113–186 (Wiley, 2014).
Guyon, I., Weston, J., Barnhill, S. & Vapnik, V. Gene selection for cancer classification using support vector machines. Mach. Learn. 46, 39–422 (2002).
Article MATH Google Scholar
Granitto, P. M., Furlanello, C., Biasioli, F. & Gasperi, F. Recursive feature elimination with random forest for PTR-MS analysis of agroindustrial products. Chemometr. Intell. Lab. Syst. 83, 83–90. https://doi.org/10.1016/j.chemolab.2006.01.007 (2006).
Article CAS Google Scholar
Sanborn, R. M. et al. Preoperative risk factors of subsequent ipsilateral ACL revision surgery following an ACL restoration procedure. Am. J. Sports Med. 51, 49–57. https://doi.org/10.1177/03635465221137873 (2023).
Article PubMed Google Scholar
Perrone, G. S. et al. Bench-to-bedside: Bridge-enhanced anterior cruciate ligament repair. J. Orthop. Res. 35, 2606–2612. https://doi.org/10.1002/jor.23632 (2017).
Article ADS PubMed PubMed Central Google Scholar
Proffen, B. L., Perrone, G., Roberts, G. & Murray, M. M. Bridge-enhanced ACL repair: A review of the science and the pathway through FDA investigational device approval. Ann. Biomed. Eng. 43, 805–818. https://doi.org/10.1007/s10439-015-1257-z (2015).
Article PubMed PubMed Central Google Scholar
Flannery, S. W. et al. Reproducibility and post-acquisition correction methods for quantitative magnetic resonance imaging of the anterior cruciate ligament (ACL). J. Orthop. Res. 40, 2908–2913. https://doi.org/10.1002/jor.25319 (2022).
Article PubMed Google Scholar
Youden, W. J. Index for rating diagnostic tests. Cancer Epidemiol. Biomark. Prev. 3, 32–35. https://doi.org/10.1002/1097-0142 (1950).
Article CAS Google Scholar

Download references

Acknowledgements

This study was funded by the National Institutes of Health (R01-AR065462, R01-AR056834, K99/R00-AR069094, P30-GM122732, P20‐GM103645), the Translational Research Program at Boston Children’s Hospital, the Children’s Hospital Orthopaedic Surgery Foundation, the Children’s Hospital Sports Medicine Foundation, the Football Players Health Study at Harvard University, RIH Orthopedic Foundation, and the Lucy Lippitt Endowment of Brown University. The preclinical work was performed at the Center for Animal Resources and Education (CARE) under the veterinary leadership of Drs. Lara Helwig, DVM, DACLAM, and James S. Harper, DVM, and the clinical work was made possible by the BEAR Trial Team. We thank our biostatistician, Gary Badger, for his help with the statistical methods. We thank Kristina Pelkola at Boston Children’s Hospital and Lynn Fanella at the Brown MRI Research Facility for running the MRI acquisitions for this study. Last, but not least, we thank Dr. Joseph Crisco for his editorial feedback.

Author information

A list of authors and their affiliations appears at the end of the paper.

Authors and Affiliations

Department of Orthopaedics, Warren Alpert Medical School of Brown University/Rhode Island Hospital, Coro West, Suite 402, 1 Hoppin St, Providence, RI, 02903, USA
Sean W. Flannery, Jillian E. Beveridge, Brett D. Owens, Paul D. Fadale, Michael J. Hulstyn, Meggin Q. Costa, Cynthia Chrostek, Braden C. Fleming & Braden C. Fleming
Division of Sports Medicine, Department of Orthopaedic Surgery, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
Benedikt L. Proffen, Kirsten Ecklund, Lyle J. Micheli, Ryan M. Sanborn, Nicholas J. Sant, Yi-Meng Yen, Benedikt L. Proffen, Dennis E. Kramer, Martha M. Murray, Ata M. Kiapour, Dennis E. Kramer, Martha M. Murray & Ata M. Kiapour
Department of Neuroscience, Division of Biology and Medicine, Brown University, Providence, RI, USA
Edward G. Walsh

Authors

Sean W. Flannery
View author publications
You can also search for this author in PubMed Google Scholar
Jillian E. Beveridge
View author publications
You can also search for this author in PubMed Google Scholar
Benedikt L. Proffen
View author publications
You can also search for this author in PubMed Google Scholar
Edward G. Walsh
View author publications
You can also search for this author in PubMed Google Scholar
Dennis E. Kramer
View author publications
You can also search for this author in PubMed Google Scholar
Martha M. Murray
View author publications
You can also search for this author in PubMed Google Scholar
Ata M. Kiapour
View author publications
You can also search for this author in PubMed Google Scholar
Braden C. Fleming
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

BEAR Trial Team

Kirsten Ecklund
, Lyle J. Micheli
, Brett D. Owens
, Paul D. Fadale
, Michael J. Hulstyn
, Meggin Q. Costa
, Cynthia Chrostek
, Ryan M. Sanborn
, Nicholas J. Sant
, Yi-Meng Yen
, Benedikt L. Proffen
, Dennis E. Kramer
, Martha M. Murray
, Ata M. Kiapour
& Braden C. Fleming

Contributions

All authors have made substantial contributions to the study design, acquisition, analysis and/or interpretation of the data, have contributed to drafting of the manuscript, and have read and approved the final submitted version.

Corresponding author

Correspondence to Braden C. Fleming.

Ethics declarations

Competing interests

MMM is a founder and equity holder in Miach Orthopaedics, Inc, which was formed to commercialize the BEAR scaffold. AMK is a paid consultant of Miach Orthopaedics, and on the editorial board for BMC Musculoskeletal Disorders. BCF is a founder of Miach Orthopaedics, and the spouse of MMM who has the previously stated conflicts. He is also an associate editor for the American Journal of Sports Medicine. BLP is an equity holder and paid consultant of Miach Orthopaedics. DEK is a paid consultant for Miach Orthopaedics, Johnson & Johnson, and receives surgical support from Kairos Surgical. EGW is a co-founder of Theromics, Inc. On the BEAR Trial Team, NJS is a paid consultant of Miach Orthopaedics, YMY receives educational payments from Kairos Surgical, PDF and MJH have received travel support from Arthrex, and BDO receives royalties from Linvatec Corp, and consulting fees from Linvatec, DePuy Synthes Products, and Vericel. The remaining authors declare no potential conflicts of interest. All conflicts of interest are managed by the individuals’ respective institutions.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Flannery, S.W., Beveridge, J.E., Proffen, B.L. et al. Predicting anterior cruciate ligament failure load with T₂* relaxometry and machine learning as a prospective imaging biomarker for revision surgery. Sci Rep 13, 3524 (2023). https://doi.org/10.1038/s41598-023-30637-5

Download citation

Received: 06 June 2022
Accepted: 27 February 2023
Published: 02 March 2023
DOI: https://doi.org/10.1038/s41598-023-30637-5
Springer Nature Limited

Predicting anterior cruciate ligament failure load with T2* relaxometry and machine learning as a prospective imaging biomarker for revision surgery

Abstract

Similar content being viewed by others

Application of machine learning-based multi-sequence MRI radiomics in diagnosing anterior cruciate ligament tears

Risk factors for secondary meniscus tears can be accurately predicted through machine learning, creating a resource for patient education and intervention

Machine learning algorithm to predict anterior cruciate ligament revision demonstrates external validity

Introduction

Results

Model development

Clinical pilot assessment of model performance in predicting reinjury risk

Discussion

Methods

Model development

Porcine data

MR image processing

Biomechanical testing

Machine learning model development

Feature selection

Model optimization and selection

Clinical pilot assessment of model performance in predicting reinjury risk

Data set

Clinical application of the optimal machine learning model

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

BEAR Trial Team

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Table 1.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation

Predicting anterior cruciate ligament failure load with T₂* relaxometry and machine learning as a prospective imaging biomarker for revision surgery