Introduction

Anterior cruciate ligament reconstruction (ACLR) with the patient’s own autograft tissue (hamstring or bone patellar tendon bone) is the most common surgical technique to treat ACL complete rupture. Allografts are commonly used in revision and multiligament surgery [30]; albeit, in the last decades, their use also in primary ACLR is growing [1, 13, 36].

Despite the advantages of allografts in terms of reduced surgical time and donor site morbidity, they are not commonly accepted for primary ACLR, being thought to be associated with increased infectious risk and poorer clinical outcomes.

However, more recent systematic reviews have shown that non-irradiated and non-chemically treated allografts produce comparable results to autografts in terms of patient satisfaction, knee functional score and clinical failure rate [13, 15, 22, 36,37,38], although they are still thought to have a greater risk of re-rupture, particularly in young and active patients [9].

Beyond these data, given the paucity in some countries of allogenic material and the related costs, the use of allograft in primary ACLR is under discussion, especially analyzing the possible advantages to return to sport [13]. More recently, the European Society of Sports Traumatology Knee Surgery and Arthroscopy (ESSKA) has dedicated a whole initiative to evaluate the availability, awareness and cost-effectiveness of allograft tissue in joint reconstruction in Europe [13, 29].

While knee functional score and clinical failure rate after allograft ACLR have been investigated and compared, an accurate and complete radiological evaluation of the possible differences between autograft and allograft in term of maturation and integration of neo-ligament is still lacking [8, 11]. These objective parameters are known to be strongly correlated to the long-term stability of the knee and, therefore, could give additional information to direct the surgeons’ choice [19]. Although not commonly adopted during postoperative follow-up, magnetic resonance imaging (MRI) is the preferred and most commonly used technique to morphologically evaluate graft remodeling [10].

The primary aim of this prospective, single-center randomized single-blinded study was to evaluate radiologically the maturity of the neo-ACL tendon graft after 6 and 12 months from surgery. The secondary aim was to evaluate the clinical-functional recovery of daily activities (work and recreation), integrating these data with the radiological results, to provide recommendation on the use of allograft in primary ACLR. Moreover, the patients were clinically re-evaluated at 5-year follow-up, in order to assess any clinical differences between patients treated by either autograft or allograft ACL reconstruction and to estimate the predictive value of radiological parameters for mid-term clinical outcomes. The thorough radiological evaluation in the first phases of graft maturation, accompanied by simultaneous and long-term clinical evaluations, bring additional information to the existing one about the suitability of using allograft in primary ACLR as a reasonable alternative to hamstring autografts.

Materials and methods

This single-center, interventional, prospective, randomized, controlled, single-blinded study was approved by the Ethics Committee (27/INT/2014) and performed according to the standards EN ISO 14155:1, EN ISO 14155:2, Declaration of Helsinki and Good Clinical Practices (GCP).

Patient enrollment

Patients with a primary and unilateral ACL complete tear with indication to ACLR were considered eligible for the study. Fifty patients who met the inclusion criteria were asked to sign the informed consent and were randomly assigned into one of the two study groups: Autograft Group (Auto, 25 patients) and Allograft Group (Allo, 25 patients). Inclusion criteria were radiological and clinical diagnosis of complete primary and unilateral ACL tear, male or female, age between 18 and 40 years old, Tegner activity level pre-injury > 5, time from injury between 90 and 270 days. Any previous knee surgery, chondral lesion ICRS grade 2 or higher, complete meniscal injury, osteoarthritis even at initial phase, immune-mediated diseases, systemic chronic inflammatory disorders or knee/ankle infections, cardiac, pulmonary, neurological or other diseases that may increase the operative risk, use of drugs associated to increased risk of tendinitis and tendon ruptures, pregnancy status or breast feeding were considered as exclusion criteria.

The mean age was 27 ± 7 y/o in Auto group and 30 ± 6 y/o in Allo group. The first one was composed by 21 men (88%) and 3 women (12%), whereas the second by 23 men (92%) and 2 women (8%). There were 42% left and 58% right knees, and 36% left and 64% right in group Auto and Allo, respectively. No statistically significant differences in term of population’s characteristics were observed between the two groups.

Surgical procedure

All the procedures were performed by two experienced surgeons with the same single-bundle transtibial arthroscopic technique, preserving the ACL remnant tissue. When present, associated injuries were treated first to confirm full adherence to the inclusion/exclusion criteria.

Semitendinosus and gracilis tendons were used for all the patients of the Auto group.

For the patients of the Allo group, 17/25 (68%) of the allograft used were peroneal tendons, 8/25 (32%) posterior tibialis tendons. All the implanted allografts were non-irradiated fresh-frozen tissues, obtained from a single certified tissue bank.

Both autografts and allografts, doubled through the loop of the suspension system, were fixed to femoral surface by TightRope® metal plate and to the tibial bone by a bioabsorbable interference screw (Genesys Matryx®, ConMed Corp) [3].

Rehabilitation program

No knee braces were prescribed during the postoperative period.

The patients of both groups followed the same rehabilitation protocol, with an initial partial load and progressive recovery of the range of motion accompanied by isometric muscle strengthening. Water exercises, exercise bike and proprioceptive gymnastics were then introduced, while muscle strengthening continued with closed kinetic chain exercises followed by open kinetic chain exercises. Unidirectional running was included during the third month; running with changes of direction, leaps and recovery of the sporting gesture during the fifth month. Return to sport was accorded at the end of the eighth/ninth month, after recovery of quadricipital trophism and excellent proprioceptive control [21].

Clinical evaluation

All the patients were clinically assessed by Lysholm score [5], Tegner Activity Scale [4], modified Cincinnati Knee Rating System [28] and Visual Analog Scale (VAS) [33] at the time of enrollment (T0) and then at 6 months (T1), at 12 months (T2) after surgery. At about 5 years from surgery (T3) Lysholm score, Tegner Activity Scale, modified Cincinnati knee rating system and ACL-RSI [32] were collected.

MRI evaluation

All the patients underwent MRI at T0, T1 and T2. The MRI scans were performed with a 1.5 Tesla scanner (MAGNETOM®Avanto, Siemens Medical Solution, Erlangen, Germany) following a standardized image acquisition protocol (Table 1). Each knee was scanned in slightly flexed position (about 10°) with a dedicated extremity coil.

Table 1 MRI study protocol. An oblique STIR (para-sagittal) additionally to the conventional MRI protocol was included to better evaluate graft signal and its degree of edema

Image analysis was focused on several parameters: The signal intensity (SI) of ACL graft was evaluated by calculating the signal-to-noise quotient (SNQ) [31]. The SI of the graft was measured at three different regions of interest (upper, middle and lower third), and the mean was normalized by measuring the SI of the distal quadriceps tendon (QT) (Fig. 1).

Fig. 1
figure 1

MRI image showing the measurements used for SNQ calculation. The signal intensity (SI) of ACL graft was calculated using the signal-to-noise quotient (SNQ). The SI of the graft was measured at three different regions of interest (upper, middle and lower third), and the mean was normalized by measuring the SI of the distal quadriceps tendon (QT) “Media”= Mean, “Deviazione” = Standard Deviation 

The background signal was measured to finally calculate the SNQ as follows:

$$SNQ = \frac{{Graft~SI - QT~SI}}{{Background~SI}}$$

The SNQ is capable of normalizing the SI of each graft in this study; therefore, in this study, the SNQ value was used to quantify the signal intensity emitted by the neo-ligament. The SNQ evaluates the tendon graft maturation, with low values indicating a greater maturation of the graft. Position of the tibial tunnel, orientation of the neo-ACL estimated using the sagittal angle, angle between the ACL and the Blumensaat line, edematous condition of the graft based on the short tau inversion recovery (STIR) sequence, using three different grades (grade I: normal signal, grade II: possible edema, grade III: 100% of edema), and mean width of the graft, were also measured in all sequences at three different sites (distal, intermediate, proximal) (Fig. 2).

Fig. 2
figure 2

Methods used to measure the mean width of the graft on sagittal MRI at femoral, intraarticular and tibial level

MRI was evaluated by a senior radiologist who, as well as the clinician in charge of patients’ evaluation, was blinded to the patients’ group allocation.

Statistical analysis

Analysis was performed using R software version 4.0.3 (R Core Team, Wien, Austria).

Quantitative variables were described using the mean and standard deviation, whereas qualitative variables were reported with absolute and percentage frequencies.

Differences in categorical variables between groups were assessed by Chi-square test. For continuous variables, data distribution was assessed by Shapiro–Wilk test. In case of normally distributed data, one-way ANOVA (with Bonferroni’s post-test) or Student’s t test were applied, conversely Mann–Whitney and Kruskall–Wallis test (with Dunn’s post hoc test) were used. Two-way ANOVA with Bonferroni’s post-test was applied to assess the influence of two variables on the same parameter. Correlation tests were performed according to Pearson’s or Spearman’s method, depending on data distribution. Linear regression models were used to test the influence of multiple variables on continuous data, while logistic regression models were applied to categorical outcomes.

The sample size of the study was calculated from the SNQ index. Assuming a standard deviation (SD) of the SNQ value of 3, a relevant difference between the two treatment groups in favor of the autograft group of 3 [8], a test power (1-β) of 90% and a two-tailed alpha error of 5%, a sample size of 22 subjects per group was obtained.

Considering a possible loss of subjects in the follow-up of around 10%, 25 patients were enrolled in each treatment group.

A value of p < 0.05 was considered statistically significant [7].

Results

None of the patients had adverse events neither in the perioperative nor in the immediate postoperative period.

Clinical results

In both groups, the most common associated lesion with ACL injury was lateral meniscus tear: five patients in the Auto group (all treated by partial lateral meniscectomy) and nine patients in Allo group (eight treated with partial lateral meniscectomy, one treated with out-in suture repair). Medial meniscus tear was found in two patients of Auto group (one treated with partial medial meniscectomy, one treated with out-in suture repair) and five patients of Allo group (one treated with partial medial meniscectomy, four treated with out-in suture repair).

One patient of the Auto group had a neo-ACL re-rupture because he resumed the full sport activity too early (4 months after surgery) without medical authorization and therefore he was excluded from the study. Likewise, two more patients were excluded from the study because of the neo-ACL re-rupture, one of the Auto group and one of the Allo group, 37 and 30 months from surgery, respectively. Moreover, two patients of the Allo group were excluded during the first year after surgery for reasons unrelated to the study.

Therefore, 47 patients out of the 50 initially enrolled were available at 6 and 12 months follow-up and 45 for the last follow-up analysis (63 ± 7 months).

Other minor complications occurred in both groups but they did not lead to the exclusion of these patients from the study (Supplementary Table 1).

All the clinical and functional scores used for patient evaluation showed no significant differences between the two groups at any time points (Supplementary Table 2).

At the last follow-up, 44 out of 45 patients were satisfied with the surgical outcome and they declared they would undergo the same treatment. Fourty-one patients (22 of the Auto group and 19 of the Allo group) reported knee stability in every situation, whereas 4 patients (one of the Auto group and three of the Allo group) had occasional instability during intense activities, such as demanding sports and/or heavy weightlifting. No patient complained about knee instability during daily activities.

The Lysholm score improved over time, with no differences between the study groups (n.s.; interaction n.s.) (Fig. 3A). The same behaviour was observed for the Cincinnati score (Fig. 3B) and Tegner activity scale (Fig. 3C). Tegner Actvity Scale showed particularly relevant reductions at 6-month follow-up, due to the activity restrictions in both groups during the rehabilitation period.

Fig. 3
figure 3

Functional, pain and activity scores at the different time points in Allograft and Autograft groups. Lysholm Score (A); Cincinnati Score (B); Tegner Activity Scale (C); Visual Analogue Score-Pain (D). With the exception of VAS that only improved at 12-month follow-up, significant differences were observed for all the other parameters between either baseline levels and/or 6-month follow-up with respect to each time point. No difference was observed between the two groups. *p < 0.05, ***p < 0.001 vs baseline; #p < 0.05, ##p < 0.01, ###p < 0.001 vs 6 months

VAS showed a decrease during the first year in both groups, reaching a significant difference at 12 months (p = 0.014), with new non-significant increases at 60-month follow-up with respect to 12-month (Fig. 3D).

The ACL-RSI collected 60 months after surgery did not differ between patients in the autograft and allograft groups (n.s.) (Fig. 4).

Fig. 4
figure 4

ACL-RSI score at the 60-month follow-up in the Allo and Auto groups, showing no difference between the two groups

Radiological results

At 6 months, the mean SNQ index was significantly higher in the Auto group (12.9 ± 8.6) than in the Allo group (7.9 ± 6.3) (p = 0.038), whereas at 12 months the values were comparable (9.8 ± 7.1 and 10.4 ± 8.0 for the Auto and Allo group, respectively) (Fig. 5).

Fig. 5
figure 5

Comparison of the SNQ score between Allo and Auto groups. Data are presented as mean ± SD; *p < 0.05. At 6 months the mean SNQ index was significantly higher in the Auto group than in the Allo group (p = 0.038), whereas at 12 months the values were comparable. The two-way ANOVA analysis indicated a different behavior during time depending on the graft type

The two-way ANOVA analysis showed that interaction of the two variables was significant (p = 0.022), indicating a different behaviour during time depending on the graft type. Indeed while autograft seems to reach a definitive level at 6 months, allograft SNQ slightly increase at 12 months after surgery, despite this difference was not statistically significant.

All the other MRI parameters analyzed are reported in Table 2.

Table 2 Full list of MRI measurements at femoral and tibial site, with corresponding p-values for autograft and allograft groups

The presence of edema at 12 months was more frequent in patients treated with allograft (15/22) compared to those treated with autograft (13/25), but this difference was not significant (n.s.).

Correlations between SNQ and PROMs

SNQ score at 6 months did not correlate with Tegner activity scale, ACL-RSI, Cincinnati, VAS or Lysholm score at any of the analysed time-points. The 12-month SNQ values significantly correlate with 60-month Cincinnati (p = 0.003, r = 0.442), Lysholm (p = 0.027, r =  − 0.334) and Tegner activity scale (p = 0.018, r =  − 0.357) (Fig. 6). No correlation was found between 12-month SNQ values and VAS or ACL-RSI 60-month scores.

Fig. 6
figure 6

Correlations between SNQ at 12 months and Cincinnati score (A), Lysholm score (B) and Tegner Activity Scale (C) at 60 months. While the SNQ score at 6 months did not correlate with any of the clinical and functional score, the 12-month SNQ values significantly correlated with 60-month Cincinnati (p = 0.003, r = 0.442), Lysholm (p = 0.027, r =  − 0.334) and Tegner activity scale (p = 0.018, r =  − 0.357)

The attempt to predict 60-month outcomes considering age, sex, graft type and 12-month SNQ confirmed the role of this radiological score for Tegner activity scale (p = 0.017) and Cincinnati score (p = 0.030) but not for Lysholm, where male gender played the most significant role in predicting better results (p = 0.027). No association was observed between the variables considered and VAS or ACL-RSI recorded at 60 months.

Association between radiological observations and complication/failures, laxity and graft edema

At 6 months after surgery, patients who then faced complications or re-ruptures had a median SNQ of 11.1 (interquartile range—IQR: 6.6–16.2) compared to the 7.3 (IQR: 4.7–12.6) of patients who did not experienced these events. Nevertheless, this difference was not statistically significant (n.s.), possibly due to the limited number of patients with complications and re-ruptures (n = 3). No association was observed considering 12-month SNQ values.

Reported instability at 60 months was not associated with SNQ at 6 or 12 months.

The presence of graft edema at 12 months was associated with reduced sagittal graft inclination (p = 0.020) and femoral tunnel width (p = 0.026), but not with SNQ value at 6 months.

Femoral and tibial graft width, Blumensaat angle, coronal and sagittal graft inclination angles at 6 months, demonstrated no significant association with the 12- and 60-month Lysholm, TAS and VAS with respect to the two types of graft.

Discussion

The most important finding of the present study was a different graft maturation time in the two groups, with allografts being more reactive in the first 6 months but with an opposite trend at 12 months, when autografts appeared more mature. Interestingly, the 12-month graft maturation values significantly correlated with Cincinnati score, Lysholm score and Tegner activity scale evaluated at 60-month follow-up. The currently accepted view is that the hamstring autograft detached from its tibial insertion, as used in ACLR, undergoes a process of “ligamentization”, showing an initial phase of avascular necrosis with hypocellularity followed by cellular proliferation with high revascularization [14, 25]. This process, causing changes in graft water content, determines the variation of SNQ values on MRI images. A significant difference in the 6-month SNQ values between the two groups was observed, with autograft patients presenting higher SNQ compared to allograft patients. This difference progressively narrowed during time, with similar SNQ scores between the groups at 12 months. The SNQ mean values for allograft showed a slight increase during time, although not statistically significant, suggesting a slowdown in the maturation process. This finding is in line with previous ones showing higher SNQ for allografts at later follow-ups [11, 19]. The literature presents controversial data on graft maturation trend. While some studies showed a progressive and persistent increase in the SNQ values of allograft and autograft from 6 to 12 months [24], others reported an increase in the SNQ for both allograft and autograft at 6 months, followed by a subsequent SNQ decrease [6, 17]. These discrepancies may be related to the differences in the MRI protocols and image evaluation methods as reported by a systematic literature review that concludes that SNQ is still ineffective to anticipate clinical outcomes after ACLR [35].

Several studies examined the association between the graft SNQ and the clinical and functional outcomes [2, 12, 16, 17, 19, 20, 26, 27]. Some researchers [12, 17, 26] found no correlation during the first year after surgery comparing SNQ values with objective measurements obtained by KT-1000 arthrometer, while others reported a significant association between graft SNQ and long-term clinical outcomes [16, 27], similarly to what observed in the present study.

Only three studies reported a positive correlation between graft SNQ and clinical results [2, 18, 19]. In particular, Li et al. [19] reported that good graft maturity at 6-month MRI could strongly predict superior clinical results at 1-year follow-up. They also hypothesized the use of SNQ predictive value to improve the functional outcomes and to early detect patients who might need further surgical procedures. However, this study did not confirm this hypothesis, as the SNQ at 6 months did not correlate with any of the clinical scores at any of the time-points.

Interestingly, 12-month SNQ significantly correlated with Cincinnati score, Lysholm score and Tegner activity scale at 60-month follow-up, whereas none of the other MRI parameters did. It is possible that the use of pure axial, sagittal and coronal plane to perform such measurements may have negatively influenced these parameters, as it is well known that the graft course is wavy, and measurements may be not fully reliable when performed on standard planes. Further, this study did not highlight any significant association between postoperative complications/failures and 12-month SNQ values.

The secondary aim of this study was to evaluate the suitability of the allograft use in primary ACLR by comparing the clinical-functional recovery of daily and sport activities of patients who underwent this procedure either by autograft or allograft. The evidence provided by old studies showing that allograft ligamentization and integration into host tissue requires a longer time if compared with autograft [26, 34], together with the higher costs of allograft and the poor availability in many countries of allogenic grafts still make autograft the first choice for primary ACLR [23, 34]. Nevertheless, some studies showed no significant clinical and radiological differences between autograft and allograft in primary ACLR [13, 15, 22, 36,37,38] at the medium-long-term in young active patients.

In this study, all the patients were treated with the same surgical technique and rehabilitation protocol. All the parameters assessed significantly improved over time, with no statistically significant difference between the two groups. The 6-month time point showed lower level of activity in both Auto and Allo group, with a significant improvement at 1-year follow-up and maintained up to 5 years.

ACL-RSI score collected at the last follow-up showed no significant difference between the two groups, indicating a comparable subjective confidence in the operated knee. Similarly, no difference in term of rate and time form surgery of retear events was observed between the two groups.

The satisfaction of the whole study population (44 out of 45 patients) meant as the willingness to repeat the same surgery and postoperative rehabilitation protocol to achieve the same functional results, confirmed the effectiveness of the ACLR, regardless of the type of graft.

Study limitations include the subjectivity of all the postoperative clinical and functional evaluations, although the literature confirms that the subjective confidence in the operated knee and the return to daily and sport activities are reliable indicators of good surgical outcomes. A further limitation of the study is the lack of correlation between the radiological parameters and specific sports activities. It would be interesting in the future to restrict the same type of in a more homogeneous population of athletes practicing a specific high-contact sport.

At the time of last follow-up COVID-19 restrictions made not advisable to ask patients to undergo a MRI evaluation. The Authors’ intention would be to repeat MRI exams to evaluate further variations in radiological parameters (especially SNQ) several years after surgery.

Conclusion

The graft maturation time in the two groups was different, with allografts being faster in the first 6 months but appearing less mature than autograft at 12 months.

The significant correlation found between 12-month SNQ with Cincinnati score, Lysholm score and Tegner activity scale collected after 60 months from surgery suggests that MRI can provide an objective evaluation of the graft integration process as well as useful information to predict the best time to return to sport, allowing a more individualized postoperative management.

From a clinical perspective, the findings of this study support the suitability of using allograft in selected cases of primary ACLR as a reasonable alternative to hamstring autografts, given the comparable functional outcomes at short and mid-term follow-up.