Introduction

Knee osteoarthritis (OA) is a common joint disease in older patients that may cause severe pain and lead to an increasing financial burden and a reduced quality of life [1]. Although knee OA may involve any one or all 3 compartments, up to 30% of patients have evidence of only single compartmental degeneration [2].

The optimal surgical treatment for OA of the knee remains in question. To date, several surgical solutions have been proposed to address OA, such as TKA (total knee arthroplasty), UKA (unicompartmental knee arthroplasty), HTO (high tibial osteotomy), BCA (bicompartmental knee arthroplasty), BIU (bi-unicompartmental knee arthroplasty), or KJD (knee joint distraction), disagreements often exist between surgeons regarding the best choice of procedure and the best decision for patients with OA [3].

For example, although TKA is a primary surgical treatment for OA, as many as 15-20% of patients are dissatisfied with their surgical outcome [4,5,6]. Good or excellent long-term results with high patient satisfaction were reported for UKA, but its survival has been found to be inferior to that of TKA [7,8,9,10]. Unlike TKA and UKA, HTO accomplishes the reconstruction of joint function by correcting varus malalignment; however, some HTOs may need a conversion to TKA due to the progression of OA, and TKA following HTO has worse outcomes and higher complications [11,12,13]. Compared to TKA, BCA is a less invasive procedure but might have a relevant change in the leg axis and poor long-term survivorship [14, 15]. BIU demonstrates good functional outcomes, but data on its long-term outcomes remain limited [16]. KJD is a new surgical joint-preserving treatment that also appears to be associated with joint tissue regeneration, but relevant evidence is sparse [17]. Hence, for surgical interventions for knee OA, there is high variation in treatment choice and little robust evidence to guide selection.

The purpose of the present study was to perform a systematic review and network meta-analysis (NMA) of randomized control trials (RCTs) comparing different surgical treatments (TKA, UKA, HTO, BCA, BIU, and KJD) and assessing their complications, revisions, reoperations, and functional outcomes.

Table 1 Study Characteristics

Materials and methods

Study selection

This NMA is reported following the standards proposed by PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines [18]. This study was registered with Research Registry (reviewregistry1395). Two researchers independently performed the literature search, and any discrepancies were reappraised and arbitrated by a third investigator. The titles and abstracts were screened, and the full-text was reviewed whenever necessary to evaluate the eligibility of each study. The search was performed in PubMed, Embase, and the Cochrane Library Database through March 20, 2022. The search terms included: total knee replacement, unicompartmental joint replacement, high tibial osteotomy, bicompartmental knee arthroplasty, bi-unicompartmental knee arthroplasty, and knee joint distraction. Furthermore, the corresponding references of all included studies were manually screened according to the selection criteria for consideration of inclusion in the study. There was no restriction regarding the publication date.

Eligibility criteria

We included all multiarm RCTs of human subjects that compared complications, revisions, reoperations, and functional outcomes among TKA, UKA, HTO, BCA, BIU, or KJD with a minimum 1-year follow-up.

The exclusion criteria were as follows: (1) nonrandomized and nonclinical studies, (2) single-arm clinical trials, (3) case reports and series, (4) conference abstracts, and (5) non-English language publications.

Data extraction and risk of bias assessment

All relevant information regarding the study characteristics was collected: study design, population characteristics, surgical approaches, indications, prosthesis, clinical outcomes, and follow-up time points. Two independent reviewers extracted data using a predetermined data sheet and evaluated the risk of bias using the revised Cochrane risk of bias tool for randomized trials (ROB 2) [19]. In cases of disagreement, a third reviewer was consulted to reach an agreement. The clinical outcomes evaluated included complications, revisions, reoperations, and functional outcomes. The complications were defined as any issues related to the surgical procedure that might require readmission, reoperation, additional treatment or longer hospital stays. Postoperative death from any relevant cause was also considered as a complication. Revisions of UKA, HTO, BCA, BIU and KJD were defined as failures equating to revision to TKA. The reoperations were defined as any unscheduled operations resulting from surgical site complications, including irrigation and wound revision, debridement, implant revision, open reduction internal fixation, and others.

Fig. 1
figure 1

PRISMA flow diagram

Fig. 2
figure 2

a-c Network geometry of different surgical interventions for comparisons of complications (a), revisions (b), and reoperations (c)

Table 2 Function Outcomes
Fig. 3
figure 3

NMA results for complications. The treatments are compared in a forest plot. The vertical reference line centered at 1 indicates statistical equivalence

Fig. 4
figure 4

NMA results for revisions. The treatments are compared in a forest plot. The vertical reference line centered at 1 indicates statistical equivalence

Statistical analysis

For dichotomous outcomes, the relative effect sizes were reported as odds ratios (ORs) with 95% confidence intervals (CIs). When the 95% CI of the OR contained 1, the comparison was considered to have no statistical significance. For direct comparisons, a conventional meta-analysis was conducted to synthesize the results using random-effects models as sensitivity analyses (Supplementary Figs. 13). An NMA using a frequentist approach with a random effects model was used to estimate direct and indirect comparisons. NMA aims to test whether superiority exists for one of the comparator interventions. The potential inconsistency between the indirect and direct comparisons was inferred by global inconsistency, local inconsistency (a node-splitting approach), and loop inconsistency. Heterogeneity was quantified using the tau value, and P < 0.05 was considered statistically significant. A global network diagram was used for each prespecified outcome to demonstrate direct comparisons between interventions. A comparative hierarchy was obtained by calculating the relative ranking probabilities between the effects of all interventions for the target outcomes and SUCRA (the surface under the cumulative ranking curve). The SUCRA value showed the percentage of procedural efficacy and safety of each treatment and ranged from 0 to 100%. Hence, the larger the SUCRA value, the higher the rank of the intervention, indicating generally a better or worse effect. The contribution plot for the network is summarized in Supplementary Fig. 4, and the size of each circle is proportional to the weight attached to each direct summary effect for the estimation of each network summary effect. The comparison-adjusted funnel plot was used to assess the possibility of publication bias (Supplementary Fig. 5). All statistical analyses were performed using Stata 14 software (StataCorp LP, USA).

Results

Literature review and risk of bias assessment

The initial literature search identified a total of 2916 studies. Then, 2883 studies were excluded after screening the titles and abstracts. Twenty-one studies were included in this review [20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40]. Due to different durations of follow-up or clinical outcomes, studies 8–10 (one RCT) [27,28,29], 14–15 (one RCT) [33, 34] and 19–21 (two RCTs) [38,39,40] were all included to avoid omitting any clinical outcome. Finally, 21 studies were included in this review, with a total of 1749 patients. There were 709 patients treated using TKA, 711 treated using UKA, 153 treated using HTO, 91 treated using BCA, 42 treated using BIU, and 43 treated using KJD. The details of the literature review are presented in Fig. 1. The characteristics of the included studies are summarized in Table 1.

Fig. 5
figure 5

NMA results for reoperations. The treatments are compared in a forest plot. The vertical reference line centered at 1 indicates statistical equivalence

Fig. 6
figure 6

Ranking of different surgical interventions based on rank probabilities (a, c, e) and cumulative probabilities (b, d, f)

A network diagram summarizing the NMA geometry involving complications, revisions and reoperations is presented in Fig. 2A–C. In 14 studies [20,21,22,23, 25, 26, 29, 31,32,33, 35,36,37,38], the authors reported complications, and, in 13 studies [20,21,22,23, 25,26,27, 31, 33, 35,36,37,38], the authors reported revisions and reoperations. The results of the NMA, including the OR with 95% CIs, are reported in Figs. 3, 4 and 5, and the rank probabilities and cumulative probabilities are plotted in Fig. 6. Because an NMA involving functional outcomes was not possible due to the heterogeneity or deficiency of the functional data, a systematic review was performed for functional outcomes in all studies. The risk of bias of the RCTs is depicted in Fig. 7. Five studies (23.8%) showed some concerns, and 9 studies (42.9%) had a high risk for overall bias. In those RCTs, the majority of risk of biases arose from the randomization process, where it was unclear if the allocation sequence was random and concealed. All NMAs showed that heterogeneity and inconsistency were low.

Fig. 7
figure 7

Risk of bias of the included studies (ROB2 bias assessment). a: each domain of studies with high, low, or unclear risk of bias and concerns regarding applicability; b: the proportions of studies with high, low, or unclear risk of bias and concerns regarding applicability

Complications

Complications were reported in 14 studies [20,21,22,23, 25, 26, 29, 31,32,33, 35,36,37,38]. The NMA revealed that the likelihood of complications for each method was as follows: KJD (99.8%), BCA (68.0%), BIU (47.7%), TKA (41.6%), HTO (40.0%), and UKA (3.0%) (Fig. 6A and B). These findings indicate KJD had the greatest probability of postoperative complications, followed by BCA, while UKA had the lowest probability of postoperative complications. There were statistically significant differences in the incidence of complications when performing the following comparisons: KJD vs. TKA, UKA, HTO, BCA, and BIU, UKA vs. TKA, and BCA vs. UKA (Fig. 3). The above NMA results are consistent with direct comparisons among different interventions (Fig. 3 and Supplementary Fig. 1).

Revisions and reoperations

Revisions were reported in 13 studies [20,21,22,23, 25,26,27, 31, 33, 35,36,37,38]. The NMA SUCRA rankings indicating the likelihood of revisions were as follows: KJD (73.5%), BCA (71.3%), HTO (55.8%), BIU (41.7%), UKA (33.9%), and TKA (23.8%) (Fig. 6C and D), but there was no difference among all treatments (Fig. 4). No differences in the risk of revision among different treatments were observed with direct-comparison meta-analysis and NMA (Fig. 4 and Supplementary Fig. 2).

Reoperations were also reported in 13 studies [20,21,22,23, 25,26,27, 31, 33, 35,36,37,38]. The NMA SUCRA rankings indicating the likelihood of reoperations were as follows: KJD (94.8%), BCA (69.9%), HTO (52.0%), BIU (37.8%), TKA (24.3%), and UKA (21.2%) (Fig. 6E and F). There were statistically significant differences in the incidence of reoperations when comparing KJD with TKA, UKA, and HTO (Fig. 5). The above NMA results are consistent with the direct comparison among the different interventions (Fig. 5 and Supplementary Fig. 3).

Functional outcomes

Functional outcomes were reported in all studies. The follow-up durations of these RCTs ranged from was 1 to 15 years. The majority of comparisons among various treatments showed no difference for functional outcomes in Table 2. Compared with TKA, only one RCT (1/8) definitely showed that UKA had a better functional enhancement after 3 years of follow-up [20] and three RCTs (3/8) revealed that UKA had a superior ROM (range of motion) [20, 21, 29]. For UKA vs. HTO, the functional outcomes were not different in all 3 RCTs [30,31,32]. During a follow-up period of 1 to 10 years, BCA and BIU resulted in similar clinical and functional scores as TKA in 5 studies [33,34,35,36,37]. KJD was noninferior to TKA and HTO in the primary functional outcomes in 3 studies [38,39,40], but a high incidence of pin track infection associated with KJD was found.

Discussion

To date, there has been no consensus regarding the best surgical option for symptomatic knee osteoarthritis. To assist in providing robust evidence to guide surgical selection for surgeons and patients, the current NMA of 21 studies involving 1749 participants compared 6 surgical interventions (TKA, UKA, HTO, BCA, BIU, and KJD) and revealed that UKA and TKA are better options to treat symptomatic knee osteoarthritis, according to SUCRA rankings. KJD has the highest incidence of complications, revisions, and reoperations, thereby limiting its application. To our knowledge, this is the first NMA to compare the impacts of six surgical methods on the treatment of symptomatic knee osteoarthritis.

Some similar findings were also observed in other assessments of syntheses studies:(1) Most evidence-based studies showed that UKA is a better option than TKA, such as Wilson et al. who reported that UKA had significantly shorter operating times and hospital stays, fewer complications, and quicker recovery, but the revision rates for TKA were low [41]; Migliorini et al. and Arirachakaran et al. also showed that UKA reported reduced survivorship but better clinical and functional performances than TKA [42, 43]; Chawla’s meta-analysis indicated that revisions of medial UKA and lateral UKA occur at an annual rate of 2.18 and 2.31-fold that of TKA, respectively [44]; Only one meta-analysis showed that there were no statistically significant differences between UKA and TKA in terms of function scores, complications and survivorships, but they still supported the routine use of UKA for OA because of shorter hospital stay, faster recovery and less need for rehabilitation [45]; and Tripathy et al. reported that patients with UKA are better able to forget about their artificial joint than patients with TKA [46]. The reason why the rate of revision of TKA is lower than that of UKA is controversial. Revision of unsatisfactory TKA has a lower frequency because the reason for unsatisfactory results is also not known, but replacing with another TKA is risky when the reason for revisions is unknown and the solution is the same as the original. Most unsatisfactory UKA can be revised with TKA, so the relatively high revision rate of UKA can also be explained. In addition, UKA also has an obvious drawback in which the revised UKA to TKA had inferior outcomes compared to those of the primary TKA; hence, primary TKA may be a preferable procedure to UKA for patients for whom UKA and TKA are both applicable [47,48,49]. (2) Several meta-analyses demonstrated that UKA resulted in better clinical outcomes, greater improvement in physical activity levels, and fewer postoperative complications than HTO [50,51,52], and HTO is more appropriate for younger patients whereas UKA is appropriate for older patients [52, 53]; The Postoperative rate of revision and complications did not differ significantly between UKA and HTO [54]. Nevertheless, TKA following HTO provides similar clinical outcomes compared to TKR without previous HTO, but the conversion process of HTO to TKR is technically challenging [13, 55, 56]. (3) The majority of meta-analyses showed that BCA did not prove to be an equivalent alternative to TKA in knee OA due to more postoperative complications and poorer long-term survivorship [15, 57, 58]. (4) Wada’s and Takahashi’s systematic reviews reported that bi-UKA or KJD is a feasible and viable surgical option for knee OA in carefully selected patients, but long-term outcomes remain limited [17]. Moreover, 74% of KJD patients had complications, and 56% experienced pin tract infections [38].

Combined with the aforementioned literature review and the analysis of this NMA, most surgical methods for patients with OA have little difference in postoperative functional recovery of the knee joint; hence, patients and surgeons should pay more attention to complications, revisions, and reoperations when multiple options are available. According to our analysis, UKA is the preferred treatment for medial compartment OA patients with appropriate indications, followed by TKA. HTO is suitable for younger patients. Because there is little evidence for the applications of BCA, BIU and KJD, their indications need to be strictly controlled. Clinical decision-making for surgical options of patients with OA is meant to be informed by the best medical evidence, clinical judgment, surgeon’s experience and patient requirements. Treatment options should be carefully considered for each patient in accordance with their individual requirements and actual condition.

Limitations

Our study has some limitations. First, to improve the quality of this NMA, only RCTs were included, which resulted in a small number of studies. There were few RCTs included, which might reduce the robustness of our conclusions, especially for BIU and KJD evaluations. Second, it is difficult to conduct an ideal RCT as a result of clinical ethics. The overall allocation process was unclear in most studies, leading to a moderate to high risk of selection bias. Third, these included RCTs varied in inclusion criteria, population demographics, end assessment points and methods, and varying degrees of statistical adjustment. Fourth, implant survival and revision rates are likely to be affected by the experience of the operating surgeon and hospital performing the procedure. Fifth, because only BIU was involved in lateral compartment OA, stratified analysis was not performed according to the number of affected compartments. Sixth, the conclusion is limited by the selection of reviewed publications and individual variation of surgical indications for patients. The strengths of this NMA are the comprehensive nature of the literature search along with the strict eligibility criteria, and only high-quality RCTs were included in the analysis.

Conclusions

This study may provide evidence to support informed shared decision-making in the care of patients with knee OA. Based on our analysis, any surgical intervention is noninferior to other treatments in functional outcomes, but UKA and TKA are better options to treat symptomatic knee osteoarthritis according to SUCRA rankings by comparing complications, revisions, and reoperations. KJD is an imperfect option for treating OA. It needs to be emphasized that other treatment options should be carefully considered for each patient in accordance with their individual requirements and actual condition. Moreover, further well-designed and large-scale clinical trials and systemic reviews are required to confirm our findings.