Background

The load distribution at the physiological tibiofemoral joint is usually inconsistent [1, 2] because the load of the medial compartment accounts for more than 60% of the joint load as a result of varus malalignment [3]. Therefore, the medial compartment is susceptible to lesions in the early stages of osteoarthritis, and this susceptibility is partial caused by the additional force [4].

High tibial osteotomy (HTO) is now considered effective surgical treatments for medial compartment arthritis [5, 6], which can maintain the integrity of the knee, reduce pain, and extend joint life, and HTO has been gaining attention in recent years [7]. The principle of high tibial osteotomy is to correct varus limb alignment so that the excessive pressure is transferred from the medial compartment to the relatively healthy lateral compartment.

Medial open-wedge (OW) and lateral closed wedge (CW) HTO, during which a wedge-shaped cut is made in the medial and lateral parts of the tibia [8], respectively, and internal fixation with a plate and screws is then performed at the end [2, 9], are two of the most commonly used surgical methods among young and highly active patients.

To evaluate these two surgical methods, numerous studies have compared the radiographic changes and clinical results after OWHTO and CWHTO [6, 10,11,12,13,14]. The choice of osteotomy site, anatomical femorotibial angle (AFTA), hip-knee-ankle angle (HKA), leg length, patellar height, posterior tibial slope, and correction accuracy are among the ongoing controversial issues [15,16,17,18]. As individual studies may not be able to provide sufficient data on their own, the effect of OWHTO and CWHTO should be assessed objectively using pooled analysis.

Although several meta-analyses [14, 19, 20] were performed earlier, most evaluated only postoperative outcomes. A preoperative comparative analysis is also quite important, which could make results more persuasive and accurate. The posterior tibial slope (PTS) was measured by two different methods (measuring line: posterior tibial cortex or tibial mechanical axis) in some studies comparing OWHTO and CWHTO [16, 18, 21,22,23,24], but some meta-analyses [14, 19, 20] pooled the results of different methods together, which may be not completely accurate. Additionally, numerous recently published studies [23,24,25,26,27] have longer follow-up times or present different directions for evaluating the effects of these two surgical methods. Therefore, regarding the current dilemma, the purpose of this meta-analysis is to examine changes in radiological variables and clinical outcomes between OWHTO and CWHTO which have ongoing controversial issues in numerous quantitative clinical studies and to assist surgeons in determining the appropriate method according to the patient condition. This study hypothesizes that OWHTO is better than CWHTO in clinical outcomes, that there are no differences in the function of correction between OWHTO and CWHTO, that posterior tibial slope increases after OWHTO and decreases after CWHTO, and that patellar height decreases after OWHTO and increases after CWHTO.

Methods

This meta-analysis was performed in strict accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) Statement [28].

Search strategy

PubMed, Embase, and the Cochrane library were searched systematically, using English, to identify relevant studies published between Jan 1, 1999, and Feb 2, 2018. The complete search terms used in these three databases were tibial osteotomy [All Fields] AND high tibial osteotomy [All Fields] OR open wedge osteotomy [All Fields] OR open tibial osteotomy [All Fields] OR closed wedge high tibial osteotomy [All Fields] OR closed wedge osteotomy [All Fields] OR closed tibial osteotomy [All Fields]. Additionally, manual searches utilizing the reference lists of in included studies were performed to obtain articles neglected by searching the databases as mentioned above.

Assessment of study eligibility

The research question and eligibility criteria were determined a priori. The inclusion criteria were (1) studies presenting a comparison of results between OWHTO and CWHTO, (2) studies presenting at least one result for radiographic and clinical outcomes, and (3) clinical studies written in English and with a level of evidence of I–IV. Postoperative indicators less than 1 year of follow-up were divided into subgroups for analysis to reduce heterogeneity. For multiple articles containing the same sample population, the one with the relatively larger sample size or longer follow-up time was included in our investigation, and the others were used as a reference.

The exclusion criteria were (1) studies such as case reports, book chapters, review articles, summaries of experience, and cadaver studies; (2) animal or cell studies; (3) studies involving participations who had rheumatoid arthritis or former infection in the knee; and (4) studies presenting data from original articles that cannot be expressed as the mean ± standard deviation.

Data extraction

The following information was extracted from each study: the first author’s surname; publication year; country of origin; participant characteristics (number, age, and gender); operated knees; trial duration; type of internal fixation; clinical outcomes, including range of motion (ROM), Hospital for Special Surgery knee score (HSS), American Knee Society Score (KSS), Visual Analog Scale pain score (VAS), Lysholm score and the total of the Western Ontario and McMaster University osteoarthritis (WOMAC) index; and radiographic results, including anatomical femorotibial angle (AFTA), hip-knee-ankle angle (HKA), mechanical axis deviation (MAD), mechanical medial proximal tibia angle (MMPTA), posterior tibial slope angle (PTSA), Caton-Deschamps index (CDI), Insall-Salvati Index (ISI), and Blackburne-Peel index (BPI). All processes of data extraction were performed by two reviewers (Xiangyun Cheng & Fanxiao Liu) repeatedly. Any discrepancies reached a consensus by discussion with an arbitrator (Fei Xiong).

Quality assessment

Quality assessment was performed for each included study using the “assessing risk of bias” table [29] for randomized controlled trial (RCT) studies and using the MINORS (Methodological Index for Nonrandomized Studies) checklist for nRCT articles. Discrepancies were resolved through discussion among the researchers (Xiangyun Cheng, Fanxiao Liu and Fei Xiong). Additionally, the quality of evidence for radiographic results and clinical outcomes was evaluated using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach [20].

Statistical analysis

For data measurement, the standardized mean difference (SMD) with a 95% confidence interval (CI) was pooled using Stata 12.0 (Stata Corp, College Station, TX, USA). According to the Cochrane Handbook, statistical homogeneity, clinical homogeneity, and methodological homogeneity were evaluating using I2 [28, 29]. If p > 0.1 and, I2 < 50%, considering the heterogeneity is small, a fixed effects model was performed. Otherwise, the causes of heterogeneity were investigated using subgroup analysis and sensitivity analysis [14]. If the cause could not be identified, a random effects model was used. Subgroup analyses were performed based on the preoperative and postoperative results of the two groups (OWHTO and CWHTO), the data from RCTs and nRCTs, follow-up (more than 1 year). Kappa values were used to evaluate the degree of agreement between the two authors as follows: fair 0.40 to 0.59, good 0.60 to 0.79, and excellent 0.8 or more [29].

Results

Results of the search and study selection

After screening the titles and abstracts, 2708 of 2759 records (2748 retrieved from the 3 databases and 11 identified from references) were excluded. Subsequently, after downloading and identifying the full text, 3 articles with unavailable data, 8 duplicates, 3 cadaver studies, and 3 reviews were excluded (Fig. 1). Eventually, 28 articles [10,11,12,13, 15,16,17,18, 21,22,23,24,25,26,27, 30,31,32,33,34,35,36,37,38,39,40,41,42] published between 1999 and 2018 met the inclusion criteria and were included in the meta-analysis.

Fig. 1
figure 1

Flow diagram of included and excluded studies

Basic characteristics of included studies

The basic information of the included studies is summarized in Table 1. Of these articles, eight articles [10, 11, 13, 24, 26, 27, 30, 31] were randomized controlled trials (RCTs) and 20 articles [12, 15,16,17,18, 21,22,23, 25, 32,33,34,35,36,37,38,39,40,41,42] were nRCTs. Two studies [27, 30] used the WOMAC score, and only one study used the Lysholm score [13]. The duration of follow-up in these studies ranged from 6 months to 13 years. These studies included a total of 2788 participants (1370 in OWHTO and 1438 in CWHTO) involving 2840 knees (1384 in OWHTO and 1456 in CWHTO).

Table 1 The main information of the included RCT and prospective studies

Risk of bias

The risk of bias, which is used to assess RCTs, is presented in Fig. 2. The MINORS score assessing nRCTs was 15 ± 2, and their levels of evidence were III or IV (Table 2). The kappa values regarding the evaluation of the risk of bias in RCTs and nRCTs were 0.821 and 0.803, respectively, indicating the excellent degree of agreement between the two researchers (Xiangyun Cheng & Fanxiao Liu).

Fig. 2
figure 2

Risk of bias of included in randomized controlled trials. +, no bias; –, bias; ?, bias unknown

Fig. 3
figure 3

Preoperative comparison between two groups for PTS

Fig. 4
figure 4

Postoperative comparison between two groups for PTS

Table 2 The MINORS score of the relevant studies (NRCT)

Clinical outcomes

Preoperative HSS, VAS, and KSS were not statistically significant between the OWHTO and CWHTO groups (p > 0.05). Similarly, postoperative comparisons of the two groups showed no statistically significant differences (p > 0.05). However, for these three indicators, the postoperative scores were significantly better than the preoperative scores. The postoperative ROM difference between the two groups was not statistically significant (p > 0.05). OWHTO increased the length of the leg, while CWHTO decreased the condition (p < 0.05).

Radiological outcomes

Comparison of the AFTA between the CWHTO and OWHTO groups showed no statistically significant difference (p > 0.05) either at preoperation (SMD 0.04, 95% CI − 0.66 to 0.74) or postoperation (SMD 0.08, 95% CI − 0.23 to − 0.37) (Table 3). However, the postoperation AFTA was better than the preoperation AFTA (p < 0.05) for both the CWHTO and OWHTO groups, which demonstrated that the OWHTO and CWHTO surgeries were similar in function of correction. Furthermore, similar results of the H-K-A angle between the two methods were found (Table 4).

Table 3 Radiological and clinical results of the preoperative comparison and postoperative comparison between OWHTO and CWHTO
Table 4 Radiological and clinical results comparing preoperative and postoperative outcomes in OWHTO or CWHTO group

For PTS, whether using the posterior tibial cortex or the tibia mechanical axis as a reference line, there were significant differences (p < 0.05) between the CWHTO and OWHTO groups at postoperation. Meanwhile, the PTS increased (SMD − 0.71, 95% CI − 1.04 to − 0.37, p < 0.05) after OWHTO but decreased significantly after CWHTO (SMD 0.72, 95% CI 0.35 to 1.08, p < 0.05) (Figs. 3 and 4).

The patellar height, as evaluated by three indicators (BPI, CDI, and ISI), decreased significantly after OWHTO (p < 0.05), except for one subgroup of nRCTs in ISI (Table 3 and Table 4). However, the patellar height after CWHTO demonstrated no significant difference between postoperation and preoperation (p > 0.05).

All results of the preoperative comparison and postoperative comparison in the two groups are presented in Table 3. The comparison of preoperative and postoperative outcomes of each group is shown in Table 4. Considering that some included articles (follow-ups: less 1 year) may negatively skew the results, an analysis of subgroups of follow-ups more than 1 year was performed, and the results of subgroups were consistent with the overall results comparing postoperative outcomes of CWHTO and OWHTO (Table 5 and Table 6).

Table 5 Radiological results of the preoperative comparison and postoperative comparison between OWHTO and CWHTO after excluding the results of the follow-up within 1 year
Table 6 Preoperative and postoperative imaging outcomes of each group after excluding the results of the follow-up within 1 year

Discussion

This meta-analysis of 28 studies indicates that there are no differences (p > 0.05) in the function of correction (AFTA and HKA) and main clinical outcomes (HSS, KSS, and VAS) between OWHTO and CWHTO, that PTS increases after OWHTO but decreases after CWHTO (p < 0.05), and that patellar height (CDI, BPI, and ISI) decreases (p < 0.05) after OWHTO but does not change after CWHTO (p > 0.05).

The medial compartment receives a greater load (approximately 60%) in healthy knees than the lateral compartment [43, 44]. If the tibia has some degree of varus deformity, the pressure on the cartilage in the medial compartment will significantly increase beyond the range of tolerance, triggering cartilage wear and inflammation and resulting in medial osteoarthritis [4, 45]. Historically, HTO was first reported by Jackson et al. [46] in the 1960s. The original intention of HTO was to correct tibial varus deformity to properly transfer some stress to the lateral compartment, significantly reducing the pressure in the medial compartment and effectively preventing cartilage wear and relieving pain symptoms. After 50 years of development, HTO has evolved into a much safer, more accurate and more effective surgical procedure for patients with medial compartmental arthritis [37].

Previously, a clinical study including 39 HTO cases [47] presented a reliable result with the survival rate of 82% at 12-year follow-up. If seven patients with bicompartmental osteoarthritis receiving HTO were excluded in this study, it was foreseeable that these results would become better. Therefore, Berman et al. [47] emphasized that considerably strict indications and careful preoperative imaging assessments are critical for HTO. Currently, the best indications for HTO are relatively young, active patients with medial knee osteoarthritis or with a tibial varus greater than 5° [47,48,49]. Emphatically, most cases included in this meta-analysis are in line with these indications for HTO.

To evaluate the controversial clinical effects between OWHTO and CWHTO, we compared the outcomes according to the included articles [10, 13, 24, 27, 30, 31]. As the results imply, the postoperative results of VAS, HSS, KSS, Lytolom score, and WOMAC were better than the preoperative results in both groups (p < 0.05), which indicated that both surgical methods were effective. However, there was no significant difference comparing the postoperative results of CWHTO and OWHTO (p > 0.05). Additionally, it should be emphasized that no significant change was found in the range of motion after OWHTO or CWHTO (p > 0.05) in this study. Similarly, following up for a minimum of 3 years and 1 year postoperatively, Song et al. [21] and Kim et al. [24] both found that mean maximal flexion was not significantly changed after OWHTO or CWHTO. Furthermore, when evaluating the influence of HTO technique on the performance and results of total knee arthroplasty (TKA) at a mean follow-up of 2 years, Filho et al. [36] indicated that post-TKA range of motion was not different between OWHTO and CWHTO.

Regarding the comparison of radiological results, the correction angle evaluated by H-K-A and AFTA (built between the mechanical axis and anatomical axis of the tibia and femur, respectively), is one of the most important indicators that can influence surgical outcomes [24]. In this study, postoperative H-K-A or AFTA did not show significant differences between OWHTO and CWHTO (p > 0.05), while the postoperation H-K-A or AFTA was better (p < 0.05) than the preoperation values in both groups, which demonstrated that the correction functions of OWHTO and CWHTO were similar. Three previous studies [13, 41, 44] showed that both CWHTO and OWHTO could provide acceptable correction, which was consistent with our results. Considering the degree of correction, Aglietti et al. [50] indicated that the functional effect of surgery would be maintained at a follow-up of 10 years when keeping the valgus angle (AFTA) at 8 to 15° after HTO. The significant under-correction caused by the incompletely closed gap in CWHTO or the bone deficit in OWHTO could not reduce the medial tibial plateau load, leading to the loss of osteotomy angle and recurrence of varus. Additionally, overcorrection would cause an excessive lateral compartment load, accelerating degeneration of the articular cartilage [51, 52].

The original goal of HTO was to correct deformities of the coronal plane, but unexpected events, the changes in the PTS, occurring in the sagittal position are always inevitable [53, 54]. Our results showed that PTS increases after OWHTO (p < 0.05) and decreases (p > 0.05) after CWHTO, which is consistent with several comparative studies [16, 18, 23, 24, 26]. Multiple biomechanical and clinical studies attempted to identify the reasons for changes in PTS after HTO [33, 40, 44, 55]. First, surgeons cannot release the posterior soft tissues sufficiently [12], considering to protect muscles and posterior vessels [16, 22]. The other reason is that the triangular shape of the proximal tibia with the apex directed anteriorly makes the center of rotation angulation slightly posterior in the sagittal plane [8, 37, 56]. The demonstrations above cause a less opened posterior gap after OWHTO and a less touched area of the posterior bone after CWHTO.

Flexion and extension activities can be affected by unintended alterations of PTS in the sagittal plane [44, 55]. Second, a linear relationship between tibial slope and tibial translation during unilateral weight-bearing was shown: the greater the angle of slope was, the greater the anterior translation in knees was [12, 55]; therefore, an increased anterior tibial translation after OWHTO can aggravate the load of the anterior cruciate ligament (ACL) [43].

To avoid the undesirable results caused by PTS, many researchers and clinicians have proposed many novel surgical techniques. Nerhus et al. [26] pointed out that PTS after OWHTO increased by a small amount or did not increase at all if the fixation plate was placed close to the posteromedial corner. The osteotomy height and the opening gap at the posteromedial cortex should be higher and broader than those at the tibial tuberosity, which can reduce or avoid changes of PTS pointed out by Giffin et al. [55]. Careful preoperative planning is also quite essential for reducing or preventing changes of PTS in HTOs [26, 38].

Three indicators (BPI, CDI, and ISI) were used to evaluate changes in the patella height. OWHTO reduced the patella height (p < 0.05), while statistically significant effects on the patellar height were not found after CWHTO (p > 0.05). Using the measurement of CDI, some researchers reported that patellar height decreases by 9 to 16% after OWHTO compared with that before surgery [15, 27, 30, 42], which was similar to our results. However, consistent with our pooled results, Song et al. [21] indicated that patellar height remained unchanged after CWHTO.

The reason for patellar decreasing after OWHTO is that the opening wedge of the medial tibia prolongs the proximal tibia, resulting in a reduction in the height of the patellar tendon attachment site, thus causing a decrease in patellar height. This result occurs because CWHTO circumvents the drawbacks of OWHTO that the patellar height has not changed significantly. Tigani et al. [42] indicated that, according to the measurement of 40 knees after OWHTO, patellar lowering would become significant if the correction of the knee axis exceeded 15°. However, when the tibial tuberosity is left and attached to the proximal tibia [38], the patellar height can maintain unchanged after OWHTO, and the osteotomy site still has excellent blood perfusion due to the cancellous bone surface and is therefore very conducive to bone healing [42].

The main limitations of this systematic review and meta-analysis originate from the data pooled from the included articles. RCTs and nRCTs were both included when comparing OWHTO and CWHTO, especially having a greater caseload in the nRCTs, which may likely lead to bias. Nevertheless, the MINORS scores were acceptable when evaluating the quality of nRCTs, and the results of RCTs and nRCTs were processed respectively in order to reduce heterogeneity. Additionally, the follow-up periods and the internal fixation were diverse, which may affect the final results of the two surgical methods. In some subgroups, we did not find enough data, such as for ISI and BPI.

Conclusion

This meta-analysis indicates that compared with CWHTO, OWHTO increases the posterior slope, decreases the patellar height, and provides a similar accuracy of correction; however, CWHTO led to a deceased posterior slope and an unchanged patellar height. Therefore, programs should be personalized and customized for the specific situation of each patient.