Introduction

While the overall effectiveness of total knee arthroplasty (TKA) is largely unquestionable, the best surgical approach is still to be determined. The medial parapatellar approach, introduced by Von Langenbeck [1], is still regarded as the standard. The main benefit of this approach is that it provides the best exposure of knee surfaces. On the other hand, it has been criticized for introducing massive damage to the articular capsule, patellar and quadriceps tendons, extensor apparatus, soft tissue and vascular structures. Minimally invasive approaches to TKA have been evolved for the purpose of preserving the extensor mechanism as much as possible [2, 3]. The first minimally invasive approach for knee arthroplasty was described by Repicci and Eberle [4] for implantation of a unicompartmental prosthesis. Then came techniques extending to TKA. The mini-subvastus (MSV) approach was introduced by Hoffman et al. [5] in 1991 as a way to minimize extensor damage and preserve the vascular supply to the patella. Furthermore, this approach avoided eversion of the patella, thereby reducing the risk of tendon and cartilage damage [2, 3]. In 1997, Engh et al. [6] described the mini-midvastus (MMV) approach. In addition to the advantages of the MSV, this approach provided better exposure to all knee structures. Finally, Tria et al. [7] introduced the quadriceps-sparing (QS) approach in 2003 and Scudieri et al. the mini-medial parapatellar (MMPP) approach in 2004 [8]. The latter is a shortened version of the MPP. To our knowledge, no study has compared data from all these approaches to establish the best approach for TKA. Hence, the purpose of the present study was to perform a Bayesian network meta-analysis to compare these approaches and determine the most effective. We focused on perioperative data, clinical and functional outcomes.

Materials and methods

Search strategy

The present Bayesian network meta-analysis was carried out according to the PRISMA extension statement for reporting systematic reviews incorporating network meta-analyses of healthcare interventions [9]. To orient the literature search, the following features were defined:

  • P (population) end-stage knee joint disease;

  • I (intervention) total knee arthroplasty;

  • C (comparison) MPP, MMV, MSV, QS, MMPP;

  • O (outcomes) perioperative data, functional outcomes, clinical scores.

Data source

Two authors (FM and JE) independently performed the initial search. In October 2019, the main databases were accessed: PubMed, Google Scholar, Embase and Scopus. The following keywords were used in combination: total knee arthroplasty, total knee replacement, prosthesis, medial, quadriceps sparing, midvastus, subvastus, mini-medal parapatellar, KSS, KSFS, range of motion, flexion, straight leg raises, hospital duration. All pertinent titles and abstracts were screened, and if matching the topic, the full text was accessed. Bibliographies of the included studies were also cross-referenced. Disagreements between the authors were debated and mutually solved.

Eligibility criteria

All clinical trials comparing two or more approaches for primary TKA were considered for inclusion. According to the Oxford Centre for Evidence-Based Medicine, only articles with levels I and III evidence were considered for the present study. Articles were limited to English, German, Italian, French and Spanish. Case series, case reports, letters, expert opinions and editorials were excluded. No differences concerning the type of implants were made; only the surgical approach was pivotal for inclusion. Missing data under our outcomes of interest warranted exclusion. Disagreements were debated and mutually solved.

Outcomes of interest

Two authors (FM and JE) independently screened all articles resulting from the search. For each approach, study generalities and patient demographics were noted: type of study, number of procedures, duration of follow-up, surgical approach(es), percentage of female and osteoarthritic (OA) patients, mean age and body mass index (BMI). Data concerning the following outcomes of interest were collected: perioperative data (duration of surgery and hospitalization, total estimated blood loss), functional outcomes (range of motion (ROM), flexion, straight leg raise (SLR) [10]) and clinical scores (visual analogic scale (VAS) for pain, the Knee Society Score (KSS) and its related function subscale (KSFS) [11]).

Methodological quality assessment

The methodological quality of the present meta-analysis was evaluated using the PEDro appraisal score (http://www.pedro.org.au/english/downloads/pedro-scale/), which has been validated for this type of study [12]. The PEDro score is an aggregate of dichotomically assigned points given to studies based on the presence or absence of specific endpoints such as eligibility criteria, allocation, baseline comparability, blinding, follow-up, type of analysis, point estimates and variability. Values > 6 points are considered satisfactory.

Statistical analysis

The statistical analyses were performed by the senior author (FM). The baseline comparability was evaluated through the analysis of variance (ANOVA) test. Values of P > 0.5 were considered satisfactory to verify comparability. The statistical analysis was performed through the STATA software/MP 14.1 (Stata Corporation, College Station, TX). A Bayesian hierarchical random-effects model analysis was adopted for all the comparisons. We referred to the generic inverse variance statistic method for continuous data analysis with standardized mean difference effect measure. The edge network plot was performed to analyse connections, contribution weights between studies, and to detect direct and indirect comparisons. To evaluate loop-specific inconsistency, heterogeneity and related inconsistency factor (IF), the if test was performed. To evaluate for overall inconsistency, the equation for global linearity via the Wald test was used. If the P value was > 0.05, the null hypothesis could not be rejected and the consistency assumption could be accepted at the overall level of each treatment. The interval plot was performed to rank the estimated effect (EE) of the endpoints between them. Confidence and percentile intervals (CI and PrI) were each set at 95%. The funnel plot was performed to assess the risk of publication bias for each comparison.

Results

Identification of eligible studies

A total of 1715 articles were obtained from the initial search. Of them, 605 were duplicates. A further 377 were rejected because of poor levels of evidence. Another 86 articles were excluded because of language barriers: Chinese, Polish, unknown. Another 534 articles were excluded because they did not report quantitative data under the outcomes of interest. Thirty-eight were excluded because of uncertain data or incomplete results and 23 due to source of publication bias or excessive heterogeneous results. This last operation left 52 articles for review: 34 randomized clinical trials (RCTs), ten prospective cohort studies (PCS), eight retrospective cohort studies (RCS). The flow chart of the literature search is shown in Fig. 1.

Fig. 1
figure 1

Flow chart of the literature search

Methodological quality assessment

All included articles stated clearly their eligibility criteria and demonstrated satisfactory baseline comparability. A random and concealed allocation was performed in 32% of the studies. 8% performed a single blinding, and 15% had blinded assessors. Only a third of the included studies performed an adequate follow-up. Almost all the articles performed an adequate analysis, and the intention to treat was satisfied. In the end, the overall PEDro score was 7.42, detecting an optimal quality for the methodological assessment. The PEDro scores across the studies are shown in Table 1.

Table 1 Demographic baseline of the included studies

Patient demographics

Data from 4533 patients were collected. The mean follow-up was 20.38 (range 3 to 109) months. In the MVV group, a total of 880 patients were analysed. 96% suffered from OA, and 67% were female. Their mean age was 68.64 ± 3.5 years, and mean BMI was 29.46 ± 1.7 kg/m2. In the MPP group, a total of 2026 patients were analysed. 98% suffered from OA, and 69% were female. The mean age was 66.27 ± 7.9 years, and mean BMI was 29.29 ± 1.9 kg/m2. In the MSV group, a total of 604 patients were analysed. 90% suffered from OA, and 70% were female. The mean age was 67.72 ± 3.1 years, and the mean BMI was 29.61 ± 1.5 kg/m2. In the MMPP group, a total of 660 patients were analysed. 100% suffered from OA, and 66% were female. The mean age was 66.31 ± 2.5 years, and mean BMI was 28.00 ± 2.4 kg/m2. In the QS group, a total of 474 patients were analysed. 99% suffered from OA, and 73% were female. The mean age was 70.13 ± 1.9 years, and mean BMI was 29.03 ± 1.4 kg/m2. Among the studies, with regard to diagnosis, gender, age and BMI, adequate baseline comparability was detected (P = 0.7, P = 0.8, P = 0.9, P = 0.8, respectively). Patient demographics for the included studies are shown in Table 1.

Outcomes of interest

Concerning perioperative data, the MSV approach demonstrated the lowest duration of hospitalization (EE: − 5.63, 95% CI: − 6.48 to − 4.79), followed by the MPP approach (EE: − 2.99, 95% CI: − 4.61 to − 1.38). The MMPP approach reported the highest duration of hospitalization (EE: − 0.92, 95% CI: − 2.03 to 0.18). The test for overall inconsistency scored P = 0.2. The MPP approach demonstrated the lowest value of total estimated blood loss (EE: 363.84, 95% CI: 286.42 to 441.26) followed by the MSV approach (EE: 820.47, 95% CI: 737.19 to 903.75). The QS approach reported the greatest value of total estimated blood loss (EE: 957.31, 95% CI: 801.26 to 1113.37). The test for overall inconsistency scored P = 0.9. The MPP approach demonstrated the shortest surgical duration (EE: − 74.68, 95% CI: − 81.14 to − 68.21) followed by the MPP approach (EE: − 1.58, 95% CI: − 12.64 to 9.48). The QS approach reported the longest surgical duration (EE: 18.72, 95% CI: 5.31 to 32.13). The test for overall inconsistency scored P = 0.9. The network results concerning the endpoint perioperative data are shown in Fig. 2.

Fig. 2
figure 2

Results of the network comparison perioperative data

Regarding the functional outcomes, the MSV approach detected the greatest degree of flexion (EE: 24.94, 95% CI: 17.94 to 31.94) followed by the MPP approach (EE: 13.37, 95% CI: − 0.42 to 27.15). The MMV approach reported the lowest degree of flexion (EE: 0.07, 95% CI: − 13.41 to 13.55). The test for overall inconsistency scored P = 0.3. The MSV approach detected the greatest ROM (EE: 24.94, 95% CI: 20.15 to 29.73) followed by the MPP approach (EE: 7.34, 95% CI: 0.17 to 14.50). The QS approach reported the lowest ROM (EE: 0.19, 95% CI: − 7.11 to 7.48). The test for overall inconsistency scored P = 0.5. The MSV approach demonstrated the shortest SLR (EE: − 11.64, 95% CI: − 27.26 to 3.98) followed by the MMPP approach (EE: − 7.14, 95% CI: − 29.50 to 15.22). The MPP approach reported the longest SLR (EE: 14.64, 95% CI: − 14.11 to 43.38). The test for overall inconsistency scored P = 0.6. The network results concerning the endpoint functional outcomes are shown in Fig. 3.

Fig. 3
figure 3

Results of the network comparison functional outcomes

Concerning the clinical scores, the MSV approach demonstrated the highest KSFS scores (EE: 23.47, 95% CI: 13.38 to 33.56) followed by the MMV approach (EE: 17.06, 95% CI: − 8.00 to 42.11). The QS approach reported the lowest value of KSFS (EE: 0.70, 95% CI: − 11.64 to 13.03). The test for overall inconsistency scored P = 0.7. The MSV approach demonstrated the highest KSS scores (EE: 88.96, 95% CI: 76.25 to 101.68) followed by the QS approach (EE: 11.29, 95% CI: − 9.55 to 32.13). The MMV approach reported the lowest KSS scores (EE: − 21.29, 95% CI: − 46.99 to 4.41). The test for overall inconsistency scored P = 0.4. The MSV approach demonstrated the lowest VAS pain score (EE: − 2.33, 95% CI: − 3.37 to − 2.30) followed by the QS approach (EE: − 1.03, 95% CI: − 3.08 to 1.02). The MMV approach reported the highest value of VAS (EE: 0.50, 95% CI: − 1.10 to 2.10). The test for overall inconsistency scored P = 0.05. The network results concerning the endpoint clinical scores are shown in Fig. 4.

Fig. 4
figure 4

Results of the network comparison clinical scores

Discussion

The main results of this Bayesian network meta-analysis encourage to perform the minimally invasive subvastus approach for total knee arthroplasty. Concerning clinical scores, functional outcomes and length of the hospital stay, the MSV approach outperformed all other approaches. Surgical duration and total estimated blood loss were lower in the MPP approach. The transitivity between studies was always satisfied, and the equation for global linearity via the Wald test found no statistically significant inconsistency among the studies, attesting reliability of the present results.

The intention of the minimally invasive approaches is to provide quicker recovery after TKA by preventing damage to the extensor mechanism. However, the MPP demonstrated a very short length of the hospital stay compared to the other minimally invasive techniques. Regarding perioperative outcomes, the QS approach reported the longest surgical duration and the highest value of total estimated blood loss. This can be explained by the reduced joint exposure of minimally invasive surgeries compared with the traditional approach. Peersman et al. [63], in a retrospective cohort study including 6489 patients, found correlation between an increased surgical duration and an augmented risk of surgical site infection during TKA. Reduced visibility can considerably complicate component installation, prolong the learning curve and generate skin sloughs [52, 64]. The QS approach especially requires more attention to retractor positioning, since they damage bones (particularly when osteoporotic) and soft tissue [13, 30]. The advantage of the QS approach is that it can be easily converted to MMPP or MPP [13].

Concerning clinical scores (KSS, KSFS, VAS), the MSV approach performed better overall. The equation for global linearity via the Wald test evidenced no statistically significant inconsistency among the studies. Thus, the assumption of transitivity can be accepted. The MMV reported the lowest value of KSS and VAS, while the QS the lowest KSFS. Concerning functional outcomes (SLR, ROM, flexion), the MSV approach performed better overall. No statistically significant inconsistency was found through the equation for global linearity via the Wald test; therefore, transitivity between the studies is assumed. As expected, the MPP evidenced the longest SLR, while the MMV the lowest degree of flexion and the QS the lowest ROM.

One of the purposes of minimally invasive TKA is to reduce damage to the quadriceps tendon, in order to guarantee quicker recovery of the extensor mechanism function. The straight leg raise (SLR) is used to assess quadriceps restoration after TKA [65, 66]. SLR times were longer in the traditional MPP group and strongly improved in the other approaches, especially in the MSV approach, confirming reduced damage to the extensor mechanism in these approaches. Several studies have tried to quantify quadriceps destruction in minimally invasive TKA versus the traditional MPP using levels biomarkers indicative of muscle damage (e.g. creatine kinase, interleukin-6, myoglobin). However, their results are contrasting and controversial [67,68,69].

Comparing the traditional MPP to the other approaches for TKA, we highlight multiple advantages and disadvantages. First, all the other approaches provide a minimally invasive surgery. Second, they aim to preserve patellar vascularization [68], which can reduce the occurrence of patellar fractures, avascular necrosis, subluxations, dislocations, component loosening and rates of anterior knee pain [5, 7]. Third, they are supposed to promote quicker recovery of the quadriceps function and decreased post-operative pain [70, 71]. The MSV, along with the MMV, preserves the vastus medialis insertion [47, 72] and potentially reduces the risk of VMO denervation [71, 73]. Pagnano et al. [74] in a cadaveric study demonstrated that the VMO tendon inserts mostly down to the mild pole of the patella, thus proving that the MSV is the only approach able to preserve the VMO insertion on the patella. Furthermore, the MSV proximally avoids an incision to the descending genicular artery branches (musculoarticular branch) [75, 76]. Fourth, the MSV and MMV almost never require a lateral release [72, 77]. Lateral retinacular release during TKA is not fully understood. It is meant to improve patellar tracking, but can also damage patellar vascularization and reduce joint stability [78, 79]. However, due to difficult execution, a longer learning curve and the need for special instruments, it has not been very popular [13, 80]. The difficulty of execution can result in ligament–patellar maltracking, increased rates of polyethylene wear, loosening, imbalance and instability [81, 82]. To assist the surgeon, the use of a mobile window can facilitate exposure of knee surfaces and dedicated instrumentation should be considered [28, 42]. Not surprisingly, previous studies confirm that the MPP exposure is related to an optimal component positioning [35, 83, 84]. This has discouraged many surgeons from performing minimally invasive TKAs, and the MPP remains the most common approach for TKA. Indeed, our results evidenced that the MPP exposure required less surgical duration and lower estimated blood loss. However, we hypothesize that these results are strongly influenced by the learning curve.

This Bayesian network meta-analysis has several limitations. As previously mentioned, due to reduced visibility and augmented difficulties of installation, minimally invasive TKAs can result in implant malposition. Notwithstanding, implant positioning has not been evaluated, thus representing an important limitation of this study. Implant malposition relates to instability, loosening and consequent joint failure. Further studies should clarify this important endpoint. Another notable limitation of the present study is its lack of analysis for complications. This is due to a lack of data in the included studies under these endpoints. Further studies should implement analyses of complications and evaluate the feasibility of minimally invasive surgeries, especially when it comes to obese patients and patients with previous knee surgeries (e.g. high tibial osteotomies). Points of strength in this Bayesian network meta-analysis are the comprehensive nature of the literature search and the optimal baseline comparability, along with the high number of enrolled studies. To the best of our knowledge, this study represents the first study comparing multiple surgical approaches for TKA. Data from the present network analysis provide evidence in favour of the mini-subvastus approach for total knee arthroplasty. However, the present study represents a data statistical elaboration and, therefore, must be interpreted with caution.

Conclusion

According to the main findings of the present study, the mini-subvastus approach for total knee arthroplasty demonstrated superior overall compared to the other approaches. Orthopaedic surgeons should consider this approach in the light of the evidence and limitations of this Bayesian network meta-analysis.