Introduction

Knee osteoarthritis (OA) is common [6, 94]. Knee OA impairs joint function and quality of life, limiting physical activities and patient independency [71, 73, 115]. Total knee arthroplasty (TKA) is advocated for end-stage knee OA [46, 74, 85]. Both mobile (MB) and fixed (FB) bearing implants are available for primary TKA [1, 49]. FB implants were introduced first, and still represent the most common type of TKA [21, 84]. The polyethylene inlay of FB implants is secured on the tibial plateau. On the other hand, MB implants allow rotation of the polyethylene inlay around its longitudinal axis, miming the physiological kinematics of the knee and promoting a wider range of motion [22, 34, 93]. Previous evidence suggested that MB may promote greater outcomes in functional scores and complications [40, 44, 45, 75, 95]. However, the difference was minimal, and whether mobile bearing provide better outcomes remains controversial [5, 16, 38, 46, 63, 82, 96, 100, 103, 111]. Several randomized clinical trials (RCTs), which have not been yet considered in any previous meta-analyses, have recently been recently published [8, 24, 28, 59, 93, 105, 107, 123]. An update of current evidence could clarify whether MB implants promote greater outcomes to FB in TKA in terms of outcome and complication rate. This meta-analysis compared MB versus FB for primary TKA in terms of implant positioning, patient reported outcome measures (PROMs), and complications. It was hypothesised that MB promotes better outcomes than FB implants in primary TKA.

Materials and methods

Eligibility criteria

All the clinical investigations comparing mobile versus fixed bearing for primary TKA were considered. Only randomized clinical trials (RCTs) with level I to II of evidence, according to Oxford Centre of Evidence-Based Medicine [47], were considered. Only articles in English, German, Italian, French, and Spanish were eligible. Only studies published in peer reviewed journals with accessible full-text article were considered. Only studies which clearly stated the number of included procedures with a minimum of 8 months follow-up were considered. Reviews, opinions, letters, and editorials were not considered. Animals, in vitro, biomechanics, computational, and cadaveric studies were not eligible. All studies investigating the efficacy of experimental rehabilitation protocols were also not included. Studies reporting revision surgeries were also excluded from the analysis.

Search strategy

This meta-analysis was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses: the 2020 PRISMA statement [89]. The PICODT algorithm was preliminary pointed out:

  • P (Population): end-stage knee osteoarthritis;

  • I (Intervention): TKA;

  • C (Comparison): Mb versus Fb;

  • O (Outcomes): implant alignment, surgical duration, range of motion, PROMs, complications;

  • D (Design); RCT;

  • T (Follow-up): minimum 8 months.

In February 2022, the following databases were accessed: Pubmed, Web of Science, Google Scholar, Embase. The search was limited to RCTs, with no time constrains. The following keywords were used in combination using the Boolean operator AND/OR: knee, osteoarthritis, total, arthroplasty, replacement, prosthesis, implant, mobile bearing, fixed bearing, patient reported outcome measures, PROMs, function, efficacy, complication, revision, reoperation, pain, outcome.

Selection and data collection

Two authors (F.C. and K.E.) independently performed the database search. All the resulting titles were screened and if suitable, the abstract was accessed. The full-text of the abstracts which matched the topic was accessed. A cross reference of the bibliography of the full-text articles were also screened for inclusion. All disagreements between the authors were debated and, if necessary, solved by a third author (NM).

Data items

Two authors (F.C. and K.E.) independently performed data extraction.

The following data at baseline were extracted:

  • Generalities of the study: name of the first author, year of publication and journal, length of the follow-up, number of patients, percentage of women (%), body mass index (BMI).

The following data at baseline and at last follow-up were extracted:

  • Range of motion (ROM);

  • PROMs: Oxford Knee Score (OKS), Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), visual analogue scale (VAS), Knee Society Score (KSS), and relate function (KSFS and pain (KSPS) subscales.

    The following data at last follow-up were collected:

  • Implant alignment: tibial slope, delta angle, alpha femoral component angle, gamma femoral component angle, beta tibial component angle, tibiofemoral alignment angle, posterior condylar offset, radiolucent lines;

  • Surgical duration;

  • Complications: anterior knee pain (AKP), revision, aseptic loosening, fractures, deep vein thrombosis (DVT).

Study risk of bias assessment

The risk of bias was valuated using the software Review Manager 5.3 (The Nordic Cochrane Collaboration, Copenhagen). The risk of bias was evaluated, based on the guidelines in the Cochrane Handbook for Systematic Reviews of Interventions [27], by the two reviewers (F.C. and K.E.). The following endpoints were evaluated: selection, detection, performance, attrition, reporting, and other bias. To assess the overall risk of publication bias, the funnel plot of the most commonly reported outcome was performed. The funnel plot charted the standard error (SE) of the Log Odd Ratio (LogOR) versus its OR. The degree of asymmetry of the plot is directly proportional to the degree of bias. To assess the risk of bias of each included studies, the risk of bias graph was performed.

Statistical analysis and synthesis methods

The statistical analyses were performed by the main author (F.M.). For descriptive statistics, the IBM SPSS software (version 25) was used. The mean difference and standard deviation were adopted. The T test was performed to assess baseline comparability, with values of P > 0.1 considered satisfactory. For the meta-analyses, the software Review Manager 5.3 (The Nordic Cochrane Collaboration, Copenhagen) was used. For continuous data, the inverse variance method with mean difference (MD) effect measure was used. For binary data, the Mantel–Haenszel method with odd ratio (OR) effect measure was used. The confidence interval (CI) was set at 0.95 in all the comparison. Heterogeneity was assessed using \(\chi\)2 and Higgins-I2 tests. If \(\chi\)2 > 0.05, no statistically significant heterogeneity was found. If \(\chi\)2 < 0.05 and Higgins-I2 > 60% high heterogeneity was found. A fixed model effect was used as default. In case of high heterogeneity, a random model was used. Overall values of P < 0.05 were considered statistically significant.

Results

Study selection

The literature search resulted in 414 articles. After removal of duplicates (N = 200), a further 140 articles were not eligible for the following reasons: study design (N = 78), language limitation (N = 17), short follow-up (N = 19), lacking quantitative data under the endpoints of interest (N = 26). Finally, 74 comparative studies were included. The results of the literature search are shown in Fig. 1.

Fig. 1
figure 1

Flow chart of the literature search

Risk of publication bias

The funnel plot of the most commonly reported outcome (revision) was performed to assess the risk of publication bias. The plot evidenced very good symmetry, with optimal distribution of the estimated effects of the included studies. The Egger’s test score was P = 0.6, attesting a low risk of publication bias (Fig. 2).

Fig. 2
figure 2

Funnel plot

Study risk of bias assessment

Given the randomized design of the included studies, the risk of selection bias was low. The risk of detection bias was low to moderate, as was the risk of attrition and reporting biases. The risk of other bias was also low to moderate. Concluding, the quality of the methodological assessment was good. The Cochrane risk of bias graph is shown in Fig. 3.

Fig. 3
figure 3

Methodological quality assessment

Study characteristics and results of individual studies

Data from 11,116 procedures were retrieved. 69% (7670 of 11,116 patients) were women. The mean follow-up was 58.8 (7.5 to 315.6) months. The mean age was 67.5 ± 5.9 years, the mean BMI was 28.6 ± 2.3 kg/m2. Comparability was found at baseline concerning the mean age, mean BMI, female, ROM, KSS, OKS, KSS pain, WOMAC, VAS, and KSS function. Generalities and patient baseline of the included studies are shown in greater detail in Table 1, the baseline comparability between the two groups at baseline in Table 2.

Table 1 Generalities and patient baseline of the included studies
Table 2 Baseline comparability of the two groups

Results of syntheses

Eighteen studies (3827 procedures) were included in the comparison of ROM [4, 9, 10, 12, 14, 18, 23, 25, 41, 54, 59, 60, 62, 70, 75, 118]. The MB group demonstrated greater ROM (MD 1.58; 95% CI 0.22 to 2.93; P = 0.02; Fig. 4).

Fig. 4
figure 4

Forest plot of the comparison: ROM (IV: inverse variance; CI: confidence interval). The square represents the effect of each single study. The horizontal line represents the confidence interval of each study. The vertical line “0” represent the no effect threshold. The diamond represents the final effect of the overall analysis

Thirty-one studies (5094 procedures) were included in the comparison of the KSS score [4, 10, 12, 13, 17, 18, 25, 32, 33, 39, 41, 42, 48, 54, 57, 59, 60, 62, 66, 70, 75, 83, 87, 91, 95, 96, 104, 106, 107, 110, 118]. The MB evidenced greater KSS score (MD 1.23; 95% CI 0.85 to 1.61; P < 0.0001; Fig. 5).

Fig. 5
figure 5

Forest plot of the comparison: KSS score (IV: inverse variance; CI: confidence interval). The square represents the effect of each single study. The horizontal line represents the confidence interval of each study. The vertical line “0” represent the no effect threshold. The diamond represents the final effect of the overall analysis

Thirty-two studies (6489 procedures) were included in the comparison of rate of deep infection [4, 7, 13, 14, 17, 23, 25, 33, 35, 36, 39, 41, 42, 55,56,57, 60,61,62,63,64, 67, 70, 72, 76, 95, 104, 113, 122].The MB group evidenced a greater rate of deep infections (OR 1.64; 95% CI 1.07 to 2.52; P = 0.02; Fig. 6).

Fig. 6
figure 6

Forest plot of the comparison: rate of deep infection (M–H: Mantel–Haenszel; CI: confidence interval). The square represents the effect of each single study. The horizontal line represents the confidence interval of each study. The vertical line “0” represent the no effect threshold. The diamond represents the final effect of the overall analysis

No difference was found in implant positioning: tibial slope, delta angle, alpha femoral component angle, gamma femoral component angle, beta tibial component angle, tibiofemoral alignment angle, posterior condylar offset, radiolucent lines. No difference was found in duration of the surgical procedure. No differences were found in the following PROMs: OKS, WOMAC, VAS, function and pain subscales of the KSS score. No difference were found in the rate of the following complications: AKP, revision, aseptic loosening, fractures, DVT.

Discussion

The main finding of the present study was that the MB implants performed in a similar fashion to FB implants for TKA. The analyses evidenced greater KSS, ROM, and rate of the deep infection in MB implants. However, though statistically significant, their clinical relevance is likely limited. Concerning the KSS score, its overall difference between the two implants does not overcome their minimal clinically important difference (MCID), which has been estimated between 6/100 and 9/100 [40, 52, 68, 69, 112]. A formal MCID for the ROM has not yet been estimated. However, given its minimal difference, the clinical relevance of this finding is dubious. Marques et al. [75] conducted a RCT on 99 patients. After 1-year follow-up, a statistically significant increase in ROM and KSS was found in the MB group. After 4-year follow-up no difference was found between the MB and FB group. Kim et al. [59] conducted a RTC on 291 patients, with a follow-up period of 27 years. No differences were found in ROM and KSS between the two groups. Powell et al. [95] analysed 167 patients at 10 years of follow-up, with no statistically significant difference in KSS between MB and FB groups. However, a trend was seen with higher mean scores over the years for the MB group. Given the minimal difference between the two groups in ROM and KSS, the clinical relevance of these findings was dubious. A slight improvement of PROMs was not necessarily associated with a functional advantage [15]. The minimal functional improvement may be explained by greater axial rotation promoted by the MB implants [29, 53, 99]. Amaro et al. [8] evaluating kinematic differences in 64 patients, found that axial rotation was higher in the MB group after 1 year, but disappeared at 2-year follow-up. A histological study showed the development of fibrotic tissue in the synovial membrane and infrapatellar fat pad after a TKA [3]. This produces a hardening effect that may minimize the kinematic differences between the MB and FB groups [8]. MB actively corrects the rotational femoral offset while standing, improving stepping and squatting [46]. However, this difference is not clinically relevant [88]. Moreover, different types of MB implants have different kinematics during stepping and squatting [46], and the final clinical outcome of MB can be influenced by the brand. A long-term study comparing different types of MB and FB implants could be useful to further understand the real benefits of different type of prostheses.

The rate of deep infection was strongly influenced by the study by Breeman et al. [17], which weighted 17.6% on the final effect. Indeed, when conducting the analyses without those data [17], the rate of deep infection is similar between the two groups. Nevertheless, the authors evidenced no difference between the two implants in terms of infections in their study [17]. Indeed, a deep infection was present in 12 of 276 patients in the MB group, and in 6 of 263 patients in the FB group [17]. Some limitations that may have influence our results should be discussed. The authors conducted a multicentre study involving 116 surgeons [17, 99]. Surgeon experience and approaches, implants design and post-operative protocols were not considered.

No differences in radiographic alignment were shown in the present study. Only one study showed a radiographic difference in patellar translation [104]. A tendency to increase patellar translation in the MB group was also evidenced in the present study. In MB implants, the rotation of the tibial component and the variable position of the tibial relative to the femoral implant can affect patellar tracking [90]. However, other meta-analyses comparing patellar translation did not evidence any differences between MB and FB implants [86, 111].

The MB design has been introduced to better simulate knee kinematics, reducing contact stresses, aseptic loosening, and polyethylene wear [20]. The self-alignment promoted by the MB implants compensates the physiological tibial and the femoral component offset [30]. The latter has been hypothesized to improve the conformity between femoral component and mobile insert during stepping and squatting, thus reducing contact pressure and loosening of polyethylene wear [46]. However, this study was unable to identify differences between the two implants, in contest with previous evidence [59, 62, 105]. Though there is less wear at the femoral condyle interface in MB than in FB implant, the former produce additional wear at the surface of metallic tibial implant, which may explain the similarity in the rate of overall wear [105]. Only one study [37] showed a higher rate of aseptic loosening in the MB group. The risk was higher only in certain models. In the MB implants, the geometry of tibial component is such that the shortening of the keel and the under-face texture increase the risk for micromotion and aseptic loosening [26, 53, 65, 101, 104, 108].

This study certainly has limitations. The analyses were conducted irrespective of the surgical exposure and approach. In the present study, both minimally and standard invasive techniques were included. Surgical exposure may influence outcomes, and minimally invasive surgery performed by experienced surgeon may offer short- and mid-term clinical and functional benefits over the conventional exposure [78]. Moreover, the surgical approach may influence the clinical outcomes. A recent network meta-analysis demonstrated that the mini-subvastus approach outperformed all other approaches (mini-medial parapatellar, midvastus, quadriceps sparring) [77]. Patellar retaining or resurfacing has not been investigated, and may represent a further limitation [80]. Different inlay designs (posterior stabilized, cruciate/bicruciate retaining) were not considered as separate. A previous meta-analysis demonstrated no difference in the outcome between the posterior stabilized versus cruciate retaining [81], while no study which compared MB versus FB using bicruciate retaining implants were included in the present study. The manufacturer of the implants was often biased. MB implants are more sensitive to soft tissue release and optimal gap balancing over flexion and extension. Differently, in FB implants planned resection following the anatomical landmarks (anteroposterior and trans-epicondylar axis) can be performed [79]. Few authors appropriately described the surgical protocol, and further subgroups comparisons were not possible. This may generate bias and increase heterogeneity. The conclusion of the present meta-analysis should be considered with these limitations. Results of the present study indicated that bearing in TKA, whether mobile or fixed, does not influence the clinical outcome.

Conclusion

There is no evidence to support that MP implants promote better outcomes compared to FB implants in primary TKA. The analyses evidenced greater KSS, ROM, and greater rate of the deep infection in MB implants. However, though statistically significant, their clinical relevance is limited. Further clinical trials are required.