Introduction

Patellofemoral arthroplasty (PFA) for treatment of isolated patellofemoral osteoarthritis (OA) remains until today a controversial subject due to inconsistent results found throughout the existing literature [20, 26]. Patient selection, surgical technique, as well as implant choice have a direct effect on clinical outcomes. Historically, the first patellofemoral joint replacement was a vitallium cell patella cap designed by McKeever in 1955 [31]. Nowadays, PFA implant designs can be divided into two larger groups: inlay and onlay PFA.

First generation inlay designs, such as the Richard and Lubinus prosthesis, introduced back in 1979 [8], replaced only the worn cartilage, leaving the subchondral bone untouched. Short-term outcomes, were however not promising, with a low rate of patient satisfaction, but a high conversion rate to total knee arthroplasty (TKA) [9, 45, 47]. The second-generation, or onlay design, was introduced in the 1990s. Contrary to the first-generation inlay designs, the onlay trochlea prosthesis completely replaced the anterior compartment of the knee, providing a possibility of correcting trochlea rotation or for dysplasia [46].

Due to potential complications of onlay designs, such as patellar catching or anterior notching, overstuffing, alongside an increased bone loss when compared to inlay designs, new generation inlay trochlea implants have been introduced [18, 21, 28, 34]. These implants aim to reproduce the complex kinematics of the patellofemoral joint with less mechanical and patellofemoral complications, increased implant stability and no alteration to the soft tissue tension or extensor mechanism [11, 13, 15, 16, 41].

Up-to-date studies, which report or compare clinical or functional outcomes, complication rates, revision or conversion rates, as well as progression of OA between different trochlea designs, are limited. Hence, the aim of this study is to report the clinical and functional outcomes, complication rates, implant survivorship and the progression of the tibiofemoral OA, after inlay or onlay PFA. Comparison of different implant types and models, where it is possible, also represents one of the objectives. The extended information provided from this systematic review will help physicians improve the patients’ management, functional, clinical outcomes and, therefore, patient satisfaction.

Materials and methods

A systematic literature search following PRISMA guidelines [37] was conducted on PubMed, Scopus, Embase and Cochrane databases to identify possible relevant studies, published from the inception of these databases until 11.11.2022. The study protocol has been registered and approved by Prospero (CRD42022330285). The search strategy can be found in Additional Material 9. Randomized control trials (RCTs), case series, case control studies and cohort studies, written in English or German, and published in peer-reviewed journals after 2010, were included in the title and abstract screening of this review. Not original studies, case reports, simulation studies, systematic reviews, or studies that included patients who underwent TKA or unicompartmental arthroplasty (UKA) of the medial or lateral compartment of the knee, were excluded. In a second step, full text analysis was performed by two authors. Articles that assessed functional and/or clinical outcomes, patient-reported outcomes (PROMs) (i.e. Knee Society Score [KSS], Oxford Knee Score [OKS], Western Ontario and McMaster Universities Arthritis Index [WOMAC], Knee Injury and Osteoarthritis Outcome Score [KOOS], American Knee Society Score [AKSS], Visual Analog scale [VAS], Hungerford and Kenna Score [HKS], International Knee Documentation Committee Score [IKDC], International Knee Society Score [IKS], Anterior Knee Pain Score [AKP], etc.), radiographic progression of OA, complication rates, implant survival rates, pain, as well as conversion to TKA rates in patients treated with PFA, using inlay or onlay trochlea designs, were included. Additionally, only articles presenting their results in numerical data form were considered. Finally, surgical technique studies, abstract only studies, studies reporting outcomes after PFA with additional UKA, robotic PFA, or reporting outcomes of the patellar components of TKA, or comparing PFA with TKA, as well as studies which did not report preoperative data, have been also excluded. In case of discrepancy regarding eligibility criteria a third author was consulted.

Quality assessment

In order to assess the quality of the included studies, the Methodological Index for Non-Randomized Studies (MINORS) for non-comparative and comparative clinical intervention studies was used [44]. The global ideal score for non-comparative studies was 16 and for comparative studies 24. The level of evidence of each included study war also reported. With the sole purpose of improving the systematic review’s quality, articles which did not meet a score of at least 11 for non-comparative studies or at least 16 for comparative studies according to MINORS have been excluded.

Data extraction

Title, author names, study design, year and journal of publication, abstract, level of evidence, follow-up time, design of the trochlea implant, clinical outcomes, functional outcomes, revision rates, complication rates, conversion to TKA rates, progression of OA, as well as reported pain levels and PROMs were extracted into a Microsoft Excel spreadsheet (MS Microsoft, USA).

Statistical analysis

Continuous variables were described with means and standards deviations or medians and ranges. Categorical variables were reported with absolute and relative frequencies. A p < 0.05 was considered statistically significant.

Results

The literature search identified 404 publications in the initial screening process. Twenty-nine of them met all the inclusion criteria following the selection process. A detailed overview of the process is shown in Fig. 1. Median MINORS for non-comparative studies value was 12.5 (range 11–14), and for comparative studies 20.1 (range 17–24). Results from a total number of 1,761 patients were evaluated (median age at surgery 53 years, range 22–92 years). The reported median body mass index (BMI) was 26.4 (range 20–50.8). Several scores (OKS, KSS, KOOS, WOMAC, VAS, IKDC, AKP, HKS, IKS, Hospital for Special Surgery Patellofemoral Score [HSS-PF], University of California Los Angeles Score [UCLA], Short Form-36 Items [SF-36], Short Form-12 Items [SF-12], Melbourne Knee Score, Lysholm, Tegner, Kujala, Bartlett), alongside postoperative range of motion (ROM), implant survivorship, rate of complications, conversion to TKA and progression of OA, were used to evaluate clinical and functional outcomes. The detailed characteristics of the included studies are presented in Table 1.

Fig. 1
figure 1

Flowchart of the study selection process according to the PRISMA 2020 statement: an updated guideline for reporting systematic reviews [37]. LCS low contact stress, TKA total knee arthroplasty, PROMs patient-reported outcome measures, PFA patellofemoral arthroplasty, UKA unicondylar knee arthroplasty

Table 1 Overview of reported patients

In terms of OKS, 13 included studies, have reported improved postoperative scores, when compared to preoperative ones [1,2,3,4, 14, 19, 24, 32, 33, 36, 38, 42, 48]. No difference was observed between inlay and onlay implants, in terms of OKS [16]. Although, a couple of studies, which do not mention p values or confidence intervals do exist [14, 33], the overwhelming majority of the findings qualify as statistically significant (p < 0.05). Patients have been followed at short, short to medium, medium and long terms. Collected data can be found in Table 2.

Table 2 Overview of reported OKS

When discussing WOMAC, seven studies state that both inlay and onlay designs yield improved postoperative scores [1, 16, 17, 22, 23, 32, 40]. Feucht et al. also directly compared WOMAC scores, between onlay and inlay designs at a median follow-up of two years. There was no difference between the reported scores in the two groups [16]. WOMAC scores were reported at medium- and long-term follow-ups. All reported results are statistically significant (p < 0.05). Collected data can be found in Table 3.

Table 3 Overview of reported WOMAC

In the case of ROM, 12 studies were identified for data extraction. The majority of the studies reported an increase in the postoperative ROM when compared to preoperative values [1, 12, 19, 24, 27, 35, 38, 43, 50]. Contrary to the majority, Al-Hadithy et al. reported no change in ROM, when comparing preoperative values to 12-months follow-up ones [4]. Furthermore, Ajnin et al. actually reported a decrease in ROM values at a median follow-up of 65 months (range: 30–119), when compared to preoperative values [2]. No studies were found which directly compared ROM values between onlay and inlay designs. ROM was reported preoperatively and postoperatively at short-, short-to-medium-, medium and long-term follow-ups. Collected data can be found in Table 4.

Table 4 Overview of reported ROM

Regarding KSS, almost all of the nine analysed studies reported an increase in both postoperative clinical/objective scores and functional scores, when compared to preoperative ones [5, 7, 12, 19, 30, 35, 43, 50]. Both currently circulating variants of KSS were used (KSS 1989 and KSS 2011). With the notable exceptions of Morris et al. [35], who did not mention the statistical significance and Bernard et al. [7], who did present his findings as statistically non-significant, the remaining majority of analysed studies reported their findings as statistically significant (p < 0.05). The KSS scores were reported preoperatively and postoperatively at short to medium, medium and long-term follow-ups. Collected data can be found in Table 5.

Table 5 Overview of reported KSS

In addition, various other PROMs were reported. AKSS was reported by three studies, at short and long-terms intervals [1, 25, 43]. They have found improved scores postoperatively when compared with preoperative ones. Tegner score was reported by five studies [7, 12, 23, 30, 40]. Furthermore, Kujala score [42, 48, 50], Lysholm score [16, 42, 50], KOOS [1, 12, 38], SF-12 and SF-36 [1, 19, 38] UCLA [12, 30], MKS [19, 43], HKS [3], IKS [10], IKDC [23], AKP [10] and HSS-PF [50] were also presented. A small difference in Lysholm score values between inlay and onlay designs has been reported at a median follow-up period of 25.5 months (range not given), with the inlay group scoring slightly higher (66 ± 11 vs. 57 ± 22) [16]. With the notable exception of Mofidi et al. [33] all other authors present their findings as statistically significant (p < 0.05). The collected scores were reported preoperatively and postoperatively at short to medium, medium and long-term follow-ups. Collected data can be found in Table 6.

Table 6 Overview of reported PROMs

Nine out of ten identified studies have reported a statistically significant reduction in perceived pain (p < 0.05) [5, 16, 22, 23, 25, 30, 40, 48, 50]. When comparing onlay designs with inlay ones, Feucht et al. showed that although both groups exhibited the same mean postoperative VAS value (4 ± 3), the mean preoperative VAS value was much higher in the onlay group (8 ± 2), when compared to the inlay group (6 ± 2) [16]. Scores have been reported preoperatively and postoperatively at short, short to medium, medium and long-term follow-ups. Collected data can be found in Table 7.

Table 7 Overview of reported VAS

In the case of reported complications, complication rates and implant survivorship, the present findings tend to exhibit a high degree of heterogeneity. In total, 19 studies were identified [1,2,3, 5, 7, 10, 19, 22, 24, 25, 30, 32, 36, 38, 40, 42, 43, 49, 50]. Complication rates varied greatly among analysed studies, from as low as 0.0% [25, 36] to as high as 35.3% [49] or even 41.2% [50]. The most commonly reported complication was patellar maltracking, followed closely by anterior knee pain [1,2,3, 5, 7, 10, 19, 22, 24, 30, 32, 38, 42, 43, 49, 50]. Reported revision rates also exhibited an elevated degree of heterogeneity between them, with some studies stating low revision rates of 3.8% [42] or 3.9% [24], and others reporting high revision rates such as 50.0% [22] or even 55.0% [5]. No studies directly compared the type and rate of complications, or the rate of revisions between onlay and inlay type of prostheses. Results were reported postoperatively at short-to-medium-, medium- and long-term follow-ups. Collected data can be found in Table 8.

Table 8 Overview of reported complications and revision rate

In terms of progression of OA and conversion to TKA, an elevated variance in findings has been noted between the 23 analysed studies [1,2,3,4, 6, 7, 10, 12, 16, 19, 22,23,24,25, 30, 32, 36, 38, 40, 42, 43, 48, 49]. Reported rates of OA progression varied from as low as 0.0% [22, 23] and 3.9% [24, 49] at 24 and 60 months follow-up, up to 32.2% [12] and even 53.3% [16] at a median follow-up period of 48 months (range: 24–72 months) and 25.5 months (range not given) follow-ups. When comparing the rate of OA progression between inlay and onlay designs, Feucht et al. found a notable difference, with the reported rate of OA progression being 0.0% in the inlay group and 53.3% in the onlay group [16]. In the case of conversion to TKA, the reported rates of conversion vary from 0.0% [38] and 0.8% [30], at a reported median of 24.1 months (range: 6–34 months) and 84 months (range: 24–142 months) follow-ups, up to 27.8% [25] and 30.0% [10], at 6 and 20 years follow-ups. No difference in reported rates of conversion to TKA has been noted [16]. Results were analysed at short-, short-to-medium-, medium- and long-term follow-ups. Collected data can be found in Table 9.

Table 9 Overview of reported progression of OA

Discussion

The main finding of the present review was that both onlay and inlay PFA yield satisfactory clinical and functional outcomes at short-, medium- and long-term follow-ups. No difference between designs has been described, although only one study from Feucht et al. [16] directly compared onlay and inlay designs using WOMAC and Lysholm scores, which presented a small and statistically non-significant difference in favour of the inlay design. Both designs improved pain postoperatively and no difference between them in terms of postoperative VAS has been noted, although the onlay group presented a higher preoperative VAS [16].

Regarding complication rates, implant survivorship and revision rates, the studies presented a high degree of heterogeneity between them. The most common complication described was the patella maltracking, followed closely by anterior knee pain.

One interesting finding of the study pertains to the progression of OA in the tibiofemoral compartment. When comparing the inlay and onlay trochlea designs Feucht et al. [16] found a statistically significant difference, in favour of the inlay group.

There are several systematic reviews in the literature, which report on PROMs and survivorship of the patellofemoral arthroplasty [29, 39, 46]. However, this is the first systematic review, which undertakes such a comprehensive analysis of postoperative outcomes. Additionally, none of them presents the results of both onlay designs and the new inlay designs. Pisanu et al. noted satisfactory results at short to mid-term follow-ups, and a 10 years survivorship of 90% with onlay designs, whereas inlay type of prosthesis showed disappointing results, with high rates of complications and failures [39]. This might be due the fact that included studies had reported results of first generation inlay designs only.

The systematic review of Lonner et al. based on the Australian National Joint Registry also described a 5-year cumulative revision rate of more than 20% in the case of inlay, and less than 10% when discussing onlay [29]. This is also because only first generation inlay designs were analysed. Progression of tibiofemoral OA after a successful PFA was found as the most common reason for failure [29].

This study has several limitations. First of all, the lack of more than one available studies in the literature, which directly compared the new inlay type of trochlea prosthesis, with the onlay design. Furthermore, no available RCTs pertaining to this subject have been found in the current literature. Another weakness is the retrospective type of the majority of the included studies, which could have led to an unknown selection bias. Another important aspect is that there are no studies reporting at mid- and long-term follow ups regarding the new inlay type of prosthesis, meaning that safe conclusions, with regards to the clinical and functional outcomes, and the survivorship of this type of prosthesis should be drawn with all due caution. Lastly, many authors were consultants for the companies designing the type of prostheses investigated in the respective studies, which might have led to a conflict of interest.

This systematic review provides physicians with valuable information to improve patient management, functional and clinical outcomes, and increase patient satisfaction.

Conclusion

There is no difference in functional or clinical outcomes after PFA between the new inlay and the onlay designs, with both presenting an improvement in most of the scores that were used. A higher rate of OA progression was observed in the onlay design group.