Background

Total knee arthroplasty (TKA) is a standard surgical treatment for advanced-stage knee osteoarthritis [1]. Regarding the method used to evaluate its outcome, physician-derived scores had an important role in the past. Nevertheless, inconsistency was found between physician- and patient-derived scores [2]. In addition, several reports revealed that patient satisfaction after TKA is not high [3,4,5]. Therefore, patient-reported outcome measures (PROMs), such as the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) [6], Knee Injury and Osteoarthritis Outcome Score (KOOS) [7], and Oxford Knee Score (OKS) [8], are used frequently to evaluate postoperative outcomes after TKA.

The Knee Society also revised the evaluation method for TKA to PROMs in 2011, and it was launched as the Knee Society Scoring System 2011 (KSS-2011) [9, 10]. This scoring system includes questionnaires related to the category of satisfaction, which directly evaluates patient satisfaction after TKA. Furthermore, questionnaires about the knee condition when more active patients enjoy leisure activities and sports were included to daily activities.

The Forgotten Joint Score-12 (FJS-12) is a PROM developed in 2012 [11]. Good joints are considered “no awareness,” and the FJS-12 is useful to evaluate joint awareness. FJS-12 correlated with PROMs associated with knee joints, such as WOMAC, KOOS, and OKS [11,12,13,14,15], and it is effective for outcome evaluation after TKA. The FJS-12 has an advantage in that it has 12 questions and fewer ceiling and floor effects. Nevertheless, the relationship between FJS-12 and KSS-2011, which are the most recently developed PROMs used to evaluate postoperative TKA performance, has not been validated. In particular, because KSS-2011 is the only PROM that contains a “patient satisfaction” category, the relationship between FJS-12 and subcategories of KSS-2011, including “patient satisfaction,” should be evaluated.

In addition, joint noise frequently occurs after TKA. Nam et al. [16] reported that noise after TKA is related to residual symptoms, whereas Kuriyama et al. [17] reported that noise was not correlated with patient satisfaction after TKA. Noise may be related to joint awareness after TKA. Nevertheless, no enough information is available on whether the noise after TKA is related to patient-reported outcomes, including joint awareness.

The aim of this study is to reveal how joint awareness correlates with knee symptoms, functions, postoperative satisfaction, and noise perception after TKA. To assess these questions, the relationship between FJS-12 and KSS-2011 and the perception of noise were evaluated. Furthermore, the relationship between PROMs (FJS-12, KSS-2011, and noise perception) and factors, such as TKA mechanism and range of movement (ROM), was assessed.

Methods

A total of 476 patients (598 knees) who underwent primary TKA from January 2007 to November 2017 at our hospital and related institutions and for whom ≥1 year had elapsed after TKA were enrolled in this study. After excluding patients who died or whose implants were removed owing to loosening or infection, the questionnaire survey, including FJS-12 and KSS-2011, was mailed to 451 patients (566 knees). We used the validated Japanese versions of the questionnaires and signed license agreements with the copyright owners.In addition, the noise in the knee after TKA was assessed via a questionnaire asking if the patients feel any noise during the activity of daily life. Noise scoring was defined on the basis of a 5-point scale: never feel (4 points), feel almost never (3 points), feel seldom (2 points), feel sometimes (1 point), and feel mostly (0 points). This questionnaire used the same expression as that of FJS-12. In patients who underwent bilateral surgery, the responses for two knees on each side were obtained. Valid responses for FJS-12, KSS, and noises were obtained from 289 patients (372 knees, questionnaire collection rate, 65.7%).From the medical records, age at operation, body mass index (BMI) at the time of the survey, and preoperative and postoperative ROM were examined. Hence, in 225 (295 knees) patients, valid data on all items of FJS-12, KSS2011, noise, preoperative ROM, and postoperative ROM were obtained.This study was approved by the institutional review board of our institution (approval number: 4681).

The 11 models of TKA implant used in this research were classified based on the mechanism as posterior stabilized (PS; 91), cruciate retaining (CR; 78), cruciate sacrificed (without post-cam mechanism) (CS; 102), and bi-cruciate stabilized (BCS; 24). The models used were Legion® PS (39 knees; Smith & Nephew, Memphis, TN, USA), Legion® CR (11 knees; Smith & Nephew), NexGen® LPS (31 knees; Zimmer, Warsaw, IN, USA), NexGen® CR (12 knees; Zimmer), Genesis II® PS (19 knees; Smith & Nephew), Genesis II® CR (31 knees; Smith & Nephew), Persona® CR (12 knees; Zimmer), Advance® CR (14 knees; Wright, Memphis, TN, USA), LCS® rotation platform CS (81 knees; Depuy), GMK Sphere® CS (19 knees; Medacta, Strada Regina, Switzerland), and Journey II® BCS (24 knees; Smith & Nephew). The TKA models were chosen by the surgeons’ preferences at operation.

Statistical analysis

For statistical examination, Cronbach’s α was used to evaluate the internal consistency of FJS-12, and > 0.9 indicated that the scale is reliable. The Shapiro–Wilk test was used to evaluate normal distribution. The ceiling or floor effects were considered to be present if more than 15% of the respondents achieved the highest or lowest possible score [13, 18]. Because the Shapiro–Wilk test revealed that FJS-12 and KSS-2011 did not have normal distribution, the correlation between FJS-12 and KSS-2011 was examined using Spearman’s correlation coefficient. Noise was examined also for correlation with FJS-12 or KSS-2011. In addition, FJS-12, KSS-2011, and noise examined the correlation between the patient characteristics (age at operation and BMI at questionnaire collection) and ROM pre- and postoperatively. Regarding the TKA mechanism, differences among the mechanisms in patient characteristics, ROM, FJS-12, KSS-2011, and noise were examined using analysis of variance (ANOVA). Regarding ANOVA, power analysis was performed and detection power was calculated. Multiple comparisons were performed with the Steel–Dwass test. JMP Pro 14.0.0 was used in statistical processing, and the rejection area was set to 5%.

Results

There were 180 female and 45 male subjects. The right side was 163 and the left side was 132. The demographic data for the subject’s characteristics are found in Table 1.

Table 1 Characteristics for validation sample

FJS-12 had an average score of 49.8 [standard deviation (SD), 28.4]. The ceiling effect was 15.6%, and the floor effect was 11.9% (Table 2). Regarding the average value of each of the 12 items, question 8 (“when you are standing up from a low-sitting position?”) had the worst score (average, 3.67 points). Regarding response rate, several patients (14.5%) did not answer question 12 (“when you are doing your favorite sport?”; Table 3). The Cronbach’s α value was 0.947. KSS-2011 had an average of 119.3 (SD, 32.4). The ceiling effect, floor effect, and the results of each item are found in Tables 2 and 4. A strong correlation was found between FJS-12 and the total scores of KSS-2011 (0.70; 0.65–0.75; p < 0.001 in 95% confidence interval; Fig. 1). The correlation between FJS-12 and the subscores of KSS-2011 by category is found in Table 4. A positive correlation was observed in all items, and correlation coefficients of approximately 0.60 were observed for “symptoms,” “patient satisfaction,” “functional activities,” and “standard activities.” Noise had weak correlations with FJS-12 (0.28; p < 0.001) and KSS-2011 (0.20; p < 0.001). Age, BMI, and ROM had also no effect on FJS-12, KSS-2011, and noise (Table 5).

Table 2 Postoperative Results of FJS-12, KSS-2011, and Noise
Table 3 Results of Forgotten Joint Score—12
Table 4 Postoperative KSS-2011 Subscores and Correlations to FJS-12
Fig. 1
figure 1

The graph reveals the relationship between the Forgotten Joint Score—12 (FJS-12) and the 2011 Knee Society Score (KSS-2011). Linear regression analysis revealed a strong correlation between FJS-12 and KSS-2011 scores with a correlation coefficient at 0.70 (95% confidence interval, 0.65–0.75; P < 0.001)

Table 5 Correlation Coefficients Among FJS-12, KSS-2011, and Noise

Moreover, FJS-12, KSS-2011, postoperative ROM, and noise were examined based on the mechanisms (CS, CR, PS, and BCS), and no remarkable differences were observed in FJS-12. Nevertheless, a remarkable difference was found in KSS-2011, postoperative ROM, and noise. Among the four mechanisms in KSS-2011 and postoperative ROM, BCS had the best performance, whereas noise had the lowest score. Multiple comparisons revealed that BCS had remarkably better KSS-2011 scores than PS, greater ROM than most of the other mechanisms, but lower noise score than CR (Table 6). Power analysis revealed more than 85% statistical power in detecting the difference among the groups with this sample number in each category of examination.

Table 6 Results of ANOVA by Mechanism of TKA

Discussion

This study revealed that FJS-12 and KSS-2011 had a strong correlation in patients who underwent TKA, confirming the validation of the two most recent scoring system with each other. Although noise was hypothesized to have an effect on joint awareness, only a weak correlation was found with FJS-12 and KSS-2011. In fact, BCS had the worst score in noise, but the best score in KSS-2011, and the greatest ROM. Noise is suggested to have a limited effect on joint awareness and clinical outcomes. Residual symptoms and joint functions rather than noise perception might be important for joint awareness after TKA.

Of the KSS-2011 subscores, correlation coefficients of approximately 0.60 were found for “symptoms,” “patient satisfaction,” and “functional activities” for FJS-12. It became clear that joint awareness was correlated with postoperative pain and activities of daily living. Although FJS-12 does not directly question patient satisfaction, it also can evaluate patient satisfaction because it also is associated with patient satisfaction from KSS-2011, which directly asks for satisfaction. In addition, as the functional activities of KSS-2011 are divided into subcategories (“walking and standing,” “standard activities,” “advanced activities,” and “discretional activities”), our study indicated a significant correlation between FJS-12 and standard activities. Therefore, joint awareness was influenced by the difficulty level of routine activities, such as walking on an uneven surface, turning or pivoting, climbing up or down a stairs, rising from a low chair, and stepping to the side. FJS-12 is a simple questionnaire with only 12 items and has few ceiling and floor effects. Therefore, FJS-12 is a useful tool to evaluate residual symptoms, daily activity performance, and patient satisfaction after TKA.

Noise was expected to be correlated with joint awareness. Nevertheless, it had only a weak correlation with FJS-12 and KSS-2011. Although the patients with TKA are aware of a noise, some studies have revealed that it has minimal influence on patient satisfaction if the functional aspects, such as ROM, are good [17]. The previous study was done with a TKA model using a unique tricondylar mechanism; therefore, the noise perception influence on PROMs for patients with conventional TKA models needed to be elucidated. In this study of TKA mechanisms, a remarkable difference was found between KSS-2011 and postoperative ROM, and the BCS results were good. A remarkable difference was confirmed regarding noise, and the BCS score was the lowest. Nam et al. [16] reported that noise was recognized in 27% of the patients and the likelihood of noise generation was different among the TKA mechanisms; PS design was the greatest, followed by rotating-platform, sex-specific, and CR factors. Furthermore, patient-recognized noise was reportedly associated with residual symptoms, such as difficulty in getting in and out of cars, limp, stiffness, or swelling. The inconsistency of the findings between the previous studies and our study can be caused by the difference in assessing noise frequency. The previous studies categorized subjects into two groups based on with or without noise, whereas our study used an ordinal scale. From our findings, the noise score had remarkable correlations both with FJS-12 and KSS-2011, but the correlation coefficients were small. It was suggested that even if noise was recognized, patient satisfaction postoperatively would be high if functions, such as the ROM, were good. Therefore, noise itself had a limited effect on joint function, overall satisfaction, and joint awareness after TKA. Nevertheless, it should be encouraged to find better implant design and surgical techniques to reduce the noise with high function after TKA.

This study had several limitations. First, there was a question with low response rate in FJS-12. As reported in previous studies [19, 20], the response rate for item 12 was particularly low. It may be reasonable to consider that some patients who underwent TKA were relatively inactive. Nevertheless, FJS-12 has been confirmed to be valid with a few missing items [11]. In addition, the collection rate of scores was low (65.7%). The reasons for this are as follows: in the collection process of the questionnaires by mail, we found some patients with newly confirmed death, unresponsiveness owing to dementia, and serious disabilities other than knee disabilities. In addition, some patients relocated and returned invalid responses. The process of eliminating the invalid responses would improve the response reliability even though it would reduce the response rate. Furthermore, the effect of bias owing to the low responsive rate on the results is estimated to be small because it is a cross-sectional study that examines the correlation within one sample and the final number of respondents was over 200.Second, this was a retrospective cross-sectional study, and there were time variations in data collection in the answers of FJS-12. In fact, some studies have found that the FJS-12 scores change over time [21]. The differences in questionnaire sampling time might have influenced the results. Nevertheless, we recruited patients for whom at least 1 year had passed postoperatively. Third, various TKA models had been used. Although the mechanisms can be categorized into four systems, several models remained within the same category. Furthermore, preoperative patient conditions were not matched among the groups. Therefore, this study did not conclude that BCS was superior in clinical outcomes than in any other TKA mechanisms. We did not attempt to investigate which mechanism had the best performance. The point of this analysis was to show that some TKA models exhibit greater noise generation while showing higher KSS-2011 score. Thus, this finding suggested that noise perception has a limited effect on clinical outcomes. Lastly, the noise score used in this study has not been validated previously. Because there is no score as a reference to evaluate the noise perception, the same questionnaire expression method as the FJS-12 was used to evaluate noise.

Conclusions

A strong correlation was found between FJS-12 and KSS-2011. In addition, FJS-12 was associated with the subcategories of KSS-2011 for symptoms, patient satisfaction, and standard activities. Noise did not have a remarkable association with joint awareness or clinical outcomes as assessed by FJS-12, KSS-2011, or ROM. Some knees exhibited greater postoperative ROM and KSS-2011 with more frequent noise perception.