Introduction

Knee osteoarthritis (KOA), characterised by degeneration of the articular cartilage in the knee joints causing crepitus and the growth of osteophytes in addition to stiffness and pain in the joint, eventually results in impairment of movement [1]. Age, sex and obesity are the most important risk factors for KOA; other factors such as lack of exercise, genetic predisposition, bone density, occupational injury, previous trauma and activities involving repeated knee bending can also contribute to the development of this condition [2]. The prevalence of KOA is increasing due to improved life expectancy and increase in obesity [3, 4].

The prevalence of KOA is higher than that of hip osteoarthritis (OA) [3]. In 2017, there were approximately 303.1 million people estimated to have hip and knee OA globally [5]). The prevalence rate was 3,754.2 of cases per 100,000 people. This represents a 9.3% increase from 1990 to 2017 [5]. The prevalence of KOA is increasing globally as well as in Sri Lanka. In Sri Lanka, the age-standardized prevalence of clinical KOA among females, estimated based on population statistics of the 2012 census, was 21.8% (95% CI: 21.7–21.9%) [6].

Because of the high prevalence of the condition and possible movement impairments the disease can result in, it is important to identify, assess and monitor the progress of the condition. Internationally, several tools are available for assessing the condition of a patient with KOA such as the Knee Injury and Osteoarthritis Outcome Score (KOOS) [7]. Among them, the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) and the Intermittent and Constant Osteoarthritis Pain measure (ICOAP) are commonly used by orthopaedic doctors and therapists [8]. However, in Sri Lanka, no validated disease-specific instrument is available. Sri Lanka has a rapidly ageing population [9], and age-related musculoskeletal conditions and their subsequent health burden due to complications are expected to rise. In such a context, instruments that can assess the condition and its functional impairments within a relatively short time, and that do not require special training to administer and interpret are beneficial.

KOOS evaluates both short- and long-term consequences of knee injury and consequences of primary OA. It assesses the patient’s opinion about their knee and associated problems subjectively. The scale is available in about 49 languages [10] and has been validated for several orthopaedic interventions [11, 12].

This study aimed to validate KOOS among a Sinhala-speaking Sri Lankan population which will enable researchers and clinicians to obtain a reliable assessment of orthopaedic conditions and monitor response to treatment.

Methods

This study was carried out in two phases. Phase I consisted of translation of the scale from English to Sinhala. Phase II consisted of field testing the Sinhala version for reliability and validity.

Translation and cultural adaptation procedure

The English KOOS scale was translated into Sinhala as per the guidelines laid down by American Academy of Orthopaedic Surgeons (AAOS) [13]. The original English version was translated into Sinhala by two independent translators. One of them was familiar with the concepts aiming for clinical equivalence, whereas the other, a naive translator, was unaware of the concepts to ensure linguistic accuracy. The Sinhala version was synthesized by reconciling differences between the two translations, resulting in a common version. The synthesized version was back translated to English by two independent translators who were fluent in both English and Sinhala and who were unaware of the original scale. The back translation was compared with the original for consistency. An expert committee, consisting of medical professionals, a methodologist and translators, reviewed all translations and resolved discrepancies, producing a pre-final version. Four items in the scale were deemed not acceptable during reviewing as the activities indicated were not culturally relevant. These items were: item A7 - getting in and out of a car; item A9- putting on socks/stockings; item A11 - taking off socks/stockings; and item A13 - getting in and out of a bath tub. The items were replaced with functionally similar items that were relevant to the Sri Lankan culture. For example, “putting on socks/stockings” was changed to “wearing a trouser/skirt by bending the knee”; “taking off socks/stockings” was changed to “removing a trouser/skirt by bending the knee” “Getting in and out of a car” was replaced by “getting in and out of a vehicle/car”; and “having bath in a bathtub” was changed to “having a bath by getting water from a bucket”. The pre-final version was pretested on 30 individuals. No further issues were observed during the back translation and pre-testing.

Study design, setting and participants

A cross-sectional study was conducted at outpatient clinics and in-patient units of the Departments of Physical Medicine at the National Hospital, Kandy; Rheumatology and Rehabilitation Hospital, Ragama: and District General Hospital, Ampara in Sri Lanka.

A convenient sample of individuals aged 30 to 70 years with KOA, who had been referred for physiotherapy, were invited for the study. All KOA patients referred for physiotherapy were diagnosed by physicians using clinical and/or radiologic confirmation. Those who had undergone knee surgery, including arthroscopy, within the past 12 months, received intra-articular steroid injections within the past six months, who were diagnosed with other types of arthritis, fibromyalgia, congenital deformities of the lower limb, those who were not conversant in Sinhala and unable to give consent were excluded. A second group of participants aged 30 to 70 years receiving curative and/or rehabilitative services for conditions other than KOA was also administered the scale as a control group. These participants were selected through a screening procedure that involved answering questions about their medical history, including any prior history of osteoarthritis, joint issues, or knee pain. Both groups consisted of male and female Sinhala-speaking participants.

Sample size

KOOS scale has 42 items. Therefore, for a 10:1 ratio, a sample size of 420 was required [14]. Assuming a non-response rate of 10%, 467 participants were required for the validation of the scale. A further 100 participants without KOA were recruited as a control group. Debates regarding the ideal sample size for factor analysis are abound in the field. According to Guliford and Cattell, a minimum sample size for confirmatory factor analysis (CFA) is 200 and 250, respectively [15]. Based on that, a sample size of 467 was deemed adequate.

Study instruments

Socio-demographic characteristics and anthropometric data were obtained through a questionnaire and measurements.

KOOS scale

The KOOS-Sinhala scale had 42 items divided into five subscales: KOOS pain, KOOS symptoms, activities of daily living (KOOS ADL), function in sport and recreation (KOOS Sport/Rec), and knee–related quality of life (KOOS QoL). All items were scored on a five-point Likert scale ranging from 0 to 4; zero indicating no problem and four indicating extreme problems. Scores were converted to a scale from 0 to 100, with zero for extreme knee problems and 100 for no knee problems as per the guidelines for scoring given for the original English version.

Short form − 36 health survey questionnaire (SF-36)

The validated Sinhala SF-36 questionnaire was used to assess Health-Related Quality of Life [16]. It measures eight aspects of quality of life across eight subscales: physical functioning, role limitations due to physical health, bodily pain, general health, vitality, social functioning, role limitations due to emotional problems, and mental health. Each item is scored on a 0 to 100 scale where the lowest and highest possible scores are 0 and 100, respectively. The items in the relevant scale were averaged to obtain the scale scores. A high score denotes a better health state.

Data collection

The paper-based scales and questionnaires were distributed among the consenting participants who were able to read and write in Sinhala in the participating hospitals after details of the study were explained to them. The researchers interfered minimally in the completion of the questionnaires but were available to answer any queries the participants had. Data were coded and stored in password protected electronic formats.

A subsample of 50 individuals from the KOA patient group who visited the rehabilitation clinic after 2 weeks were randomly selected and re-administered the KOOS and SF-36 under the same conditions to assess test-retest reliability.

Data analysis

Summary statistics of socio-demographic characteristics are presented using means, frequencies and percentages. It was hypothesized that KOA patients will score significantly lower for KOOS and SF-36 subscales compared to participants without KOA. A p-value < 0.05 was considered significant. Reliability and validity of the instrument was assessed using the indices given below.

Floor and ceiling effects

Floor and ceiling values were considered acceptable if < 15% of the participants scored either the lowest score or the highest possible score, respectively [17].

Internal consistency reliability

Internal consistency of individual subscales was measured using Cronbach’s alpha. A Cronbach’s alpha value of 0.7–0.9 is indicative of a good internal consistency of the instrument [18].

Test-retest reliability

ICC was calculated by using a two-way mixed effects model to assess absolute agreement between the two measures. Based on the 95% confidence interval of the ICC estimate, values less than 0.5, between 0.5 and 0.75, between 0.75 and 0.9, and greater than 0.90 are indicative of poor, moderate, good, and excellent reliability, respectively [19].

Standard error of measurement (SEM)

SEM has been defined as the amount of variation or spread in measurement errors for a test [20]. It is a parameter for the amount of measurement error present in an instrument and is an indicator of the reliability of the instrument [21]. The SEM is calculated by the following formula.

$${SEM}\equiv{SD}\sqrt{1-ICC}$$

Minimum detectable change (MDC)

MDC is the change in the instrument’s score beyond measurement error [22]. It provides a value for the minimum change that needs to be observed to be confident that the observed change is real and not a product of measurement error [21]. The MDC was calculated using the following formula [22].

$$\text{MDC}=1.96 \ {*}\:\sqrt{2}\ {*}\:\text{SEM}$$

Validity

Construct validity

Construct validity of the instrument was assessed by testing a priori set of hypotheses about the correlation between the KOOS subscales and the SF-36 scale using the Pearson correlation coefficient. It was expected that the scales that measured similar constructs will demonstrate strong correlations. A correlation coefficient |0.68| was considered strong, between |0.36| and |0.67| as moderate, and <|0.35| as weak [23]. We hypothesized a strong convergence between KOOS pain score and SF-36 bodily pain score; and a moderate convergence between all other subscales of KOOS and SF-36 Physical Function score.

For divergent relations, we hypothesized that all subscales of the KOOS scale except KOOS QoL subscale will poorly or negatively correlate with SF- 36 emotional wellbeing scale and a weak positive correlation between KOOS QoL subscale and SF-36 emotional well-being scale.

Factorial Validity

Factorial validity was tested for a five-factor model using confirmatory factor analysis (CFA). The model fit was evaluated with several goodness-of-fit indices: Comparative Fit Index (CFI), Tucker Lewis Index (TLI), Standardized Root Mean Squared Residual (SRMR) and Root Mean Squared Error of Approximation (RMSEA). The standardized factor loading (standardized regression weight), modification indices (MI), and squared multiple correlation (R2) were used as indicators to select items to be retained in the CFA model. A value of more than 0.90 for CFI and TLI, Chi-square/degrees of freedom of less than 3 and a RMSEA value of < 0.08, and a SRMR < 0.08 were considered to indicate a good model fit [24, 25].

All analyses were performed using SPSS version 22 and CFA done using Lavaan package in R software [26].

Results

A total of 462 patients with KOA completed the scale. Forty-seven patients with incomplete responses were excluded. Thus, data from 415 patients were included in the study. The second group consisted of 100 participants without KOA. Of these participants, 79 were included in the study and the rest were excluded due to incomplete responses.

Missing baseline data

Seventy two of 17,430 items (0.41%) were missing for the KOOS data and 125 of 14,940 items (0.83%) were missing for the SF-36 data among the KOA patients (first group). Thirteen of 3,318 items were missing for the KOOS data and 0 of 2,844 items (0.83%) were missing for the SF-36 data among the participants without KOA (second group). Values were missing at random. A subsample of data without missing values was used for the CFA.

Sociodemographic characteristics of participants

Table 1 presents the characteristics of the two participant groups. The mean age (± sd) of the KOA subjects was 54.9 (± 9.2) years. Majority of the respondents were female (55.1%) and Sinhalese (88.1%). The mean (± sd) BMI was 26.3 (± 3.3) and the mean (± sd) duration of KOA was 3.0 (± 1.6) years. Majority of the participants were homemakers (31.9%).

Table 1 Sociodemographic characteristics of the study population

The mean (± sd) age of the control group was 49.2 (± 8.0) years. Majority of the participants were female (78.5%) and Sinhalese (87.3%). Less than one third (30.4%) of the participants were homemakers and 30.4% were retired. The mean (± sd) BMI was 26.9 (± 2.7).

The mean KOOS and SF-36 scores of KOA patients were significantly lower than those of participants without KOA across all subscales (Table 2).

Table 2 Comparison of mean scores of KOOS between participants with and without KOA

Floor and ceiling effects

There were no floor/ceiling effects for any of the KOOS subscales among participants with KOA. For participants with no KOA, there were no floor effects in any of the KOOS subscales; ceiling effects were reported in ADL Function (20.3%), and Sports/Recreation Function (29.1%) subscales.

Reliability

Internal consistency reliability

The Cronbach’s alpha values for all five subscales ranged from 0.73 to 0.96 which suggests acceptable internal consistency. In KOOS QoL subscale, the overall alpha increases to 0.88 if Q2 (have you modified your lifestyle to avoid potentially damaging activities to your knee?) was deleted (Table 3). Other subscales had optimum internal consistency and could not be further improved.

Table 3 Reliability coefficients for the KOOS subscales

Test-retest reliability

A subsample of 50 KOA patients completed the KOOS-Sinhala again within 10 to 14 days. The ICC for all subscales were above 0.90 demonstrating excellent temporal stability. The SEM ranged between 1.07 and 2.26 and the MDC ranged between 2.96 and 6.26 (Table 4).

Table 4 Test-retest reliability of KOOS subscales

Construct validity

KOOS Pain subscale moderately correlated with SF-36 bodily pain scores (r = 0.41) (Table 5). The correlation between all KOOS subscales and SF-36 physical function scores had a weak positive correlation; there was no correlation between SF-36 emotional wellbeing score and KOOS QoL subscale (Table 5).

Table 5 Correlation between KOOS subscales and SF-36 scores

Factorial Validity

The CFA supported a five-factor model (CFI 0.95, TLI 0.95, RMSEA 0.08, SRMR 0.07). All factor loadings were significant except for item S4 which was negative (Fig. 1).

Fig. 1
figure 1

KOOS 42 items: CFA model; n = 404

Discussion

The validation and adaptation of the Sinhala KOOS scale has demonstrated good psychometric properties. The reliability of the instrument, tested in terms of internal consistency, test-retest reliability, SEM and MDC was good. The internal consistency assessed using Cronbach’s alpha was above 0.7 for all subscales of KOOS indicating excellent reliability of the scale. Test-retest reliability was high which indicates stability of scores over short time periods (14 days); the SEM was low for all sub scales. SF-36 questionnaire was used to assess construct validity which had small but yet positive correlations among corresponding subscales of Sinhala KOOS. Structural validity, assessed using a five-factor CFA model, showed good model-fit.

High Cronbach’s alpha coefficients obtained for all subscales confirmed that the KOOS subscales are internally consistent, and items within a subscale correlate with each other. The internal consistency of the QoL subscale is within acceptable range but could be improved from 0.83 to 0.88 by removing item Q2. Cronbach’s alpha coefficients were comparable to the coefficients obtained in the validation of the Swedish (0.71–0.95) [28], Singapore English (0.70–0.92) [29], French (0.76–0.93) [30], Portuguese (0.77–0.95) [31], Indonesian (0.84–0.97) [32], Finnish (0.79–0.96) [33], and Malaysian (0.78–0.95) [34] versions. The Dutch version was validated in several different groups of patients, the Cronbach’s alpha values for mild OA ranged from 0.71 to 0.94, for moderate OA from 0.83–0.97, for severe OA from 0.56 to 0.98, six months after a total knee replacement (TKA) from 0.74 to 0.94 and for patients who had undergone a revision of the primary TKA from 0.78 to 0.95. Among those Cronbach’s alpha values, the symptoms subscale for severe KOA was 0.56 [35]. Xie et al., and Salavati et al., reported Cronbach’s alpha values less than 0.70 for the pain subscale and QoL among KOA patients and patients with knee injuries [29, 36].

The test–retest reliability was excellent with the intraclass correlation coefficients being > 0.9 for all subscales indicating satisfactory temporal stability of the KOOS Sinhala version. This is similar to other cultural adaptations of the KOOS: the Urdu version had an ICC ranging from 0.967 to 0.986 [37]; the Arabic version an ICC ranging from 0.875 to 0.957 [38]; and the Portuguese version had an ICC ranging from 0.82 to 0.94 [31].

The SEM (range 1.07 and 2.26) and the MDC (range 2.96 and 6.26) of the Sinhala KOOS scale are small showing the scale’s ability to detect even small changes not due to random errors with time. These values indicate a high level of measurement precision and a relatively small detectable change threshold. The SEM for the mild and moderate OA groups in de Groot et al.’s study ranged from 5.2 to 9.0 and from 5.8 to 11.6, respectively; the SEM values for post-revision total knee replacement patients ranged from 7.2 to 24.6 [35]. The French KOOS showed good to excellent reproducibility with ICC values ranging from 0.755 to 0.914 [30]. The Persian version in patients with isolated meniscus injury had SEM values between 5.68 and 9.24 and a MDC value of 25.61 [11]. The SEM values reported for the Finnish version of the KOOS ranged from 6.0 to 12.2, the lowest being for the symptom subscale and the highest for the sport and recreation function subscale; the individual level MDC ranged between 2.2 and 3.4, the lowest (16.6) being for the symptom subscale and highest (33.8) for the sport and recreation function subscale, while at the group level, the MDC ranged between 2.2 and 4.4 [33].

Item-subscale correlation was satisfactory. Similar findings have been reported in the Portuguese KOOS validation [31]. Xie et al., (2006) reported correlations exceeding 0.40 for all 42 items in the KOOS subscales in the Singapore English and Chinese versions with the exception of 4 items and 13 items, respectively [29]. In the Persian KOOS, Spearman’s correlation coefficient exceeded 0.40 for all subscales except all items of the symptom subscale and one item of the QoL subscale [36].

Many studies have reported a moderate correlation between the KOOS subscales and SF-36 subscales measuring similar constructs [30, 31, 39]. In our study, KOOS pain scores moderately correlated with SF-36 bodily pain and physical health scores. The correlation between KOOS pain scores and other SF-36 subscale scores were statistically significant but poor. Similarly, moderate correlations were observed between KOOS symptoms scores and SF-36 physical health and bodily pain scores; KOOS ADL scores and SF-36 bodily pain scores; and KOOS sports scores and SF-36 physical health scores. Difference in the time periods considered in SF-36 (4 weeks) and KOOS (1 week) could be a reason for the poor correlation between the subscales. It is likely that the hospitalized patients may have experienced a certain level of difficulty evaluating their ability to do certain knee-related functions like jumping, climbing stairs etc.

The SF-36 emotional wellbeing subscale poorly correlated with KOOS symptoms subscale; it did not correlate with other KOOS subscales including KOOS QoL. SF-36 emotional wellbeing scale assesses mental health and KOOS QoL assesses knee-specific quality of life; this difference may have been responsible for the poor correlation between the two scales.

Results of CFA showed that the KOOS Sinhala version with all items retained the original five-factor model. The Malaysian KOOS was the first validation which used CFA to test factor structure [34]. The goodness-of-fit indices of the final model of the Malaysian KOOS with only 26 items were CFI = 0.93, TLI = 0.92, IFI = 0.93 and chi- square/degree of freedom = 2.18 and RMSEA = 0.07. Items s2, s4, s5, p1, p2, p4, p6, a1, a2, a3, a5, a9, a10, a11, a16 and q1 were eliminated from the Malaysian KOOS.

We acknowledge several limitations of this study. We did not compare Sinhala KOOS with other scales such as Western Ontario and McMaster Universities Arthritis Index (WOMAC), Pain numeric scale or Visual Analog Scale. Lack of a validated, specific knee scale in Sinhala is a limitation of this study; poor correlations between subscales of the Sinhala KOOS scale and the Sinhala SF-36 subscales suggests that SF-36 may not have been the best tool to use to assess construct validity. The selection of KOA patients was based on a physician diagnosis which probably was based on a radiologic diagnosis; even if it was not, clinical diagnoses of experienced physicians is unlikely to have affected the performance of the scale. As most of the study participants were in an advanced stage of the disease, this study possibly may not represent the entire disease spectrum of KOA. Radiography which is considered the gold standard for diagnosing osteoarthritis was not used when recruiting patients. The lack of radiologic confirmation is a limitation as it may have led to misclassification of participants.

Conclusions

The Sinhala version of the KOOS scale is a reliable and valid instrument to measure the outcomes of KOA progression. The Sinhala version of the KOOS scale can be used as a valid, self-reporting, disease-specific instrument in a Sinhala speaking population. Further research is suggested to assess its usefulness in monitoring progression of disease.