Introduction

During the last decades, patient-reported outcomes have become central in reporting the treatment effect and patients’ satisfaction following treatment of joint disorders [1]. Large-scale or national databases have been established in several countries worldwide, and patient-reported outcomes are commonly a central part of the registers to monitor quality of patient care [2].

In contrast to generic patient-reported measurement outcomes, body region-specific patient-reported outcome measurements can capture the patient perceived ability to perform specific activities indicatory for body regions such as the hip. This is often combined with questions covering health related quality of lifeFootnote 1 [3]. In orthopedics, body region-specific patient-reported outcomes are used to capture the current status of the patient or as repeated questionnaires to describe the changes in a patients self-report status. These data are then often used in combination with radiological and functional measurements for clinical decision making.

The Hip Disability and Osteoarthritis Outcome Score (HOOS) is widely used body region-specific score used to investigate the effect following treatments of osteoarthritis and other hip related disabilities and is one of several internationally recommended patient-reported instruments to be used following hip replacement [4]. In 2019, a short form of HOOS named HOOS-12 was developed using item-response theory with the aim to reduce the respondent burden of patients especially in the clinical setting including scoring for joint replacement registries [5].

The interpretation of results from patient-reported outcome measurements such as the HOOS and HOOS-12 can be improved using large-scale normative data from randomly selected reference populations. Patient-reported outcomes scores are frequently reported to be influenced by age, sex, weight, social status, and comorbidity [6,7,8,9,10]. At present, reference data from the HOOS has been established from a single geographical region of southern Sweden and in a selected US population [7, 11]. No reference data are available for its short form HOOS-12. Accurate and large-scale reference data are essential to facilitate interpretation of HOOS and HOOS-12 scores and may be used as reference data for comparative study designs and in future power calculations.

The aim of the present study was to establish reference values for the Hip Disability and Osteoarthritis Outcome Score (HOOS) five subscales and its short form HOOS-12 based on a randomly selected population-based sample of adult Danes.

Methods

Study design

The study was designed as a population record-based sample, including a national representative sample of all citizens of Denmark.

At birth or emigration to Denmark, a Civil Registration Number (CPR) is given to all residents and registered in the Civil Registration System. The CPR number is mandatory by law for all citizens in Denmark. Prospective information regarding emigration and death is recorded in this registry [12]. The Civil Registration System includes individual information of the entire population of Denmark [12]. The CPR register was used to establish the study’s national and randomly selected sample of all Danish citizens. A secure digital mailing system (E-boks) is mandatory for almost all Danish citizens and is based on the Danish Civil Registration System.

The Danish Data Protection Agency approved the study (J. nr. 2021 Id: 114). The reporting of the study complies with the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement [13].

Data retrieval

Included were Danish citizens 18 years and older. Excluded were those without online contact information (E-boks). An invitation to participate in the project was mailed to all participants using the mandatory online system E-boks. The invitation included an online link to the HOOS questionnaire and contact information for the research group in case of questions. Before the study started, the invitation letter, HOOS, and the E-boks system were piloted on a small group of 10 participants. After finishing the HOOS questionnaire, participants were also asked to submit their height and weight. Furthermore, one supplemental question was asked: Within the last 5 years, have you been in contact with a health professional due to a hip problem? Answer: yes/no.

In case of no response within 14 days, participants received a second request to participate.

Study population

A representative sample of 10,000 citizens of Denmark was derived from the Danish Civil Registration System. By 2021, the population of Denmark consisted of 5.8 million citizens.

The sample was selected based on seven predefined age groups (18–29, 30–39, 40–49, 50–59, 60–69, 70–79, 80 +) and an equal sex distribution across all groups. A sample of 10,000 citizens was defined and contacted to allow adequate power for subgroup analyses based on both age and gender. We expected a response rate of about 30% yielding approximately 200 citizens in each age and gender group.

The Hip Disability and Osteoarthritis Outcome Score (HOOS) and its short form HOOS-12

The Hip Disability and Osteoarthritis Outcome Score (HOOS) is a patient-reported and body region-specific questionnaire that was introduced in 2003 with the aim of assessing patients’ experiences regarding their hip and associated problems [14]. It is a hip-specific, patient-administrated questionnaire which include 40 items across five subscales. These five subscales evaluate pain, symptoms, function of daily living (ADL), sport and recreation function (sport), and quality of life (QOL) [14]. The HOOS is available in more than 25 languages [14]. The HOOS is free to use and can be downloaded at www.koos.nu.

The short form HOOS-12 was developed using item response analysis and is calculated based on 12 items from the full HOOS questionnaire [5]. The outcome of HOOS-12 is reported in three subscales; pain, function and QOL, along with a aggregated overall hip impact summary score [5].

HOOS and HOOS-12 scores are calculated based on standardized scoring algorithms and presented as a score between 0 and 100 for each respective subscale [14]. A score of 100 indicates the best possible results and 0 the worst outcome [14].

The two scoring manuals are available at www.koos.nu.

Statistical analysis

The HOOS and HOOS-12 scores are given as mean, median, standard deviation (SD), 95% confidence intervals (CI), minimum, maximum, and number for the overall group and for each age and gender group. In accordance with the HOOS scoring manual [14], scores were not calculated if the number of missing items was more than 50% in a subscale.

Continuous variables are reported by mean and standard deviation (SD) and categorical variables by frequencies. A two-way analysis of variance (ANOVA) was used to test the difference between predefined age and gender groups. If significant ANOVA factors or interactions were found, multiple pairwise analyses with post hoc test (Bonferroni) corrections were used.

One-way ANOVA was used to test the difference between subscales scores and reporting of hip problems (yes/no) and between subscales scores and BMI groups (18–24.9, 25–29.9, 30–34.9, 35–39.9, 40–44.9 and > 45), where BMI > 40 indicate the super obese. If a significant ANOVA factor was found, multiple pairwise analyses with post hoc test (Bonferroni) corrections were used.

Responders and non-responders were checked for age difference by the unpaired t test and for gender difference by the Chi-square test.

A P value of < 0.05 was considered significant. The statistical analysis was performed by STATA (version 27).

Thresholds suggested to represent a clinically relevant improvement range from 7 to 36 for the HOOS subscales, and from 15 to 28 for the HOOS-12 subscales [15,16,17,18]. In this paper, we considered thresholds of 8 for all HOOS subscales, and 15 for all HOOS-12 subscales and impact score, to represent a clinically relevant difference between age, sex and BMI groups.

Results

Of the 9997 nationally representative persons identified, 1093 did not have a valid E-boks and could not be contacted. A total of 2277/8904 (26%) contacted participants completed the HOOS questionnaire, 947 women (41.6%) and 1330 men (58.4%). Detailed information regarding the full study population is presented in Fig. 1.

Fig. 1
figure 1

Detailed information regarding the full study population. N = number

HOOS and HOOS-12 reference scores

The HOOS and HOOS-12 subscale scores for the total sample and for the seven predefined age- and gender groups are presented in Tables 1 and 2, respectively.

Table 1 HOOS and HOOS-12 subscale scores for the total sample
Table 2 HOOS and HOOS-12 subscale scores by age

Age- and gender specific scores for the HOOS and HOOS-12

The age- and gender specific HOOS and HOOS-12 subscale scores are presented in Fig. 2 and Table 2, respectively. Women reported statistically significant worse scores for three of the five HOOS subscales (pain mean difference 2.6, 95% CI 0.9–4.2), (symptoms mean difference 3.3 95% CI 1.7–5.0), (QOL mean difference 3.7 95% CI 1.8–5.6) and for the HOOS-12 subscale scores (pain mean difference 2.9 95% CI 1.2–4.6), (QOL mean difference 3.5 95% CI 1.8–5.6) and for the HOOS-12 impact score (mean difference 2.6 95% CI 0.9–4.3).

Fig. 2
figure 2

HOOS subscales divided by age groups and gender

Comparing the youngest and oldest age groups a significant difference were observed in four of the five HOOS subscales (pain mean difference 7.2 95% CI 0.4–14.0), (ADL mean difference 11.4 95% CI 4.9–17.8), (sport and recreation function mean difference 17.7 95% CI 9.0–26.4), (QOL mean difference 10.1 points 95% CI 2.0–18.2) and for the HOOS-12 (pain mean difference 7.3 95% CI 0.1–14.5), (ADL mean difference 12.9 95% CI 6.0–19.8), (QOL mean difference 10.1 95% CI 2.0–18.1) and (impact score mean difference 10.2 95% CI 3.1–17.4).

Excluding patients reporting a hip complaint reduced the difference between age groups in three of the five HOOS (ADL mean difference 8.6 95% CI 3.1–14.1), (sport and recreation function mean difference 13.6 95% CI 5.7–21.5), (QOL mean difference 8.2 points 95% CI 1.4–15.1.

Impact of hip complaint within 5 years on HOOS/HOOS-12 scores

A total of 377 participants (17%) reported a hip complaint within the last 5 years. Of the 377 participants reporting a hip complaint, most were 50 years of age or older (75%), and most were women (67%). Reporting a hip complaint within the past 5 years was associated with worse HOOS scores compared to participants without reporting hip complaints: (pain mean difference 25.4 95% CI 23.6–27.3), (symptoms mean difference 25.2 95% CI 23.3–27.1), (ADL mean difference 22.1 95% CI 20.3–23.9), (sport mean difference 29.0 95% CI 26.5–31.5) and (QOL mean difference 34.6 95% CI 32.5–36.7). Similar differences between groups were found when using the HOOS-12 subscales and the impact score: (pain mean difference 27.3 95% CI 25.4–29.3), (function mean difference 23.3 95% CI 21.3–25.2), (QOL mean difference 34.6 95% CI 32.5–36.7) and (impact score mean difference 28.4 95% CI 26.1–30.8).

Impact of BMI on HOOS and HOOS-12 scores

The mean self-reported BMI was 26.8 (± 11.5). Nineteen percent (N = 428) of participants reported a BMI of 30 or higher. Markedly worse scores were found in participants reporting a BMI over 40, Table 3.

Table 3 HOOS and HOOS-12 subscale scores by BMI groups

Discussion

This study is the first to report HOOS and HOOS-12 reference values from a nationally representative sample. We report reference data across age groups from 18 years of age and older in 10-year strata and subdivided into men and women and the impact of BMI. These reference values will improve the interpretation of HOOS and HOOS-12 collected in clinical practice and research.

HOOS/HOOS-12 and age

This study indicated that the use of age-specific HOOS scores could be warranted. We found statistically significant worse HOOS/HOOS-12 scores with increasing age. Our findings are in line with other studies reporting worse HOOS scores with increasing age[7, 11, 19]. This supports the use of age-strata when interpreting outcomes from clinical practice across age groups. The impact of age differences may affect how the reference values are used in clinical practice and in research. For an individual patient, the appropriate age-strata would be useful, as expected treatment results are likely not to increase above the level of age-specific HOOS reference scores. For use in research where the specific patient-population spans several age-strata it may be useful to check the difference in reference values across the relevant age-strata, and consider if this difference is of clinical relevance, or not. Prevalence of hip complaints increase with older age, indicating that the effect of hip complains are imbedded in the age-related differences in the HOOS subscale scores. However, when removing participants with hip complaints the difference between the youngest and oldest age group decrease 2–4 points across the different subscales.

HOOS/HOOS-12 and sex

This study indicates that the use of sex specific HOOS subscales is not needed. In this study we found significantly worse HOOS and HOOS-12 outcomes on three of the five HOOS subscales for women compared to men. Although statistically significant differences, the current body of knowledge do not indicate these differences of 2.6–3.7 points are clinically important[15, 16, 18, 20]. Findings were similar across the 7 predefined age groups.

HOOS/HOOS-12 and BMI

We found that being super obese (BMI > 40) was associated with substantially worse HOOS and HOOS-12 scores. The difference between normal weight and the super obese were > 12 points on the HOOS/HOOS-12 subscales indicating likely clinically relevant differences between groups that needs to be considered when using HOOS scores for interpreting patient outcomes. These results are in line with Raja et al. [11] who reported a comparable association between higher BMI and worse HOOS score in a selected US population. Reference data for the super obsese may assist in guiding health care workers and patients when considering the expected level of HOOS and HOOS-12 scores following treatment for a hip problem.

Use of HOOS reference data in clinical and research settings

The use of HOOS reference data in the orthopedic setting is twofold. When used in the clinic for evaluation of a specific treatment, reference data may help indicate the expected improvement and provide a treatment goal. Age specific reference values may be of particular relevance when for example treating an older woman with a hip fracture, and BMI-specific reference values may be of particular relevance to indicate the expected improvement for a superobese or obese patient having total hip replacement. Reference values are also usable when applied to groups of patients in orthopedic research. When HOOS or HOOS-12 is considered as the primary outcome in a future study, reference values support the initial planning of the study by indicating optimal mean scores following treatment.

Strengths and limitations

A strength of this study is the randomly selected national age and gender representative sampling base, with more than 2200 responders making this study the largest HOOS/HOOS-12 reference material available. Missing responses were spread across age and gender subgroups minimizing the risk of a strong selection bias. Responders and non-responders did not differ on age and gender, it is not possible to test if non-responders were different on other variables.

Conclusion

This study is based on a national randomly selected population and provides sex and age-specific reference values of the Hip Disability and Osteoarthritis Outcome Score (HOOS) and its short form HOOS-12. We found that older patients and patients with a BMI more than 40 have clinically relevant worse HOOS and HOOS-12 scores. Using age- and BMI-specific reference values may therefore be indicated when evaluating potential for improvement and post-treatment results in these patient groups.