Background

Osteoarthritis is one of the fastest growing chronic health problems worldwide [1, 2]. According to the 2015 Global Burden of Disease Study, osteoarthritis accounted for 3.9% of years lived with disability worldwide in 2015, up from 2.5% in 2010, and was the 13th highest contributor to global disability [1, 2]. Knee osteoarthritis consistently accounts for approximately 85% of the burden attributable to osteoarthritis [1, 2]. Osteoarthritis also imposes a significant economic burden, with the total annual costs estimated to be $A8.6 billion in Australia [3], £20.9 billion in the UK [4], and $US142.1 billion in the US [5]. The majority of these costs are attributable to ambulatory and inpatient care, including surgery and lost work productivity [3, 4].

Excess weight is an important modifiable risk factor for the onset and progression of knee osteoarthritis [6], and there is strong evidence that weight loss interventions reduce pain and disability in overweight patients with knee osteoarthritis [7, 8]. Consequently, international clinical practice guidelines recommend all patients with knee osteoarthritis who are overweight receive support to lose weight [9,10,11]. Typically, these treatments are delivered using clinical face-to-face models of care [12]. While such clinical models produce moderate effects on weight loss, pain, and physical function [7, 8], only 22% of patients with knee osteoarthritis referred for orthopaedic consultation at a large Australian public hospital report receiving weight loss care [13], possibly due to limitations in service delivery and patient access to care. Arguably more scalable delivery options, using remotely delivered approaches, such as telephone-based support, can maximise the reach of weight loss care and are more cost-effective to support weight loss in this patient group. While telephone-based behavioural interventions targeting weight loss are used routinely in the general populations, the cost-effectiveness of referring patients with knee osteoarthritis to these is unknown.

Given the scarce resources in healthcare, policy-makers are increasingly requiring evidence of economic value for healthcare interventions to make informed decisions about how to allocate resources [14]. Therefore, undertaking economic evaluations of knee osteoarthritis management approaches is important. Recently, we conducted a randomised controlled trial (RCT) using an existing non-disease specific telephone-based weight management and healthy lifestyle service for patients with knee osteoarthritis who are overweight or obese [15]. The primary objective of the intervention was to reduce knee pain intensity, by reducing weight. The RCT found no between-group differences in knee pain intensity, nor weight [15]. Conducting a cost-effectiveness analysis is recommended in all trials, irrespective of their clinical effect [14]. This recommendation is based on cost-effectiveness analyses considering the joint distribution of differences in cost and clinical effect and thereby is able to show that an intervention is cost-effective when neither cost nor clinical effect differences are individually significant [14]. Cost-effectiveness analyses estimate the cost (saved or spent) per unit of effect gained. Such estimates can support healthcare policy and decision-makers choose which interventions should be implemented for specific health outcomes given the available resources [14]. The purpose of this study is to undertake an economic evaluation of the aforementioned RCT, compared to usual care.

Methods

Study participants and design

The economic evaluation was conducted alongside a pragmatic parallel group RCT, which was part of cohort multiple RCT [16]. Full details of the study design has been described in the paper presenting the clinical results of the trial [15] and in the study protocol [17, 18]. The trial was prospectively registered (ACTRN12615000490572). The Hunter New England Health Human Research Ethics Committee (13/12/11/5.18) and the University of Newcastle Human Research Ethics Committee (H-2015-0043) approved the RCT.

Patients on a waiting list for an outpatient orthopaedic consultation for their knee osteoarthritis at the John Hunter Hospital in NSW, Australia, were invited to participate. Patients were assessed for eligibility during a telephone assessment and eligible patients were randomised into either the intervention or usual care control group (1:1 ratio).

Inclusion criteria were: primary complaint of pain due to knee osteoarthritis lasting longer than 3 months; 18 years or older; overweight or obese (body mass index (BMI) ≥27 kg/m2 and < 40 kg/m2); average knee pain intensity ≥3 out of 10 on a 0–10 numeric rating scale (NRS) over the past week, or moderate level of interference in activities of daily living (adaptation of item 8 of SF36); and access to a telephone. Exclusion criteria were: known or suspected serious pathology as the underlying cause of their knee pain (e.g. fracture; cancer, inflammatory arthritis; gout; or infection); previous obesity surgery; currently participating in any prescribed or commercial weight loss program; knee surgery in the last 6 months or planned surgery in the next 6 months; unable to comply with the study protocol that requires them to adapt meals or exercise due to non-independent living arrangements; medical or physical impairment precluding safe participation in exercise such as uncontrolled hypertension; and unable to speak or read English sufficiently to complete study procedures. Recruitment for the trial occurred from May 19 to June 30, 2015, and follow-up concluded January 26, 2016.

Interventions

The intervention included two components. First, brief advice and education about the benefits of weight loss and physical activity for knee osteoarthritis were provided over the telephone immediately after randomisation. Second, intervention participants were informed about the NSW Get Healthy Information and Coaching Service (GHS) (www.gethealthynsw.com.au) [19], and referred to the service for weight loss support. The GHS is an existing government funded telephone-based health coaching service developed to support adults of the general population to make sustained healthy lifestyle improvements. Targets include diet, physical activity and achieving a healthy weight, and if suitable, referral to smoking cessation services [19]. The GHS provides 10 individually tailored coaching calls, centered on national dietary and physical activity guidelines [20, 21], delivered over a 6-month period by university qualified health professionals [19]. Participation in the intervention did not affect the patients’ place on the waiting list for orthopaedic consultation.

Participants in the control group remained on the ‘usual care pathway’ (i.e. on the waiting list to have an orthopaedic consultation and could progress to consultation if scheduled or surgery if recommended by the orthopaedic department) and took part in data collection during the 6-month intervention period. No other active intervention was provided as part of the study, however; no restrictions were placed upon the use of other health services. Control participants were informed that a face-to-face clinical appointment was available in 6 months with a study physiotherapist (CW).

Measures

All measures were collected by self-reported questionnaires at baseline, six weeks and 26 weeks (see Additional file 1: Appendix 1 for questionnaires). The baseline questionnaire was completed by telephone. Week 6 and week 26 surveys were completed via telephone or mailed in the post as per participant preference.

Utility measure

We measured utility using health-related quality of life, assessed using the 12-item Short Form Health Survey version 2 [22]. Participants’ SF-6D [23] health states were converted into utility scores using the British tariff [24]. QALYs were calculated by multiplying the duration of time spent in a health state by the participants’ utility score and were linearly interpolated between measurement points.

Clinical measures of effect

Primary outcome

Knee pain intensity was assessed using an 11-point NRS. Participants were asked to rate their “average knee pain intensity over the past week”, where 0 represents ‘no pain’ and 10 represents ‘the worst possible pain’ [25].

Secondary outcomes

Disability was assessed using the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) [26]. The total WOMAC score ranges from 0 to 96, with higher scores indicating greater disability. Weight (kg) was assessed via participant self-report and BMI was calculated as weight/height squared (kg/m2) [27] using self-reported weight and height.

Cost measures

Costs were converted to Australian dollars in 2016 using the consumer price index [28]. As the follow-up of the trial was 26 weeks, discounting of costs was not necessary [29].

Intervention costs were estimated using a micro-costing approach and included the cost of delivering the telephone brief advice at baseline and the cost of the GHS coaching calls. The cost to deliver the brief advice was calculated by estimating the development and operational costs of the call and the estimated wages for the telephone interviewer to provide the brief advice (estimated average time 5 min). The cost to provide the GHS coaching calls was provided by the GHS [30] and multiplied by the number of calls each participant received. The GHS reported the number of health coaching calls participants received directly to the research team.

Healthcare utilisation costs were calculated from a patient reported healthcare utilisation inventory and included any healthcare services or medications used for knee pain (independent of the intervention costs). Participants were asked to recall the type of healthcare provided and the number of sessions attended as well as all medications used for their knee pain during the past six weeks, within the six and 26 weeks follow-up participant surveys. Healthcare services were priced according to Australian standard costs or professional organisations if this data was unavailable [31,32,33]. Medications were priced using unit costs from the Australian pharmaceutical benefits scheme [34] or online Australian pharmacy websites if this data was unavailable. To gain an estimate of the cost of healthcare utilisation over the entire 26-week period, the average of the week six and week 26 costs per patient was interpolated, assuming linearity.

Absenteeism was measured by participant recall of the total number of sickness absence days from paid work due to knee pain during the past six weeks, within the six and 26 weeks follow-up participant surveys. The ‘Human Capital Approach’ [29] was used to estimate absenteeism costs which involved multiplying each participant’s total number of sickness absence days off by the Australian Bureau of Statistic’s reported age and gender specific national average hourly income [27, 34]. To gain an estimate of the cost of absenteeism over the entire 26-week period, the average of the week six and week 26 costs per patient was interpolated, assuming linearity.

Statistical analysis

Data were analysed in STATA (V13, Stata Corp). The sample size was based on the primary clinical measure of effect [15]. Analyses were performed according to the intention-to-treat principle. Descriptive statistics were used to compare baseline characteristics between the intervention and control group participants. Missing data for all effect and cost measures were imputed using Multiple Imputation by Chained Equations [35]. The imputation model included variables related to the “missingness” of data and those that predicted outcome variables, imputations were stratified by treatment group. Variables in the model included baseline education level, employment status, Aboriginal and/or Torres Strait Islander status, age, country of origin, gender, and duration of knee pain. Ten different datasets were created (loss-of-efficiency < 5%) [35]. These separate datasets were analysed as indicated below, after which pooled estimates were calculated using Rubin’s rules [36].

Mean cost differences between study groups were calculated for total and disaggregated costs. The cost measures were adjusted for the confounders of baseline knee pain intensity, baseline duration of knee pain, baseline BMI and number of days on the waiting list for orthopaedic consultation because the addition of these confounders to the regression model changed the cost differences by more than 10%. Total cost (ΔC) and effect (ΔE) differences were estimated using seemingly unrelated regression analyses, adjusted for baseline values as well as other potential baseline prognostic factors (knee pain intensity, duration of knee pain, BMI and number of days on the waiting list for orthopaedic consultation, obtained from hospital records) [37]. Seemingly unrelated regression is advantageous because possible correlation between the two regression equations (i.e., one for ΔC and one ΔE) can be accounted for [37].

Incremental cost-effectiveness ratios (ICERs) were calculated by dividing the adjusted difference in total costs between both groups by the difference in effects (i.e. ΔC/ΔE). Bias-corrected and accelerated bootstrapping (5000 replications) was used to estimate 95%CIs around cost differences and the uncertainty surrounding the ICERs. Uncertainty surrounding the ICERs was illustrated graphically using cost-effectiveness planes [29]. Cost-effectiveness acceptability curves, which consider the joint uncertainty of costs and effects, were used to graphically represent the intervention’s probability of cost-effectiveness in comparison with usual care at different values of willingness-to-pay [29].

The primary analysis was conducted from the societal perspective, which included all of the cost categories.

Sensitivity analysis

We performed a per-protocol sensitivity analysis from the societal perspective that included only participants that completed at least six telephone GHS coaching calls in the intervention group (n = 20 participants).

Secondary analysis: Healthcare perspective

A secondary analysis was performed from the healthcare perspective, which excluded absenteeism costs.

Results

A total of 120 patients were randomised into the study (Fig. 1). Baseline participant characteristics were similar between groups (Table 1). Eleven participants in the intervention group and three in the control group were lost to follow-up. At 26 weeks, complete effect data was obtained from between 70 and 82% of participants (QALYs 70%, knee pain intensity 82%, disability 79%, weight 81%, BMI 81%). For cost data, complete data was obtained from 48% of participants at 26 weeks. As a consequence, between 18 and 30% of effect data and 52% of cost data were imputed.

Fig. 1
figure 1

Flow diagram of trial participants

Table 1 Baseline characteristics of the study populationa

Utility and clinical measures of effect

There were no differences found between groups for QALYs (Mean difference 0.00, 95%CI: -0.02 to 0.02), knee pain intensity (Mean difference 0.64, 95%CI: -0.49 to 1.77), disability (Mean difference 0.80, 95%CI: -6.68 to 8.47), weight (Mean difference − 0.02, 95%CI: -3.46 to 3.42), and BMI (Mean difference 0.11, 95%CI: -1.16 to 1.39) (Table 2).

Table 2 Differences in pooled mean costs and effects (95% CI), incremental cost-effectiveness ratios, and the distribution of incremental cost-effect pairs around the quadrants of the cost-effectiveness planes

Costs

The average number of GHS coaching calls to intervention participants was 4.7 (Standard deviation 4.6). The mean intervention costs were $622 (Standard error 80) per participant (Table 3). An overview of the unit costs and sources is reported in Table 4.

Table 3 Mean costs per participant in the intervention and control groups, and unadjusted and adjusted mean cost differences between study groups during the 6-month follow-up period (based on the imputed dataset)
Table 4 Unit costs used for valuing resource use

From the societal perspective, mean cost differences between groups (intervention minus control) were 493 (95%CI: -3513 to 5363) for healthcare costs, $-32 (95%CI: -73 to 13) for medication costs, and $125 (95%CI: -151 to 486) for absenteeism costs. The total mean difference in societal costs was $1197 (95%CI: -2887 to 6106) (Table 3). From the healthcare perspective, total mean difference between groups (intervention minus control) was $-1071 (95%CI: -5910 to 2931).

Societal perspective: Cost-utility

For QALYs, most of the incremental cost effect-pairs were located in the northwest quadrant (37.5%), indicating the intervention was on average more costly and less effective than usual care (Fig. 2 (1a)). The ICER for QALYs was 581,828 indicating that one QALY gained in the intervention group was associated with a societal cost of $581,828 as compared with the control group (Table 2). This large ICER is due to the large difference in cost (Mean difference 1197 (95%CI -2962 to 6139) and very small effect on QALYs (Mean difference 0.00, 95%CI: -0.02 to 0.02). The cost-effectiveness acceptability curve for QALYs in Fig. 2 (2a) indicates the probability of the intervention being cost-effective in comparison to usual care was 0.36 at a willingness-to-pay of $0/unit of effect gained and the probability remained about the same irrespective of the willingness-to-pay.

Fig. 2
figure 2

Primary analysis (societal perspective): Cost-effectiveness planes indicating the uncertainty around the incremental cost-effectiveness ratios (Fig. 2 (1)) and cost-effectiveness acceptability curves indicating the probability of cost-effectiveness for different values ($) of willingness-to-pay per unit of effect gained from the societal perspective (Fig. 2 (2)) for QALY (a), pain intensity (b), disability (c), weight (d), and BMI (e) (based on the imputed dataset)

Societal perspective: Cost-effectiveness

For all clinical measures of effect, most of the incremental cost effectiveness-pairs were located in the northwest quadrant (Table 2, Fig. 2 (1b-1e)), indicating that the intervention on average achieved poorer outcomes at a higher cost compared to usual practice. Figure 2 (2b-2e) presents the cost-effectiveness acceptability curves for knee pain intensity, disability, weight, and BMI.

For clinical measures of effect, the probability of the intervention being cost-effective in comparison to usual care was 0.35 at a willingness-to-pay of $0/unit of effect gained. For disability, weight, and BMI the probability remained about the same irrespective of the willingness-to-pay (Fig. 2 (2c-2e)). For knee pain intensity, this probability decreased with increasing values of willingness-to-pay (Fig. 2 (2b)).

Societal perspective: Sensitivity analysis

Results of the sensitivity analysis can be found in Table 2. In brief, for QALYs, the probability of cost-effectiveness was 0.63 at a willingness-to-pay of $0 per QALY gained (Fig. 3 (b)). For QALYs the probability of cost-effectiveness remained about the same irrespective of the willingness-to-pay.

Fig. 3
figure 3

Sensitivity analysis: Cost-effectiveness plane indicating the uncertainty around the incremental cost-effectiveness ratios (a) and cost-effectiveness acceptability curves (b) indicating the probability of cost-effectiveness for different values ($) of willingness-to-pay per unit of effect gained for QALY

Healthcare perspective: Cost-utility

The ICER for QALY was 387,820 indicating that one QALY gained was associated with a cost of $387,820 compared with the control group (Table 2). The probability of the intervention being cost-effective in comparison to usual care was 0.40 at a willingness-to-pay of $0/unit of effect gained remained about the same irrespective of the willingness-to-pay (Fig. 4 (b)).

Fig. 4
figure 4

Secondary analysis (healthcare perspective): Cost-effectiveness planes indicating the uncertainty around the incremental cost-effectiveness ratios (a) and cost-effectiveness acceptability curves (b) indicating the probability of cost-effectiveness for different values ($) of willingness-to-pay per unit of effect gained from the healthcare perspective for QALY

Discussion

We found that referral to a telephone-based weight management and healthy lifestyle service was not cost-effective from a societal perspective for patients with knee osteoarthritis who are overweight or obese, compared with usual care. The maximum probability of the intervention being cost-effective was low (≤0.40) for all outcomes for both the societal and healthcare perspectives, irrespective of the willingness-to-pay. The findings from the sensitivity analysis found the intervention had a slightly higher probability of cost-effectiveness compared to the main analysis, however the probability was still relatively low (i.e. 0.63 at a willingness-to-pay of $0/QALY) and remained the same regardless of willingness-to-pay.

To our knowledge, there are no other economic evaluations of telephone-based interventions for patients with knee osteoarthritis, hampering comparisons to similar interventions. A recent study assessed the cost-effectiveness of a 6-week multidisciplinary face-to-face treatment program compared with a telephone-based program for patients with osteoarthritis [38]. In this study, all patients received in-depth education about osteoarthritis, pain management, physical activity and diet, with the overall goal to enhance self-management skills [38]. Patients in the face-to-face group received six therapeutic groups session (2–4 h each), whereas the telephone group received only two face-to-face group sessions (2–2.5 h each) and four individual telephone contacts (15–30 min each). The study found that from a societal perspective the face-to-face treatment was more likely to be cost-effective when QALYs were the measure of benefit at 1-year follow-up [38]. Together with the findings from our current study, these results suggest that telephone-based care for patients with osteoarthritis may not be a cost-effective management approach. Since many patients with osteoarthritis do not receive recommended treatments via clinical models of care [13, 39], understanding why telephone-based interventions are reported to be as effective as face-to-face interventions but not cost-effective, is an important consideration to inform how best to provide care to this patient group.

An important strength of the present study is the pragmatic trial design, which enabled us to evaluate the intervention under ‘real world’ circumstances. This facilitates the generalisability of the results and allows decision-makers to use these results to help guide future healthcare interventions. A second strength is the use of contemporary statistical analysis methods. Multiple imputation was used to avoid loss of power due to sample size reduction and inefficiency associated with complete-case analyses and is regarded as a more valid way to deal with missing data than naïve imputation techniques, such as mean imputation or last observation carried forward [40]. Seemingly unrelated regression analyses were used for analysing the cost and effect components of the cost-effectiveness analysis. This method was used, instead of two separate regression analyses (i.e. one for costs and one for effects) or a net benefit framework, as it allowed us to adjust for various potential confounders that may not be the same for costs and effects, while simultaneously accounting for the possible correlations between costs and effects [37]. Bootstrapping techniques were used allowing for an estimation of the mean difference in costs as well as the joint uncertainty of costs and effects, while dealing with the right skewed nature of cost data.

This study is not without limitations. Firstly, the sample size calculation was based on the primary clinical measure of effect. While there have been a range of techniques proposed to estimate sample size based on economic endpoints [41,42,43], sample size calculations are usually performed based on the primary outcomes of the study [14, 44]. This is because cost data are right skewed and so require larger sample sizes than needed for trial outcomes to detect significant differences, which may not be feasible [14, 45]. Moreover, when performing a sample size calculation for economic endpoints a number of parameters need to be specified in advance e.g. cost measures, variance parameters of effectiveness measures, and incremental cost-effectiveness ratios, many of which are difficult to predict a priori [41,42,43]. As a result, economic evaluations conducted alongside clinical trials are typically underpowered [44] and should be interpreted with caution [45]. However, as economic evaluations are about estimation rather than formal hypothesis testing they still provide valuable information even when underpowered [29]. A second limitation is the rate of missing data at 26 weeks, between 18 and 30% for effect measures and 52% for cost data is high, however, not dissimilar to those in other economic evaluations [46]. Multiple imputation was used to account for the missing data, which is recommended over complete case analyses, still the results from this study should be treated with caution. Another limitation is that the self-reported cost data was interpolated to gain an estimate of costs over the 26-week intervention period as the recall period for participant’s healthcare utilisation and absenteeism was six weeks. Although it would be preferable for the recall period to cover the complete duration of follow-up we chose a shorter recall period to reduce participant recall bias. Lastly, presenteeism costs were not included, (i.e. reduced productivity while at work) which is known to be an issue reported by patients with chronic disabling pain [47].

There is a need for more information about the cost-effectiveness of lifestyle interventions for osteoarthritis. Although this study indicates that the use of a generic non-disease specific telephone-based service is not cost-effective for overweight and obese patients with knee osteoarthritis, the current evidence suggests existing models of care delivery are unable to provide recommended care to the large number of patients with knee osteoarthritis [13]. More research into how to provide scalable models of care that are cost-effective is needed. A potential way forward is to develop and test a range of scalable modes of weight loss care delivery e.g. telephone, online platforms (website, email), and mobile apps, and determine how these work together, or not, to deliver effective recommended care to patients. Importantly these various models of care delivery need to focus on delivering the same care e.g. focus on diet, exercise, to ensure that the studies are testing the mode of delivery, not the intervention components.

Interestingly, the National Institute for Health and Clinical Excellence guidelines for the management of osteoarthritis [9] refer to general obesity management guidelines for weight loss care for these patients [48], and not disease-specific models of care. Based on the results of our study, recommending general non-disease specific weight loss interventions may not be appropriate, or cost-effective for these patients. A key feature of our study and that of other osteoarthritis telephone interventions is that they only provide support over a relatively short period (six weeks to six months). However, other general weight loss programs occur over a much longer time frame. Better understanding about how the key ingredients for telephone services like dose and relevant components (e.g. exercise, weight loss, education) affect cost-effectiveness may provide more insight about the true value of telephone-based approaches for osteoarthritis.

Conclusions

Our findings suggest that referral to a telephone-based weight management and healthy lifestyle service is not cost-effective compared with usual care for overweight and obese patients with knee osteoarthritis. These findings apply to QALYs, knee pain intensity, disability, weight, or BMI, from the societal and healthcare system perspectives.