Background

Total hip arthroplasty (THA) has been heralded as the ‘operation of the Century’ [1]. Large improvements in mobility and patient-reported outcomes are typically observed with changes far exceeding the small-to-moderate effects seen with non-surgical interventions for arthritis [2,3,4]. These changes are often experienced in the early subacute phase and improvements are evident regardless of underlying aetiology [3, 5, 6]. Such is the success of the operation, annual increases in the volume of surgery in Australia – as it has elsewhere - have been recorded by the Australian Orthopaedic Association National Joint Replacement Registry (AOANJRR) since 2003 [7]. Presently, over 36,000 THA procedures are undertaken annually in Australia with the majority (~ 60%) performed in the private sector [7]. In 2016, 87,733 primary THA procedures were performed in the United Kingdom (UK) [8]. In the United States (US), it is estimated that up to 562,000 THA cases may be undertaken annually by 2030 [9].

Following surgery, physiotherapy-led exercise-based rehabilitation is often prescribed [10]. Evidence in support of such prescription is apparent when physiotherapy-led programs are compared to no or minimal intervention following THA. A recent systematic review of randomized trials concluded that physiotherapy-led exercise programs yielded better gait-related outcomes post-THA when compared to no intervention [10]. Interestingly, consistent with the evidence concerning the more intensive clinic-based rehabilitation programs following total knee arthroplasty (TKA) [11,12,13,14,15], clinic-based one-to-one or group-based programs have not been shown to be superior over monitored or unmonitored home-based programs for the recovery of mobility, function and health-related quality of life after THA [10, 16, 17]. Further, the evidence-base lacks randomized trials comparing inpatient-based rehabilitation with less intensive programs. This gap is important to address as utilization of inpatient rehabilitation is common and it is a comparatively costly pathway [18,19,20]. Recent Australian data from the private sector indicate the median referral rate per surgeon to inpatient rehabilitation is 29% (range 0–100%) [21]. Inpatient utilization rates vary internationally [22, 23], with reported rates as high as 81% (Japan) and 53% (US) and as low as 0% (Spain, Turkey) and 3% (UK) [24].

Only one randomized controlled trial (RCT) has been published aimed at determining the efficacy of inpatient therapy over a non-inpatient alternative following THA [25]. Eighteen days of inpatient rehabilitation was not more effective nor cost-effective compared to eigth domiciliary (physiotherapy) visits following THA or TKA (combined cohort).

Randomized trials are difficult to undertake in general, but they are particularly difficult to perform in the private healthcare sector where consumer and provider preferences are strong drivers of care delivery [26, 27]. This may partly explain why much of the evidence in this space - showing no superiority of the inpatient pathway - is predominantly provided by retrospective and prospective observational studies and administrative datasets [19, 28,29,30].

In light of the difficulty in performing an RCT, this study, using a propensity score (PS) matched cohort, aimed to determine whether privately insured THA patients who received inpatient rehabilitation, compared to those who did not, had better patient-reported joint and health scores across the first 12 months post-surgery. Informed by the high- and low-level evidence in this area, we hypothesized that the inpatient pathway would not procure superior patient-reported recovery to a level that was clinically meaningful over the alternative.

Methods

This study was nested within a national, prospective observational study involving 19 hospitals across five States [31] (ClinicalTrials.gov Identifier: NCT01899443). We have previously reported the results of a related PS matched cohort aimed at determining the effectiveness of a rehabilitation pathway involving inpatient rehabilitation compared to one not involving inpatient rehabilitation following TKA [32]. The PS is the probability of treatment assignment conditional on observed baseline characteristics [33,34,35,36,37,38]. Here, matching people on their probability of being allocated to inpatient therapy addresses treatment selection bias based on known patient characteristics.

Ethical approval and funding source

Ethical approval was provided by multiple ethics committees (see Declarations). Informed participant consent was obtained. The HCF Research Foundation funded the study, but had no role in study design or data analysis.

Sampling and hospital recruitment

Hospitals contributing to the AOANJRR [7] performing over 275 TKA or THA procedures in 2012, were eligible to participate in the larger study. Related publications describe the sampling procedure which procured 10 public and nine private hospitals [31, 32, 39].

Patient enrolment and eligibility

Patient enrolment and eligibility were identical to that described in an aforementioned study [32]. People awaiting arthroplasty secondary to osteoarthritis between August 2013–December 2014 were screened by site coordinators during their pre-admission review. Eligibility criteria for the larger study included primary surgery, no further arthroplasty planned within 3-months, and the ability to understand the protocol and be followed-up by telephone for 1 year. For this nested study, being privately insured was an additional inclusion criterion. Publicly insured patients were explicitly excluded because the few referred to inpatient rehabilitation (20/317, 6.3%) were primarily referred based on a clinical or social need (14/20). To reduce treatment selection bias and improve generalizability to uncomplicated THA, we also subsequently excluded the privately insured patients referred to inpatient rehabilitation due to slow progress, inadequate social supports (e.g living alone or without the help of an able carer), and those with conditions that would alter their rehabilitation pathway (e.g bilateral surgery or significant complication (defined below) up to 90-days post-surgery).

Data collection

Patient-level variables collected prior to surgery (baseline) included weight, height, insurance status, comorbid conditions including requirement for daily medication, education level, presence of other lower limb or back conditions impairing mobility and patient-reported outcomes measures (PROM). Additional patient-specific data via clinician-completed forms were collected by site coordinators - American Society of Anesthesiologists (ASA) scores, complications, length of stay (LOS), discharge destination, and reason for referral to inpatient rehabilitation if relevant. Patient-reported outcomes, including surveys, ongoing rehabilitation details, complication, and time-off-work (patient, carer) were obtained by researchers by telephone at 35, 90 and 365-days post-surgery. For quality control, clinician-collected data were re-abstracted from the medical record by researchers; significant patient-reported complications were verified by treating hospitals, surgeons or general practitioners.

Complications

Acute complications were stratified according to the Clavien-Dindo Classification whereby the code applied (1–5) is based on the type of care deviation necessitated by the complication [40]. People experiencing the more severe complications (coded ≥3) (e.g pulmonary emboli or dislocation), were excluded. Those experiencing a less severe complication, but one potentially affecting longer-term recovery (e.g neuropraxia) were also excluded. Patients experiencing any surgery-related complication requiring re-admission or any unrelated health event requiring hospital admission up to 90 days post-surgery, were also excluded.

Outcomes

The Oxford Hip Score (OHS) [41] and the Euroqol Visual Analogue Scale for health ‘today’ (EQVAS) [42, 43], were the primary outcomes. The EQVAS was obtained using a VAS at baseline and verbally at the time of follow-up as the methods have been shown to be equivalent in people awaiting arthroplasty [42].

Rehabilitation charges were determined from the perspective of the provider and the methodology is described elsewhere [32]. Table 1 summarizes the unit costs (AUD) applied for each rehabilitation ‘type’ encountered. Total provider charges for each participant were calculated based on the length of stay in a rehabilitation facility (where applicable), and the type and frequency of community-based therapy received up to 10-weeks post-surgery, including clinic, domiciliary, supervised hydrotherapy, and day hospital visits. Individuals in the ‘No inpatient rehabilitation’ group who also had no community-based rehabilitation were assigned $0 rehabilitation provider charges.

Table 1 Unit charges assigned to each rehabilitation ‘type’ encountered

Analysis

The projected sample size for the larger study was ~ 2000. Assuming our recruitment pattern was consistent with patterns observed in the AOANJRR [7], we anticipated the definitive sample would comprise 40% THA recipients (n = 800), and most of those (60%) (n = 480) would be private insured. We conservatively predicted, therefore, our sample for this nested study would exceed 128 people (64 in each treatment group) which is what would be required (80% power, 0.05 significance level) to detect a minimal important between-group difference of 5 points (0.55 of the baseline SD) [41] in the OHS. A sample of 17 would be sufficient to detect a minimally clinically important improvement of 23 (1.36 SD [42, 43] baseline mean) for the EQVAS.

We assessed normality of the data prior to analysis. Mean, median and percentages were used to characterize the cohorts. Covariates of the unmatched cohorts were assessed using independent t-tests and χ2, whilst covariates and outcomes in the matched (paired) cohorts were assessed using paired t-tests, Wilcoxon signed-rank tests and McNemar tests [33, 34].

Propensity score determination and subsequent matching were determined as follows:

i) PSs for the prediction of referral to inpatient rehabilitation were estimated for all eligible patients using unconditional logistic regression [33,34,35]. Here, the outcome was treatment group and model covariates included variables thought to influence treatment assignment prior to surgery or the primary outcomes [33,34,35]. Covariates included baseline Oxford and EQVAS scores, age, body mass index (weight [kg]/height [m]2), gender, ASA scores, comorbidity (any condition of the cardiovascular, renal, respiratory, endocrine, liver, central and peripheral nervous systems, and excluding musculoskeletal conditions), other lower limb or back problems limiting mobility, surgical approach (‘anterior’ or ‘other’), and education level (Yr 8 or below, Yrs 9–10, Yrs 11–12, Degree qualified). We also included any measured variable that was found to be significantly different between the treatment groups prior to matching. For this reason, employment status was also used for matching.

ii) We assessed the distribution of the PSs across the two treatment groups using visual inspection and descriptive statistics to ensure there was sufficient overlap (area of common support [33,34,35]) in probabilities to receive inpatient rehabilitation.

iii) Using caliper matching without replacement [32, 33] we then used the PSs to match each patient who received inpatient rehabilitation in a 1:1 ratio with one who did not. A caliper distance of an absolute difference in PS of 0.05 was employed, equating to 0.28 of the pooled SD of the PSs for the unmatched cohorts. Successful matching was indicated by achieving a standardized mean difference (SMD) [34] between the groups < 0.20 for any given variable (differences < 0.1 are considered negligible [36]; differences up to 0.25 are considered tolerable [35]), and an absence of statistical significant difference (p <  0.05).

Variables not included in the matching were ‘hospital’ and ‘surgeon’. The inclusion of these caused non-convergence of the estimate in the PS calculation owing to high correlation between surgeon or hospital and treatment group. To assess the influence of ‘hospital’ or ‘surgeon’ on primary outcomes, we performed multivariable linear regression analyses incorporating these variables.

As treatment allocation was not random, we considered the experience of a second arthroplasty (different joint) or later joint-related complication after 90 days as a potential between-group confounder of the primary outcomes. Thus, we conducted sensitivity analyses on the 365-day outcomes excluding persons who experienced such events.

We also performed sensitivity analysis using Rosenbaum bounds [37, 38] on the matched groups to determine the possible effect of unmeasured confounders on the conclusiveness of our results. A sensitivity parameter ‘Gamma’ was defined in the Rosenbaum bounds analysis. The Gamma indirectly informs how large the effect of the unmeasured confounder would need to be to change the conclusiveness of the matched results. The larger the gamma the more robust the result to an unobserved confounder. A Gamma of 1.2, for example, indicates that an unmeasured confounder would need to increase the likelihood of a patient being assigned to the inpatient group by a factor of 1.2 before the conclusiveness of the results are threatened [36, 44].

All analyses were conducted using SAS V9.4 (SAS Institute Inc. Cary, NC, USA) and the psens function in the rbounds package of R. No imputation was undertaken for missing data.

Results

Figure 1 summarizes cohort ascertainment. From an eligible sample of 431 privately insured primary, unilateral THA patients from 12 hospitals, 123 pairs were matched.

Fig. 1
figure 1

Cohort ascertainment

Unmatched cohort

In the unmatched cohort, a PS was estimable for 403 patients. The mean (SD) propensity score for the sample was 0.34 (0.18). Assessment of the overlap of the PSs across the unmatched groups revealed sufficient common support between the two treatment groups [Inpatient Group, n = 136 - mean 0.44 (0.18), min 0.06, max 0.85; Non-inpatient Group, n = 267 – mean 0.29 (0.16), min 0.02, max 0.85].

Unmatched versus matched cohorts

In the unmatched cohort, the inpatient rehabilitation group was older, had more males, had higher levels of comorbidity, and worse baseline OHSs (Table 2). Unadjusted comparisons between the two groups revealed significantly worse health scores at every time point and worse OHS at 90 days for the Inpatient Group (Additional file 1). Matching lowered the SMD for all but one covariate (it went from 0.10 to 0.11); the largest SMD was 0.14 (Table 3). Acute-care LOS was significantly longer in the Inpatient Group in both the unmatched (6.2 (2.2) vs 4.6 (1.7) days) and matched (5.9 (2.1) vs 5.0 (2.0) days) conditions.

Table 2 Characteristics of total hip arthroplasty recipients: Pre-matching
Table 3 Characteristics of total hip arthroplasty recipients: Post-matching

Outcomes – Matched cohort only

35- and 90-days: Table 4 summarizes the comparisons between the matched groups. The Inpatient Group did not have superior outcomes at 35- or 90-days. Time-off-work outcomes were not significantly different. Median rehabilitation provider charges were significantly higher in the Inpatient Group regardless of whether inpatient provider charges were included in the calculation. The Inpatient Group received a greater number of physiotherapy sessions in the community (day hospital as well as clinic) [median difference 5 (IQR 9,0); Inpatient Group - 6 (IQR 2,10), Non-Inpatient Group 0 (IQR 0,3)] (Table 5). A greater proportion of the Inpatient Group were receiving community-based rehabilitation beyond week 10 (23.9% [95%CI 17.1–32.4] vs 12.2% [7.2–20.2]).

Table 4 Outcomes at 35, 90 and 365 days in the matched cohorts
Table 5 Profile of community-based supervised rehabilitation received amongst those who received it

365-days: The Inpatient Group did not have superior outcomes at 365 days; the median OHS in both groups was the maximum score possible (48). Sensitivity analyses for the 365-day outcomes involving 114 pairs, yielded the same results (Additional file 2). In the analyses incorporating hospital and surgeon, neither significantly influenced the primary outcomes at any point (Additional file 3). Sensitivity analysis exploring the Rosenbaum bounds for the possible effect of unmeasured confounders yielded gamma values from 1.1 to 6.0 (Additional file 4).

Discussion

Results from a related PS matched cohort involving uncomplicated TKA recipients indicated that the inpatient rehabilitation pathway, compared to the non-inpatient pathway, did not provide superior patient-reported outcomes up to 1-year post-surgery [32]. The current study suggests the same is true for THA recipients. Importantly, the median between-group differences for the OHS and EQVAS were much smaller than the differences estimated to be clinically important. Further, the inpatient rehabilitation pathway was associated with far higher rehabilitation provider charges, and similar time-off -work outcomes for patients and carers. Overall, our findings align with the one aforementioned RCT conducted in this area involving hip arthroplasty [25], and highlight the impressive levels of patient-reported recovery typically seen after this surgery regardless of the rehabilitation pathway taken.

Our study has several strengths. The use of propensity-based scoring has allowed us to report the comparison between two rehabilitation pathways in simple, absolute terms of the outcomes whilst keeping measured covariates balanced between the groups. Most importantly, acceptable matching across all our measured covariates was achieved indicating that any differences in the outcomes observed are not attributable to confounding resulting from differences in these variables. Further, we removed other sources of ‘noise’ through our strict eligibility criteria, thus, between-group differences in the rate of complications or other major health events are not a source of bias. Finally, our cohort was derived nationally, thus, the characteristics of the unmatched and matched samples should reasonably reflect private patients who undergo THA in Australia.

Despite its advantages, a limitation of the PS method is that bias from unmeasured covariates that may be influencing treatment allocation or outcomes is not eliminated [33,34,35,36,37]. Here, surgeons and hospitals could be viewed as possible confounders. Our supplementary regression analyses, however, allowed us to adjust for the effect of these with neither having a significant effect on outcomes. We did not match on prosthesis type (as there were too many), though we did adjust for what may be considered more important (surgical approach). There is still a possibility that we have not accounted for all unknown confounders, but we think it unlikely we missed important ones. Our sensitivity analyses on the possible effect of unmeasured confounders indicate that an unknown confounder would need to change the probability of being allocated to inpatient rehabilitation by at least a factor of 1.3 for five of our outcomes in order to change the statistical significance of the results. Further, if an unmeasured confounder changed a result from non-significant to significant, the effect could still favor discharge directly home. Given we have included many of the known potential confounders in our PS analysis and supplemented this with multiple regressions including ‘surgeon’ and ‘hospital’, we contend the important variables are covered. Further, we are not aware of any other unmeasured confounder that the literature points to that would lead us to change our conclusions. That our results concord with the one RCT conducted in this area [25] and several observational studies [28,29,30], it would appear we have not overlooked major confounding influences.

A potential limitation of our study is the lack of physical performance measures [45]. It remains to be seen if inpatient rehabilitation improves gait or strength more so than rehabilitation provided in other settings. We note gait outcomes have not been shown to be superior in supervised community-based settings compared to home programs when conducted within 8-weeks of surgery [10, 17]. It is possible the same is true for inpatient rehabilitation.

An important caveat is that our results apply to those who have had an uncomplicated recovery and to those for whom discharge destination was a choice. We contend then that the inpatient rehabilitation pathway remains an important option for those with complications, limited social supports, and poor progress acutely. Based on the criteria used here, we estimate that a small minority (approximately 14%) of privately insured patients may require inpatient rehabilitation based on these three conditions (9.1, 4.1 and 1.2% respectively) (Fig. 1). We note, however, recent data from the US suggest discharge to inpatient rehabilitation facilities following THA is associated with increased morbidity [23]. A reasonable approach, therefore, would be to avoid referral to inpatient facilities unless alternatives are exhausted.

A final important caveat is that our cost data pertain only to the first 90 days after surgery and only include rehabilitation provider charges. It is likely that there were continued cost differences beyond this time given a greater proportion of the Inpatient Group were still receiving community-based therapy beyond 10 weeks.

Conclusions

In conclusion, in the absence of sufficient high-level evidence to date, our study provides robust and useful signals concerning the comparative effectiveness and provider charges of a pathway that includes inpatient rehabilitation after uncomplicated THA. Consistent with the one published RCT in this space [25], our data indicate a rehabilitation pathway incorporating inpatient rehabilitation is not likely to be cost-effective for uncomplicated THA recipients. Our data and data elsewhere support the use of community-based and home-based programs to secure impressive levels of patient-reported recovery.