Introduction

Total hip replacement (THR) is a common operation with low revision rates [1], excellent patient-reported outcome measures (PROMs) [2] and low mortality [3]. Efforts to improve outcomes focus on specific factors, such as implants [4] and thromboprophylaxis [5]. Surgical approach is a relatively simple way to effect outcomes. Although attention has been given to surgical approach over the last decade (e.g. use of mini-incisions, and more recently the anterior approach) [6, 7], there is a lack of well-designed studies comparing outcomes when using different hip approaches.

For THR, the hip can be approached anteriorly, laterally, posteriorly or by detaching the greater trochanter (trans-trochanteric). Each approach can be performed through a limited (minimally invasive) incision/s, although this is extremely rare for the trans-trochanteric, leaving seven common approaches. The anterior approach is intermuscular, the lateral passes through the major hip abductors and the posterior through the short external rotators. The type and extent of soft tissue damage and bleeding caused by each approach differs and thus influences outcome [8,9,10,11].

Using data from the National Joint Registry (NJR) for England, Wales, Northern Ireland and the Isle of Man, we compared implant survivorship, PROMs and post-operative mortality between the seven common surgical approaches used for primary THR.

Methods

Our initial cohort comprised 890,681 linked THRs, performed between 1 April 2003 and 31 December 2016 [12]. We excluded 21,549 (2·4%) where data was collected on version 1 of the minimum data set collection form (MDS v1) that combined the anterior and anterolateral approach. Of the remaining 869,132, we selected 800,555 where osteoarthritis (OA) was stated as an indication for surgery but, further, sequentially excluded any that reported, in addition to OA, fractured neck of femur (1074), fractured acetabulum (209) and previous arthrodesis or failed hemiarthroplasty (253), leaving 799,019 for initial analysis (Fig. 1), with a maximum potential follow-up of 13.75 years.

Fig. 1
figure 1

Study selection criteria. Abbreviations: OA = osteoarthritis; MoM = metal-on-metal

Surgical approach was grouped as (i) posterior, (ii) lateral, anterolateral, Hardinge (an eponymous name for a variant of the lateral approach), (iii) anterior or other and (iv) trans-trochanteric (overriding the others). These groups were further subdivided according to whether or not minimally invasive surgery (defined as skin incision lengths of less than 10 cm) was used, giving 8 potential subgroups. There was a secular increase in the use of the posterior approach (Additional file 1: Table S1 and Fig. S1). In total, 40,552 (5·1%) of the operations used minimally invasive surgery; the proportions of minimally invasive cases per year have decreased and plateaued (Additional file 1: Fig. S2).

Additional file 1: Table S2 summarises the characteristics of those in each approach group, including the type of implant used. As body mass index (BMI) was not collected in the early years of the registry (MDS v1) and data completeness for BMI improved over time, the data on this variable were not available for all patients. Metal-on-metal (MoM) hip replacements (including resurfacings) are strongly associated with higher failure rates [13] and have now largely fallen into disuse, so we focussed on the remaining 724,165 (90·6%) non-MoM hip replacements. Finally, as only 261 of these THRs underwent minimally invasive trans-trochanteric surgery, this group was excluded, leaving 723,904 for analysis (Fig. 1). Demographics for the final 7 groups are shown in Table 1.

Table 1 Numbers in the approach groups with demographic information

Revision and short-term mortality

Survival analyses were used to compare the revision rates and 90-day mortality between groups. The unit of analysis was the THR. Although 5967 had been implanted as part of a bilateral pair, these were retained in both sets of analyses as they did not necessarily have the same approach. For revision, time was measured from the date of the primary to the date of first revision, if any, censoring at the end of 2016 or at death if that occurred first. For mortality, time was measured from the date of the primary to death, censoring at 90 days or at the end of 2016 if that was earlier.

Confounders for both sets of analyses were age at primary, sex, ‘risk group’ (i.e. had additional indications to OA for surgery), American Society of Anesthesiologists (ASA) grade and THR fixation. Year of primary, femoral head size and BMI (values < 10 and > 60 kg/m2 were excluded as likely to be invalid) were also considered.

For mortality, other known confounders were thromboprophylaxis and anaesthetic [3]. Also included was a measure of area deprivation, the Index of Multiple Deprivation (IMD), obtained via linkage to Hospital Episode Statistics (HES) inpatient records (for patients with National Health Service (NHS)-funded procedures in England). The IMD is an overall ranking of the patients’ ‘Super Output Area’ of residence, rank 1 being the most deprived area, 32,482 the least; we grouped the patients according to the quintile of their area rank.

A series of Cox ‘proportional hazards’ (PH) regression models were used to compare outcomes between groups, with stratification to allow subgroups of confounder variables to have different baseline hazard rates. For revision, the analyses were supplemented with Flexible Parametric Survival Modelling (FPM) [14]. The latter gave the analyses more scope and allowed exploration of the temporal changes in the effects of confounder variables as previously utilised in NJR analyses [13].

Patient-reported outcome measures (PROMs)

The NHS PROM programme has collected PROMs for unilateral THRs in England since 1 April 2009 [15]. Patients complete validated PROM questionnaires 2 weeks before their primary operation (Q1) and approximately 6 months afterwards (Q2). Each questionnaire includes (a) the Oxford Hip Score (OHS) [16], derived from 12 questions asking about pain and mobility (over the previous 4 weeks); (b) the EuroQol-5D (EQ-5D) Health Scale [17], a visual analogue scale representing the patient’s health status on the day (0 = worst imaginable; 100 = best imaginable) and (c) the EQ-5D-3L Index (3 response options for mobility, self-care, performing usual activities, pain/discomfort and anxiety/depression). For (c), we looked at the responses to each of the 5 questions, rather than the composite ‘Index’. We also report on additional questions asked at Q2 about self-reported post-operative problems.

The NJR was linked to the PROM dataset via HES identifiers. PROM data were incomplete. Where multiple PROMs had been collected, the best ‘episode matched rank’ was used.

Statistical methods included regression analyses for OHS at Q2 with covariate adjustment for differences at Q1 (using fractional polynomials to linearise the relationship between Q2 and Q1), logistic regression (binary target variable) and ordered logistic (proportional odds) models for ordered categories. Non-parametric analyses (Kruskal-Wallis test) were used for complementary analyses where distributional assumptions were not met satisfactorily and followed by pairwise comparisons with Dunn’s method.

Throughout, the ‘conventional’ posterior approach (i.e. without minimally invasive surgery) group was used as the reference; further comparisons were made between minimally invasive surgery ‘yes’ vs. ‘no’ within other main approach groups. No adjustments for multiple comparisons have been made. All statistical analyses were conducted using Stata version 14·2 (Stata/SE 14·2 software, StataCorp LLC, Texas, 1985–2015).

Results

Revision

A total of 12,989 (1.8%) of 723,904 implants were revised during follow-up; 84,294 (11.6%) died without undergoing revision. Indications for revision surgery by primary surgery approach group are summarised (Additional file 1: Table S3). Figure 2 shows the estimated cumulative percentage revised (Kaplan-Meier) up to 12 years for the 7 approach groups.

Fig. 2
figure 2

Cumulative percentage revised (Kaplan-Meier) up to 12 years for the 7 surgical approach groups

There were some differences in possible confounding factors between groups (Table 1); we adjusted for these using two approaches. The first set of analyses used a Cox PH regression model stratifying by 16 age/sex/risk subgroups (age < 55, 55–64, 65–74 and 75+ years at primary; male vs. female; risk group, i.e. ‘other indication than just OA’, no vs. yes). Exploratory graphical checking suggested that the hazard rates for these subgroups were not proportional; hence, the stratification allowed their baseline hazards to differ. Table 2 (i) shows the results of these analyses. In total, 723,747 cases had complete data on age and sex and all had complete data on ASA and fixation. The middle column (Table 2 (i)) shows results with fixation and ASA as ‘fixed’ effects. Whilst both fixation and ASA were significant, graphical checking suggested their effects may be time-varying. However this was explored more using FPM described below. Year of primary was added to the model but was not statistically significant (P = 0.125, likelihood ratio test). Neither was femoral head size 36 mm and above in those where head size was known (P = 0.256; n = 723,635). Finally, BMI subgroup was added, as in the right-hand side of Table 2 (i) (P < 0.001). The unadjusted model showed that, compared with the conventional posterior approach, there was a higher risk of revision with the conventional lateral (P = 0.009), minimally invasive lateral (P < 0.001), minimally invasive anterior (P < 0.001) and trans-trochanteric (P = 0.004) approaches. Only the difference for the conventional lateral (P < 0.001) and trans-trochanteric (P = 0.003) approaches remained significant after adjustment.

Table 2 Regression models to compare approach groups for revision risk (n = 723,747 with complete information)

The second set of analyses used FPM. The baseline hazard was modelled with 4 degrees of freedom (df); time-varying effects of age (as 4 restricted cubic splines), sex and risk group were modelled with 4, 1 and 1 df respectively (Table 2 (ii)). Further adjustment was made for fixation and ASA (here after combining P3 and P4/P5) which were also time-varying (and modelled with df = 2 and 3 respectively). We were unable to obtain convergence when we added BMI to this model. Similar to the Cox model, the unadjusted analyses showed a higher risk of revision for the conventional lateral (P = 0.003), minimally invasive lateral (P < 0.001), minimally invasive anterior (P < 0.001) and trans-trochanteric (P = 0.002) approaches. These findings persisted in the adjusted models (conventional lateral (P < 0.001), minimally invasive lateral (P < 0.001), minimally invasive anterior (P < 0.001) and trans-trochanteric (P < 0.001)).

The FPM model demonstrated that the minimally invasive lateral and anterior approaches have higher failure rates than their corresponding conventional approaches, but with the Cox model this effect dissipates with adjustment for BMI (Table 2 (i)).

PROMs

PROMs were able to be linked to 289,296 (54%) of the 533,477 unilateral THRs undertaken since PROM monitoring began. The cohort with PROMs was broadly similar to the remainder over this period in respect of age, sex, ASA, risk group and fixation (data not shown but available).

Oxford Hip Scores (OHS)

The pre-operative measurements (Q1) were normally distributed, but the post-operative measurements (Q2) were highly skewed (Additional file 1: Fig. S3) with a median score of 42 (IQR 35–46). There were differences between the groups in pre-operative Q1 scores (Additional file 1: Table S4).

Additional file 1: Table S5 shows between group comparisons of the Q2 scores. A non-parametric comparison (i) is shown because of extreme skewness. To adjust for differences in Q1, a regression model was sought. The relationship between Q2 and Q1, however, was non-linear; fractional polynomials of Q1 were explored in conjunction with the regression and the transformation that best linearised this relationship was ln(1 + Q1 score). Comparisons between the mean Q2 scores with adjustment for differences in (transformed) Q1 scores are shown in (ii). The results were virtually unchanged from (i) except the difference between (3) and (4) (effect of minimally invasive surgery for ‘Lat/Ant-Lat/Hard’) was no longer significant. Residuals from this analysis were only approximately normal, but analysis using robust variance estimation (shown in parenthesis) left the results unchanged.

Additional file 1: Table S5 also shows the improvements in OHS between Q1 and Q2, calculated simply from ‘Q2 minus Q1’. Because of the ceiling effect of the OHS (the maximum attainable score being 48), the potential to ‘improve between Q1 and Q2’ depends on the initial score; the right-hand side column of Additional file 1: Table S5 shows the ‘potential’ for improvement, calculated from ‘48 minus Q1’.

Both of these showed statistically, but not clinically [16], significant differences between the groups. We also repeated (i) and (ii) using proportional odds ordered logistic regression (not shown) with similar findings.

EQ-5D-3L Health Index

Analysis of the EQ-5D Health Scale (Visual Analogue Score) results also showed statistically, but not clinically significant differences between the subgroups (Additional file 1: Table S6).

EQ-5D-3L Health Index question responses are shown for complete pairs (Q1 and Q2) in Additional file 1: Tables S7 (a) to (e). Responses to each question took the form of 3 ordered categories as listed in these tables. In each case, logistic regression analyses were used to compare the approach groups at Q2, after first combining 2 of the 3 categories as indicated in the footnote of the table; both adjusted and unadjusted models showed differences between the approaches.

Self-reported post-operative problems

We analysed the PROM self-reported post-operative problems (Additional file 1: Table S8). There were significant differences between groups. Comparing the 2 commonest approaches, the lateral conventional approach was associated with slightly higher rates of complications than the conventional posterior approach.

90-day mortality

Figure 3 shows the cumulative mortality up to 90 days (Kaplan-Meier) for the 7 groups (logrank P < 0.001).

Fig. 3
figure 3

Cumulative mortality up to 90 days (Kaplan-Meier) for the 7 surgical approach groups

Our previous work on mortality after hip replacement [3] had identified confounding factors, and a series of univariable analyses (data not shown, but available on request) confirmed these. Thus, in our analysis shown in Table 3, we have adjusted for these factors. Analyses included a stratification for ‘risk group’, as the hazard rates were not proportional over time. Age and year of primary THR were analysed as continuous variables, rather than ‘grouped’, and modelled using restricted cubic splines. In all models, the conventional lateral approach was associated with a higher risk of mortality than the conventional posterior approach. There were no other significant differences in mortality compared to the referent conventional posterior approach group. Similar findings were demonstrated when models were also adjusted for Charlson comorbidity subgroups, which were based on HES inpatient admissions within 5 years prior to date of primary THR operation (Table 4).

Table 3 Cox ‘proportional hazards’ regression model to compare 90-day mortality between the 7 approach subgroups
Table 4 Cox ‘proportional hazards’ regression model to compare 90-day mortality between the 7 approach subgroups. Models from Table 3 were also adjusted for Charlson comorbidity subgroups, which were based on HES inpatient admissions within 5 years prior to date of primary THR operation

Duplication of data by inclusion of the bilateral implants did not impact the findings; only 17 of the 2675 90-day deaths were associated with bilateral THRs.

Discussion

We have compared the 7 common surgical approaches to the hip joint used in THR by risk of revision, PROMs and 90-day mortality. In the fully adjusted Cox model, the conventional lateral and conventional trans-trochanteric approaches were associated with higher risks of revision compared to the conventional posterior approach. All other approaches (including all minimally invasive approaches) were not associated with an increased revision risk. However, the fully adjusted FPM which accounts for time-varying effects showed higher risk of revision with both the conventional and minimally invasive lateral approaches and the minimally invasive anterior and the trans-trochanteric approaches.

Composite PROM scores showed statistical, but not clinically significant, differences. This is similar to findings from other registries [18]. Self-reported specific complications such as bleeding and reoperation were higher with the conventional lateral approach than the conventional posterior approach, but again, there were no other consistent meaningful differences found. Ninety-day mortality was higher in the conventional lateral approach group when compared with the conventional posterior approach, but there were no other significant differences.

Registry data is observational and thus causation cannot be attributed. However, the associations shown here are consistent even with thorough adjustment for confounders. Some BMI data is missing as this was not collected in the early years of the NJR. Previously when analysing this dataset, we have used multiple imputation for BMI data, but this did not affect the results [3], suggesting data was missing at random. Any missing revisions would also be expected to be missing at random when analysing by primary approach. Registry data is generalisable as it encompasses the entire population and is thus less susceptible to selection and reporting bias. Some iterations of the NJR MDS and reporting of the data from those versions of the MDS did not specify ‘anterior’ as a separate category of approach, but included it with ‘other’; therefore, these groups were combined. It is possible that some of the cases defined as ‘other’ were not anterior and did not fit into any of the other approach groups defined in this analysis or their minimally invasive variants, but we believe the number of such cases would be small given the limited options for other hip approaches.

Furthermore, limited information is collected in the NJR regarding what specific minimally invasive surgical techniques were used, other than the use of a small skin incision, with these approaches only accounting for a small subgroup (5.1%) of the cohort. Although we recognise further research is needed into defining and categorising minimally invasive approaches, we considered it important to include these patients given a conscious effort was made by the surgeon to minimise the incision length, which may influence the technical performance and/or outcome of the THR.

A number of small randomised controlled trials (RCTs) have compared the effect of two THR approaches; however, these have limitations. Cheng et al. (n = 72) observed that the anterior approach had similar results at 12 weeks to the posterior approach in terms of complications, patient-reported outcomes and gait, aside from neuropraxia which was more common following the anterior approach [11]. Taunton et al. (n = 101) found that although there were some early benefits of the anterior approach (e.g. earlier discontinuation of walking aids), it was similar to the minimally invasive posterior approach at 1 year for PROMs, complications and radiographic parameters [19]. Two trials comparing the lateral with the posterior approach showed the posterior approach had greater improvement in muscle strength at 1 year (n = 47) [20], and fewer dislocations at an average 3-year follow-up (n = 196) [21]. A recent RCT (n = 164) found the lateral approach was associated with more abductor muscle weakness and worse PROMs, but the anterior approach had more nerve injuries [22]. A systematic review, including 12 RCTs, reported similar risks of dislocation, nerve injury, infection and venous thromboembolism between the posterior approach and the minimally invasive posterior approach [10].

The RCTs comparing surgical approach are not large enough to compare less common but important events such as revision and mortality rates. We observed that the conventional lateral approach was associated with higher revision and mortality rates than the conventional posterior approach. Previous studies have also observed higher revision rates (for all causes apart from dislocation) [23] and higher mortality rates [3] with the lateral compared to the posterior approach. The posterior approach is a more muscle-sparing approach compared with the lateral approach and is associated with less bleeding [8]. Problems seen more commonly following the anterolateral approach include nerve injury [8], reduced muscle strength [9] and limping [10]. These problems invariably influence patient mobility, especially in the early post-operative phase, when the mortality risk is higher. Previous work on mortality after THR showed lower all-cause mortality, and mortality from respiratory issues with the posterior approach; this finding is most likely related to improved mobility after surgery [24]. It is therefore plausible that a combination of these factors contribute to the higher revision and mortality rates observed for the lateral approach compared to the posterior approach.

The conventional lateral approach (36.0%) is the second most popular approach and is currently used annually in over 20,000 primary THRs in the NJR. This approach was associated with worse outcomes in all measures than the commonest approach, the conventional posterior (54.5%). Therefore, the data presented here does not support its continued use over alternative approaches. Although the conventional trans-trochanteric approach was associated with higher revision rates (compared with the posterior approach), it is acknowledged that this approach is rarely used now, and when it is used it is usually for more complex cases (such as hip ankylosis, proximal femoral deformities and acetabular protrusio), which may explain the inferior outcomes observed here with the conventional trans-trochanteric approach. By contrast, the posterior approach is common, extensile and associated with good outcomes in comparison to other approaches. Therefore, we recommend the posterior approach should be considered the current standard surgical approach for THR. It would be difficult and perhaps unwise to attempt conversion of experienced surgeons to an approach with which they may be unfamiliar. However, surgeons in training should be taught alternative approaches to the lateral associated with better outcomes, like the posterior approach. The data does support continued use of minimally invasive approaches, with acceptable mortality and PROM outcomes, although minimally invasive lateral and anterior approaches may be associated with higher revision rates than their corresponding conventional approaches.

Conclusion

The lateral approach was associated with worse outcomes, including more deaths and more revisions, than the posterior approach for primary THR. We thus recommend that new surgeons do not routinely use the lateral approach for THR. The conventional posterior approach is common and associated with at least as good outcomes as other less common approaches; therefore, we recommend the posterior approach should be considered the current standard approach for THR and should be used when training future surgeons. As minimally invasive posterior approaches and the conventional anterior approach had outcomes in our observational data similar to the conventional posterior approach in all domains, these should be assessed further in randomised controlled trials and compared with the conventional posterior approach.