Introduction

Since 1996, more than one million metal-on-metal (MoM) articular couples have been implanted in patients worldwide and it is estimated that nearly 93,000 of these procedures included articular surface replacement (ASR) acetabular components (DePuy Orthopaedics, Warsaw, IN, USA) [11, 16]. The ASR hip resurfacing system and the ASR XL for THA (ASR and ASR XL; DePuy Orthopaedics) were commercially introduced in 2004. This system including both the resurfacing and the THA versions was recalled in 2010 as a result of an unexpectedly high early revision rate [30].

One of the critical issues in arthroplasty during the last few years has been to establish an appropriate protocol for monitoring patients who have implants with MoM bearings. The goal has been to identify specific measurable parameters that can be used, alone or in combination, to screen and stratify which patients are likely to develop adverse local tissue reactions (ALTRs) and thus need either revision or close monitoring. This has proven to be challenging as we learn more about the complex interrelationships among tissue reactions, symptoms, sex of the patient, activity level, serum metal ion levels, component position, modular junctions, and component head size [16]. Further complicating the matter is that higher risk patients need to be identified before severe tissue damage occurs and to do so, these patients need to be assessed at regular intervals to minimize excessive tissue damage [21]. Additionally, the financial burden and potential patient distress need to be considered and addressed. A recent algorithm published as a consensus statement of the American Association of Hip and Knee Surgeons, the American Academy of Orthopaedic Surgeons, and The Hip Society stratifies patients into low-, moderate-, and high-risk groups based on several of the parameters mentioned [16]; however, there are still many unanswered questions such as how often we should perform imaging studies or measure systemic metal ion levels, especially if patients are completely asymptomatic. Should we be concerned about patients who are subjectively doing well? The consensus statement emphasizes that clinicians should avoid relying too heavily on any single investigative tool in the decision-making process, which accentuates the complexity of risk stratification.

Adverse local tissue reactions around THAs are an important reason for failure of MoM implants [18]. Previous studies have shown the prevalence of ALTRs, based on MRI, to be as high as 57% to 69% [5, 6]. Several studies have also shown that the survivorship of ASR XL THA is significantly poorer than for ASR resurfacing yet the reason for this is not fully understood [18, 24]. Interestingly, some studies have also reported that many ALTRs occur in asymptomatic patients [4, 31]; this is cause for concern because patients may present with symptoms only after severe collateral tissue damage has occurred, making reconstructive procedures challenging and outcomes unpredictable [21]. Also, little is known about capsular dehiscence patterns as ALTRs decompress from the hip into the surrounding tissue planes. The dehiscence patterns may also influence the onset and severity of patient symptoms. Previous incision patterns into the hip may play a role by providing a path of least resistance for increased intracapsular pressure. Both anterior and posterior disruption of the pseudocapsule have been reported with posterior disruption possibly more common [7].

Through a multicenter study approach, we asked: (1) Is ALTR location related to the surgical approach used for arthroplasty in patients who underwent hip arthroplasty (resurfacing or THA) with a single, recalled hip arthroplasty system? (2) Do ALTR severity and location affect patient-reported outcomes in these patients? (3) Is ALTR severity different between patients who received the resurfacing version of this component (ASR) and those who received the THA implant in this system (ASR XL)?

Patients and Methods

We performed a prospective, multicenter, followup study of the ASR hip system (ASR resurfacing and ASR XL for THA). The purpose of this multicenter study was to collect longitudinal data on a population treated with the ASR hip system that had not been revised and was undergoing regular followup. The data were collected prospectively with the aim of assessing potential predictors for various outcomes including ALTRs and patient-reported outcomes. Data were collected at Massachusetts General Hospital in Boston, MA, USA, from the various participating sites through a secure, web-based data entry and monitoring system. All patients meeting our study inclusion criteria were enrolled from 15 sites worldwide: any patient (1) with on-label use of the ASR XL or ASR component system currently implanted; (2) able to provide informed consent previously approved by institution’s institutional review board or ethics committee; (3) able to return for followup annually for 5 years; and (4) able to complete the required patient-reported outcome measures. Exclusion criteria were: any patient (1) with off-label use indications for the ASR XL or ASR component system; (2) who received the ASR XL implant as a result of a hip resurfacing conversion or revision THA; and (3) with difficulty in comprehending the informed consent form for any reason.

Overall, 1258 patients (1427 hips) from 15 sites in five countries (United States, United Kingdom, South Africa, Denmark, Australia) were enrolled in the multicenter study. For our analysis, we only included data from two centers (255 patients from Center A and 33 patients from Center B) that routinely used metal artifact reduction sequence (MARS) protocol MRI as part of annual followup for ASR patients (Fig. 1). This subcohort of 288 patients (333 hips) was recruited from September 2012 to October 2014 and had an MRI of the hip performed at a mean time of 6 years postindex surgery (range, 1–10 years). The mean age of these 288 patients was 61 years at index surgery (range, 25–95 years) and 113 (39%) were women. Procedures included 166 hips (50%) with ASR resurfacing and 167 hips (50%) with THA using the ASR XL system. One hundred twenty-nine hips (39%) had been operated on using a direct lateral approach and 204 using a posterior approach (61%). At both centers MRI scanners were adjusted to produce MARS. Scanning was performed with 1.5-T clinical scanners (GE Healthcare, Waukesha, WI, USA, and Siemens Healthcare, Erlangen, Germany). Sequences used in imaging were coronal, axial, and sagittal proton density or T1-weighted fast spin echo as well as coronal and axial short tau inversion recovery (STIR) sequences.

Fig. 1
figure 1

Flowchart of the study cohort is shown.

The EuroQOL five dimensions questionnaire (EQ-5D) [13, 32], the self-reported Harris hip score (HHS) [19, 26], UCLA activity score [15], and visual analog scale (VAS) pain score (0–10) were obtained for all 288 patients to assess subjective health-related quality of life, hip function, activity, and pain levels, respectively. Patient-reported outcome measures were obtained at a mean (± SD) of 3.2 ± 6.8 months after having MRI. The outcome scores were collected using paper questionnaires and then uploaded into a web-based data entry system. The abduction angle of the acetabular components was determined from digital plain radiographs of all patients using mdesk software (RSA Biomedical, Umeå, Sweden). The mean abduction angle of the acetabular component was 43° (range, 21°–63°).

Patient MR images were assessed for presence and location of ALTRs (anterior, posterior, or on both sides of the hip in axial images). ALTRs were classified as anterior or posterior based on location relative to the axis of the femoral neck in the axial MRI plane. ALTRs were also classified according to the Anderson ALTR grading system, which has the highest intra- and interobserver reliability of currently used systems [1, 28]. Per this classification, a mild ALTR (C1) is defined as a periprosthetic soft tissue mass with no hyperintense T2-weighted fluid signal or a fluid-filled periprosthetic cavity less than 5 cm maximal diameter. A moderate ALTR (C2) is a periprosthetic soft tissue mass/fluid-filled cavity greater than or equal to 5 cm in diameter or C1 lesion with either (1) muscle atrophy or edema in any muscle other than short external rotators; or (2) bone marrow edema hyperintense on STIR sequences. Severe ALTRs (C3) include fluid-filled cavities extending through deep fascia, tendon avulsion, intermediate T1-weighted soft tissue cortical or marrow signal or fracture. The classification system was originally devised to correspond with the decision and urgency to treat such that C1 meant no immediate intervention but followup was recommended; C2 implied that an elective revision of the hip should be considered; and C3 meant an urgent need for revision surgery [1]. In addition to the Anderson ALTR grading system, synovial thickness and maximal diameter of ALTRs were also determined [10, 22].

All patients with extracapsular ALTRs with a maximal diameter of more than 5 cm were identified, which included both moderate and severe ALTRs. Mild (C1) ALTRs were excluded from the analysis of ALTR location, because these were often intracapsular or very small. MRI assessment was performed by one of the authors (RM) with 2 years of experience and validated by a musculoskeletal radiologist (HGP) with more than 10 years of experience with MARS MRI. Discrepancies were resolved by consensus. MRI assessment was blinded to all clinical and surgical patient data.

Statistical analysis was performed using the Kruskal-Wallis test to compare patient-reported outcome measures of patients with anterior and posterior ALTRs with those of patients with no ALTRs on MRI. The Mann-Whitney U test was used to assess differences in ALTR synovial thickness and diameter between patients with procedures using ASR and those with ASR XL. After excluding 45 patients with bilateral implants, multivariate logistic regression analysis was used to determine which factors were associated with the presence of moderate and severe ALTRs. First, logistic regression models were generated to identify any predictive variables to be used in the multivariate analysis. The models were based on the independent variables of implant type (ASR or ASR XL); gender; age at the time of MRI; time from index surgery to MRI; HHS; EQ-5D score; UCLA activity score; VAS pain score; femoral head size; surgical approach (lateral or posterior); and abduction angle. Significant risk factors were used to generate multivariable logistic regression models with the dichotomous outcome variable of presence or absence of C2/C3 ALTR.

Interobserver reliability for the assessment of categorical variables (ALTR grade and location) was assessed using Cohen’s kappa (κ) and, for continuous variables (synovial thickness and maximal ALTR diameter), using Pearson’s r. Interobserver reliability for assessment of both ALTR grade (κ = 0.64) and location (κ = 0.67) was substantial, whereas the interobserver reliability for the assessment of synovial thickness and maximal diameter was high (r = 0.91 and r = 0.84, respectively). A chi square test was used to assess the association between surgical approach and location of ALTR. Spearman’s rank order correlation was used to assess the relationship between Anderson grade and synovial thickness. A p value < 0.05 was considered significant. Statistical analyses were performed using SPSS (Version 17.0; IBM Corporation, Armonk, NY, USA).

Results

One hundred thirty (39%) hips had no visible ALTRs and 124 hips (37%) had mild (C1) ALTRs. Extracapsular ALTRs classified as severe (C3; Fig. 2) or moderate (C2; Fig. 3) were identified in 79 hips (24%). Forty-one of the 79 hips had been operated on using the direct lateral approach (41 of 129 [32% of hips with the lateral approach]) and the remaining 38 using the posterior approach (38 of 204 [19% of hips with the posterior approach]). The location of ALTR corresponded with the surgical approach to the hip in the majority of cases (Fig. 4). Of patients with moderate-to-severe ALTR whose surgery was performed using a lateral approach, 83% (34 of 41) had an ALTR that was located anteriorly. Similarly, 71% (27 of 38) of patients with moderate-to-severe ALTR who were operated on using a posterior approach had a posterior ALTR. Ten percent (four of 41) of patients in the direct lateral and 21% (eight of 38) in the posterior approach groups had ALTRs located both anterior and posterior to the hip simultaneously (Fig. 4). Only 8% (six of 79) of patients in both the direct lateral and posterior approach groups had ALTRs located opposite to the direction of surgical approach to the hip.

Fig. 2A–C
figure 2

(A) A plain radiograph shows the right hip of a 70-year-old man 9 years after undergoing ASR XL THA using a lateral approach. (B) A coronal STIR MRI image of the same hip demonstrating a large extracapsular ALTR, classified C3, in the region of the greater trochanteric bursa. (C) An axial STIR MRI image of the hip shows the anterior location and dehiscence pattern.

Fig. 3A–C
figure 3

(A) Plain radiograph of the left hip of a 64-year-old woman 10 years after ASR hip resurfacing arthroplasty performed using a posterior approach is shown. (B) A coronal STIR MRI image of the same hip demonstrating a large extracapsular ALTR, classified C2, in the region of the greater trochanteric bursa. (C) An axial STIR MRI image of the hip shows the posterior location and dehiscence pattern.

Fig. 4
figure 4

The relationship between surgical approach and location of ALTR is demonstrated.

Interestingly, there were no clinically relevant differences in patient-reported outcome measure scores between patient groups with mild, moderate, or severe ALTRs and those with no ALTR findings on MRI (HHS, p = 0.14; EQ-5D, p = 0.09; UCLA, p = 0.93; VAS pain, p = 0.20 [Fig. 5]). Similarly, the HHS, UCLA scores, and EQ-5D scores did not differ between patients with anteriorly and posteriorly located ALTRs (Table 1). The median pain score was slightly higher (p = 0.05) for patients with a posterior ALTR, although likely not clinically relevant (VAS pain = 0.5 compared with VAS pain = 1.0).

Fig. 5
figure 5

Median and standard error of patient-reported outcome measure values for patients with ALTRs of increasing severity is shown.

Table 1 Median and range of PROM values for patients with moderate and severe ALTRs based on location and compared with patients with no ALTR findings on MRI

Patients who had undergone THA with ASR XL had larger ALTRs (median diameter, 47.6 mm; range, 14.0–109.7 mm) compared with patients with ASR implants (median diameter, 38.4 mm; range, 17.2–118.0 mm; p = 0.02). Similarly, patients with ASR XL also had thicker synovia (median thickness, 3.6 mm; range, 1.2–10.6 mm) compared with those of patients with ASR implants (2.6 mm; range, 1.2–10.7 mm; p < 0.001). In the multivariate analysis, the only variable that remained independently associated with an increased risk of moderate-to-severe ALTR development was the ASR XL implant (odds ratio, 2.8; 95% confidence interval, 1.4–5.5; p = 0.004). The acetabular component abduction angle, femoral component head size, or surgical approach was not associated with an increased risk of moderate-to severe ALTR. Finally, we found a significant correlation between ALTR Anderson grade and synovial thickness (Spearmans’s rho = 0.762, p < 0.001).

Discussion

There is scant literature on the relationship between the surgical approach used in arthroplasty and the location of ALTRs after hip arthroplasty using MoM implants. Furthermore, the role of subjective patient symptoms in the risk stratification process has not been defined. We showed that the location of ALTRs is related to the surgical approach used in arthroplasty. We also demonstrated that patient-reported outcomes did not differ in patients with or without ALTRs. Finally we showed that the severity of ALTRs was greater in patients with ASR XL THA.

The current study has some limitations. We were not able to include objective clinical function tests that assess ROM and abductor function because these were unavailable and not included in the study protocol. However, we believe that subjective patient satisfaction, pain, and concerns related to the hip are equally important and these were thoroughly assessed using multiple, validated patient-reported outcome measures included in the study protocol. The current study only included patients with the ASR hip system who had not been revised at the time of recruitment and it is possible that some patients with severe ALTRs at an early stage were not included. However, the aim of the study was to focus on the patients who had not yet been revised and were currently undergoing followup so as to better understand their risks and optimize followup strategies. Another limitation was the wide variation in the time between surgery and MRI, because no standardized protocol for when to perform MRI was used. The recently published MoM risk stratification guidelines have aimed to establish such a protocol [16], yet even with protocols, it has been acknowledged that the decision-making process for stratifying patients can be complex [3, 16]. Although we did assess the location and severity of ALTRs on MRI, we did not have intraoperative data to confirm the location and extent of tissue damage. On the other hand, MRI has high specificity in detecting pseudotumors, and both synovial thickness and ALTR size have been shown to correlate with intraoperative tissue damage and histologic aseptic lymphocyte dominated vasculitis-associated lesion (ALVAL) severity [10, 17, 22]. Finally, we were unable to include patient weight as a variable in the multivariate analysis because this information was not available for all patients in the study. On the other hand, it is noteworthy that weight is not considered a risk factor in the recent risk stratification guidelines for patients with MoM hip arthroplasty [16].

The overall prevalence of ALTRs in our study was 61%, which is similar to the level reported in previous studies assessing MRI findings in patients with ASR implants [4, 6]. We showed that the location of moderate and severe ALTRs with a diameter greater than 5 cm can be predicted in the majority of patients based on the previous surgical approach to the hip. We theorized that the location of ALTRs would be related to the initial violation of the joint capsule and surrounding soft tissues and hence the focus on the surgical approach. Fritz and coworkers [7] suggested that disruption of the pseudocapsule in progressive ALTRs usually occurs at the posterolateral attachment. This becomes intuitive in light of the current findings when we consider that most hip arthroplasties are performed from the posterior approach. Although mechanical compromise of the tissues at previous incision sites may provide a low resistance path for ALTR dehiscence, the biologic processes associated with the timing of tissue or wound healing may also play a role in ALTR progression. It is not customary to include information regarding the surgical approach used when ordering soft tissue imaging. Based on our findings, this information may be useful when screening hips with MoM implants. To our knowledge this is the first study to evaluate the relationship between ALTR location and surgical approach.

We demonstrated that patients with moderate or severe ALTRs did not have inferior patient- reported outcome measures in terms of health-related quality of life, activity level, hip function, or pain. Moreover, the location or severity of ALTRs did not affect patient-reported outcome measures. Our study results also add to the growing body of evidence that moderate and even severe ALTRs can be asymptomatic [4, 31].

The three commonly used systems for grading pseudotumors are those described by Anderson et al. [1], Matthies et al. [20], and Hauptfleisch et al. [1, 9, 20]. Of these classification systems, the Anderson grading system for ALTRs has been shown to have the highest inter- and intraobserver reliability [1, 28]. The grading system is based on the size and collateral damage associated with the ALTR. However, the Anderson ALTR grading system does not account for synovial thickness or solid components of the reaction, which have been shown to correlate with symptoms, histologic ALVAL scores, and even a higher likelihood of revision [9, 10, 22]. The other two commonly used classifications (Matthies et al. [20] and Hauptfleisch et al. [9]) do take into account the wall thickness and contents of the ALTR but are substantially inferior in both inter- and intraobserver reliability compared with the Anderson grading system for ALTR [20, 28].

ALTR findings on MRI do not necessarily correlate with subjective symptoms. Chang et al. [4] found no correlation between patient symptoms and severity of ALTR based on presence or size of ALTR. They did, however, find that bone marrow edema and abductor tendon tears were associated with patient pain. Another study has also shown that the prevalence of cystic pseudotumors does not differ between well-functioning and poorly functioning hip implants [8]. Several studies have also highlighted the existence of “silent” ALTRs. In a study assessing the prevalence of soft tissue pathology in hips having undergone THA with ASR XL, it was found that half the patients with the best possible Oxford hip scores had adverse reactions based on MRI assessment [31]. On the other hand, van der Weegan and colleagues showed that asymptomatic pseudotumors after MoM hip resurfacing showed little or no variation in size over 1-year followup [27]. Another study assessing longitudinal changes in ALTR findings after hip replacements with ASR in 154 hips showed that substantial pseudotumor progression occurred in only 8% of the hips and most of these had a normal baseline MRI [23]. The clinical importance and progression rate of various types of ALTR are still not fully understood. The findings from our study demonstrated that even patients with moderate and severe ALTRs generally had good-to-excellent hip function and subjective general health equal to normative values of patients with standard THA [2, 25].

Finally, we demonstrated that patients with the ASR XL implant had larger ALTRs with thicker synovia compared with patients with ASR implants. Patients who had undergone arthroplasty with ASR XL were nearly three times more likely to have moderate-to-severe ALTRs than patients with the ASR implant. This is supported by a previous study that showed a failure rate of 49% for patients with ASR XL THA compared with 25% for patients with ASR revision at 6 years [18]. Similarly, a study by Reito et al. [24] also showed that the 7-year survivorship was 51% for the ASR and only 38% for the ASR XL THA. These observed differences are likely to be attributable to reciprocating movement at the taper junction of the ASR XL THA leading to fretting corrosion described as mechanically assisted crevice corrosion [12]. Interestingly, head size was not found to be an independent predictor of moderate-to-severe ALTR in the multivariate analysis. To our knowledge this study is the largest study that has assessed both synovial thickness and ALTR Anderson grade in patients with the ASR hip system. It is also the first study to show a correlation between ALTR Anderson grade and synovial thickness.

The location of the ALTR may have several consequences because location may determine which of the surrounding tissues becomes damaged first. None of the patients included in our study had neurovascular structure involvement, which usually leads to clear symptoms in the form of swelling or neurologic deficits, subsequently prompting revision. However, it is concerning that the relatively large ALTRs did not worsen subjective outcomes in any of the multiple categories assessed in the current study. It is possible that many ALTRs do not cause symptoms until they result in a mass effect on surrounding neurovascular structures or until substantial abductor or bone damage occurs leading to instability or component loosening and associated pain or mechanical symptoms [4, 14]. It is noteworthy that some ALTRs may even initially present with superimposed periprosthetic infection [29].

Current risk stratification guidelines for patients with MoM hip arthroplasty are being updated as we learn more about the risk factors for development and progression of ALTRs. Awareness of the initial surgical approach may be useful when assessing both MR images and probably also in performing ultrasound screening of hips with MoM implants. Because dehiscence patterns generally appear to follow the surgical approach, the direct lateral approach may be associated with earlier abductor damage or detachment compared with the posterior approach and may warrant earlier intervention. There are differences in the risk of ALTR even within the recalled implant category and our study suggests that increased vigilance in the followup of patients who underwent THA with ASR XL is crucial because they are more likely to develop moderate-to-severe ALTRs. Based on our study results, imaging studies should still play a key role in the risk stratification process, and lack of patient symptoms does not preclude substantial ALTRs, because even an extensive range of patient-reported outcome measures cannot identify all patients at high risk for implant failure.