Introduction

Although modularity of THAs has been around for decades, reports of modular head-neck junction corrosion and metal release continue to rise. Corrosion products can trigger adverse local tissue reactions (ALTRs) and subsequent revision surgery. Multiple case reports have implicated corrosion of modular head-neck junctions as the cause of hip pain resulting in revision surgery [2, 26, 28, 29, 33, 37, 38]. So far the largest clinical series have suggested that between 1.8% and 2.3% of patients undergoing THA with metal-on-polyethylene bearings will undergo revision surgery per year because of modular head-neck junction corrosion [9, 10]. This number may be underreported because it can masquerade as infection and instability. Even for reports of metal-on-metal bearing failure, the head-neck modular junction has been associated with high metal ion release [13, 22, 24, 36].

Factors previously identified as increasing corrosion and fretting of modular head-neck junctions include design characteristics such as larger femoral head diameter [5, 12], longer femoral neck offset [4, 11, 16, 25], larger angular mismatch between the stem taper and head taper [2, 6], and higher flexural rigidity [17]. Surgical assembly technique such as the force and angle at which the surgeon impacts the femoral head onto the stem taper [32] and the presence of fluid or debris on the taper surfaces during surgical assembly [16, 21] can negatively affect the taper’s initial stability. Material alloy of the head and stem taper also plays a role because different corrosion and fretting processes can occur depending on the material couple [18, 19].

One aspect that has received little attention is the surface topography of the head and stem taper surfaces. The taper surfaces have parallel machining lines, which create a rough surface meant to provide for strong interlocking between the femoral head and stem during assembly in the operating room. We previously characterized the stem taper surface topography as a wave defined by height and spacing parameters (Fig. 1A) and found that the surface topography of stem tapers was highly variable depending on the manufacturer and material alloy used for the femoral stem [27]. The most frequently used stem/head taper metal alloy couples are either a CoCrMo or a Ti6Al4 V stem paired with a CoCrMo head (CoCr/CoCr and CoCr/Ti, respectively). So far, it is unclear how taper surface topography influences the degree of damage on the taper surfaces and if the influence is mitigated or enhanced by design factors such as flexural rigidity and taper angle mismatch, previously identified as important for head-neck taper corrosion.

Fig. 1A–B
figure 1

(A) Schematic showing the stem taper surface topography consisting of parallel machining marks. The machining marks are characterized as a wave that can be represented by height and spacing parameters. (B) Schematic of head and stem tapers showing global dimensions that were measured with a digital caliper. Head and stem taper angles were measured with a coordinate measuring machine.

Therefore, we analyzed taper surface topography characteristics, design features, and damage scores of retrieved THAs with either CoCr/CoCr or CoCr/Ti modular junctions. Specifically, we investigated the following three research questions: (1) What are the differences in damage score and surface topography between CoCr/CoCr and CoCr/Ti modular junctions? (2) How are initial surface topography, flexural rigidity, taper angle mismatch, and time in situ related to visual taper damage scores for CoCr/CoCr couples? (3) How are initial surface topography, flexural rigidity, taper angle mismatch, and time in situ related to visual taper damage scores for CoCr/Ti couples?

Materials and Methods

We performed a retrospective analysis of retrievals in our institutional review board-approved implant registry. All implants were retrieved at revision surgery from 2004 to 2014 and implanted between 1995 and 2012. The registry contained 269 paired head and stem tapers of either CoCr/CoCr (140) or CoCr/Ti (129) material couples (Table 1). The retrievals represented 11 manufacturers with the majority of the implants from Zimmer (Zimmer Inc, now Zimmer-Biomet, Warsaw, IN, USA). The bearing surfaces of the retrievals were metal-on-polyethylene (N = 226), metal-on-metal (N = 22), hemiarthroplasty (N = 19), and two unknown. None were metal-on-metal failures or from recalled implant designs. All retrievals had CoCrMo heads.

Table 1 Retrieval demographics (n = 70)

The head and stem taper surfaces were scored using a stereo microscope (SMZ-U; Nikon, Melville, NY, USA) based on a validated visual damage scoring system (modified Goldberg score) [17, 35], where damage is scored from least to most severe as none (1), mild (2), moderate (3), or severe (4). The score included damage resulting from fretting and corrosion combined [9]. Two observers scored the surfaces (RMU, DJH). In the case of disagreement in scores between the two observers, the observers discussed and came to a consensus on the score. The reliability of this technique compared with volumetric measurements of material loss has been studied by Hothi et al. [20], who found an intraobserver and interobserver agreement of greater than 90%. Head and stem taper scores were correlated with volumetric wear measurements, and the agreement was especially good for lower visual damage scores (1–3).

A subset of 70 THAs (37 CoCr/CoCr, 33 CoCr/Ti) was randomly chosen for the evaluation of the initial as-manufactured taper surface topography (Table 2). Components were chosen so there was a comparable number of head and stem damage scores for each material couple; however, components with severely damaged stem tapers (score 4) were excluded, because evaluation of the initial surface topography could not be ensured. For stem tapers, surface topography was defined by the height and spacing of the circumferential machining marks (Fig. 1A). A white light interferometer (NewView 6000; Zygo, Middlefield, CT, USA) was used to measure the stem taper surface topography. Imaging was conducted at ×400 over an evaluation length of 700 μm. At this magnification, damaged machining mark peaks could easily be identified and avoided to ensure that only the initial as-manufactured surface topography was evaluated. Six images were collected: three around the circumference of the proximal stem taper and three around the circumference of the distal stem taper. Linear profiles were taken perpendicular to the machining marks through each image. Custom-written Matlab software (The Mathworks Inc, Natick, MA, USA) was used to identify the height and spacing between machining marks as previously described [27]. Preliminary investigations revealed that the head taper surfaces did not have consistent machining mark topography; therefore, average surface roughness (Ra) was used to define the head taper surface topography [27]. For this study, femoral heads could not be sectioned, which made surface evaluation by white light interferometry impossible. Therefore, the roughness of the head taper surfaces was measured with a tactile profilometer (Surftest SJ201P; Mitutoyo, Aurora, IL, USA). Great care was taken to avoid areas of visible damage.

Table 2 Frequency of head and stem damage scores for the subgroups of CoCr/CoCr and CoCr/Ti couples selected for the evaluation of surface topography

In addition to the evaluation of surface topography, design factors were measured including taper angle mismatch and head and stem taper width and length (Fig. 1B). A coordinate measuring machine (Smartscope; OGP, Rochester, NY, USA) in tactile mode with a 2-mm diameter ruby ball was used to determine the head and stem taper angles. Fifteen points were collected in areas optically free from damage in three planes (proximal, center, and distal) perpendicular to the axis of the head and stem tapers. The exact locations of the points were chosen to avoid areas with visible damage. A perfect cone was fitted to the points to compute the taper angle. Head and stem taper angles were defined as the angle of the cone; taper angle mismatch was defined as the head taper angle minus the stem taper angle. Flexural rigidity was calculated as the elastic modulus of the stem taper times the bending moment of inertia about the centroid of the stem taper [31] using the stem taper width and length. These taper dimensions were measured with a digital caliper (Fig. 1B).

Both semiquantitative and quantitative measures were used to address the three research questions. Damage scores for the stem and head taper surfaces were semiquantitative. The height and spacing of stem taper surface topographies, average roughness of head tapers, flexural rigidity of stem tapers, and angular mismatch between the stem and head tapers were quantitative. Another quantitative variable, time in situ, was also reported because it could be a confounding factor that influenced damage score.

To answer research question 1, a chi-square test was used to compare the frequency of damage scores within different material couples for the entire retrieval group. With a sample size of 269 retrievals, we achieved a power of 0.95 to detect an effect size (W) of 0.25 with a significance level of 0.05. For the analyzed subgroups of each material couple damage score, machining mark height and spacing, head taper roughness, flexural rigidity, and time in situ were compared using the Wilcoxon rank-sum test. Differences in taper angle mismatch were tested with a two-tailed t-test. The significance level for comparisons was set as p = 0.05. For comparing the two groups, our random sample of 70 implants could detect an effect size (difference of means divided by the SD) of 0.7 with a power level of 0.8 at a confidence level of 95%. To answer research questions 2 and 3, for each material couple group, the effect of the six quantitative variables on head and stem taper damage scores was evaluated using a multiple regression model (SPSS, IBM Corp, Armonk, NY, USA). Only significant variables (p < 0.05) remained in the model. Normality of the data was confirmed by qq-plots. Our random sample of 70 implants could detect slopes of 0.01 per month for time in situ, 0.14 µm−1 for stem machining mark height, 0.01 µm−1 stem machining mark spacing, and 0.9 µm−1 for head roughness with a power level of 0.8 at a confidence level of 95%.

Results

With the numbers available, there were no differences in damage scores, surface topography, flexural rigidity, or taper angle mismatch between CoCr/CoCr and CoCr/Ti modular junctions. For all couples in our implant registry, the mean damage scores for the stem and head tapers were 1.8 ± 0.9 SD and 1.7 ± 1 SD for CoCr/CoCr couples and 1.8 ± 0.8 SD and 1.7 ± 1 SD for CoCr/Ti couples, respectively. No damage (score 1) was the most common score for both CoCr/CoCr and CoCr/Ti couples for stem (73% and 60%; Fig. 2) and head tapers (74% and 63%; Fig. 3). CoCr/CoCr couples were more likely (p = 0.02) to have severely damaged (score 4) stem tapers (6%) than CoCr/Ti couples (1%) and less likely (p = 0.006) to have mildly damaged (score 2) stem tapers (Table 3). The percentage of severely damaged (score 4) head tapers was not different at 9% and 8% for CoCr/CoCr and CoCr/Ti couples, respectively (Table 4). However, the CoCr/Ti coupled head tapers exhibited a higher number of moderately damaged (score 3) tapers than CoCr/CoCr couples (17% versus 7%, respectively; p = 0.003).

Fig. 2
figure 2

Distribution of stem taper damage scores shown for CoCr/CoCr and CoCr/Ti couples.

Fig. 3
figure 3

Distribution of head taper damage scores shown for CoCr/CoCr and CoCr/Ti couples.

Table 3 Frequency of stem taper damage scores for CoCr/CoCr and CoCr/Ti couples
Table 4 Frequency of head taper damage scores for CoCr/CoCr and CoCr/Ti couples

The median machining mark height were spacing on the stem tapers for the subgroup of 37 CoCr/CoCr couples was 11 (range, 1.6–14) µm and 200 (range, 24–229) µm, respectively (Table 5). The median machining mark height and spacing on the stem tapers for the subgroup of 33 CoCr/Ti couples were 11 (range, 1.9–13) µm and 198 (range, 30–234) µm, respectively (Table 5). There were no differences between material couples for either stem taper machining mark height (Fig. 4) or spacing (Fig. 5). The topography of the head tapers did not follow a repeating pattern of machining marks in 24% (CoCr/CoCr) and 22% (CoCr/Ti) of cases. Therefore, topography was quantified by means of Ra instead of machining mark characteristics. The median Ra was 0.50 (range, 0.14–2.2) µm for CoCr/CoCr couples and 0.65 (range, 0.24–1.8) µm for CoCr/Ti couples. The median flexural rigidity was 365 (range, 230–581) Nm2 for CoCr/CoCr couples and 177 (range, 117–265) Nm2 for CoCr/Ti couples (Table 5). The mean taper angle mismatch between the head and stem taper was 0.05° (SD 0.1; 95% confidence interval [CI], 0.1–0.8) for CoCr/CoCr and 0.03° (SD 0.1; 95% CI, −0.2–0.8) for CoCr/Ti couples. There was no significant difference in taper angle mismatch between material couples (p = 0.6).

Table 5 Time in situ, damage scores, and design factors for CoCr/CoCr and CoCr/Ti couples*
Fig. 4
figure 4

Comparison showing stem taper machining mark height for CoCr/CoCr and CoCr/Ti couples. Circles mark mild outliers (between 1.5 × q to 3 × q away from the nearest quartile; q = interquartile range).

Fig. 5
figure 5

Comparison of stem taper machining mark spacing for CoCr/CoCr and CoCr/Ti couples is shown. Circles mark mild outliers and asterisks mark extreme outliers (> 3 × q away from the nearest outlier).

For CoCr/CoCr stem tapers, machining mark height and time in situ were associated with higher damage scores, but for CoCr/CoCr head tapers, only stem taper machining mark height was associated with higher damage scores. For stem tapers, the multiple regression had an improved outcome by replacing time in situ with an interaction term between machining mark height and time in situ. The resulting relationship between the two factors showed that a higher machining mark height slows damage score increase over time, whereas a shallower stem taper topography results in a steep initial increase in damage score followed by only a marginal increase over time (Fig. 6). At approximately 10 years in situ, the damage score became independent of the topography. For the head tapers, stem taper machining mark height was associated with higher damage scores (p = 0.001) (Fig. 7), but time in situ was not (p = 0.441). The difference was especially pronounced between mild and moderate damage, where higher stem taper machining marks resulted in a lower head taper damage score.

Fig. 6
figure 6

Regression model shows stem taper damage scores for CoCr/CoCr couples as a function of stem taper machining mark height and time in situ.

Fig. 7
figure 7

The relationship between head taper damage score and stem taper machining mark height for CoCr/CoCr couples is shown. The circle marks a mild outlier.

For CoCr/Ti couples, the only factors associated with stem taper damage were head taper roughness and the stem taper machining mark height, whereas only head taper roughness was associated with head taper damage. There appeared to be a pronounced difference in damage score for flexural rigidity (Fig. 8) and head taper roughness (Fig. 9) between mild and moderate damage. However, the multiple regression model that controlled for all factors revealed only the head taper roughness (p = 0.001) and stem taper machining mark height (p = 0.007) were associated with stem taper damage score. Increasing stem taper machining mark height correlated with a decrease in stem taper damage score and increasing head taper roughness correlated with an increase in stem taper damage score. Head taper roughness was also associated with higher head taper damage scores (p = 0.003) by multiple regression.

Fig. 8
figure 8

The relationship between stem taper damage score and flexural rigidity for CoCr/Ti couples is shown. The circle marks a mild outlier and the asterisk marks an extreme outlier.

Fig. 9
figure 9

The relationship between stem taper damage score and head taper roughness for CoCr/Ti couples is shown.

Discussion

Success of THAs can be limited by ALTRs triggered by corrosion products generated in the modular taper junction. Surface damage caused by corrosion processes is a multifactorial problem, which includes several design, material factors, and patient and surgical factors. We sought to determine whether surface topography applied to the stem and head taper surfaces plays a role in the damage process. Our results suggest that especially the machining mark characteristics on the stem taper contribute to the visual damage outcome of the head and stem tapers. Stem taper topography is most impactful in CoCr/CoCr couples.

This study has several limitations including (1) limitations associated with the repository from which the retrievals were drawn; (2) limitations of the retrieval sample (n = 70) reported; and (3) methodological limitations. The larger repository from which the retrievals were drawn has a large representation from one manufacturer, Zimmer (Warsaw, IN, USA), as a result of the use of those particular implants at Rush. Nevertheless, 10 manufacturers were represented in the 70 retrievals evaluated in this study. The retrievals in the repository were all obtained at the time of revision surgery and were not removed as a result of taper corrosion, but for various other reasons (Table 1). Therefore, we are investigating taper topography simply as an indicator of damage accumulation. There are several limitations regarding the subgroup of 70 retrievals, which include the obtained statistical power, the exclusion of severely damaged stem tapers, the low number of retrievals with long in situ times, and the fact that patient weight and implant alignment were not available. With 70 retrievals, this study had a power of 80% to detect a true difference. Further studies need to be conducted with a larger sample size to achieve higher power. We did not evaluate stem tapers categorized as “severely” damaged (score 4) as a result of the low numbers present in the repository. Stem tapers of score 4 only constituted 6% of the CoCrMo stem tapers and 1% of the Ti6Al4 V stem tapers in the entire retrieval repository. Additionally, it was not possible to accurately determine the initial surface topography for those components as a result of the excessive amount of corrosion damage. Knowledge of the initial stem taper topography was required for the inclusion as an input variable to the regression model. Only three retrievals had in situ times greater than 10 years. Retrievals with longer in situ times should be added to the model as they become available to determine definitively if specific surface topography characteristics result in implants that last longer in vivo. We also recognize that patient weight and implant alignment are additional contributors to taper damage, but those clinical data were not available for this study. Methodological limitations include the use of the semiquantitative damage grading (modified Goldberg score) and the method used to measure stem and head taper angles. A previous study showed that the visual-based Goldberg scoring system was positively correlated to volume loss [20]. For components with severe damage (score 4), there was a broad range of material loss. Therefore, future studies should include both stem tapers of damage score 4 and quantitative analysis of material removed resulting from corrosion and fretting to determine the relationship between the most severely damaged stem tapers and surface topography. The evaluation of stem and head taper angles was based on only 15 points taken in obviously undamaged areas. Other studies have used full metrology data sets to fit the initial taper geometry and to determine the taper angle [7, 8, 23]. However, the obtained results fit well within the range of these studies.

It was our goal to determine differences in damage score and surface topography for the modular junction material couples that were most common in our retrieval repository: CoCr/CoCr and CoCr/Ti. Interestingly, the average damage scores of stem and head tapers were very similar between the two material couples. The main difference was the higher number of severely damaged stem tapers for CoCr/CoCr couples. The reason for more distinct damage is the higher susceptibility of CoCrMo alloy to corrosion under specific circumstances. Earlier studies have shown that different modes of corrosion can occur such as grain and phase boundary corrosion, etching, fretting corrosion, and pitting [14, 15, 19]. Such mechanisms are known to cause severe damage and elevated material loss. Under damage scores 1 to 3, damage is more mechanically driven (cyclic creep, plastic deformation, mechanically dominated fretting). Interestingly, the occurrence of severe damage to head tapers appears to be independent of the counter alloy. With regard to surface topography, there appears to be no difference between CoCr and Ti6Al4V tapers. The machining mark height and spacing are almost identical, indicating that similar machining parameters have been applied. Other studies have reported values for head and stem taper topographies that fit well within our results [18, 23, 30, 34, 39, 40].

Stem taper machining mark height was associated with greater stem and head taper damage scores after controlling for time in situ, flexural rigidity, taper angle mismatch, head taper roughness, and stem taper machining mark spacing for CoCr/CoCr modular junction couples. Interestingly, we were able to show that a gradual increase in damage score over time occurred on stem tapers with higher machining marks, whereas a smoother topography resulted in elevated damage independent of time. Also, the stem taper topography appears to directly impact damage on the head taper side. One study presented results in disagreement with our data [1]. However, they did not consider machining mark height as a continuous variable but rather distinguished between a smooth and rough category. Another study found that rougher and shorter tapers had more taper wear in a large-head metal-on metal system [3]. Again, surface topography was characterized by average roughness, not machining mark height. Our own work based on multiscale finite element analysis supports our results that surface topography plays a meaningful role in damage evolution of modular junction [27]. The surface topography drives the actual and real contact area, the degree of plastic strain, and residual stresses. Interestingly, we did not find that flexural rigidity was associated with stem taper damage score within different material couples, although a previous study found that necks with higher flexural rigidity exhibited lower damage scores [17]. However, this study did not use a multiple regression model that takes other factors into account.

Although there was no difference in surface topography between the two material couples, unlike CoCr stem tapers, Ti6Al4V stem tapers exhibited a damage evolution that was independent of implantation time. In the multiple regression model, damage score did correlate with stem taper machining mark height and head taper roughness. Both factors drive the contact area and contact mechanics between the two surfaces. However, it needs to be stated that the initial head taper roughness could not be measured with the same certainty as the stem taper topography because a tactile profilometer had to be used instead of the interferometer. As a result of the lack of optical feedback of measuring, it was more challenging to exclude damaged areas. Thus, it is possible that the head taper roughness, which increases with damage score, is partly a result of head taper damage rather than a contributing factor. The reason for this manifesting noticeably in the CoCr/Ti couple may be related to the occurrence of material transfer from the softer Ti6Al4V alloy to the CoCrMo alloy. Such material transfer is the result of microjoints (cold welding) that may already have formed during the initial assembly or resulted from cyclic load. Potentially, the actual material transfer did not occur until separation of the head and stem. Material transfer would be detected by visual inspection as damage, but can only be distinguished from corrosion damage by scanning electron microscopy (Fig. 10), which may explain the higher proportion of moderately damaged head tapers for CoCr/Ti (Table 3). However, the stem taper topography would still play an important role because it is directly related to the local contact stresses necessary for the formation of microjoints and subsequent material transfer. Currently, it is unclear whether material transfer is harmful by increasing two-body wear or useful by preventing micromotion.

Fig. 10
figure 10

Scanning electron micrograph of a CoCr/Ti head taper surface showing an example of titanium transfer to CoCrMo. The image was taken in backscatter mode where dark areas represent titanium transfer and brighter areas represent the underlying CoCrMo alloy.

In conclusion, we were able to show that stem taper machining mark height plays a crucial role in modular junction performance, especially for CoCr/CoCr couples, where a higher stem taper machining mark height results in lower damage score over implantation time. Surface topography appears to also impact damage modes of CoCr/Ti couples; however, this interface deserves further investigation. It is of great importance to determine the acting damage modes (corrosion, wear, plastic deformation) and how they relate to damage score and material loss to better understand the role of surface topography. It needs to be stated that other design factors (such as engagement length), patient factors (eg, patient weight), and surgeon factors (eg, implant alignment) may play an important role, although they could not be considered in this study. Nonetheless, based on the results of this study, we believe that the accepted dimensions and tolerances of stem taper surface topographies need further attention to ensure that damage is minimized within each material couple. We expect that surface topography characteristics can be adjusted during machining to reduce the degree of damage, delay the onset of corrosion, and to reduce the release of corrosion products.