Introduction

Developmental dysplasia of the hip (DDH) is characterized by abnormal acetabular and femoral morphologies and is thought to be one of the leading causes of secondary osteoarthritis resulting in THA [3]. The osseous anatomy associated with hip dysplasia makes these reconstructions more difficult. The acetabulum often is shallow with superior or anterior bone loss. The femur often is excessively anteverted and valgus. The medullary canal can be small and abnormally shaped with a smaller mediolateral diameter than AP diameter [8, 12]. Because of these associated anatomic variants, some authors recommend the use of modular or specially designed components to accommodate the shape of the dysplastic femur, whereas other authors have published good long-term results with cementless tapered stems [4, 8].

Biant et al. [1] and Tamegai et al. [13] have both reported excellent results in patients with DDH who underwent THA with a modular femoral stem. The benefits of using the modular system include the ability to adjust femoral anteversion allowing for more optimal placement of the prosthesis as well as functional advantages of reliable fixation and the prevention of dislocations postoperatively [1, 13]. Unfortunately, these stems are not always readily available in the operating room, and it would be helpful if standard radiographic views could predict before surgery which patients would be likely to benefit from their use. Three-dimensional (3-D) CT scans allow for accurate assessments of anteversion but are time-consuming, costly, and require substantial radiation. To our knowledge, no parameters have been defined using plain radiographs that can help the surgeon anticipate the use of a modular stem during THA for DDH.

We therefore asked if we could reliably identify preoperative plain film radiographic parameters of the femur that predicted the use of modular stems during THAs for DDH. The general indication for using these stems was femoral anteversion of more than 25° measured at the time of surgery. In this retrospective study, we looked at the following: (1) the neck-shaft angle based on the AP radiograph (alpha); (2) the neck-shaft angle from the lateral radiograph (beta); and (3) the femoral anteversion angle as calculated from the prior two angles.

Materials and Methods

After we received an exemption from the institutional review board for secondary data analysis, we performed a retrospective search of our surgical database. We identified 630 primary THAs that were performed by the senior author (CLP) between February 1999 and January 2007 inclusive. From these, we identified 79 hips in 67 patients who had a primary diagnosis of DDH as determined by lateral center-edge angle < 20° and an upwardly sloping acetabular sourcil. Hips were excluded if they had a history of prior femoral-sided hip surgery (n = 2) or did not have digital preoperative films available (n = 19). This resulted in a series of 58 hips in 50 patients. To avoid potential bias related to bilateral hips and implant selection, where the surgeon might use a modular stem in the second hip because it was used in the first, the first hip that underwent THA was selected for the review resulting in 50 patients with 50 hips. We assessed operative reports to identify the type of implant used and the rationale for the implant. S-ROM modular femoral stems (DePuy Synthes, Warsaw, IN, USA) were placed in 21 of 50; the remaining 29 patients received a tapered nonmodular stem (Biomet Orthopaedics, Warsaw, IN, USA). This stem has a moderate metaphyseal flare that does not accommodate excessive anteversion (Fig. 1). In this single-surgeon experience, the rationale for femoral stem selection was principally based on femoral anteversion: if femoral anteversion was estimated to be greater than 25° by the senior surgeon during surgery, a modular stem was typically selected. There were no differences in patient age, height, weight, or sex between the modular stem (MS) group and tapered stem (TS) group (Table 1).

Fig. 1
figure 1

Tapered nonmodular stem is shown (Biomet Orthopaedics, Warsaw, IN, USA).

Table 1 Demographic comparison between stem types

To evaluate the radiographic characteristics that predicted the surgeon’s selection of MS versus TS, we performed a radiographic analysis of the 50 hips. Lateral center-edge angle [14] as well as Crowe’s classifications [3] and the Hartofilakidis classification [15] were used to confirm a diagnosis of dysplasia on preoperative films. The angles were obtained from a described radiographic method [9]. The MS group demonstrated smaller lateral center-edge angles (mean 9°; 95% confidence interval [CI], 3°–15° versus mean 19°, 95% CI, 15°–22°; p = 0.003). Modular stems were found to have a slightly higher classification using Crowe’s classification (median 1.5; interquartile range [IQR], 1–2 versus median 1; IQR, 1–1; p = 0.006). This was also the case for the Hartofilakidis classification where MS stems had a median classification of 2 (IQR, 2–2) compared with 1 (IQR, 1–2) in the TS group (p = 0.001).

Clinically, our radiographs have been standardized to ensure reproducibility. The crosstable lateral radiograph is obtained with the patient supine on the radiograph table. The contralateral hip and knee are flexed up out of the field of the hip being imaged. The extremity of interest is internally rotated 15° to expose the full anterolateral and posterolateral femoral head-neck junction. The radiograph tube is orientated parallel to the table at a 45° angle to the imaged limb and centered on the femoral head. The AP radiograph is obtained with the patient in the standing position with lower extremities oriented in 15° of internal rotation to maximize the femoral neck length. The distance from the imaging tube to the plate is approximately 120 cm in a perpendicular orientation. The image is centered between the superior border of the pubic symphysis and a line drawn connecting the anterosuperior iliac spines to visualize the entire bony pelvis and to minimize scatter and parallax effect. This standardization assures true anatomy and minimized a rotational misconstruction.

Radiographic measurements were performed using our digital PACs system (iSite v3.6; Philips Healthcare, Andover, MA, USA) by two experienced clinicians (JC, JAE) on preoperative films. The AP radiograph was used to obtain the alpha neck-shaft angle (Fig. 2) and the crosstable lateral radiograph was used to measure the beta neck-shaft angle (Fig. 3). The initial reader (JC) performed all reads from preoperative films, but may have visualized the stem in the PACS system. Similarly, a second, experienced clinician (JAE) then read the same radiographs twice (blinded to the first reads), 1 week apart, to allow for assessment of interrater and intrarater reliability and agreement. The concordance correlation coefficient (rc) and a Bland-Altman analysis were used for this analysis [2, 6]. The amount of agreement between readers for the rc was classified as minimal, < 0.2; poor, 0.2 to < 0.4; moderate, 0.4 to < 0.6; strong, 0.6 to ≤ 0.8; and almost perfect, > 0.8 [10]. There was strong agreement between the two clinicians when measuring the alpha neck-shaft angle and the beta neck-shaft angle (Table 2). Furthermore, the intrarater agreement also demonstrated almost perfect agreement between reads (Table 3). Thus, the measurements from the first reader were used in this analysis.

Fig. 2
figure 2

This figure demonstrates the supplementary neck-shaft angle of Ogata and Goldsand’s α angle.

Fig. 3
figure 3

This figure demonstrates the supplementary neck-shaft angle of Ogata and Goldsand’s β angle.

Table 2 Interrater agreement on alpha and beta neck-shaft angles
Table 3 Intrarater agreement on alpha and beta neck-shaft angles measured on a subsample (first 25 patient) 1 week apart from each other and blinded to prior results

Based on these measurements, we then calculated the patient femoral anteversion based on the formula presented by Ogata and Goldsand [9] (tanθ = tanβ/tanα). In their original publication, they described α (Fig. 2) and β (Fig. 3) as supplementary angles to what we calculated as neck-shaft angles, which we then corrected for in our equations. We used Microsoft Excel (Microsoft Inc, Redmond, WA, USA) to perform the calculations as follows:

  1. 1)

    Equation 1: 180°- AP neck-shaft angle = α

  2. 2)

    Equation 2: 180°- lateral neck-shaft angle = β

  3. 3)

    Equation 3: “θ = TAN (RADIANS [β])/TAN (RADIANS [α])”

  4. 4)

    Equation 4: “Anteversion = DEGREES(ATAN [θ])

Ordinal data were compared using a Wilcoxon Mann-Whitney test. Additionally, an independent samples t-test was used to compare radiographic measures between the groups. Spearman’s rank correlation (rs) was used to assess the relationship between the actual angle measured and the intraoperative stem decision. Correlation coefficients were assessed as previously mentioned. Furthermore, we dichotomized the beta neck-shaft angle at every possible cutoff point and computed the area under the receiver operating characteristic curve (ROC area) using the dichotomized variable as a binary diagnostic test for the outcome. This was repeated for every value of the variable. The cutoff point that resulted in the maximum ROC area was identified as the optimal cutoff point. To verify the stability of the optimal cutoff point, meaning that cutoff point would be selected as the optimal cutoff point in the future, we used an algorithm adapted from Royston and Sauerbrie [11]. The algorithm is as follows: (1) draw a bootstrap resample of size, the original sample size; (2) compute the ROC area for every possible cutpoint in the resample; (3) record which cutoff point optimized the ROC area; (4) repeat steps 1 through 3 1000 times; (5) summarize the frequency of the cutoff points that optimized the ROC area; and (6) select the cutpoint with the highest frequency percent as being the most stable cutpoint.

After the optimal cutpoint was selected, we then performed an internal bootstrap validation of the cutpoint’s test characteristics (sensitivity, specificity, and ROC area) using Harrell et al.’s [5] algorithm. The test characteristics for any prediction equation, or optimal cutoff point, are known to always be greater in the sample it was derived on. The internal bootstrap validation subtracts out this overfitting, or optimism, to provide test characteristic values and CIs that can be expected in future samples or future patients.

Results

The cutoff point for the AP neck-shaft angle that best predicted the use of a modular stem in patients undergoing THA for DDH was ≥ 142°. On evaluation of the preoperative radiographs, the MS group had a greater mean AP radiographic neck-shaft angle compared with the TS group (152°; 95% CI, 146°–157° versus 137°; 95% CI, 134°–141°; p < 0.001). There was a moderate agreement between the mean AP neck-shaft angle and the stem decision (rs = 0.54; 95% CI, 0.31–0.71; p < 0.001). The cutoff point was verified to be stable in a bootstrap simulation. In 1000 bootstrap resamples, which mimic future patient data sets, the cutpoint of ≥ 142° was found in 54% of the resamples. The next most frequent optimal cutoff point was 145°, which occurred in only 11% so that ≥ 142° was the clearcut most reliable cutoff. The decision rule of ≥ 142° to require a modular stem has validated sensitivity of 77% (95% CI, 54%–91%), specificity of 69% (95% CI, 49%–84%), and ROC area of 73% (95% CI, 61%–85%) (Table 4).

Table 4 Test characteristics

The cutoff point for the lateral neck-shaft angle that best predicted the use of a modular stem was ≤ 153°. The MS group had a smaller mean lateral neck-shaft angle (152°; 95% CI, 148°–157° versus 161°; 95% CI, 158°–164°; p = 0.003). There was a moderate negative correlation with the lateral neck-shaft angle and the intraoperative stem choice (rs = −0.41; 95% CI, −0.62 to −0.15; p = 0.003). In 1000 bootstrap resamples, which mimic future patient data sets, the cutoff point of ≤ 153° was found in 32% of the resamples. The next most frequent optimal cutoff point was ≤ 162°, which occurred in only 20% so that ≤ 153° was the clearcut most reliable cutoff point. The decision rule of ≤ 153° to require a modular stem has validated sensitivity of 51% (95% CI, 28%–72%), specificity of 80% (95% CI, 61%–91%), and ROC area of 65% (95% CI, 52%–78%) (Table 4).

The cutoff point for femoral anteversion that best predicted the use of a modular stem was ≥ 32°. Femoral anteversion, as calculated with our modification of Ogata’s original equation [9], revealed that the MS group had a higher femoral anteversion angle (mean 45°; 95% CI, 37°–54° versus 21°; IQR, 17°–25°; p < 0.001). There was a strong correlation between the estimated femoral anteversion and the intraoperative stem choice (rs = 0.67; 95% CI, 0.47–0.80; p < 0.001). This cutoff point was verified to be stable in a bootstrap simulation. In 1000 bootstrap resamples, which mimic future patient data sets, the cutoff point of ≥ 32° was found in 35% of the resamples. The next most frequent optimal cutoff point was 27°, which occurred in only 14% so that ≥ 32° was the clearcut most reliable cutoff point. The decision rule of ≥ 32° to require a modular stem has validated sensitivity of 78% (95% CI, 55%–91%), specificity of 83% (95% CI, 65%–93%), and ROC area of 80% (95% CI, 70%–91%) (Table 4).

These cutoff predictions (Fig. 3) confirm the accuracy of the alpha neck-shaft angle and the anteversion angle in identifying patients who received modular stems. Exceptions to the predictive cutoff values, however, were seen (Table 5). One patient who received a tapered stem, yet had met all three cutoff definitions for a modular stem, had angles as follows: lateral neck-shaft angle = 151°, AP neck-shaft angle = 155°, and an anteversion angle of 50°. Two patients who received a modular stem, yet had met none of the cutoff definitions, had angles as follows: lateral neck-shaft angle = 160°, AP neck-shaft angle = 140°, and anteversion = 23° and lateral neck-shaft angle = 162°, AP neck-shaft angle = 131°, and anteversion = 16°.

Table 5 Decisions based on the three cutpoints

Discussion

The dysplastic femur poses a challenge for adult reconstructive surgeons during THA attributable primarily to altered morphology, particularly on the femoral side. Femoral morphology includes an extensively anteverted and valgus femur as well as a small and misshapen medullary canal. Excellent results have been reported with the use of modular stems during THA in these challenging dysplastic hips [1, 13]; however, these stems are not always available in the operating room. The availability and ease of obtaining preoperative radiographs makes them an optimal choice for determining the need for modular stems in preoperative planning. We found that by using neck-shaft angles obtained from preoperative crosstable lateral and AP radiographs and calculating anteversion, from previously published formulas, allowed us to predict the use of modular stems [9].

There are several limitations to this study. First, it is a retrospective chart and radiographic review of a single-surgeon series of patients with hip dysplasia who underwent THA with either a tapered nonmodular stem or a modular stem that relies on proximal metaphyseal ingrowth. Thus, these results might not be applicable to all practices. For example, Faldini et al. [4] reported on a series of 46 patients who underwent primary THA secondary to DDH using a Wagner uncemented stem aimed at achieving 10° to 20° of anteversion. However, they did not report on how many patients achieved that alignment. Regardless, they note excellent osteointegration and remarked that all patients were satisfied with both function and pain relief. One caveat to the use of a Wagner cone-like stem is that anteversion is associated with a loss of femoral offset, which occasionally can be problematic. The Wagner stem does not account for this, but the SROM modular stem has the ability to address both anteversion and offset. Second, there is potential for inaccurate and potentially unreproducible measures obtained intraoperatively. The choice of femoral component was determined by the judgment of the senior author (CLP) intraoperatively and femoral anteversion was not quantified in operative reports. Although the intraoperative decision for the use of a modular femoral stem was based on femoral anteversion being approximately 25°, we had inadequate records to verify this. However, even if we routinely measured and recorded this, we still would not have an optimal cutoff point to reference, hence the need for this study. Furthermore, because the cohort was assembled over a long time period, the possibilities of changes in surgical technique are not controlled. However, with regard to the technique to address increased femoral anteversion, the surgeons’ intraoperative choice of using a modular stem when femoral anteversion exceeded approximately 25° was consistent throughout the study period. Although the statistical methodology of the radiographic analysis was independent of this and the question of the study was not dependent on intraoperative quantification of femoral anteversion, the stem decision was. Third, the two groups were not similar in grading or classification of DDH. However, this is in fact somewhat reassuring, because the level of femoral dysplasia agrees with the level of acetabular dysplasia. Finally, the cutoff points cannot be claimed to be the absolute best. A surgeon might consider being close to the cutoff points (Fig. 4), the peaks, or near the cutoff points, the plateaus, as a decision rule, rather than the exact values reported after observing the patient both clinically and within the operating room.

Fig. 4
figure 4

The line graphs demonstrate the optimal cutoff points of each angle for stem decision with + representing modular stem selection and – representing tapered stem selection.

We developed decision rules based on preoperative radiographs that should predict the intraoperative choice of a modular stem when that choice is made based on an estimated intraoperative anteversion of > 25°; the best cutoff values on preoperative radiographs were AP neck-shaft angle of ≥ 142°, lateral neck-shaft angle of ≤ 153°, and femoral anteversion of ≥ 32°. To the best of our knowledge, there are no previous studies investigating whether plain radiographs can predict the need for a modular stem in dysplastic femora. Our findings do seem consistent with observations of Noble et al., [8] who investigated the 3-D shape of the dysplastic femur and found that the primary deformity of the dysplastic femur is rotational with an increase in femoral anteversion of 5° to 16°. These authors concluded with a recommendation for the use of modular femoral components in dysplastic hips [8].

An optimal cutoff point is that point or degree wherein both the specificity and sensitivity of a diagnostic test are maximized [7]. In other words, it is the point in which a test can most accurately determine the normal from the abnormal. In our study, this point was defined as the degree at which a modular stem was used. We found that the most accurate cutoff point for determining the use of a modular stem was a femoral anteversion angle of ≥ 32° meaning that patients with an estimated femoral anteversion angle, as calculated from preoperative radiographs, of ≥ 32° would suggest that a modular stem be used. A request to have the stem available in the operating room at the time of surgery would be recommended. Ultimately, however, the need for the modular stem should be determined in the operating room by the surgeon.

In conclusion, we identified cutoff points for the use of modular femoral stems with standard AP and lateral preoperative plain film radiographs without the need for CT. The best cutoff value for the AP neck-shaft angle was ≥ 142°. For the lateral neck-shaft angle, the best cutoff value was ≤ 153° and for femoral anteversion, it was ≥ 32°. When patients lie close to these cutoffs, intraoperative decisions by the surgeon are still needed to determine the best implant choice for the patient. Nevertheless, these values allow for improved preoperative planning to permit the availability of these stems when the intraoperative anteversion is > 25°. Future studies should evaluate the use of these cutoff points in a prospective fashion to better assess the accuracy of the measurements and correlation with intraoperative decision-making.