Introduction

The terms “femoral anteversion” and “femoral torsion” have often been used interchangeably in the orthopaedic literature, yet they represent distinct anatomical entities. Anteversion refers to anterior tilt of the femoral neck, whereas torsion describes rotation of the femoral shaft. Before the widespread use of CT, the location of rotation was ignored. Early authors stated that neck anteversion and shaft torsion could not be distinguished roentgenologically, deeming such discrimination to be of “no practical importance” [2]. With CT, however, more accurate three-dimensional evaluation became possible. Currently, despite radiologic advances for characterizing axial femoral morphology, the terms “anteversion” and “torsion” are still used interchangeably in the primary literature [4, 6, 16, 17]. Treatment of femoral neck deformity may be different than diaphyseal deformity, and clinicians should use terms that differentiate these morphologies.

Assessment of femoral rotation by CT is now routinely measured by several methods. However, the most widely used radiographic technique (in which only two CT-derived axes are made, one through the femoral neck and one at the distal femoral condyles) may neither accurately quantify proximal femoral anatomy nor identify the center of rotation. A second technique, popularized by Kim, adds an intertrochanteric axis, and thus provides three axes for measurement. A final approach uses three-dimensional volumetric reconstruction of the proximal femur.

We therefore sought to identify what CT methodology (the traditional method, Kim method, or volumetric method) (1) most accurately quantifies transverse-plane femoral deformities; (2) localizes those deformities; and (3) is most reproducible across different observers.

Materials and Methods

We designed a sawbones study in which femoral rotational abnormalities could be readily simulated. Medium-sized, left, high-resolution femurs of standardized anteversion (Pacific Research Laboratories, Vashon, WA, USA) were used. Simple transverse osteotomies were performed with an oscillating saw in either the subtrochanteric region (ST), 4 cm distal to the inferior aspect of the lesser trochanter, or the intertrochanteric region (IT), 2 cm proximal to the superior aspect of the lesser trochanter. These levels were chosen because of the uniform thickness of the sawbones “cortex” (ST level) to simulate diaphyseal rotation, and IT osteotomies are generally performed within 2 cm of the lesser trochanter and an osteotomy further cephalad would have involved the femoral neck.

Femora were rotated in the diaphyseal or intertrochanteric region around their geometric centroid as defined by Murphy et al. [10]. The distal fragment of all femora was internally rotated relative to the proximal fragment. Each sample was marked circumferentially in 1° increments and then rotated a random value between 0° and 45° according to a sequential randomization schedule developed by an independent statistician (AT). The femora were secured with a radiolucent adhesive (Fig. 1). Rotational values were selected to mimic a range of values of femoral deformity that are encountered clinically [14]. The full lengths of all femora were then subjected to axial CT. All scans were performed on a GE Discovery 64-detector scanner (General Electric, Fairfield, CT, USA) with slice thickness 0.6 mm, 120 KvP, 312 mA, and 500 ms. Any axial images on which the osteotomy was visible were removed.

Fig. 1A–B
figure 1

(A) Pictured is a high-definition sawbones femur with an intertrochanteric osteotomy. (B) Pictured is a femur with a subtrochanteric osteotomy after rotation and security with radiolucent adhesive.

Measurements of femoral rotation were performed by each of three methods by two fellowship-trained musculoskeletal radiologists (DSS, CES) blinded to the study design. The “traditional” method described by Weiner et al. [18] incorporated the femoral neck axis from a single axial image and a single posterior condylar axis. The posterior condylar axis provides the most accurate and reproducible distal reference point [10]. The method of Kim et al. [7] involved these axes and a third “intertrochanteric” best-fit line in which the greater and lesser trochanters were bisected as determined from two axial images, allowing qualitative localization of rotation to either the supra- or infratrochanteric region. The third method incorporated three-dimensional (3-D) volumetric reconstruction of the proximal femur for a true cervical axis (Philips iSite Advanced Visualization Suite, Version 7.4.6.500, Amsterdam, The Netherlands) while the posterior condylar axis was identically obtained (Fig. 2). A repeat set of measurements of all femora was separated by an interval of 6 months. The observers were also asked to identify the location of rotation (IT versus ST region) in each sample.

Fig. 2A–C
figure 2

(A) An example of the traditional method is shown, in which femoral rotation is assessed by a single cervical axis and the posterior condylar axis. (B) An example of the Kim method is shown, whereby single cervical and posterior condylar axes are drawn. An intertrochanteric line (dashed) bisecting the greater and lesser trochanters allows evaluation of torsional contribution from the neck versus the diaphysis. The intertrochanteric line is drawn by finding the midpoint of the greater trochanter on a single axial image and the midpoint of the lesser trochanter on a single axial image and connecting those points. In this femur, the torsion is emanating from above the lesser trochanter. (C) A representative volumetric 3-D reconstruction of transverse plane femoral morphology from a true cervical axis is shown. This axis is generated from radiological software as a “true” composite neck axis.

An a priori sample size estimation analysis was performed under the following assumptions: a paired t-test design with a correlation between the two sets of observations of 0.94, power set to 0.80, and alpha set at 0.05 for a two-sided test. The sample size estimation investigated the number of pairs of observations that would be necessary to detect a significant effect given: (1) a difference in the rotational degree of either 5° or 10°; and (2) assuming a SD of 5°, all consistent with detectable differences in previous CT-based cadaveric femoral rotation studies [10]. A minimum of four samples from each osteotomy type (ST or IT) was statistically necessary, and we conducted our study with eight samples for each osteotomy type (16 total).

All analyses were performed using SAS 9.2 (Cary, NC, USA). The level of statistical significance was set at p ≤ 0.05. Fisher’s exact tests were used to compare the frequency of correctly identifying the location of the rotation by each of the three methods. The percentage of correct identifications by each method was compared for each observer and for both observers combined.

The intraclass correlation coefficient (ICC) was calculated to look at intermethod agreement and interrater agreement. It is a measure of reliability of the outcomes and indicates whether there is strong agreement between continuous values of rotation (true versus traditional, true versus Kim, true versus volumetric, Observer 1 versus Observer 2). An ICC of 0.81 to 1.00 represents “almost perfect agreement” [8]. Agreement across time was assessed for both location and degree of rotation. Similar to previous analyses, agreement for identifying the location of rotation (IT versus ST) was calculated using Cohen’s kappa.

Results

With the numbers available, no differences were identified among the traditional, Kim, and volumetric CT techniques in detecting the total amount of transverse-plane femoral deformity in this model (Table 1). Specifically, the traditional and Kim methods averaged −1° ± 4° from true rotation (range, −7° to 7°; p = 0.4895), and the volumetric method averaged rotation 1° ± 4° from true rotation (range, −7° to 7°; p = 0.3404).

Table 1 Assessment of femoral rotation with three distinct CT-derived methods

The Kim and volumetric techniques were superior to the traditional technique in terms of localizing the deformity, and with the numbers available, we identified no differences in performance between the Kim and volumetric techniques. With traditional measurements using two axial images (single femoral neck axis, single posterior condylar axis), the correct site of torsion was identified 47% of the time compared with 100% with the trochanteric method of Kim and 100% with volumetric analysis (p < 0.001; Table 2).

Table 2 Correct localization of femoral rotation

There was near perfect agreement between observers in quantifying rotation by all three methods (Table 1). At index measurement, the ICCs of the traditional, Kim, and volumetric methods were 0.9569, 0.9569, and 0.9359, respectively. The interrater reliability of the angular measurements persisted when a second set of measurements was taken 6 months later (Table 3). After allowing 6 months to elapse, the ICCs of the traditional, Kim, and volumetric methods were 0.9808, 0.9807, and 0.9353, respectively. Localizing the site of femoral rotation was inconsistent between observers by the traditional method (κ = 0.375). In contrast, the Kim and volumetric techniques showed almost perfect kappa values (κ = 1.0), indicating that every time one observer indicated a location, the other observer indicated that same location (Table 4).

Table 3 Accuracy and interrater reliability of measurements over time
Table 4 Interrater agreement on the location of femoral rotation

Discussion

Femoral rotation has been variably termed “anteversion” or “torsion” regardless of the site of rotation. Transverse-plane femoral profiles are currently measured using axial CT, and the often-used method of a single cervical axis has been suggested to introduce a measurement error. This study aimed to determine which of the existing techniques of CT-derived transverse plane femoral profiles is most accurate and reproducible. We found that a volumetric 3-D reconstruction of the proximal femur and a three-axis technique adding an intertrochanteric axis (the “Kim method”) were more accurate than a two-axis technique at (1) quantifying transverse plane femoral morphology; (2) localizing those deformities; and (3) were more reproducible among different observers.

This study has a number of limitations. First, it is a sawbones model and not a clinical study. Although a sawbones model allows complete investigator control of rotation, the heterogeneity of rotational deformity in vivo cannot be mimicked. Femoral rotational deformity can stem from multiple loci along the femoral length. Furthermore, some femora in the clinical setting may be retroverted and deformity may stem from the neck or epiphysis, which was not simulated in this investigation. Volumetric transverse plane profiles were constructed with complex software not widely available to all surgeons and radiologists; hence, the inclusion of the Kim method [7], which provides a straightforward technique for rotational localization available with any CT scanner. It also provides for quantification of a mixed supra- and infratrochanteric rotational deformity. This method can itself be subject to measurement error, because the profiles of the greater and lesser trochanters may not be readily identifiable in a deformed femur. One other limitation of the study is the anatomic variation of the location of the lesser trochanter in vivo. How this affects this study and the calculation and localization of rotation is not currently fully understood, although it is assumed that the lesser trochanter is generally consistently located on the posteromedial femur. It could affect and limit the applicability of the Kim method if in vivo variation exists. Although we examined eight femora in each group (IT and ST), some differences of modest size could be missed because of Type II error, although a prestudy power analysis confirmed adequate sample size.

In our study, we found equivalent accuracy for each measurement technique in quantifying femoral rotation, which was unanticipated. This was an unexpected finding because Murphy et al. [10] found significant underestimation of rotation using single neck axis cuts in their cadaveric CT investigation. Before the advent of CT, both direct [5] and indirect [2, 1113, 15, 17] plain radiographic techniques were used to characterize femoral rotation. CT allowed for direct superimposition of a single neck and condylar axis, which remains the basis for most measurements to this day [18]. However, the femoral neck is neither uniplanar nor cylindrical and can be best approximated by a series of ellipses that create a unique composite neck axis, which deviates from the shaft axis by an average 5 mm and the center of the femoral head by 2.5 mm [3]. As a result of this complex proximal femoral anatomy, some authors have suggested that the traditional methods of axial CT-derived femoral profiles are inaccurate and underestimate true anatomic measurements. The clinical significance of this measurement error in CT-derived profiles is not known. In this investigation, transverse-plane rotational profiles were quantified equally well by the traditional, Kim, and 3-D/volumetric methods.

The traditional method of a uniplanar neck axis did not enable anatomic localization of the site of rotation, whereas the Kim and volumetric methods did. In this study, the addition of the intertrochanteric CT axis described by Kim to the traditional method allowed differentiation of “supratrochanteric” and “infratrochanteric” femoral rotation and was the basis for localizing an otherwise hidden osteotomy. This added CT axis enabled the differentiation of femoral “anteversion” and “torsion” or the relative contributions of each to a mixed rotational profile. Without it, our observers identified the location of femoral rotation only 47% of the time and the two observers had poor interrater agreement in the instances in which they were correct. In sum, the site of rotation could only be identified when using a technique that considered anatomic details between the femoral neck and distal condyles (ie, the Kim or volumetric methods).

All three CT methodologies had high interrater reliability. Although other authors have successfully used two-dimensional (2-D) CT to obtain accurate rotational profiles despite complex femoral neck anatomy [1, 3, 9], it has been noted that variability in the traditional method is minimized with more experienced observers (attending physicians versus residents) and the use of predefined cut levels [9]. In our investigation, both the traditional and Kim methods (2-D) provided excellent inter- and intraobserver reliability as did the volumetric (3-D) technique.

Despite the complex 3-D anatomy of the proximal femur, we have demonstrated that the overall rotational profile can be accurately and reproducibly measured using traditional 2-D techniques, even with a single neck axis. However, the locus of rotation (neck versus shaft) cannot be identified without, at minimum, some evaluation of the intertrochanteric region. In the absence of software providing 3-D volumetric analysis of the proximal femur, an intertrochanteric CT axis localizing the trochanters can differentiate rotation occurring proximal to the lesser trochanter from rotation occurring distal to the lesser trochanter. As the biomechanical role of femoral rotation on hip mechanics becomes increasingly understood, it is important to both accurately measure and localize the site of femoral rotation. Furthermore, using unambiguous terminology that reflects the site of rotation will facilitate discussions between practitioners and investigators. In the future, we suggest that when referring to transverse plane rotation of the femur and its surgical treatment that “version” be used for rotation localized proximal to the lesser trochanter and “torsion” be used for rotation localized distal to the lesser trochanter.