Introduction

The lower limb alignment is an important reference for clinical diagnosis and treatment of conditions affecting the knee. Malalignment of the lower extremity is recognized as a critical component of the etiology of knee osteoarthritis [4, 17, 19]. Correct alignment is critical for the outcome of total knee arthroplasty (TKA) [12], especially for personalized surgical design [16, 21]. Lower limb alignment is commonly assessed using measurements of anatomical or mechanical (weight-bearing) axes of the femur and the tibia, as depicted on plain radiographs. Quantification of these axes usually requires the identification of anatomical landmarks from hip to ankle. For evaluation of the knee alignment, long-leg radiographs (LLR) are considered as the gold standard image modality, with high inter- and intra-observer reliability.

Correct and standardized patient positioning during imaging is the prerequisite for the accuracy of the lower limb alignment measurement while using LLR. However, considering patients with knee osteoarthritis or other joint disease are not able to extend the knee fully, it is reasonable to study the possible alignment measurement errors of patients with severe deformity. Studies have reported that limb rotation or foot rotation could significantly affect the outcome [6, 9, 11] [15]. The measurement error of hip–knee–ankle (HKA) angle could reach 4–10 degrees or even greater [14, 15]. In comparison, the recommended HKA after TKA ranges from − 3\(^{\circ }\) to 3\(^{\circ }\), which represents a proper knee alignment and prosthesis position. Therefore, the preoperative measurement error directly affects knee osteotomy. Previous studies have suggested that improved methods of measuring lower limb alignment are of greater importance to the advancement of knee osteotomy than the technical details of the surgical procedure itself [11, 13, 15].

Although studies involving synthetic prosthesis [11, 14], specimen of human lower limbs [3, 20] and in vivo images [7] all existed, the magnitude of these errors varies dramatically. Some researchers believed the long-leg radiograph is a reliable method [1, 10] and reported that the error caused by limb rotation was less than 1\(^{\circ }\) within the acceptable range [18], whereas others found significant errors in the measurements of severe varus knees [8, 14, 15, 18]. Coronal deformity, knee flexion contracture, or some other anatomical features in patients might be important factors that potentially affect the magnitude of measurement errors. Considering the in vivo images derived from patients can better reflect real clinical application scenarios, the measurement error of knee alignment in patients with complex deformities should be investigated more in-depth. It is meaningful to quantify the magnitude of alignment measurement errors in patients with different degrees of deformity.

The purposes of this study are: (1) to determine the influence of limb rotation on measurements of knee alignment derived from long-leg radiographs; (2) to measure the additional influence of coronal deformity and knee flexion contracture; and (3) to determine whether weight-bearing affects the accuracy of the radiographic measurements. The significance of this study is to assist in reducing measurement errors of mechanical and anatomical leg axis. This study attempts to prove the following hypotheses (Fig. 1).

Fig. 1
figure 1

This study attempts to prove these hypotheses

Results

The median value of knee flexion was 5.32\(^{\circ }\) (2.27\(^{\circ }\); 8.38 \(^{\circ }\)). The median knee coronal deformity was 7.37 \(^{\circ }\) (4.25\(^{\circ }\); 9.50\(^{\circ }\)). Descriptive statistics of the HKA and FTA deviation range are summarized in Table 1. The average deviation ranges of HKA and FTA were 4.84 ± 0.66\(^{\circ }\) (range: 0.56–25.03) and 7.35 ± 0.88\(^{\circ }\) (range: 1.30–30.42), respectively. As shown in Fig. 2, the mean difference between HKA and FTA deviation range was 2.50 (p < 0.01). The intraclass correlation coefficients for HKA and FTA deviation range were perfect, 0.976 (p < 0.05) and 0.935 (p < 0.05), respectively.

Table 1 HKA and FTA deviation range derived from varying degree of flexion and coronal deformity
Fig. 2
figure 2

Box-plot illustrated the average deviation ranges of HKA and FTA. p value < 0.01, the difference between HKA and FTA deviation range was significant

According to the two-way ANOVA: there were statistical differences in both HKA deviation range and FTA deviation range significantly for patients with varying flexion and coronal deformity (p < 0.05); the interaction between flexion and coronal deformity also had a significant effect on HKA and FTA deviation range (p < 0.05). Fig. 3 illustrates the average HKA and FTA deviation range in patients with different flexion and coronal deformities.

For flexion index, there were significant differences among the slight, moderate and severe groups in both HKA and FTA deviation range (p < 0.001) (Table 2). For the coronal deformity index, there was no significant difference between the slight and moderate groups in both HKA and FTA deviation range (p > 0.05). Yet, the coronal deformity index of the severe group was significantly different from that of the slight and moderate groups (p < 0.001) (Table 3). Among the different degrees of coronal deformity, only the severe group differed dramatically from the other two groups in the HKA deviation range (p < 0.001) and in the FTA deviation range (p < 0.001). No significant difference was found between the slight coronal deformity group and the moderate coronal deformity group in the HKA deviation range (p > 0.05) and in the FTA deviation range (p > 0.05).

Fig. 3
figure 3

The deviation range of HKA and FTA in patients with different flexion and coronal deformity. p value < 0.05 means that difference between the two corresponding groups is significant in 95\(\%\) confidence level. a Participants were divided into slight, moderate, and severe flexion groups by the degree of flexion contracture. The left part showed the HKA deviation range in the slight, moderate and severe flexion group. The right part showed the FTA deviation range in the slight, moderate and severe flexion group. b Participants were divided into slight, moderate, and severe coronal deformity groups by the degree of coronal deformity. The left part showed the HKA deviation range in the slight, moderate, and severe coronal deformity group. The right part showed the FTA deviation range in the slight, moderate and severe coronal deformity group

Table 2 Multiple comparisons of flexion group, the mean difference of HKA deviation range and the mean difference of FTA deviation range
Table 3 Multiple comparisons of coronal deformity group, the mean difference of HKA deviation range and the mean difference of FTA deviation range
Table 4 Correlation coefficients between the knee deformity and alignment measurement errors

The deviation ranges of HKA and FTA were significantly correlated with the knee flexion and coronal deformity (Table 4). In the non-weight-bearing condition, the correlation coefficient between coronal deformity and HKA deviation range was 0.307 (p < 0.05); the correlation coefficient between coronal deformity and with FTA deviation range was 0.504 (p < 0.01). In the weight-bearing condition, the correlation coefficient between coronal deformity and HKA deviation range was 0.635 (p < 0.01); the correlation coefficient between coronal deformity and with FTA deviation range was 0.639 (p < 0.01). Remarkable correlations were found between the flexion angle and deviation ranges of both HKA (0.933, p < 0.01) and FTA (0.861, p < 0.01).

Fig. 4
figure 4

Box-plot illustrating the coronal deformity measured in neutral position. Virtual LLR: virtual radiographs simulated from full-leg CT scan in non-weight-bearing condition. Real LLR: standing full-leg radiographs in weight-bearing condition

The box-plot in Fig. 4 illustrates the coronal deformity measured from virtual LLR and real LLR. In the neutral position, coronal deformity measured on real LLR 10.52 ± 0.94\(^{\circ }\)was significantly greater than that on virtual LLR 7.90±0.54\(^{\circ }\). The mean difference of coronal deformity in the non-weight-bearing condition and weight-bearing condition was 2.62 ± 0.68\(^{\circ }\) (p < 0.01).

Discussion

The lower limb alignment measurement is a crucial factor in the preoperative plan and postoperative assessment of a knee replacement. Inconsistent positioning during the capture of X-ray and resulting projected image can bring measurement errors of the lower limb alignment. However, the argument over whether these measurement errors are acceptable and related to the knee deformity has not been settled. This study quantitatively revealed that the influence of limb rotation on measurements of knee alignment derived from long-leg radiographs, the measurement errors corresponding to different deformed knees, and the difference of radiographic measurements on weight-bearing and non-weight-bearing conditions.

In this study, limb rotation caused the deviation ranges of HKA and FTA measurement to be greater than 3\(^{\circ }\). The degree of the anatomical axis variation was greater than that of the mechanical axis. Although tilted projections can lead to measurement errors of the lower limb alignment, the magnitude of these errors measured in different studies varies dramatically. Wright JG [20] reported that the deviation range of FTA measurement was less than 1\(^{\circ }\) under the limb rotation from 20\(^{\circ }\) external rotation to 20\(^{\circ }\) internal rotation, whereas Swanson KE [18] found significant errors in severe deformity knees up to 7.8\(^{\circ }\). This could be explained by the fact that the degree of deformity of the lower limb would influence the measurement error of the alignment axis.

Our results demonstrated that the coronal deformity affected the alignment measurement errors greatly. Multiple comparisons of slight, moderate, and severe degrees of coronal deformity indicated that the severe group (coronal deformity > 10\(^{\circ }\)) differed significantly from the other two groups in alignment measurement errors. Compared with the non-weight-bearing condition, the coronal deformity of the weight-bearing condition had a greater influence on the measurement errors. However, the effects of coronal deformity on the assessment of alignment on LLR in the previous studies were controversial. Swanson et al. [18] discussed in saw-bones research of alignment measurements in deformed limbs that the FTA measurement in limbs with severe valgus or varus deformity was more sensitive to the effect of rotation than in normally aligned limbs.This is in line with our results that mechanical and anatomical axis measurement errors in the severe coronal deformity group were greater than those in the slight or moderate group. Jud et al. [7] conclude that deviations in mechanical leg axis measurements did not vary relevantly through the coronal deformity. It is an interesting conclusion that appears to contradict the findings of our study. Still, it is, in fact, compatible with this study’s conclusions because they only analyzed coronary malformations below 9 degrees. The coronal deformities of our research subjects ranged from 0.62 to 19.55 degrees. No significant difference was found between the slight and the moderate coronal deformity group in alignment measurement errors. While the severe group (coronal deformity > 10\(^{\circ }\)) had significant differences with the other two groups in alignment measurement errors. Additionally, this study measured both the HKA and FTA deviation range of each subject. On the contrary, the study conducted by Kawakami et al. [8] reported no significant correlation between coronal deformity and measurement error. Kawakami et al. set different limb rotation ranges for different individuals to calculate the measurement error. In the present study, the limb rotation from internal 20\(^{\circ }\) to external 20\(^{\circ }\) was performed uniformly for each subject. And the present study further collected coronal deformity in the weight-bearing condition to compare with the result obtained from the non-weight-bearing condition. The opposite conclusion was probably reached from these two aspects. Therefore, the hypothesis that coronal deformity and the measurement errors were significantly related is credible. The effect of coronal deformity on the assessment of alignment on real LLR may be more pronounced. The method of alignment measurement should vary from slight to severe patients of varus and valgus.

The knee flexion angle plays an important role in the potential error of the alignment measurement. Multiple comparisons of the slight, moderate and severe flexion group showed significant differences in alignment measurement error for all combinations. With the increase of knee flexion, the measurement error of lower limb alignment increased significantly. According to a study with a synthetic model by Lonner et al. [11], the FTA deviation range of samples with flexion varied more than the control group. This statement was consistent with our study, and a similar conclusion was reached in a radiographic cadaver study conducted by Brouwer et al. [3]. Differently, the present study recruited clinical patients instead of using synthetic models or specimens of human lower limbs. Both knee osteoarthritis and normal individuals under clinical conditions were observed. Since the bony anatomy varies widely, the lower limbs alignment in vivo state can reflect the real morphology information. On the other hand, the knee flexion of each subject was not preset by the researchers but appeared naturally. Knee flexion angles were calculated by the self-developed program for quantitative analysis. The difference in the measured coronal deformity was significant between the weight-bearing condition and the non-weight-bearing condition. Brouwer et al. [2] performed a standing and a supine LLR in 20 patients with varus deformity and found an average of 2\(^{\circ }\) more varus deviation in the standing position than in the supine position. In our study, the virtual LLR reconstructed from CT scanning in the supine position was used to compare with the real LLR in the standing position. The average HKA measured on the virtual LLR is 2.83\(^{\circ }\) less than that on the real LLR. This could be explained by the reduction of loading forces across the knee joint in the supine position. Although the clinical examination of standing LLR is different from the supine position LLR we used, it is not expected to alter the trends of the above results observed from the virtual LLR in the present study.

There are some limitations to this study. Firstly, the virtual LLRs were simulated from supine CT scanning, which resulted in a slight difference from the clinical examinations. It is preferable to obtain the CT images in the weight-bearing condition. Consequently, this study compared the lower limb alignment measured on virtual LLRs with the real LLRs to complement our results obtained in the non-weight-bearing position. Secondly, when exploring the relationship between knee coronal deformity and measurement errors in the weight-bearing condition, it is not rigorous enough to use HKA reflected on the real LLR to represent knee coronal deformity. However, this provides some clues to the relation between coronal deformity and measurement errors in clinics. Thirdly, the gender-specific differences of coronal alignment in osteoarthritic knees were not clear according to a review by Hess et al. [5]. The study participants were primarily females (73\(\%\)), and thus, we could not show a gender difference. Some studies proposed that assessing the lower limb alignment in subjects with an uneven gender ratio might be biased. The last limitation is associated with the low proportion of severe lower limb malformation subjects. However, current results demonstrated a trend that the more severe the deformity, the greater the measurement error of lower limb alignment.

Conclusions

The alignment measurement of the lower limb on LLR can be affected by the limb rotation. However, the measurement errors caused by this effect vary individually. This kind of measurement error is positively associated with the knee deformity (coronal deformity and flexion contracture). When comparing the varying coronal deformity group, the severe group (coronal deformity>10\(^{\circ }\)) differed significantly from the other two groups in the HKA deviation range and in the FTA deviation range. Moreover, compared with the non-weight-bearing condition, the influence of coronal deformity on measurement errors will be amplified in the weight-bearing condition. In addition, the error of measuring the anatomic axis on LLR is greater than that of the mechanical axis. Therefore, attention should be paid to the errors of measuring lower limb alignment on LLR, especially for patients with severe knee deformity in the weight-bearing condition.

Materials and methods

Subjects and imaging procedure

Table 5 Patient baseline demographics and image information collection in clinics

From May 2018 to November 2019, 30 patients (45 knees) with a median age of 67 (range from 48 to 94) were enrolled in this study (Table 5). Twenty-two patients (33 knees) were female, and eight patients (12 knees) were males. Patients with a history of bone loss, infection, tumor, congenital disease, and lower extremity surgery were excluded from participation. CT scans of all subjects (45 knees) and weight-bearing LLR for some patients (31 Knees) with knee arthritis were detected preoperatively. Radiographs were performed with 7–22 mAs and 70–90 kVp, depending on the body mass. CT scans (Siemens SOMATOM Definition Flash, Germany) of the lower extremities in the supine position were obtained using a 1.0 mm section thickness with 120KVp. The method of measuring the knee flexion and coronal deformity is shown in Supplementary Materials. The knee flexion in the supine position and coronal deformity of each patient appeared naturally were calculated by a custom program. According to the degree of knee flexion and coronal deformity in the non-weight-bearing condition, 45 knees were graded as shown in Table 6. The degree of knee flexion and coronal deformity is divided by three levels: 0–5 degrees, 5–10 degrees, more than 10 degrees.

Table 6 According to the degree of knee flexion and coronal deformity, 45 knees were classified into the following levels

Alignment measurement

The determination of anatomical planes depends on three landmarks from 3D models (Fig. 5a): center of the femoral head (FH), the center of ankle joint (AC), and the tip of the greater trochanter (GT). Specifically, FH was defined as the well-fitting sphere center point of the femoral head; AC was defined as the diagonal intersection of the ankle joint surface; GT was defined as the tip of the greater trochanter from the frontal forward direction. Then the coronal plane was defined as the plane passing through Points FH, AC, and GT. The sagittal plane was defined as the plane perpendicular to the coronal plane and passing through line FH–AC.

Fig. 5
figure 5

Applying the 3D model of the lower limb reconstructed from CT scanning to set parameters and geometric relations for virtual X-ray capturing. a Three landmarks to determine anatomical planes on the 3D model: the center of the femoral head (FH), the center of ankle joint (AC), and the tip of the greater trochanter (GT). b The schematic diagram of capturing virtual X-rays by using the lower limb 3D model. The center of the lower limb was defined as the midpoint of line FH–AC. The X-ray source was 1.6 and 1.8 m away from the center of the lower limb and the projection plane, respectively. The initial shooting direction was perpendicular to line FH–AC, passing through the center of the lower limb

Fig. 6
figure 6

A real long-leg radiograph in clinics and a virtual long-leg radiograph in this study. a The real long-leg radiograph was captured in weight-bearing condition. b The virtual long-leg radiograph was captured in the non-weight-bearing condition. Both mechanical axis and anatomical axis were measured on virtual LLR. The HKA was defined as the mechanical axial of the femur and tibia. The FTA was defined as the anatomical axial of the femur and tibia

To measure the axial alignment of non-weight-bearing conditions and compare them with weight-bearing LLRs, the virtual LLRs were reconstructed by a Python program. The X-ray source was positioned at 1.6 meters forward from the midpoint of line FH-AC, and the projection plane was 1.8 meters from the X-ray source. The initial beam direction was oriented perpendicular to the coronal plane. Figure 5b shows the schematic of capturing virtual LLR. The virtual LLR of each knee was reconstructed from the projection of the 3D model by using a self-developed python program. Real LLR in the weight-bearing condition and virtual LLR in the non-weight-bearing condition are shown in Fig. 6. From the radiograph, the mechanical axis of the femur was defined as the line passing from the center of the femoral head to the center of the knee joint; the anatomical axis of the femur was defined as the line passing midpoint of the upper and lower 1/3 of the femoral medullary cavity; both the mechanical and anatomical axis of the tibia was defined as the line passing the center of the knee and ankle joint. HKA was defined as the medial side angle of the mechanical axis; FTA was defined as the medial side angle of the anatomical axis. Both HKA and FTA were measured on virtual LLR, as shown in Fig. 6a.

To simulate the clinical, radiographic capture, and quantify the variability of HKAs and FTAs measured on LLR, the 3D model was rotated from the neutral position. The rotation axis was defined as the line FH–AC. With the rotation of the lower limb in 10\(^{\circ }\) increments, ranging from 20\(^{\circ }\) internally to 20\(^{\circ }\) externally, five series of virtual LLRs were obtained. The range (maximum minus minimum) of HKAs and FTAs in each group was defined as the HKA deviation and the FTA deviation, respectively.

Statistical analysis

To assess the impact of knee coronal deformity and flexion angle on the deviation of HKA and FTA, two-way ANOVAs were calculated using SPSS 22.0 (SPSS Corp, Chicago, USA). The threshold for statistical significance was set at alpha = 0.05. The normality test and uniformity test of error variance are carried out for the raw data. Following that, multiple comparison least significant difference (LSD) was run to compare the mean changes between each event. In addition, Spearman’s correlation coefficients were accessed to analyze the relationship between variables. The method for performing the knee flexion and coronal deformity measurement was shown in Supplementary Materials. The percentages for categorical variables were summarized. Means, standard deviations, and the standard error of mean or medians with interquartile ranges were performed for continuous variables. The differences of coronal deformity between virtual LLR (non-weight-bearing condition) and real LLR (weight-bearing condition) were analyzed by a paired t-test. Intra-observer reliability was calculated by the intraclass correlation coefficient (ICC). The interpretation scales of ICC are classified according to the following: 0–0.20 is slight, 0.21–0.40 is fair, 0.41–0.60 is moderate, 0.61–0.80 is substantial, and 0.81–1.00 is perfect.