Introduction

Spinal fusion surgery is a commonly performed treatment with a broad spectrum of indications [1, 2]. Spinal fusion leads to a redistribution of forces throughout the spine, and the vertebral segments adjacent to the fusion must adapt to the rigid fixation beneath with compensatory increases in motion, intradiscal pressure, and force loading on facet joints [3,4,5]. Adjacent segment disease (ASD) is a broad term describing the condition that develops when the adjacent segment fails to adapt to these biomechanical redistributions and decompensates. Manifestations of ASD include adjacent disc degeneration and herniation, spinal stenosis, proximal junctional kyphosis, and adjacent vertebral compression fracture [6]. Several risk factors are associated with the occurrence of ASD, including age, female sex, obesity, osteoporosis, prior degenerative changes in the adjacent segment, and sagittal misalignment; among these, the surgeon may have a direct impact on sagittal misalignment [7,8,9,10,11]. Sagittal imbalance, most commonly measured by the pelvic incidence–lumbar lordosis mismatch (ΔPI-LL) and sagittal vertical axis, has been associated with poor health-related quality of life and increased disability [12,13,14]. Failure to restore lumbar lordosis leads to increased force loadings at adjacent segments and may trigger ASD [11, 15,16,17]. Thus, patient-specific planning to optimally balance the delicate relationship between the pelvic anatomy and lumbar alignment is of paramount importance to achieve optimal long-term outcomes.

Computerized preoperative planning tools for simulation of postoperative alignment are increasingly gaining importance. Tools requiring manual annotations are user-dependent and prone to input errors [18]. Machine learning algorithms have been shown to be helpful for predicting alignment [19]. A previously published statistical shape model (SSM) was trained and used to predict improved spinal alignment in patients with spinal disorders [20]. Annotated radiographic data were used to train the SSM, and the training dataset comprised only asymptomatic subjects without spinal pathology or deformity. The trained model contains information about the variation of sagittal spinal alignment within an asymptomatic population. Using patient-specific parameters, a potentially optimal plan for restoration or maintenance of the sagittal spinal balance can be provided.

The present study was performed to investigate differences in spinal alignment between postoperative radiographs after single-level lumbar fusion at L4/5 and the SSM-calculated optimal spinal alignment, comparing patients with and without the development of ASD. We hypothesized that non-conformance to the SSM-calculated patient-specific optimal spinal alignment is associated with the development of ASD.

Materials and methods

Patient population

In this case–control study, we retrospectively reviewed all patients after primary spinal fusion surgery, one group were revised for treatment of ASD from 2004 to 2017 and compared them with a control group of patients who did not develop ASD after primary fusion at a single institution. We included patients who primarily underwent single-level fusion at L4/5 using a posterior approach only with pedicle screw instrumentation with or without intervertebral cage placement. In the control group, patients with < 4 years of follow-up were excluded. In both groups, patients with preoperative or postoperative radiographs of insufficient quality were excluded. All radiographs were calibrated and displayed both femoral heads and all vertebrae from T12 to the sacrum. The preoperative degeneration of the adjacent segment was measured on MRIs with the Pfirrmann [21] and Weishaupt [22] classification.

Statistical shape model

As previously described, an SSM is used to predict the optimal positions of the vertebral body centers based on spinopelvic landmarks (femoral heads and S1 endplate) [20]. On preoperative lateral radiographs, the sacral endplate and center of the bicoxofemoral axis (determined by the center of each femoral head) were annotated. These patient-specific geometries were used to calculate the predicted physiological position of each center of the vertebral body from T12 to L5. Furthermore, all corners of the vertebral bodies from L5 to T12 were annotated on the preoperative and first postoperative lateral radiographs after single-level fusion. The center of each vertebral body was determined by calculating the mean of all annotated corner points (Fig. 1).

Fig. 1
figure 1

Two examples of postoperative spinal alignment versus optimal predicted sagittal alignment. Left: Case in ASD group. Right: Case in control group. For prediction of the optimal sagittal alignment, the spinopelvic landmarks (centers of the femoral heads and the sacral endplate) were annotated. The statistical shape model then calculated the optimal vertebral centers (blue circles) based on the spinopelvic annotations. The true vertebral centers (red circles) are the intersections of the annotated anterosuperior, posterosuperior, anteroinferior, and posteroinferior aspects of each vertebral body. ASD, adjacent segment disease; postop, postoperative

The predicted center of each vertebral body was compared with both the annotated preoperative and postoperative vertebral positions of each patient. This comparison was performed in a two-dimensional plane, and the vertical, horizontal, and total distance was measured. Additionally, the total distance from the preoperative to postoperative position of each vertebral body was evaluated without reference to the SSM-predicted positions.

To correct for the patient’s height, the T12 segment was set as given by the patient and was therefore the same height as the given output of the SSM-calculated T12 height.

Because the SSM-predicted vertebral positions were based on the preoperative pelvic configuration, the predicted points were placed on the postoperative radiograph by overlapping the posterior boundary of the S1 endplate (Fig. 1). In this way, the predicted vertebral centers could be compared with both the preoperative and postoperative vertebral positions.

Statistics

Statistical analysis was performed with IBM SPSS version 25 (IBM Corp., Armonk, NY, USA) and Stata version 13.1 (StataCorp LP, College Station, TX, USA). The distance to the statistical ideal was calculated for every vertebra using the formula for the root mean square error (RMSE). A parametric independent-samples t-test was performed for intergroup differences between the distances to the SSM. Data are presented as mean ± standard deviation. Epifusional classification crosstable was calculated with fisher’s exact test. The chi square test was used to compare the interbody fusions. The level of significance in all tests was set at p < 0.05.

RESULTS

Patient population

In total, 330 consecutive patients were screened for the ASD group. Of these, 22 patients were eligible for inclusion. Patients were excluded for fusion at levels other than L4/5 (n = 247); a complex medical history, incomplete medical records, or prior lumbar spine surgery (n = 21); or radiographs of insufficient quality or missing information about pixel dimensions (n = 40).

In total, 576 patients were screened for the control group, and 25 were included. Patients were excluded because of a follow-up of < 4 years (n = 186), previous operations (n = 91), recurrent pain or other surgery-related issues (n = 222), or radiographs of insufficient quality or missing segmenting of the pixels on the scan (n = 52).

The patient demographics were not significantly different between the groups with the exception of age at the primary surgery, which was lower in the control group (60 ± 12.0 years) than in the ASD group (66 ± 7.8 years, p = 0.043) (Table 1). There was no significant difference in body mass index or sex. There was also no significant difference in interbody fusion with an intervertebral cage between the control and ASD group (15 (60.0%) vs. 16 (72.2%) patients, respectively; p = 0.358). Most importantly there was no difference in preoperative degeneration of the adjacent segment between the groups (Pfirrmann p = 0.458, Weishaupt p = 0.212) (Table 2). The MRIs were taken 1.6 months (± 1.9) before the operation. Fourtyfive (96%) MRIs where analyzed, two (4%) were not available for review.

Table 1 Patient characteristics
Table 2 Preoperative epifusional degeneration L3/4

Spinal alignment

All spinopelvic parameters on the postoperative radiographs, the relationship between pelvic incidence and lumbar lordosis, as well as the calculated ideal pelvic tilt as described by Vialle et al. [23] and its deviation to our patient population, was not significantly different between the groups (Table 3).

Table 3 Postoperative spinopelvic parameters

Total distance from predicted optimal spinal alignment

Preoperatively, the mean distance (RMSE) from the centers of the vertebral bodies to the SSM-predicted ideal position was very similar between the two groups, with no significant difference (Fig. 2). At the L5 level, this distance was 6.7 ± 3.0 mm in the control group and 6.0 ± 2.5 mm in the ASD group (p = 0.390), whereas at the T12 level, the distance was 17.18 ± 11.6 mm in the control group and 17.19 ± 14.2 mm in the ASD group (p = 0.998). The sum of all total distances from the calculated ideal was 70.5 ± 40.2 mm the control group and 71.7 ± 44.1 mm in the ASD group (p = 0.918).

Fig. 2
figure 2

Total distance to the statistical shape model-calculated ideal in root mean square error. Left: Preoperative. Right: Postoperative. Green bars: ASD group. Blue bars: CTRL group. Error bars: 95% CIs. CTRL, control; ASD, adjacent segment disease; CI, confidence interval

Postoperatively, the most proximally measured vertebra (T12) had a mean distance of 14.84 ± 12.9 mm in the control group and 21.35 ± 13.0 mm in the ASD group (p = 0.092). The sum of all total distances was 67.71 ± 46.2 mm in the control group and 86.81 ± 34.3 mm in the ASD group (p = 0.119).

Horizontal distance from predicted optimal spinal alignment

Preoperatively, no significant difference was found between the groups (Fig. 3). For L5, the mean distance postoperatively was 5.48 ± 3.3 mm in the control group and 4.57 ± 2.3 mm in the ASD group (p = 0.286). At the T12 level, the mean distance was 14.84 ± 12.9 mm in the control group and 21.35 ± 13.0 mm in the ASD group (p = 0.092).

Fig. 3
figure 3

Horizontal distance to the statistical shape model-calculated ideal in root mean square error. Left: Preoperative. Right: Postoperative. Green bars: ASD group. Blue bars: CTRL group. Error bars: 95% CIs. CTRL, control; ASD, adjacent segment disease; CI, confidence interval

Vertical distance from predicted optimal spinal alignment

Preoperatively, no significant difference was found between the groups (Fig. 4). The greatest distance from the statistical ideal was found at the L4 level in both groups (4.09 ± 3.7 and 4.31 ± 2.9 mm in the control and ASD group, respectively; p = 0.823). The decreasing distance toward the cranial segments was due to the vertical referencing to the T12 segment, which equalized every patient’s height and was therefore used as a reference, as explained above.

Fig. 4
figure 4

Vertical distance to the statistical shape model-calculated ideal in root mean square error. Left: Preoperative. Right: Postoperative. Green bars: ASD group. Blue bars: CTRL group. Error bars: 95% CIs. CTRL, control; ASD, adjacent segment disease; CI, confidence interval

Postoperatively, significant differences were observed in the operated L4 and the adjacent L3 vertebrae between the two groups. At the L4 vertebra, the distance from the ideal was 5.69 ± 3.0 mm in the ASD group and 3.58 ± 3.5 mm in the control group (p = 0.034). At the L3 vertebra, this distance was 5.44 ± 3.13 mm in the ASD group and 3.08 ± 2.77 mm in the control group (p = 0.009).

Changes in preoperative to postoperative position of vertebral bodies

The distance from the preoperative to postoperative vertebral bodies was greater in the control group than in the ASD group. This trend began at the L4 level and continued through all cranial segments. At the L4 level, the distance was 7.50 ± 6.4 mm in the control group and 6.96 ± 4.50 mm in the ASD group (p = 0.742) (Fig. 5). This continued through all of the more cranial segments, with the greatest change at the T12 level (distance of 19.7 ± 19.8 mm in the control group and 16.8 ± 11.3 mm in the ASD group; p = 0.554).

Fig. 5
figure 5

Change in distance of the vertebral centers from the preoperative to postoperative position. Green bars: ASD group. Blue bars: CTRL group. Error bars: 95% CIs. CTRL, control; ASD, adjacent segment disease; CI, confidence interval

Discussion

As the numbers of primary spinal fusion procedures continuously increase, so do the rates of revision spinal surgery [24,25,26]. ASD is the most important yet insufficiently controllable long-term complication associated with lumbar spinal fusion. Within 10 years after primary lumbar fusion, 22% to 36% of patients require additional decompression and/or proximal extension of the spinal fusion because of ASD [7, 27]. Among other risk factors, unfavorable spinopelvic alignment has been identified as a major driver for the occurrence of ASD in the lumbar spine. It is therefore the surgeon’s duty to optimally influence the sagittal balance when performing fusion surgery [28, 29]. The relationship between the physiologic upright posture and the lumbosacral alignment was well described by Legaye et al. [30] and Duval-Beaupère et al. [31]. Boulay et al. [31] further developed a formula to predict lumbar lordosis based on the pelvic incidence and stated that if the standing position is outside the “efficiency zone,” adaption of the spine and pelvis is exceeded and causes pathological conditions and loading patterns. Rothenfluh et al. [32] demonstrated that patients with ΔPI-LL of > 10 degrees have a tenfold higher risk of developing ASD. However, although the definition of optimal alignment for spinal fusion surgery follows geometrical criteria, it strongly relies on the surgeon’s subjective decisions.

Because the spinopelvic system is a highly complex construct and not yet fully understood, statistical models have been introduced to simulate physiological alignments and ameliorate preoperative planning. The SSM used in the present study was developed by Caprara et al. [20]. The SSM was initially trained with 60 annotated radiographs of asymptomatic healthy subjects and calculates the optimal position of each vertebral body based on spinopelvic landmarks. In contrast to more common spinopelvic parameters, such as the ΔPI-LL and sagittal vertical axis, the SSM predicts the patient-specific optimal position of each vertebra. This may be especially helpful in lower back fusion constructs because the lowest two segments are in charge of the major part of the lordosis.

The present retrospective study compared the spinal alignment evaluated by the patient-specific SSM calculation of patients with ASD versus an asymptomatic cohort of patients before and immediately after single-level fusion at L4/5. We found greater misalignment in patients with ASD than in the asymptomatic control group, which confirms the accuracy of the SSM in clinical situations. Preoperatively, a homogenous and similar distribution of spinal alignments around the calculated ideals was found in all patients. In contrast, greater misalignment was seen in all directions (horizontal, vertical, and total distance) on standing postoperative radiographs in the ASD group than in the controls. These differences in the horizontal and total distance were more prominent in the more cranial segments. This greater deviation in the cranial segments is likely related to the increased uncertainty of the SSM towards cranial as described in Caprara et al. [20] and was attributed to the increasing distance from the input pelvic configuration. However, the cranial segments deviated more from the SSM ideal in the ASD group than in the controls. Because the postoperative radiograph is taken within the first few postoperative days and long before the occurrence of ASD, the differences from the SSM ideal may indicate the long-term progression immediately after surgery. In fact, there was no statistical significant difference in the preoperative degeneration of the adjacent segment in both groups according to the classification of Pfirrmann [21] und Weishaupt [22].

Interestingly, a greater change in preoperative to postoperative alignment was seen in the control group than ASD group (Fig. 4). Because the fusion was at the L4/5 level, these segments were mainly influenced intraoperatively. A greater change to the L4 segment was seen in the control group than ASD group, which further influenced the cranial segments. We interpret this as an intraoperative change, which is crucial because the segments from L4 to S1 are responsible for 66% of the total lumbar lordosis [33]. Therefore, surgeries involving the lower lumbar spine have a tremendous effect on the total spinal alignment, the development of ASD, and the patient’s quality of life.

Better alignment in the control group was also found in view of the vertical distance to the ideal. Because the SSM is trained with young individuals, the deviation of the L5 vertebra in the vertical direction to the mean is partly due to degeneration of the L5/S1 segment and partly due to the individual patient’s shape [20]. However, significantly better postoperative height restoration was seen in the control group than in the ASD group. Height restoration of the fused level is an important factor associated with the occurrence of ASD [34] and is another corresponding factor with the here found results.

The postoperative spinopelvic parameters as well as the preoperative degeneration of the epifusional segment showed no significant difference. As mentioned above, even the ΔPI-LL as one of the major biomechanical risk factors was the same in both groups. This indicates that the SSM might be an additional tool, in conjunction with the established indexes, for preoperative planning.

This study confirms the hypothesis that patients who develop ASD show greater non-conformance in spinal alignment to the SSM-calculated ideal than do controls on the first postoperative radiographs. However, comparative trials with large sample sizes are needed to clarify the accuracy of patient-specific simulation of the spinal alignment using the SSM.

This study has limitations. The sample sizes of the two cohorts included in this study were small, thereby limiting its statistical power. The SSM’s training population was initially constituted by rather young subjects and does not address variances in age, sex, or race. Thus, the SSM remains a rough estimation of the ideal spinal alignment. The authors are aware of the physiological spinal degeneration with age. The degenerated spine is in our understanding not the ideal spinal alignment for the individual patient, and therefore, our statistical ideal is trained with a young and healthy population. But to better individualize the patient-specific simulations we could think of an SSM that addresses not only the patient’s individual spinopelvic anatomy, but also the body habitus. Further technical limitations of the SSM were described in detail in the original publication [20]. The authors would like to point out that SSM should not be understood as a more effective tool compared to the already known indexes but might be an interesting additional point of view that warrants further research.

Conclusion

Patients who develop ASD requiring revision after lumbar spinal fusion have greater differences from the SSM-calculated optimal spinal sagittal alignment on the first postoperative radiographs. Further research might help to understand the value of SSM, in conjunction with already established indexes, for the preoperative planning of optimal spinal alignment.