Introduction

In the elderly, adult spinal deformity (ASD) remains controversial within the field of spinal disorders due to the important effect it might have on health-related quality of life [1, 2]. Despite advances in surgical techniques and implant selection, surgical treatment for ASD remains one of the most challenging because of the high risk of perioperative complications [3]. Minimally invasive lateral lumbar interbody fusion (LLIF) techniques have attracted attention as alternative or adjuvant procedures in ASD surgery with a goal of reduced surgical access morbidity and perioperative complications. Past study reported elderly patients can successfully be treated using LLIF techniques [4]. Advantages of LLIF include significantly less bleeding, and fewer neurological and operative complications [5,6,7,8]. Substantial sagittal and coronal correction by LLIF has been reported [7, 9]. However, the necessity for three-column osteotomy in the treatment of severe sagittal imbalance or rigid deformities has been indicated despite the risk of massive blood loss and complications [10,11,12]. Several preoperative radiographic methods to assess curve flexibility and preoperative planning of surgery to correct spinal deformity have been suggested such as fulcrum bending radiographic images and lateral radiographs with the patients sitting, prone, or supine [2, 13, 14]. The indications and preoperative evaluation for osteotomy in ASD surgery remain controversial. To establish an optimal strategy for ASD surgery, the limitations and problems surrounding spinal corrective surgery with LLIF without osteotomy should be known.

A novel pelvic-incidence-based proportional method to analyze the sagittal plane to predict mechanical complications after surgery for ASD, the global alignment and proportion (GAP) score has been developed [15]. The formula for the GAP score was developed under the concept of a continuum of states that provides a pelvic incidence-based proportional indication of pelvic version, magnitude and distribution of lumbar lordosis, and global spinopelvic alignment to assess disproportion compared with the calculated “ideal” for any given individual.

This study aimed to clarify the limitations and issues of spinal corrective surgery with LLIF without osteotomy using the GAP score, and to establish preoperative radiographical evaluation to understand the necessity for three-column osteotomy.

Methods

Patients and surgical techniques

We conducted a retrospective observational study of a cohort of consecutive patients with a diagnosis of ASD who underwent corrective spinal surgery. The study was approved by our institutional review board (Application no. 1183). We received written informed consent from all eligible patients. Patients were considered candidates for thoracolumbar correction if fusion was indicated because of ASD and if a full course of conservative care had been exhausted. The inclusion criteria were age > 60 years and a radiographic diagnosis of ASD defined by at least one of the following parameters: a coronal Cobb angle > 30°; a C7 sagittal vertical axis (SVA), which is the distance between the C7 plumb line and the posterosuperior edge of S1, > 5 cm; and/or pelvic tilt (PT), which is the orientation of the pelvis with respect to the femurs and the rest of the body, > 30°. Patients were excluded if they had ankylosing spondylitis, or a rounded back because of Parkinson’s disease. A total of 103 spinal correction surgeries for ASD were performed between April 2012 and March 2017 by two board-certified spinal surgeons at our institution. In this study, we included data from 57 consecutive patients treated with spinal correction surgery with LLIF and followed up for a minimum of 2 years. Patients who demonstrated corrective surgery with Schwab grade 3–6 osteotomy [16] were excluded in this study. Basic demographic and surgical data: age, sex, bone mineral density (BMD), type of procedure and area of fusion were noted (Table 1).

Table 1 Preoperative patient characteristics

Surgical procedure

We used an anterior approach to lateral interbody fusion (LIF) from L1–L2 or L2–L3 to the level of the L4-5 disk to obtain adequate coronal and sagittal global spine alignment in patients with ASD [17]. Subsequently the patient was placed in a prone position to undergo posterior lumbar interbody fusion (PLIF) at the level of the L5-S disk. Intraoperative lateral lumbosacral radiographs were taken to evaluate how much additional correction was needed using the rod cantilever technique to achieve ideal LL determined in a preoperative plan according to previous reports [18,19,20]. After setting dual iliac screws as previously described [21], the S1 pedicle screws were connected to the rod with the iliac screws using an offset connector. The spinopelvic deformity was corrected using a cantilever force technique with the pelvis retroverted, raising the pelvis to an optimal alignment. A rod was connected to each pedicle screw from caudal to cranial aspects. Where flexibility of spinal motion was lost, we added a suitable osteotomy, which was classified as grade 1–6 by SRS-Schwab [16]. Allogenic and local autogenous bone grafts were used.

Radiographic measurements

Radiographic data consisted of full-length lateral radiographs obtained pre- and 4–6 weeks postoperatively, and at 2 years postoperatively, with the patient in a freestanding posture with their fingers placed on their clavicles. To evaluate flexibility of the pelvis and lumbar spine, full-length lateral radiographs were obtained with the patients prone with both arms next to the trunk and without any cushions under the body (Fig. 1). The following radiographic parameters were measured pre- and postoperatively using a lateral view: T5-T12 TK; T12-S1 LL angles; pelvic incidence (PI); PT; sacral slope (SS); SVA; T1 pelvic angle (TPA), which is the angle between the line from the center of femoral heads to the center of S1 and the line from the femoral head to the center of T1 vertebra [22]; and global tilt (GT), which is the angle formed by the intersection of two lines, the first line drawn from the center of C7 to the center of the sacral endplate and the second line drawn from the center of the femoral heads to the center of the sacral endplate [23]; spino-sacral angle (SSA) is defined as the angle connecting the center of the C7 vertebra to the center of the S1 endplate and the line parallel to the superior S1 endplate [24]. L1–S1 lordosis and L4–S1 lordosis were measured on postoperative radiographs to assess the distribution of the lordosis (LDI). Kyphosis was expressed as a positive value, and lordosis was expressed as a negative value. The proximal junctional angle was measured as the angle between the caudal endplate of the UIV to the cephalad endplate of 2 proximal vertebrae. Increased proximal junctional angle (PJA) was calculated as increased angle between the PJA obtained on postoperative radiographs and the PJA obtained 2 years postoperatively. Radiographic measurements were obtained by two board-certified spine surgeons (HO and TO). These surgeons had > 10 years of experience in spinal surgery and were blinded to patient data before the measurements were taken.

Fig. 1
figure 1

The lumbar lordosis construction described by the lordosis distribution index (LDI; L4–S1 lordosis/L1–S1 lordosis × 100) and SS while the patient was prone

GAP score

GAP scores were calculated using early postoperative radiographic parameters following methods previously described by adding the scores for relative pelvic version (RPV), relative lumbar lordosis (RLL), LDI, relative spinopelvic alignment (RSA), and age, and can range from 0 to 13 points. Additionally, a GAP score of 0 to 2 was categorized as indicating a proportioned spinopelvic state; 3 to 6 as moderately disproportioned; and ≥ 7 as a severely disproportioned spinopelvic alignment [15] (Suppl. Table).

Clinical outcomes

Postoperative baseline patient health status was measured (for lumbar pain-related factors) using a Roland–Morris Disability Questionnaire (RDQ) and Oswestry Disability Index (ODI), where 0% indicates no disability and 100% indicates extreme debilitating disability [25] at 2 years after surgery.

Statistical analyses

All data are reported as mean ± SD. Data were analyzed using a two-sided Student t test, or a Fisher exact test to determine significant differences. Correlations between pre- and postoperative radiographic parameters and GAP score were determined using Pearson’s correlation coefficient. We examined whether SS in a prone position (prone SS) preoperatively impacted the insufficient alignment correction. We employed logistic regression analysis with disproportioned spinopelvic alignment, which was considered as a GAP score of more than 2 points, as the response variable, and the prone SS–ideal SS based on preoperative PI as the explanatory variable. Then, we used a receiver operating characteristic curve (ROC curve) to determine the threshold of prone SS–ideal SS for insufficient alignment correction. All statistical calculations were performed using Prism (version 6.0; GraphPad Software, La Jolla, CA) and the statistical package R (version 3.6.1; available at http://www.r-project.org). P < 0.05 was considered significant (*P < 0.05, **P < 0.0001).

Results

Patient population

We included 57 eligible patients in this study; 88% were female, mean age was 71.9 years (± 7.5), body mass index (BMI) was 22.3 (± 3.71) and bone mineral density (BMD) (%YAM) was 73.2% (± 14.2) (Table 1). Postoperatively, the spinopelvic alignment of patients improved significantly. Preoperative SS and LL with the patient prone were significantly greater than while they were standing. The mean pre- and postoperative alignments are summarized in Table 2.

Table 2 Comparison of preoperative spinopelvic parameters between high and low PI

Comparison of pre- and postoperative spinopelvic parameters between high and low PI

Based on a past report [17], patients were divided into groups of those with low PI Preop (PI ≤ 50°) or high PI Preop (PI > 50°) and spinopelvic parameters were compared between groups (Tables 2 and 3). Preoperative SS and LL of those in the high PI group were significantly higher than preoperative SS and LL of those in the low PI group. However, significantly worse GT and TPA were observed in those in the high PI group compared with those in the low PI group. There was no significant difference in postoperative SS and LL between the groups. By contrast, PI–LL, GT, and TPA were significantly larger in the high PI group compared with these variables in the low PI group. These findings indicated patients with high PI should undergo correction to restore a larger SS and LL to achieve balanced spinopelvic alignment.

Table 3 Comparison of postoperative spinopelvic parameters between high and low PI

Postoperative GAP score

Postoperative GAP score is summarized in Table 4. Based on total GAP score, 23 (40.4%) patients were grouped into proportioned, 26 (45.6%) into moderately disproportioned, and 8 (14%) into severely disproportioned spinopelvic alignment. According to subgroup analysis, 50 (87.7%) of the RLL subgroup and 48 (66.7%) of the LDI subgroup accomplished “alignment.” By contrast, only 29 (50.8%) of the RPV subgroup and 29 (50.9%) of the RSA group were “aligned.” This result indicated parameters of the lumbar spine were more easily “aligned” than pelvic parameters by corrective surgery with LLIF without osteotomy.

Table 4 Postoperative GAP score

Comparison of ODI, RDQ, and increased PJA 2 years after surgery between categories of total GAP score

The frequency of rod fractures was not significantly different between categories of total GAP score (Table 5). Increased PJA 2 years after surgery was significantly larger in the disproportioned group than it was in the proportioned group (Table 4). There was no significant difference in preoperative ODI or RDQ between the proportioned and disproportioned groups (data not shown). By contrast, ODI 2 years postoperatively was significantly worse in the disproportioned group than it was in the proportioned group (Table 4).

Table 5 Comparison of ODI, RDQ, and increased PJA 2 year after surgery between categories of total GAP score

Correlation between pre- and postoperative spinopelvic parameters in the standing position

The correlation between pre- and postoperative parameters is summarized in Table 6.

Table 6 Correlation between pre- and postoperative spinopelvic parameters

We found a significant positive correlation between preoperative SS while the patients were standing and postoperative SS (r = 0.32, P < 0.05). By contrast, there was no significant correlation between any other pre- or postoperative parameters while the patients were standing.

Utility of preoperative radiographic assessment with the patients prone

We found a significant positive correlation between preoperative LL while the patients were prone and postoperative SS (r = 0.38, P < 0.05). We found a significant positive correlation between preoperative prone SS and postoperative LDI (r = 0.29, P < 0.05) (Table 6). Importantly, a significant positive correlation was found between prone SS and postoperative SS (r = 0.7, P < 0.0001) and significant negative correlation with GAP score (r = 0.31, P < 0.05) (Fig. 2a, b). Prone SS–ideal SS was significantly associated with disproportionate spinopelvic alignment, considered as that with an odds ratio of 3.3 (95% confidence interval 1.8 to 7.1, P < 0.05) per − 5°. The cut-off value was determined as − 16.5° using ROC curve analysis, which indicated a sensitivity of 78.6%, specificity of 69.0%, and area under the curve of 0.804 (Fig. 2c). After the formula transformation, a patient whose prone SS was larger than PI × 0.59–7.5 preoperatively tended to achieve sufficient spinopelvic alignment using LLIF.

Fig. 2
figure 2

a Correlation between preoperative SS while prone and postoperative SS while standing (**P < 0.0001), b correlation between preoperative SS while prone and postoperative global alignment and proportion (GAP) score (*P < 0.05), c receiver operating characteristic curve to determine the threshold of prone SS–ideal SS for insufficient alignment correction

These results indicate that preoperative LL and prone SS are useful to evaluate flexibility of the pelvis and lumbar spine, and to predict postoperative SS, LDI, and GAP score in ASD surgery with LLIF without osteotomy.

Discussion

The present study has clarified the indications and limitations of spinal corrective surgery for ASD with LLIF without osteotomy. Based on total GAP score, we found that 23 (40.4%) patients were grouped into proportioned, 26 (45.6%) into moderately disproportioned, and 8 (14.0%) into severely disproportioned spinopelvic alignment. This result indicated LLIF was useful and effective for surgery to correct ASD and an acceptable surgical outcome was achieved in the majority of patients. By contrast, detailed examination of patients who had a poor GAP score should be conducted using subgroup analysis to clarify which spinopelvic parameters are problems. Formulas following Schwab’s system have been used by many to calculate an ideal LL to establish an optimal surgical strategy for patients with ASD [19, 20, 26]. However, a remaining high rate of reoperation due to mechanical revision has been reported despite achieving goals based on these formulas [3, 27, 28]. The GAP score was developed because of the limitations of existing formulas that focus on only LL values and the score has been used to set surgical goals to decrease the prevalence of mechanical complications [15, 29]. Actually, the critical importance of harmony among SS, upper LL and lower LL has been long known to maintain a favorable global spine balance [30,31,32]. Studies have been conducted to correct not only LL but also pelvic alignment with surgery for ASD patients [17, 33,34,35]. In our retrospective observational cohort study, GAP score has clarified the limitations and problems of spinal corrective surgery with LLIF without osteotomy for ASD. The present study showed most patients achieved a sufficiently ideal LL in RLL subgroups (87.7%). By contrast, ideal SS in RPV subgroups was achieved in only 50.8% of all patients (Table 4). This result indicates preoperative radiographic evaluation should be established to predict postoperative SS value after ASD surgery with LLIF without osteotomy to decide the optimal surgical strategy including use of osteotomy.

There is a limitation to evaluate flexibility of the pelvis and lumbar spine and to predict postoperative spinopelvic parameters with radiographs while the patient is standing. Several preoperative radiographic methods to assess curve flexibility and preoperative planning of spinal deformity have been suggested such as fulcrum bending radiographical images, and lateral radiographs while the patients are sitting, prone, or supine [2, 13, 14]. However, to our knowledge, there are no reports of radiographic methods to predict postoperative SS value. In the present study, we found preoperative prone SS had a significant positive correlation with LDI, a strong positive correlation with postoperative SS and a significant negative correlation with GAP score (Fig. 2a, b and Table 5). Finally, we showed that when preoperative prone SS was relatively small to ideal as calculated using PI, the resulting GAP score was clearly worse. The present study suggests preoperative SS in prone patients was useful to evaluate flexibility of the pelvis and lumbar spine, and moreover to predict postoperative SS, LDI, and GAP score in ASD surgery with LLIF without osteotomy (Fig. 3). This result indicates that when preoperative SS in prone patients is small relative to PI, osteotomy and alternative correctors such as oblique lateral interbody fusion at L5–S1 should be considered to achieve satisfactory postoperative spinopelvic parameters.

Fig. 3
figure 3

Representative schemas and radiographs of patients with low or high SS while prone

Limitations of this study include its small sample size, and retrospective nature. Differences in the thickness of the soft tissue around the abdomen and thorax might make the measurements in the prone position unreliable. The patients were followed up for only 2 years, and longer-term follow-up is necessary to determine the reliability of the results.

Conclusions

The cause of poor outcome of GAP score of ASD corrective surgery with LLIF without osteotomy is postoperative small SS value, and preoperative prone SS is useful to predict postoperative SS. When preoperative SS in prone patients is relatively small to ideal as calculated using PI, osteotomy, or other correctors should be considered to achieve satisfactory postoperative spinopelvic parameters.