Introduction

Adult spinal deformity is a common disease with a reported incidence ranging from 29 to 68% [1, 2]. Adult patients with spinal deformity can present with back pain, symptoms of nerve compression, and impaired quality of life [3,4,5,6]. Surgical management offers better radiographic and clinical outcomes than non-surgical treatment, with many patients experiencing relief of pain and improvement in disability after surgery [7, 8]. The goal of surgery is to achieve neural decompression, correct the deformity, and restore the balance of spine in both coronal and sagittal planes. Achieving these goals can be challenging and cause various complications such as wound infection, neurological impairment, pseudarthrosis, and adjacent segment disease [9,10,11,12,13]. Unlike scoliosis in adolescents, ASD is often accompanied by lateral listhesis, osteoporosis, and degenerative lumbar disorders such as spinal stenosis [10, 14]. Therefore, surgery for adult deformity is often more complicated and technically difficult. Additionally, patients are older and may have some medical comorbidities [15]. These factors contribute to the high incidence of complications.

A long thoracolumbar fusion extending to the lower lumbar or sacral spine is usually necessary in the treatment of adult spinal deformity. It is generally accepted that if the L5-S1 level is involved in the disease process such as advanced disc degeneration, spinal stenosis, L5 spondylolisthesis, and fixed lumbosacral obliquity, multi-level fusion to the sacrum should be indicated [16,17,18]. But for patients with healthy or minimal degenerated L5-S1 discs, whether to preserve this motion segment or fuse to the sacrum is still debated [19, 20]. L5 and the sacrum each have advantages and limitations as a choice for the level of distal fusion [16, 18,19,20,21,22,23,24,25]. End at L5 offers preservation of lumbosacral motion and less invasion. On the other hand, it may bring about subsequent degeneration of the L5-S1 disc. Subsequent disc degeneration is often related to axial pain, neurological compression, and loss of sagittal balance. In addition, the functional significance of preserving a single open segment below a long fusion is poorly defined and theoretical convenience on daily activities has rarely been shown [26, 27].

Therefore, choosing which vertebra to terminate a long fusion remains a controversial topic. Although several studies had compared the outcomes of distal fusion to L5 versus the sacrum, there were no prospective randomized controlled trials (RCTs) existed to illuminate the issue directly. Aiming to compare the radiographic and clinical outcomes of fusion stopping at L5 with extension to the sacrum in ASD, we performed a meta-analysis focusing on curvature correction, complications, and symptom improvement. It is hoped that the article could provide some guidance value to current operation choice for patients with ASD.

Materials and methods

Inclusion/exclusion criteria

Studies were included according to the following inclusion criteria: (1) patients were diagnosed with adult spinal deformity (adult idiopathic/degenerative scoliosis, etc.); (2) patients with at least 18 years of age at the time of surgery; (3) fusion of four or more segments; (4) patients with healthy or minimal degenerated L5-S1 discs (Grade 0 to 1); (5) availability of comparative data of distal fusion level between L5 and the sacrum; and (6) a minimum 1-year follow-up.

Studies were excluded according to the following exclusion criteria: (1) patients were diagnosed with neuromuscular scoliosis; (2) child or adolescent patients; (3) patients were treated by short fusion or non-fusion procedure; (4) pooled results were reported without distinguishing endpoint L5 from the sacrum; (5) the article was a case/case series report; and (6) biomechanical study or studies of corpses.

Study search strategies

We searched on electronic databases of PubMed, Embase, Cochrane, and Google Scholar using the following keywords: “lumbar”, “L5”, “sacrum”, “S1”, “pelvis”, “adult”, “scoliosis”, “spinal deformity”, “fusion”, “arthrodesis”, “distal”, “caudal”, and “instrumented” with Boolean operators “AND”, “OR”, and “NOT”. We retrieved all clinical studies on distal fusion levels without restriction of language and year of publication. Full texts of the published articles were manually obtained, and references of relevant studies were reviewed to ensure that no relevant researches were omitted. The PRISMA checklist is given in Online Resource 1.

Data extraction

Data extraction was performed independently by two investigators with disagreement resolved by discussion. The following parameters were extracted: (1) study’s author and publication year; (2) study type; (3) patient demographic information; (4) number of fusion segments; (5) follow-up time; (6) surgical technique and approach; (7) radiographic outcomes (final changes in scoliosis angle, lordosis, angle and sagittal vertical axis (SVA)); (8) number and rate of complications, type of complications; (9) revision rate; and (10) clinical outcomes (final changes in Oswestry Disability Index (ODI) and visual analogue scale (VAS) scores).

Quality assessment and publication bias assessment

Quality assessment was performed independently by two investigators using the Newcastle–Ottawa scale (NOS). The NOS is recommended by Cochrane Handbooks version 5.2.0 to evaluate the qualities of non-randomized studies and is widely used nowadays. Since the study types of our included studies were all retrospective studies, we used NOS to assess the risk of bias. Disagreements were resolved through consensus. Publication bias was evaluated by a funnel plot analysis.

Data analysis

Continuous variables were presented as mean difference (MD) and 95% confidence interval (95% CI), whereas dichotomous variables were reported as odds ratios (OR) and 95% CI. The rate of distal adjacent segment disease was an exception because it was 0% when fusion ended at the sacrum. Hence, ratio difference (RD) and 95% CI were used instead. To get the change values and its standard deviation (SD) from pre-operation to the last follow-up that we needed, we conducted a statistical transformation from the data in some articles with the formula from the Cochrane Handbooks version 5.2.0. Heterogeneity was tested using a Chi-square test, for which P < 0.1 was considered to be statistically significant. The I2 index was used to quantify heterogeneity. In the case of extensive heterogeneity (I2 > 50%), the random effects model was used; otherwise, the fixed effects model was used. Sensitivity analysis was conducted to explain the source of heterogeneity. All statistical tests were performed using Review Manager, version 5.3 (Cochrane Collaboration, Copenhagen, Denmark).

Result

Search results

368 studies were extracted from the PubMed, Embase, Cochrane, and Google Scholar databases initially. After review of the abstract and title, 310 studies were excluded for the following reasons: duplicates; unrelated studies; studies of corpses or animals; case reports; and review articles. Then, the full texts of the remaining 58 studies were reviewed. A further 47 studies were excluded for the following reasons: single-arm studies (n = 13); less than four segments fused (n = 3); insufficient data (n = 20); and pooled results without segregating termination site L5 or the sacrum (n = 11). Hence, a total of 11 studies were finally selected into the meta-analysis [10, 14,15,16,17,18, 23, 28,29,30,31]. The flow diagram is shown in Fig. 1.

Fig. 1
figure 1

Flow diagram of study selection

Study characteristics

All of the 11 studies were non-randomized retrospective studies. The basic characteristics of included studies are shown in Table 1. There were no significant differences on age, gender, diagnosis, and follow-up between the two groups. Because the preoperative scoliosis angles, lordosis angles, SVA, ODI, and VAS scores were significantly different in some studies [14, 16, 17, 29], the change values from pre-operation to the last follow-up were used for analysis to eliminate the influence of different preoperative values.

Table 1 Basic characteristics of included studies in this meta-analysis

Quality assessment and publication bias assessment

Two investigators assessed the quality of studies independently. All of the 11 included studies scored more than 7 points; hence, the studies were of relatively high quality. The quality assessment is summarized in Table 2. Evidence of publication bias was not indicated in the symmetrical funnel plot on visual inspection (Online Resource 2).

Table 2 Quality assessment according to the Newcastle–Ottawa scale

Final correction of scoliosis

We used the change values from pre-operation to the last follow-up for analysis. The data of main scoliosis angles were extracted from four articles [14, 16, 17, 29] (Fig. 2). There was no significant difference between L group and S group in analysis (MD 0.50; 95% CI − 1.44 to 2.44, P = 0.61) with moderate heterogeneity (P = 0.14, I2 = 45%).

Fig. 2
figure 2

Forest plot for change values of main scoliosis angle from pre-operation to the last follow-up between fusion to L5 group and fusion to the sacrum group

Final correction of lordosis

Data on the preoperative and final lordosis were available from four studies [14, 16, 17, 29] (Fig. 3). There was a significant difference between L group and S group in analysis (MD − 4.12; 95% CI − 7.85 to − 0.38, P = 0.03) with significant heterogeneity (P = 0.03, I2 = 66%).Hence, the random effects model was used. Sensitivity analysis was conducted to explain the source of heterogeneity. When omitting the study Koller 2016 [14], the I2 index fell to 0%. The reason might lie in their adequate correction and little loss of lumbar lordosis with the use of appropriate surgical approach in L group.

Fig. 3
figure 3

Forest plot for change values of lordosis angle from pre-operation to the last follow-up between fusion to L5 group and fusion to the sacrum group

Final correction of SVA

The SVA was calculated as the distance from the C7 plumb line to the posterior endplate corner of S1. Data on the preoperative and final SVA were available from five studies [14, 16,17,18, 29] (Fig. 4). There was a significant difference between L group and S group in analysis (MD − 1.15; 95% CI − 1.80 to − 0.50, P = 0.0005) with moderate heterogeneity (P = 0.17, I2 = 38%).

Fig. 4
figure 4

Forest plot for change values of SVA from pre-operation to the last follow-up between fusion to L5 group and fusion to the sacrum group

Overall complication rate and revision rate

The records of complications were different in forms—some studies listed all complications, whereas some provided the overall complication rate. Data on overall postoperative complications were retrieved from seven studies totalling 527 patients [16,17,18, 23, 28,29,30] (Fig. 5). There was no significant difference between L group and S group in analysis (OR, 0.89; 95% CI 0.60 to 1.30, P = 0.54) with moderate heterogeneity (P = 0.12, I2 = 41%). As for revision rate, data of revisions were available from seven studies totalling 673 patients [14,15,16,17,18, 28, 30] (Fig. 6). There was also no significant difference between two groups (OR, 1.08; 95% CI 0.59 to 1.99, P = 0.80) with significant heterogeneity (P = 0.06, I2 = 51%). When omitting the study Complex Spine Study Group (CSSG) 2011 [30], the I2 index dropped to 14%. The relatively low incidence of pseudoarthrosis and revision rate in this study’s S group might lead to the clinical heterogeneity.

Fig. 5
figure 5

Forest plot for overall complication rates between fusion to L5 group and fusion to the sacrum group

Fig. 6
figure 6

Forest plot for revision rates between fusion to L5 group and fusion to the sacrum group

Rate of proximal/distal adjacent segment disease

The evidence of adjacent segment disease is based on clinical symptoms instead of radiographic assessment, which included symptomatic junctional degeneration, kyphosis, and failure. Data on proximal adjacent segment disease were available from seven studies [15,16,17,18, 23, 29, 31] (Fig. 7). There was a significant difference between L group and S group in analysis (OR, 0.57; 95% CI 0.35 to 0.92, P = 0.02) without heterogeneity (P = 0.81, I2 = 0%). Data on distal adjacent segment disease were available from eight studies [14,15,16,17,18, 23, 28, 29] (Fig. 8). There was a significant difference between L group and S group in analysis (RD, 0.15; 95% CI 0.10 to 0.19, P < 0.00001) with moderate heterogeneity (P = 0.12, I2 = 39%). In other words, since no distal junctional disease would occur in S group, the incidence of advanced degeneration of L5-S1 disc with clinical sequelae in L group was 15% (10 to 19%).

Fig. 7
figure 7

Forest plot for rates of proximal adjacent segment disease between fusion to L5 group and fusion to the sacrum group

Fig. 8
figure 8

Forest plot for rates of distal adjacent segment disease between fusion to L5 group and fusion to the sacrum group

Rate of pseudarthrosis and implant-related complications

Data on pseudarthrosis and implant-related complications occurred during follow-up were available from nine studies totalling 887 patients [10, 14,15,16,17,18, 23, 28, 29] (Fig. 9). There was a significant difference between L group and S group in analysis (OR, 0.43; 95% CI 0.29 to 0.64, P < 0.0001) without heterogeneity (P = 0.74, I2 = 0%).

Fig. 9
figure 9

Forest plot for rates of pseudarthrosis and implant-related complications between fusion to L5 group and fusion to the sacrum group

Final improvement of ODI and VAS scores

We used the change values from pre-operation to the last follow-up for analysis. Data on the preoperative and final ODI were available from three studies [16, 17, 29] (Fig. 10). There was no significant difference between L group and S group in analysis (MD 0.94; 95% CI − 1.19 to 3.07, P = 0.39) with moderate heterogeneity (P = 0.18, I2 = 41%).Data on the preoperative and final VAS scores were available from two studies [17, 29] (Fig. 11). We introduced two subgroups according to the painful place (back and leg). There was no significant difference between L group and S group in subgroup analysis, respectively (MD = − 0.22, 95% CI − 1.29 to 0.86, P = 0.69; MD = − 0.46, 95% CI − 1.63 to 0.72, P = 0.45). Heterogeneity existed within both subgroups (P = 0.07, I2 = 70%; P = 0.11, I2 = 61%); hence, the random effects model was conducted.

Fig. 10
figure 10

Forest plot for change values of ODI from pre-operation to the last follow-up between fusion to L5 group and fusion to the sacrum group

Fig. 11
figure 11

Forest plot for change values of VAS scores from pre-operation to the last follow-up between fusion to L5 group and fusion to the sacrum group

Subgroup meta-analysis

Although we required that the fusion length must be greater than four in the inclusion criteria, the fusion segments were significantly different in some studies. We made the subgroup meta-analysis to eliminate the influence of different fusion lengths. There were two groups as subgroups, 4 ≤ fusion segments ≤ 8 group and fusion segments > 8 group. Eight were chosen because fusion of so many segments could just cross the thoracolumbar junction to the thoracic spine.

For the final correction of SVA, there was significant difference in each subgroup (MD = − 0.74, 95% CI − 1.47 to − 0.01, P = 0.05; MD = − 2.70, 95% CI − 5.33 to − 0.07, P = 0.04), which was consistent with the previous meta-analysis. There was no significant difference between subgroups (P = 0.16) (Online Resource 3).

For the overall complication rate, there was no significant difference in each subgroup (OR = 1.16, 95% CI 0.60 to 2.23, P = 0.67; OR = 0.52, 95% CI 0.20 to 1.36, P = 0.19), which was also consistent with the previous meta-analysis. There was no significant difference between subgroups (P = 0.18) (Online Resource 4). For the revision rate, there was no significant difference in each subgroup (OR = 1.08, 95% CI 0.69 to 1.70, P = 0.74; OR = 0.40, 95% CI 0.12 to 1.39, P = 0.15), which identified with the previous meta-analysis. There was no significant difference between subgroups (P = 0.14) (Online Resource 5).

For the rate of proximal adjacent segment disease, there was no significant difference in each subgroup (OR = 0.68, 95% CI 0.26 to 1.81, P = 0.44; OR = 0.65, 95% CI 0.35 to 1.21, P = 0.17), which was still consistent with the previous meta-analysis. There was no significant difference between subgroups (P = 0.94) (Online Resource 6). Similarly, for the rate of distal adjacent segment disease, there was significant difference in each subgroup (RD = 0.17, 95% CI 0.11 to 0.24, P < 0.00001; RD = 0.15, 95% CI 0.04 to 0.26, P = 0.007), which was consistent with the previous meta-analysis. There was no significant difference between subgroups (P = 0.70) (Online Resource 7).

For the rate of pseudarthrosis and implant-related complications, there was significant difference in each subgroup (OR = 0.53, 95% CI 0.29 to 0.97, P = 0.04; OR = 0.33, 95% CI 0.19 to 0.60, P = 0.0002), which was still in line with the previous meta-analysis. There was no significant difference between subgroups (P = 0.27) (Online Resource 8). Other variables could not be divided into two subgroups.

Discussion

ASD is a highly intricate clinical condition in which the main associated issues are pain and disability, which should be taken into account when making treatment decisions [6]. Despite a relatively high rate of complications, surgery is generally considered to be an effective and rational treatment option that has the capacity to significantly reduce pain in patients [7, 8]. In the treatment of patients with ASD, decompression is usually combined with fusion [32]. Decompressive surgery is used to release the neurological symptoms, while the purpose of spinal fusion is to stabilize the corrected alignment and prevent deterioration of the curve [33]. Long-segment fusion has been extensively used with better curve correction and long-term outcomes in ASD than short-segment fusion [34, 35].

An important aspect of long fusion surgery is the determination of fusion extent. Most studies presented unique advantages and disadvantages of L5 or the sacrum as the choice of distal endpoint, respectively. Edwards et al. [18] compared 27 patients with fusions to L5 to 12 patients with sacral fusions. Complication rates, including pseudarthrosis and medical morbidity, were higher in the sacral group at final follow-up. A systematic review of subsequent progression of L5-S1 disc degeneration found the risk of developing new symptoms was approximately 18–20%, with most requiring re-operation [36]. Cho et al. [16] found a 58% rate of subsequently advanced L5-S1 disc degeneration in patients fused to L5 and 21% developed associated symptoms. Fusion to the sacrum was recommended by the author in patients with preoperative sagittal imbalance and lumbar hypolordosis. Kuhns et al. [21] reported 69% of patients with long fusions to L5 had evidence of advanced L5-S1 degeneration, and 23% required subsequent surgery to extend the construct to the sacrum, over a minimum 5-year follow-up.

Thus, the choice of distal fusion level remains equivocal today. To our knowledge, the study is the first meta-analysis comparing the results of long fusion stopping at L5 with extending to the sacrum in ASD. We collected data from 1211 patients with healthy or minimal degenerated L5-S1 discs in 11 articles. The quality of included studies was relatively high according to the Newcastle–Ottawa scale.

Scoliosis correction angle is an important parameter to judge the efficacy of ASD surgery. The results showed no statistically significant difference between the two groups, which means that stopping a long fusion at L5 may be effective enough to correct spinal deformity in coronal plane. The magnitude of curve improvement in ASD patients was not affected by an additional lumbosacral fusion.

For lumbar lordosis angle, the final change was significantly greater in the S group than in the L group. In other words, patients fused to the sacrum obtained better restoration of lumbar lordosis. It was mainly due to the greater lordotic angle at L5-S1 given by lumbosacral instrumented correction in S group and more loss of correction in L group, which might be related to the degeneration of L5-S1 disc [16, 18]. Loss of L5-S1 segmental lordosis contributes to loss of global lumbar lordosis.

Most of included studies revealed that global sagittal balance deteriorated after long fusion in both groups [14, 16, 17]. However, there was a significant difference in the final change in SVA between two groups. Patients with sacral fusion tended to maintain global sagittal balance better than their L5 counterparts. The inability to maintain sagittal balance with long fusion to L5 might be attributed to multiple factors, such as subsequent degeneration of L5-S1 disc and fixation failure [25]. Patients with subsequent L5-S1 disc degeneration were usually in a compensatory position with a greater forward shift of the sagittal C7 plumb than in patients without disc degeneration [37]. The forward shift in sagittal balance was not benign [18]. Loss of lumbar lordosis was also in part responsible for the sagittal imbalance. Moreover, because the L5 pedicles were short and cancellous and were subject to large cantilever force at the base of long fusions, there might be insufficient fixation with a strong tendency of L5 screws to pullout [19, 38]. Fixation failures would result in loss of balance in the sagittal plane. For the coronal plane, the stiffness of the lumbosacral junction or main lumbosacral curve was usually the cause of fixation to the pelvis, whereas fixation to L5 or above was recommended for normal lumbosacral junction. Fixation to the pelvis might improve coronal balance and pelvic obliquity [39].

Overall complication rate was similar in two groups with no significant difference. The result was counter-intuitive and inconsistent with the results of some previous studies involving only perioperative medical complications [10, 15, 18]. There was also no significant difference in revision rate between both groups. The result was consistent with a previous systematic review of three studies on this topic [40]. In L group, extension of fusion to the sacrum was the most common revision procedure [15, 18]. It implied that complications related to disc degeneration at L5-S1 were the main cause of revisions in L group, and patients suffering might benefit from the extended fusion. Patients in S group were revised for various causes, such as pseudarthrosis, fixation failure, and proximal junctional kyphosis (PJK).

Specifically, the rate of proximal adjacent segment disease was significantly higher in S group in our meta-analysis. Longer fusion would create a greater lever arm that increased the stress concentration and predisposed adjacent segments to degeneration [41]. Considering the low revision rate of PJK patients, fusion to the sacrum still seemed beneficial for ASD patients to avoid advanced L5-S1 disc degeneration despite the correlation between PJK and fusion to the sacrum [42]. The rate of symptomatic distal adjacent segment disease was 15% (10 to 19%) in L group. The studies of cho et al. [16] and Brown et al. [36] presented similar results—21% and 19%. A higher incidence of subsequent degeneration of L5-S1 disc (38 to 61%) has been reported in the literature [18, 21, 43, 44]. Most cases remained asymptomatic over time, however.

Pseudarthrosis was strongly related to fixation loosening, and we combined them for analysis. The rate of pseudarthrosis and implant-related complications was significantly higher in S group than in L group. Likewise, arthrodesis to S1 with long fusion was considered one risk factor for pseudarthrosis in Kim and his colleagues’ study [10]. The location of the pseudarthrosis varied, but they occurred mostly at L5-S1 due to the high mechanical demand and short pedicles at S1 [45]. Interbody fusion at L5-S1 level and sacropelvic fixation using iliac screws were frequently recommended to reduce the incidence of lumbosacral pseudarthrosis and screw loosening [17, 28].

Clinical outcomes of individuals with ASD were evaluated using validated tools. The degree of disability was quantified by ODI, and the degree of back and leg pain by VAS. Similar improvements in pain and disability based on patients’ subjective evaluation were obtained in both groups. The result was similar to many previous findings. Bafus et al. [26] found a similar incidence of perineal care difficulties in patients with fusion to L5 or the sacrum. Daniels et al. [27] showed the functional impacts of lumbar stiffness were not significantly different between patients with distal endpoints of L5 versus S1. Therefore, the functional significance of retaining an open segment below a long fusion may need to be further measured according to age stratification and specific daily activity in the future.

Subgroup meta-analysis showed that the fusion length did not affect the change in sagittal balance, overall complication rate, and incidence of various complications. Its effect on scoliosis and kyphosis correction was unclear. Furthermore, the use of different surgical techniques and approaches may lead to clinical heterogeneity. In reference to previous studies, interbody fusion at the lumbosacral junction was conductive to reinforce the fusion and reduce the risk of fixation failure [16, 45]. It is also well accepted that in case of long fusion ending at sacrum, iliac fixation could reduce the occurrences of pseudarthrosis and result in better radiological and clinical outcomes [46,47,48]. Many of the included studies did not clearly distinguish the iliac fixation group from the non-iliac fixation group, so subgroup meta-analysis was not conducted in our study.

Several limitations of the meta-analysis should be mentioned. Firstly, all of the included studies were non-randomized controlled studies and had relatively low quality. Further randomized controlled trials should be performed to obtain more convincing conclusions. Secondly, relatively small numbers of studies and patients are analysed. It will be significant to carry out multicentre studies with large samples in the future. Thirdly, the patients with ASD included several different pathological types that presented differently and required different surgical procedures. In addition, disease severities of included patients were also different. These factors might lead to clinical heterogeneity. Finally, some of our data were statistically transformed data and it might influence the quality of our conclusion.

Conclusion

According to our meta-analysis, we found that long fusion terminating at L5 or the sacrum was similar in scoliosis correction, overall complication rate, and revision rate. In addition, patients fused to the sacrum improved in pain and disability to a similar extent as patients fused to L5. However, long fusion to L5 had advantages in lower rate of pseudarthrosis, implant-related complications, and proximal adjacent segment disease, while long fusion to the sacrum had advantages in the restoration of lumbar lordosis, maintenance of sagittal balance, and absence of distal adjacent segment disease. Surgical strategy should be taken into comprehensive consideration weighing all aspects of each patient.