Introduction

Adolescent Idiopathic Scoliosis (AIS) is a three-dimensional spinal deformity that affects 2–3% of the teenage population [1, 2]. Surgical correction for AIS is advised for large and progressive curves as prophylactic treatment to prevent further complications to cardiovascular and respiratory systems, and for aesthetic purposes.

The surgical treatment of AIS requires a careful choice of fusion levels. The spinal fusion will involve sacrificing motion segments, making the selection of fusion levels more critical when sacrificing highly mobile levels, such as those in the lumbar spine, since the fused spine tends to move as a single unit determining increased stress in unfused regions and eventually leading to early degenerative changes. Several factors, including an imbalanced spine in both the coronal and sagittal planes as well as a tilted lowest instrumented vertebra (LIV), have been previously linked to disc degeneration (DD).

The goals of AIS surgery are to minimize risk of future deformity progression, to optimize spinal balance both in the coronal and sagittal plane and fuse the least possible number of motion levels [3].

Historically, the fusion area was considered to be the portion of the spine between two stable vertebrae [4], and this was mainly due to the limitations of available instrumentation, such as the Harrington instrumentation, which allowed correction solely based on distraction forces. The advent of segmental instrumentation allowed spinal surgeons to apply translation and derotation forces on all instrumented levels; therefore, it has become possible to preserve motion levels distally [5]. Up to date, the selection of lowest instrumented vertebra (LIV) when fusing a TL/L curve is substantially limited to L3 or L4, in rare cases L5 [3, 5,6,7,8,9,10,11].

Current criteria for choosing L3 rather than L4 as the LIV in TL/L curves are as follows:

  • L3 reaches the center sacral vertical line (CSVL) on bending radiographs towards the concavity of the lumbar curve.

  • L3 exhibits vertebral rotation according to Nash–Moe < 2 on bending radiographs towards the convexity of the lumbar curve.

  • L3–L4 disc wedging becomes neutral in concave side-bending radiograph.

  • L4 vertebra has a coronal tilt < 20°.

However, even with clear criteria of LIV selection (Fig. 1), there is still a certain a lack of agreement among surgeons regarding the choice of LIV due to variability in bending radiographs and projection errors or artifacts. Some authors suggest conducting full spine supine bending radiographs under orthopedic consultant supervision[12].

Fig. 1
figure 1

Criteria for LIV selection of L3 based on supine bending X-rays in TL/L curves according to Kim et al. and Shao et al.

The consequences of the preservation of one unfused distal segment below a long spinal fusion in an adolescent patient are still subject to debate. Outcomes have been investigated in few research papers, but a consensus has not yet been reached.

The aim of the present paper is to systematically review the current literature regarding long-term radiographical and clinical outcomes of adolescent idiopathic scoliosis surgery in which spinal fusion has been extended in the lumbar spine, to provide additional guidance to spine surgeons when choosing between fusion to L3 and L4.

Materials and methods

Review design

A systematic review of the literature regarding long-term outcomes of adolescent idiopathic scoliosis surgery was carried out following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA guidelines) [13].

The Oxford level of evidence scale [14] was used to assess the level of evidence of the included studies (full version for randomized and non-randomized clinical trials, modified version for all other studies).

Inclusion criteria: papers describing the radiological and clinical outcomes of adolescent idiopathic scoliosis surgery with minimum of 5 years of follow-up.

Exclusion criteria: incomplete data regarding clinical outcomes measurements, LIV selection, and radiological data. Isolated case reports/series with less than 5 patients. In vitro studies and animal model studies.

Only articles in English on peer-reviewed journals who met the population, intervention, comparison, and outcomes criteria on systematic reviews were considered for inclusion.

Randomized-controlled trials, prospective and retrospective cohort studies, and case series (CS) were considered for inclusion.

Search strategy

Pubmed-MEDLINE, The Cochrane Central Registry of Controlled Trials, Google Scholar, and the Embase Biomedical Database were searched over the years 1980–2023 to identify eligible studies describing the clinical and radiological outcomes in surgically treated adolescent idiopathic scoliosis patients at long-term follow-up. The online literature search was conducted in June 2023 by two authors (MM, and MT). The authors stated the following research questions: “Are there radiological and clinical differences in long-term surgically treated adolescent idiopathic scoliosis patients in whom L3 or L4 were selected as LIV? Does LIV selection of L3 prevents from disc degeneration?”.

The search was conducted using combinations of the following keywords: “scoliosis”, “MRI”, “long-term outcomes”, “PROMs”, “degenerative disc disease”, “DDD”, “adjacent segment degeneration”, “ASD”. Details on the search strategy are summarized in Supplementary Table S1.

Study selection

After screening the titles and abstracts, the full-text articles were obtained and reviewed. A manual search of the bibliography of each of the relevant articles was also performed to identify potentially missed eligible papers. Reviews and meta‐analyses were also analyzed to potentially broaden the search for studies that might have been missed through the electronic search. Duplicates were removed. The study selection process was carried out in accordance with the PRISMA flowchart [13] (Fig. 2). The present systematic review was accepted for registration in the PROSPERO database for systematic reviews [15] (ID: CRD42023438172).

Fig. 2
figure 2

PRISMA flow diagram and the selection of studies

Data extraction

Two authors (MM and MT) extracted the data through a standardized data collection form. Three authors (MM, FB, and MT) checked the data for accuracy, and inconsistent results were discussed. Data concerning study design, number of patients, demographics of patients, Lenke type or curve pattern, LIV selection, follow-up time, clinical assessment, radiological assessment, and results were extracted and summarized in Table 1. The following outcomes were considered for analysis: presence of low back pain; PROMs scores; Pfirmann [16] and Modic [17] grading, coronal and sagittal alignment, and LIV tilt.

Table 1 Characteristics of treated patients, surgical treatments, evaluated outcomes and main findings of included studies;

When studies involved both patients with LIV selection not limited solely to L3–L4, data about patients with L3 or L4 as LIV selection were pooled: if this was not possible, the study was excluded.

Methodological quality assessment of included studies

The Quality in Prognosis Studies (QUIPS) tool[18] was used to assess the methodological quality of the included studies. The quality of each study was reported assessing 6 domains: study participation, study attrition, prognostic factor measurement, outcome measurement, study confounding, and statistical analysis and reporting. Each domain can present a low, moderate, or high risk of bias: these combined together form an overall risk of bias. For each included study, the total risk of bias was categorized as low risk with ≥ 4 low-risk domains, moderate risk with < 4 low-risk domains, and high risk with ≥ 1 high-risk domains. As with the evaluation of titles and abstracts, any disagreement was solved by the senior Author (CF). Details on the quality of the studies included are summarized in Figs. 3 and 4.

Fig. 3
figure 3

Risk of bias assessment of the included study in qualitative synthesis according to QUIPS tool

Fig. 4
figure 4

Weighted bar plots of the distribution of risk-of-bias judgments within each bias domain

Statistical analysis

Meta-analyses were performed when possible. Heterogeneity between studies was assessed using the inconsistency statistic (I2 > 75% was considered as highly heterogeneity). Publication bias was assessed with Egger’s test and represented with forest plots. Odds ratio with 95% CI and p value were used as a measure of effect size. A random-effects model was applied. All statistical analyses were conducted with Jamovi version 2.2 (The Jamovi Project, Sydney, Australia) software. p value < 0.05 was considered to be significant.

Informed consent and institutional review board approval

Ethical approval and institutional review board approval were not required, because this study would retrieve and synthesize data from already published studies.

Results

Included studies

A total of 257 studies were found through electronic search. After screening, 14 studies [19,20,21,22,23,24,25,26,27,28,29,30,31,32] met the inclusion criteria and were included in the systematic review. There were eight [19, 20, 22, 24,25,26, 30, 32] retrospective case series, four [21, 27, 28, 31] retrospective case–control studies, one [23] retrospective cross-sectional study, and one [29] retrospective cohort study.

In all analyzed studies, each group of patients in which L3 was selected as LIV was matched with a relatively homogeneous group composed by patients undergoing similar surgery with L4 as LIV.

Included studies analyzed both small and large-sized populations, describing the clinical and radiological outcomes in surgically treated adolescent idiopathic scoliosis patients in whom L3 or L4 were selected as LIV. Included studies are heterogeneous in the description of curve pattern/Lenke type, radiological assessment, clinical assessment, and population demographics (Table 1).

Risk-of-bias assessment

Two authors (MM and MT) assessed the risk of bias for each study using the QUIPS tool, and results are shown in Fig. 3 and 4.

All studies [19,20,21,22,23,24,25,26,27,28,29,30,31,32] indicated an overall risk of bias low or moderate (respectively 73 and 27%). Since most studies well described the outcome measurement with a clear definition of the result, accurate and reliable outcome measurements, and outcome assessment, they demonstrated a low outcome and prognostic factor measurement items (< 85%). Furthermore, the confounding measurement and account item was consistently moderate for half of the studies since the observed influence of prognostic variables on outcome may be skewed by another component linked to the outcome. The other half of the studies performed multiple multivariate analysis to reduce the influence of confounding variables and identify independent risk factors for disc degeneration.

Cohort demographics

Included studies reported data on a total of 1264 patients (1004 females, 79.4%) who underwent radiological and/or clinical assessment at long-term follow-up after adolescent idiopathic scoliosis surgery. The median age at first evaluation ranged from 13.4 ± 1.78 to 33.3 ± 8.7 years and the mean follow-up ranged from 7.4 to 36.1 years, with a mean of 15.9 years.

Curve pattern and type of instrumentation

Only four studies [22, 24, 29, 31] reported data regarding the Lenke pattern of scoliotic curves (Table 1). The other studies reported various patterns of scoliotic curves (e.g., thoracic, lumbar, and double), because, in most cases, patients were treated before the introduction of the Lenke classification. In the cohort reported by Nohara et al. [29], patients were classified according to Lenke retrospectively.

Regarding the type of instrumentation, the included studies utilized various techniques and approaches. Eight studies employed Harrington rod [19,20,21, 25,26,27,28, 32], while 5 studies utilized all pedicle screws and rods [22, 24, 29, 31]. Only one study [30] reported the use of Cotrel–Dubousset instrumentation. Remnant studies reported the use of hybrid instrumentations (e.g., wires and hooks) either segmental or non-segmental [22, 23, 29, 32].

LIV assessment

Twelve studies reported the LIV selection of the included patients [20,21,22,23,24,25,26, 28,29,30,31,32] (Table 1). Two studies [19, 27] reported LIV as selection of either high or low lumbar vertebrae (such as L2 or above/L3 or below). The included studies reported data on a total of 116 thoracic LIVs, 200 L1 LIV, 301 L2 LIV, 342 L3 LIV, 332 L4 LIV, 89 L5 LIV, and 4 S1 LIV. The majority of included studies did not clearly define LIV selection criteria, and details regarding LIV selection criteria are depicted in Table 1.

X-ray assessment

All the included studies, except two [20, 30], evaluated their cohort with an X-ray study. The X-ray evaluation focused on the most widely recognized radiographic indexes of scoliosis preoperative planning, such as Cobb angle, apical vertebral rotation and translation, coronal and sagittal alignment, thoracic kyphosis, lumbar lordosis, sacral slope, pelvic tilt, and PI-LL mismatch. LIV radiographic parameters were evaluated in seven studies [22,23,24, 28, 29, 31, 32], measuring the LIV tilt, LIV disc angle, and LIV-to-S1 lordosis. Three [19, 25, 26] studies evaluated the LIV anteroposterior translation motion on lateral flexion and extension lumbar spine X-ray, to detect possible instabilities and/or listhesis.

Hayes et al. [19] in their retrospective case series of 48 AIS patients surgically treated with Harrington instrumentation described an increased incidence of retrolisthesis and translational motion at junctional level in patients fused to L4 when compared with patients fused to L3 or above (13/16, 81%, retrolisthesis when fused to L4 vs 4/10, 40%, when fused to L3; 4.4 mm of average translational motion when fused to L4 vs 3.5 mm when fused to L3).

Analogous results were obtained by Perez-Grueso et al. [26], where a higher rate of anteroposterior translation at junctional level was found between patients fused to L4 rather than L3. In 10 of 32 patients, the first disc below fusion showed instability at flexion–extension radiographs. However, those findings did not show significant differences among groups and were described by the authors as similar in degree and incidence to those found in an asymptomatic aging population.

Akazawa et al. [32] evaluated the lumbar disc degeneration among two groups of surgically treated AIS patients, L group where LIV was L4 or below and H group where LIV was L3 or above. The authors found that L group showed less lumbar lordosis than the H group (48.1° vs 32.2°) and greater SVA (1.2 vs 5.5 cm).

Lonner et al.[22] conducted a comprehensive evaluation of disc degeneration indicators at radiographic imaging, including disc space height, presence of osteophytes, Schmorl's nodes, intradiscal calcification, and endplate sclerosis or shape modifications. Through a composite radiographic score (CRS), calculated by summing each indicator, they analyzed the eventual association between degenerative changes at X-rays and LIV selection. The findings showed that patients fused to L4 exhibited a higher risk of significant disc degeneration (27.3%; p < 0.0267) compared to those with more cephalad LIV. Moreover, patients with LIV translation > 2 cm from the central sacral vertical line (CSVL) were eight times more likely to develop significant disc degeneration than those with LIV translation ≤ 2 cm and patients fused caudal to L3 were five times more likely to develop significant disc degeneration 10 years after surgery. No significant influence of LIV tilt or disc wedging on disc degeneration was found.

The other included studies [21, 23,24,25, 27,28,29, 31] did not show any statistically significant differences between radiological parameters and LIV choice in surgically treated AIS patients.

MRI evaluation

Six included studies[23, 24, 26, 29, 31, 32] performed MRI evaluation at long term-follow-up. Most of the studies performed MRI imaging at last follow-up evaluation searching for degenerative changes in unfused distal levels. Jakkepally et al. [24] compared pre-operative and post-operative MRI images looking for degenerative disc disease progression. The data were extracted to assess degenerative disc disease rates in patients in whom spinal fusion was extended to L3 or L4. Results of the studies regarding MRI evaluation are summarized in Table 2.

Table 2 Characteristics of included studies that evaluated magnetic resonance imaging at long-term follow-up and pooled data regarding patients in which LIV selection was L3 or L4

Meta-analysis was conducted on comparative studies and studies in whom data regarding patients with LIV selection of L3 or L4 could be pooled. No significative heterogeneity (I2 = 5.88, p = 0.276) and publication bias (p = 0.286, Fig. 5) were found. No significant difference in disc degeneration, intended as Pfirrmann’s grade progression or Pfirrmann’s grade > 3 in at least one intervertebral disc distal to fused region at last follow-up, between patients fused at L3 or L4 was found at meta-analysis (p = 0.916, Fig. 6), with an estimated average log odds ratio of – 0.05 (95% CI −1.019 to 0.915).

Fig. 5
figure 5

Funnel plot of effect sizes for publication bias of the included study in meta-analysis

Fig. 6
figure 6

Forest plot of overall meta‐analysis of the included studies with data about disc degeneration rate observed at MRI images at long-term follow-up in patients in

Perez-Grueso et al. [26] found that 60% of evaluated intervertebral discs in distal unfused segments showed degenerative changes at more than 10 years of follow-up. Those findings did not correlate with number of unfused segments, radiographic results of curve correction, and final clinical outcome. The authors did not provide data regarding degenerative disc disease classification.

Nohara et al. [29], Akazawa et al. [32], and Chiu et al. [23] evaluated MRI images at last follow-up and found significant degenerative changes in the unfused distal lumbar segments. However, pre-operative MRI images were not available. Nohara et al. [29] found that degenerative disc disease (Pfirrmann grade > 3) was present in at least one unfused disc in 68 and 50% of patients fused down to L3 and L4, respectively. It is important to notice that only 2 patients in the cohort had been fused to L4. Akazawa found that 100% of patients in which L4 was chosen as LIV showed degenerated disc below fusion, on the contrary only 55% of patients fused to L3 showed disc degeneration. Chiu et al. [23] found higher rate of disc degeneration in patients fused to L3, rather than and L4, 58.3 and 40.7%, respectively. Jakkepally et al. [24] evaluated the eventual progression of disc degeneration comparing pre-operative MRI images with MRI images obtained at last follow-up (mean follow-up of 9.1 years). Eleven patients showed degenerative progression from Pfirrmann’s grade 1 to grade 2. Only in 4 patients, Pfirrmann’s grade progressed to value higher than 3, which is commonly considered the cut-off to define a degenerated disc. No patient progressed to Pfirrmann’s grade 5.

Clinical assessment

Clinical assessment of included patients was performed in all studies [19,20,21,22,23,24,25,26,27,28,29,30,31,32]. Patient reported outcomes measures (PROMs) represented the used tools to measure patient-reported outcomes in most of the included studies. The following tools were administered: SRS-22 [33] with all its domains (Function, Pain, Self-Image, Mental Health, and Satisfaction with management), SF-36 [34] (short form health survey), ODI [35] (Oswestry Disability Index), VAS [35] (Visual analogue scale), NRS [35] (Numeric rating scale), Roland-Morris Disability questionnaire [35], and Modified Cincinnati Sports Activity Scale (MCSAS) [35].

Other studies focused their clinical evaluation in assessing the presence/absence of low back pain or using questionnaires or classifications such as Moskovitz, Moe, Winter and Binner low back pain classification [19], Arbitrary Scale of Back pain and activity Grading [20], Dallas pain questionnaire [25], and Ransford pain drawings [25].

Hayes et al. [19] investigated the presence of low back pain in 48 surgically treated AIS patients with a mean follow-up of 11 years. Patients fused to L4 when compared with patients fused to higher levels showed higher incidence of low back pain, particularly in those who showed retrolisthesis in dynamic lumbo-sacral X-ray (77% of those showing retrolisthesis). These results were confirmed by Fabry et al.’s [20] study, where postoperative severe low back pain increased from patients fused to L3 (17%) to patients fused to L4 (37%), concluding that ideally the LIV should not go beyond L2, and lower fusions are prone to give more low back pain. In Krismer et al.’s [25] study, low back pain depended on fusion level, with occurrence of 80% of pain in patients fused down to L4. Nevertheless, no relationship between LIV or pain on the one hand and segmental range of motion or increased translation on the other hand, and none between lordosis, pain and increased translation was found. Chiu et al. [23] divided their population study in Group 1 (G1) and Group 2 (G2) depending on LIV selection, L3 or higher and L4 or below, respectively. They found that in the SRS-22r questionnaire results, only pain domain demonstrated significant difference between G1 and G2 (4.3 ± 0.5 versus 4.0 ± 0.6, p = 0.044). Similarly, G1 obtained significantly higher scores in the bodily pain domain in SF-36 questionnaire (88.7 ± 12.3 versus 77.8 ± 18.7 (p = 0.018)). The other included studies [21, 24,25,26,27,28,29,30,31,32] did not show any statically significant differences between clinical parameters\PROMs results and LIV selection in surgically treated AIS patients. Table 3 reports PROMs of included studies stratified per LIV selection.

Table 3 Patient-reported outcomes measures details of the included studies, stratified per LIV selection

Discussion

The surgical management of adolescent idiopathic scoliosis has undergone substantial advancements in recent decades, driven by both technological developments in instrumentation and conceptual breakthroughs leading to new classification systems that better stratify patients and guide surgical treatment. A milestone in pre-operative planning was reached with the introduction of the concept of "structural curve" which allowed to clearly define criteria for ideal fusion area and identified compensatory/non-structural curves that might undergo spontaneous correction after surgery. According to Lenke classification [1], inclusion of all structural curves in the spinal fusion area is recommended. Thus, when dealing with double major or TL/L curves (Lenke pattern 3, 4, 5 and 6), the fusion area must be extended to the lumbar spine, and the usual recommendation for the selection of the lowest instrumented vertebra (LIV) falls on L3 or L4 based on TL/L factors [1, 3, 10].

Up to date, spine surgeons still face the concern of whether to spare a distal lumbar segment, accepting a higher risk of postoperative coronal imbalance [36], greater LIV–S1 coronal Cobb angle and suboptimal LIV tilt, or to achieve better coronal correction and LIV leveling at the cost of sacrificing a motion segment. There is a moderate consensus on the importance of leveling the LIV using distraction and/or compression, allowing further improvement of the residual lumbar curve over time [10]. The long-term outcomes of preserving a distal motion segment remain unknown, since there is no long-term study that clarifies whether there is a clinical and radiological benefit from leaving an unfused lumbar spine with moderate residual curvature over a partially fused lumbar spine with leveled subadjacent disk spaces [10].

A few studies have examined the effects of long spinal fusions performed in adolescent patients on unfused caudal segments using magnetic resonance imaging (MRI).

When considering all the eligible studies, the overall meta-analysis showed that the LIV selection of L3 rather than L4 in adolescent idiopathic scoliosis surgery does not lead to a significant difference in disc degeneration rate at MRI evaluation in distal unfused lumbar segments at long-term follow-up (p = 0.916).

A possible explanation of the findings of the present work is that DD in regions not included in the spinal fusion may proceed as acceleration of physiological aging processes, affecting particularly the junctional and lumbosacral region which are subjected to higher stresses. Consequently, a certain degree of degenerative acceleration occurs after each spinal fusion surgery, and this acceleration may depend on the overall extension of the fusion area. As a result, addition of a distal level may represent a relatively modest increase in lever arm and junctional stresses, including those at the lumbosacral junction. Additionally, the relative effect of the residual lumbar curve is still unclear, so the weight of an addition of one distal level to the spinal fusion and a greater LIV–S1 coronal Cobb angle might balance each other with the result of a similar disc degeneration rate at long term.

Two distinct pathological conditions can be distinguished based on the location of disc degeneration. The first is adjacent segment disease (ASD) in which the affected disc is placed at junctional level and clearly subjected to overload and hypermotility. ASD is defined as new degenerative changes at a level adjacent to a spinal fusion, accompanied by related symptoms (radiculopathy, myelopathy, or instability). This condition typically occurs within a short time span from index surgery, making it often detectable during the conventional 2-year follow-up in cohorts of adult patients who underwent spine fusion [37, 38].

The above-mentioned condition differs from the acceleration of physiological disc degeneration processes that also occur in healthy individuals over long time span and affects all unfused discs, but predominantly the lumbosacral junction, with L5-S1 being the most affected level which is naturally more vulnerable due to the frequent concomitance of AIS with low pelvic incidence morphotypes [26, 39].

Another important factor to consider is the clinical impact of disc degeneration. In the cohort of Green et al., healthy asymptomatic patients showed a prevalence of disc degeneration approximately 38%, with the majority of affected discs being at the L5-S1 level. However, the occurrence of a similar degree of degeneration is found in 100% of patients who have undergone long spinal fusion, with 45% of patients having a Pfirrmann grade > 3 at over 10 years of follow-up. The most affected level was again L5-S1, with only one patient showing significant adjacent segment disc degeneration.

Clinical assessments of patients of the included studies at long-term follow-up show a slight tendency to have worse clinical outcomes over the long term in terms of back pain and PROMs if spinal fusion is extended to L4 rather than L3. Despite that, meta-analysis could not be performed on clinical parameters because of heterogeneity of evaluated PROMs in included studies.

Fabry et al. [20] reported 17% back pain in patients fused to L3, 37% in patients fused to L4 and 46% in patients fused to L5. On the contrary, Enercan et al. [31] reported no significant difference in any of SRS domain between patients fused to L3, patients fused to L4 and control group. Bartie et al. [28], Ding et al. and Danielsson et al. [27, 40] found no statistically significant relationship between LIV and any of the evaluated clinical parameters; furthermore, no measurement was found to be predictive of a painful outcome. Patients with L4 as LIV showed a tendency to have a more intense pain, but not a more frequent or durable back pain. Up to date, the clinical implications of preserving one motion segment below a long spinal fusion extended to the lumbar spine is still uncertain.

The present work does not come without limitations; the heterogeneity of follow-up periods among papers is one of the major confounding features, since development of degenerative disc disease is a strictly time-dependent phenomenon, 10 years of time might be a short time to observe disc degeneration and to establish any cause-effect relationship between spinal fusion and DDD, since disc degeneration also occurs in healthy patients starting from second–third decade of life [41]. Based on these assumptions, true outcomes can solely be appraised through prolonged patient follow-up over an extended period, allowing the disc degeneration acceleration resulting from spinal fusion to become more evident, as demonstrated in Fig. 7 [42].

Fig. 7
figure 7

Graph depicting the disc degeneration rate over different follow-up periods in patients of the included studies, based on whether L3 or L4 was selected as LIV

Moreover, it is important to consider the post-operative sagittal spinal alignment of instrumented spine. Different spinal instrumentations determine different post-operative sagittal alignments. Included studies were heterogeneous in term of surgical techniques (anterior or posterior instrumentation) and spinal instrumentations (Harrington rod, Cotrel–Dubousset instrumentation, Hybrid instrumentation, Pedicle screws, and rod). Jakkepally et al. [24] included patients treated only with posterior spinal fusion with pedicle screws and rods, Chiu et al. [23] included patients treated with both segmental and non-segmental instrumentation. Akazawa et al. [32] included patients treated with both anterior and posterior instrumentation. Perez-Grueso et al. [26] included patients treated with Cotrel–Dubousset instrumentation.

Harrington instrumentations extended in the lower lumbar spine often determines lumbar flatback and subsequent compensatory hyperlordosis of distal unfused segments leading to degenerative changes such as retrolisthesis, facet joint arthrosis, spinal stenosis, and DDD [43]. In fact, patients treated with Harrington instrumentation often exhibit a higher rate of degenerative disc disease compared to those treated with segmental instrumentation, which enables surgeons to achieve a more physiological sagittal alignment. Akazawa et al. [32] reported 100 and 45% of disc degeneration in patients fused to L4 and L3, respectively. On the contrary, Jakkepally et al. [24] reported significantly lower rate of disc degeneration with 28.6 and 23.1% of disc degeneration rate in patients fused to L4 and L3, respectively.

Another confounding factor is that the clinical impact of disc degeneration (Pfirmann grade > 3 or Pfirmann grade progression over time) might be modest in some patients, as similar degenerative changes can be observed in a relatively high percentage of asymptomatic subjects [44].

An additional limitation of the study is attributed to the general poor quantity and quality of the literature available. In the majority of cases, the endpoint of these works was the qualitative assessment of the long-term degeneration status of the spine following spinal fusion from both clinical and radiographic point of view.

Another major confounding factor of the present work may be represented by the lack of information in included studies regarding LIV selection criteria and comparability assessment of curves fused to L3 rather than L4.

Finally, included studies that evaluated clinical parameters were heterogeneous in terms of PROMs’ assessment and cut-off criteria for DDD definition.

Conclusion

A few studies previously suggested that the LIV selection of L3 or L4 may not have an impact on disc degeneration over the long term both from the clinical and radiological point of view. This review, although the numerous limitations, confirmed the absence of significant clinical and radiological differences between patients fused to L3 or L4. Long-term studies of patients treated with contemporary spinal instrumentations are needed.