Introduction

Since the first description and classification of signal abnormalities of the vertebral endplate and subchondral bone marrow oedema in magnetic resonance tomography (MRI) of the lumbar spine, the interpretation and clinical correlation has been the subject of debate [1, 2]. To date the understanding of Modic changes (MC) favours two main aetiologies: occult discitis [3] and autoimmune reaction [4]. These radiographic findings are understood as persistent stimuli, which presuppose structural disc damage [5]. In its physiological function, the disc is chronically exposed to forces and stresses. This may lead accumulatively to damage, which can also cause bone marrow oedema [6] or microfractures [7]. It has also been observed that biomechanical stress might contribute to the distribution of MC in the lumbar spine. In patients without specific pathology or surgery, MC were more common in the lowest two levels, especially L5–S1 [8]. Therefore, a biomechanical factor should also be considered in the pathogenesis of MC. In this context, an alteration of the biomechanical equilibrium is effected by iatrogenic damage of the disc. Correspondingly, a limited discectomy in the case of disrupted annulus fibrosus and herniated disc might alter the biomechanical forces directly on a microstructural level and predispose the adjacent vertebral to MC. Hence, we hypothesize that following discectomy the prevalence of MC should increase in both incidence and grade. And there is also little data on MC following lumbar discectomy [9,10,11,12]. To date there is no study available to evaluate the natural course of MC after a limited lumbar discectomy in a cohort of patients with prospectively collected data with a two-year follow-up and multiple imaging. This would allow real insight into these dynamic vertebral endplate changes, including any possible clinical relevance.

Materials and methods

Study design

This study describes outcomes of the control group only, from a multicenter, prospective randomized trial of subjects that underwent microscopic limited lumbar discectomy (only the herniated part of the disc was removed) and were followed for 2 years [13]. This study retrospectively reviewed the prospective data to analyse the natural course of MC in serial imaging. In addition, a possible clinical relevance of MC was assessed. The clinical trial (registered at ClinicalTrials.gov—NCT 01,283,438) was approved by local ethics committees, and informed consent was obtained from all individual participants included in the study.

Patient selection

Two hundred seventy-eight patients (enrolled between December 2010 and October 2014) between 21 and 75 years of age with the presence of lumbar radiculopathy due to a primary, single-level disc herniation between L1 and S1, and failure of at least 6 weeks of nonsurgical treatment was eligible for the study. Exclusion criteria included prior index-level surgery, spondylolisthesis with > 25% slip, scoliosis of greater than 10 degrees (both angular and rotational), insulin-dependent diabetes, and contraindication for MRI. Further details were described in the study protocol published previously [14].

Baseline characteristics are presented in Table 1. Complete data sets including MRI images and evaluation of MC at the index level were available at 1 and 2 years for 89.9% (250/278) and 85.4% (239/278) (Fig. 1). The mean volume of nucleus removal was 1.3 ml.

Table 1 Baseline characteristics of the patients
Fig. 1
figure 1

Enrollment of Patients. Intent-to-treat (ITT) population consisted of 278 patients assigned to control group. Compliance with clinical follow-up and MRI Modic data sets at 1 year was 89.9% (250/278) and at 2 years was 85.4% (239/278)

Follow-up

Clinical follow-up was done at 6 weeks, 3 months, 6 months, 1 year, and 2 years. MRI of the lumbar spine was performed preoperatively and at 1 and 2 years.

MRI

According to the imaging guidelines for the study, T1 and T2 images were required. Recommended axial slice thickness was 3–4 mm (sagittal ≤ 5 mm), a gap equal or less than 1 mm (sagittal ≤ 1.5 mm), and a maximum slice interval of 5 mm (sagittal 6.5 mm).

X-ray

According to the standard protocol, only true X-ray mode was used (no fluoro mode). For the lateral lumbar spine, 75–90 kVp and 15–150 mAs were preferred. Neutral–lateral, anteroposterior, and flexion/extension views were taken.

Magnetic resonance and conventional images were read by two board-certified radiologists at an independent imaging core laboratory (Intrinsic Imaging, LLC, Bolton, MA, USA) who were blinded to patient outcomes. Disagreement between raters was resolved with adjudication by a third radiologist.

Outcome measures

Radiological and MRI assessment

Within three months prior to surgery, MRI with T1- and T2-weighted axial and sagittal images were performed. Neutral AP, lateral, and flexion–extension radiographs were performed within sixty days prior to surgery. Postoperative imaging was obtained at each annual follow-up (12 and 24 months). An independent radiological core laboratory analysed all radiological imaging. In this analysis, we concentrated on angular motion (index and adjacent), translational motion (index and adjacent), spondylolisthesis, and MC.

The Modic classification describes three types of different vertebral endplate changes on T1- and T2-weighted MRI [2, 15]. To define the alterations of signal intensity changes at the index level during follow-up distinctly, there was the need for further subdividing each type into “new/increased”, “not changed”, or “decreased” signal intensity relative to baseline (Table 2).

Table 2 Description of sub-classification of Modic changes to assess the alterations in follow-up

Clinical assessment

Subjects’ function and pain was assessed using the Oswestry Disability Index (ODI) [16] and visual analogue scales for back, right, and left leg pain using a 100-mm scale. The postoperative scores of the assessments were compared to baseline scores.

Statistical analysis

Baseline patient characteristics, including age, BMI, sex, smoking history, together with VAS (for back and leg pain), and ODI are presented as medians and interquartile ranges for continuous variables and percentages for categorical variables. In order to gain an impression of Modic changes over time (i.e. presurgery, after 12, and after 24 months), itemized data of Modic subtypes “new/increased”, “not changes”, and “decreased” were pooled to Modic Types 1, 2 or 3, respectively, for reasons of clarity and comprehensibility. Results are represented as percentage values. Furthermore, an analysis of potential confounders, which could be a contributory cause of a specific Modic Type or Modic change over time, was carried out by the use of group comparisons between Modic Types and Modic changes over time regarding all collected patient characteristics. Initially, for each Modic Type the distribution of a potential confounder was visualized by boxplots for a continuous variable (e.g. age, BMI, size of annular defect, radiological findings) and by barplots for a categorical variable (e.g. gender, smoking,) with reference to the time point, i.e. presurgery, after 12, and after 24 months. Afterwards, possible significant differences between Modic Types regarding potential confounders were verified by using Student’s t test (in case of a two-group comparison) or ANOVA and, subsequently, Tukey’s post hoc test (in case of more than two groups) for continuous data. In case of categorical data, Chi-square test was employed. Additionally, to analyse confounding variables regarding the conversion of Modic Types from one, e.g. after 12 months, to another, e.g. after 24 months, more precisely, group comparisons were employed. To subtype within one preoperative Modic Type, e.g. within group “preoperative Type 0”, group comparisons between group “Type 0 after 12 months” and “Type 1 after 12 months” and so on, were made in the same way as described before. In case of non-normal distributed data and unbalanced groups, Mann–Whitney U test was employed. Statistical significance was set at p < 0.05. Statistical analyses were performed using version 3.3.2 of the R statistics package.

Results

Modic changes

Details are presented in Tables 3, 4, and 5. Results in per cent are based on the respective number with data for each follow-up examination.

Table 3 Modic changes of study population categorized into classical Modic types and defined subtypes (new/ increased, not changed, decreased) at presurgery (= baseline) and each follow-up
Table 4 Conversion of Modic changes categorized into Types (0–3) between each follow-up and presurgical (= baseline) situation. Percentage values based on the respective population in each categorized subtype. Not categorized images are not shown
Table 5 Visual analogue scale for back pain and leg pain and ODI score of investigated population and their statistically significance

Remarkably, there was a different pattern of transformation over time when analysing the MC Types and subtypes (Table 4). Preoperatively, MC Type 2 are observed to convert to Type 3 at one year in only 2% (2/121) of the cases, but a notably downward conversion to Type 1 and even to Type 0 at 1 and 2 years was observed [1 year FU: 33% (40/121) resp. 7% (9/121); 2 years FU: 23% (28/121) resp. 10% (12/121)]. The first year following surgery seems to be the most active phase in Type 2, whereas Type 1 changes seem to have the most active phase of change between the 12 and 24 month follow-ups (35%, 28 out of 81 convert to Type 2). Preoperatively MC Type 0 has a constant pattern of change to Type 1 (1 year FU: 17%, 2 years FU: 14%) and Type 2 (1 year FU: 10%, 2 years FU: 12%). Type 3 seems to be a less active situation of bone marrow changes.

The percentage changes of MC type and subtype are shown in Table 4. The majority of MC remain in their subtype between presurgery and 12 months (62.6% Type 0, 66.7% Type 1, 48.8% Type 2) and between 12 and 24 months (63.5% Type 0, 45.7% Type 1, 71.6% Type 2). Between presurgery and 12 months, it was observed that 33.1% Type 2 converted into Type 1 (at 12 months/24 months 14.8%) and that 20.0% Type 1 converted into Type 2 (at 12 months/24 months 34.6%). Comparing MC between presurgery and 24 months, nearly 50% remain unchanged (48.7% Type 0, 53.3% Type 1, 52.9% Type 2). Further details are shown in Figs. 2, 3 and 4.

Fig. 2
figure 2

The percentage conversion of each Modic Type (0–3) and subtype (new/increased, not changed, decreased) into another at 12 months relative to presurgery (= baseline)

Fig. 3
figure 3

The percentage conversion of each Modic Type (0–3) and subtype (new/increased, not changed, decreased) into another at 24 months relative to 12 months

Fig. 4
figure 4

The percentage conversion of each Modic Type (0–3) and subtype (new/increased, not changed, decreased) into another at 24 months relative to presurgery

Relationship of MC to clinical outcomes

Subjects achieved a highly significant (p < 0.0001) improvement in back pain, leg pain, and ODI score, at the regular 12-month visit. This clinically meaningful improvement was stable at 24 months. Regarding the sub-analysis of Modic types, no significant differences were found concerning these outcome parameters (Table 5). Even though 70% of subjects had recurrent herniations identified on imaging regardless of symptoms at 24 months, only 45 patients (37 at the index level) needed further surgical treatment.

For sensitivity assurance, patients re-operated for symptomatic reherniation within 2 years were excluded from the analysis. This did not change the reported significances.

Radiological parameters and MC

During the 2-year follow-up period, no statistically significant difference could be observed between rotation, translation, and spondylolisthesis at the index level and these radiological characteristics at baseline. Moreover, no correlations to MC Types 1, 2, or 3 nor to lack of MC were observed.

Rotational motion was −4.5° (−1.4° to −7.1°) at baseline, −4.3° (−1.4° to −6.6°) at 12 months, and −4.3° (−1.3° to −6.9°) at 24 months. Translational motion was 0.3 mm (1.1 to −0.7) at baseline, 0.5 mm (1.2 to −0.4) at 12 months, and 0.2 mm (1 to −0.6) at 24 months. Spondylolisthesis was −1.6 mm (−0.8 to –2.6) at baseline, −1.5 mm (−0.9 to –2.6) at 12 months, and −1.3 mm (−0.6 to −2.2) at 24 months.

Further parameters and MC

No statistically significant correlations could be observed between MC and age, sex, BMI, or smoking status. Insulin-dependent diabetes was excluded from the study.

Discussion

To date this is the largest retrospective analysis of prospectively collected data to evaluate the natural course of Modic changes following a limited discectomy in lumbar spine with serial control visits up to two years. Much to our surprise, we did not observe a simple evolution of MC with increasing frequency and grade of changes. The pattern is much more complex. The majority of Modic changes remain in their subtype comparing presurgery to 12 months and 12 months to 24 months. In addition, there is variable activity observed in Type 2 (e.g. downgrading at 12 months) and in Type 1 (e.g. upgrading between 12 and 24 months). Conversion from one grade to another may occur and may be upward or downward (not only MC Type 0–1, MC Type 1–2, or MC Type 2–3), whereas a change to Type 3 occurs less frequently. Comparing the changes at 24 months relative to presurgery, the dynamic variances of MC following surgery are less obvious. The impression that MC only progress following disc surgery reported elsewhere [10, 17] might be due to the lack of serial imaging in a dynamic event like Modic changes. The phenomenon of downgrading that we observed was also reported in the results of a longitudinal cohort study with conservatively treated lumbar disc herniation with 1.2% (2/166) for Type 1 and 6.6% (11/166) for Type 2 after 14 months [9]. Of note, a downward conversion of 33% of MC Type 2 to Types 1 and 0 was observed in our prospective study group at two years. Therefore, the assumption of a strict conversion from MC 0 to MC 3 should be reconsidered in the case of surgery for disc herniation. In addition, this is supported by the observation of a nonlinear evolution in MC types in a retrospective analysis of 358 patients with a medical history of discectomy [18].

In our investigation, we observed that 45% of surgical candidates for lumbar discectomy (mean age 44.0 years) had no MC in their imaging before surgery. Similar observations were reported in an observational study with a prevalence of no MC in 44.4% (249 individuals) with a mean age of 47.7 years [19]. Furthermore, we found MC most frequently in the lower lumbar region (adjacent to L4/5 and L5/S1) which has also been described elsewhere [8, 19]. Therefore, despite the limitations of clinical study conditions, distribution and prevalence of MC are close to what has previously been described in the literature [8, 19, 20].

Of note, there are different characteristics of MRI findings in the human lumbar spine besides MC, for instance disc displacement and endplate lesions. However, their prevalence is questionable. Some found endplate defects in two-thirds of their investigated cohort (108 patients) and showed a significance with disc degeneration scores, MC, and posterior disc displacement [21]. Others detected no endplate lesions in the whole lumbar spine in nearly two-thirds of 996 low back pain patients investigated with MRI [22]. Moreover, they observed a clear association between the presence of endplate defects and disc degeneration, but not for MC [22]. We concentrated on Modic changes following surgery for single-level disc herniation with unresponsiveness to conservative treatment. We did not evaluate different reasons for lumbar back pain as MC at others levels. But, main reasons such as instability, discitis, severe degeneration, and tumour were ruled out per study protocol [14].

Interestingly, prospective MRI investigation in a Finish population provided particular insights in unspecific chronic low back pain patients [23]. Baseline MRI with MC Type 1 was compared to follow-up. They found also up- and downgrading of MC in their highly selected 54 subjects. They did not present a distinct pattern, but they hypothesized a link between endplate lesions and subchondral signal abnormalities. Kerttula et al. reported in 54 subjects after 11–18 months of follow-up of constant MC Type 1 in 9%, increased in 40%, and decreased in 45% [24]. This is only partly comparable to our findings at 12 months follow-up showing two-thirds of MC Type 1 had no conversion to another type. Of these, the majority (91%) were new or increased. These differences may be mostly due to the different study population and study protocol (period for follow-up MRI, female preponderance, all disc levels included, no surgical candidates, and low back pain patients). However, it highlights the fact that also in low back pain patients a distinct pattern of change in MC Type 1 (increase, disappear, or change into another Type—MC Type 2) exists, even though their proportional prevalence is not similar to our findings. The similarities of the findings are supported by the consideration that MC seems to be a combination of endplate or disc injury, inflammatory mediators, and low-grade infection [25].

In our study population, we observed more MC following surgery than before surgery. MC subtypes had different patterns over time. Whereas MC Type 1 markedly increased by sevenfold (81/30) at the twelve months follow-up, MC Type 2 was reduced more than fivefold (81/121), and at 24 months the pattern was inverse as Type 1 decreased (20%) and Type 2 increased (19%). This leads us to the interpretation that the reaction to mechanical stress is different and long lasting. This is underlined by the results of a minor cohort of 41 subjects, who received a surgical disc prolapse removal [10]. After a median follow-up of 41 months, Rahme et al. found an increase of 81% in MC at the operated level (46.3% to 78%, being Type 2 lesions in 26 out of 32 cases). A conversion to Type 0 was not observed. In contrast, this conversion from Type 1 to Type 0 was observed after posterior screw-rod osteosynthesis and posterolateral arthrodesis. When disc degeneration after discectomy was the reason for fusion procedure, a recovery to no MC was observed in 57% [26]. Furthermore, an animal model confirmed that chronic compression led to significantly increased vertebral bone marrow oedema in mice [6]. In addition, a biomechanical testing on 27 human cadaveric lumbar spines found that in the presence of MC (or degenerative disc) the strength and stiffness of the bony endplate decreased greatly, by up to 46% [27]. Therefore, it should be evident that biomechanical stress leads to MC in the case of disc herniation, considering that loss of nucleus material (by herniation or more violently by surgery) reduces intradiscal pressure [28] and subsequently alters the pressure load distribution over the disc [29]. Due to their physical nature, these alterations are therefore likely occurring before they are evident on imaging evident below an imaging level. Moreover, a severe spinal imbalance that would further influence segmental function was ruled out by the exclusion criteria. Additionally, postoperative radiological assessment showed no segmental hypermobility after surgical procedure.

Limitation

First, due to the nature of this study (control arm of an RCT) we focused on a selected group of subjects. We therefore can only extrapolate our results to patients with symptomatic disc herniation. There are other aetiologies discussed in the literature, which might be linked with MC such as chronic low back pain. Second, this study focused on single-level pathology in the context of disc herniation. We did not consider further possible reasons for low back pain originating from different levels than the treated one or generated by endplate lesions. Third, presence of low virulent anaerobic microorganisms in the disc samples was not investigated. However, all subjects were immunocompetent and treaded conservatively at least for 6 weeks, which did not rule out treatment with steroids. Finally, longer patient follow-up is required to draw more definite conclusions regarding Modic changes.

Conclusion

In longitudinal MRI investigation, the pattern of Modic changes following lumbar limited discectomy is complex and not simply increasing. There is variable activity in MC Types 1 and 2 at the different time points of follow-up. Downgrading or upgrading of observed MC is possible. In the first year, the majority of MC Type 1 stay stable but one-third of MC Type 2 is converting (upgrading). In the second year, this changes to the opposite, as the majority of MC Type 2 stay stable but one-third of MC Type 1 is converting (downgrading). During the 2-year follow-up, the MC did not show statistically significant correlation to either radiological parameters or clinical outcome. Interpretation of these results in the context of chronic low back pain should be done with caution.