Introduction

Assessment of vertebral endplate bone marrow lesions (BML) is an important clinical task because of the positive association between MRI evidence of endplate BML and chronic low back pain (CLBP) [1, 2]. For example, BML characteristics such as size, location, and type, i.e. fibrovascular versus fatty, may provide information for diagnosis, prognosis, and treatment planning [3,4,5]. The most widely used methodology for assessing endplate BML is based on the presence of signal intensity changes adjacent to the endplates on T1- and T2-weighted MRI, often called “Modic changes” (MC) [6]. A limitation of these anatomical sequences is that they provide only qualitative visual information about lesion structure, and they are not sensitive to subtle variations in marrow composition. Thus, marrow composition in endplate BML remains poorly understood. For example, it is unknown how BML composition varies within or between patients, relates to clinical symptoms, or changes during BML progression. Such knowledge could form the basis for objective and continuous measures of endplate BML, inform new treatments, and provide targets for monitoring efficacy.

One unexplored approach for evaluating marrow composition in endplate BML is based on chemical shift encoding-based water–fat imaging. This technique enables spatially resolved assessments of bone marrow fat fraction (BMF) [7]. In water–fat imaging, water and fat signals can be separated based on the chemical shift difference between water and fat, and accurate fat quantification can be achieved by incorporating a multi-peak fat spectrum in the signal model [8]. Prior studies using water–fat imaging reported vertebral BMF values that demonstrate good agreement with histological results [9, 10] and showed age-related increases in BMF that are consistent with natural conversion of red to yellow marrow [11]. The purposes of this study were to measure BMF in CLBP patients with endplate BML using water–fat MRI and to assess the relationship between BMF measurements and patient-reported clinical characteristics.

Material and method

This prospective cross-sectional study was approved by the site’s institutional review board (#13–12224; approved August 21, 2014). Informed consent was obtained from each subject.

Subjects

Between January 2016 and July 2018, we enrolled forty-two patients with CLBP (more than three consecutive months with low back pain; VAS ≥ 4 or ODI ≥ 30) and eighteen control subjects without prior history of back pain or spinal pathology (VAS ≤ 1). Major exclusion criteria were pregnancy, diabetes, positive smoking status, cancer, spondylolisthesis, scoliosis, prior lumbar surgery, disc herniation, compression fracture, and active use of osteoporosis medication(s). A subset of the subjects was included in previous studies that did not evaluate MC (forty-six subjects [12]) or vertebral BMF (fifty-two subjects [13]). Demographic information is summarized in Table 1.

Table 1 Demographics of chronic low back pain patients and asymptomatic control subjects

Imaging

Lumbar MRI was performed on a Discovery MR 750 3-T scanner using an 8-channel phased-array spine coil (GE Healthcare). Clinical fast spin echo images with T1 and T2 weighting were acquired in the sagittal orientation (see Detailed Methods, Supplemental Material).

Water–fat imaging consisted of a 3D spoiled gradient-recalled echo (SPGR) sequence with six echoes and iterative decomposition of water and fat with echo asymmetry and least-squares estimation (IDEAL) reconstruction algorithm [7, 14] (see Detailed Methods, Supplemental Material). To test for repeatability of water–fat imaging and analysis, two subjects were each scanned three times on the same day, having exited and re-entered the scanner between scans.

Image analysis

BMF maps were generated from the six acquired echoes using the validated IDEAL algorithm that accounts for multiple peaks in the fat spectrum and performs T2* correction [14, 15]. The maps were analysed two ways. In the first analysis, BMF values were averaged for five circular ROIs (10.08 mm2 area) randomly placed inside the trabecular bone encompassed by the endplate BML, using conventional MC grading (see below) as a guide for ROI placement on the BMF maps. The BMF measurements inside the endplate BML were compared to BMF measurements averaged for five circular ROIs placed outside the BML in the adjacent, unaffected trabecular bone. For levels without endplate BML, the BMF values were averaged for five mid-sagittal ROIs placed in the trabecular bone adjacent to the endplate. The ROI analysis was performed in the 10 lumbosacral endplates from the inferior L1 endplate to the superior S1 endplate in each subject (600 endplates total).

In the second analysis, the five lumbar vertebral bodies plus first sacral vertebral body were manually segmented on the five mid-sagittal slices in IDL 8.7 (Harris Geospatial Solutions) to determine the distribution of vertebral BMF values at each level. Parameters derived from the BMF distributions—mean, median, standard deviation, coefficient of variation (SD/mean), skewness, and full width at half maximum—were compared between levels with and without endplate BML in order to understand how BML presence impacts the overall BMF distribution within the vertebral body. The whole-vertebral analysis was performed for six lumbosacral vertebrae in each subject (L1-S1; 360 vertebrae total).

Clinical grading

Assessment of endplate BML and disc Pfirrmann grade was performed by two raters blinded to group. Both are board-certified subspecialists in Pain Medicine with 20 years (C.O.) and 8 years (Z.M.) of experience reading and interpreting spine MRIs for clinical and research purposes. Endplate BML grading was done on T1-and T2-weighted images according to standard MC definitions [2], and all MC cases were large enough for ROI placement. To gauge inter-rater reliability, both raters scored a subset of the endplates (90/600). Subjects completed VAS and ODI to assess clinical symptoms.

Statistical analysis

Statistical analysis was performed using JMP Pro 14 (SAS Institute). A two-sided p value less than 0.05 was considered significant. The Kappa statistic was used to assess inter-rater reliability of MC classification. Independent t tests and χ2 tests were used to compare differences between two groups for continuous and dichotomous variables, respectively. Mixed effects models accounting for multiple measurements per subject were used to estimate the least squares mean of the various BMF parameters [16], and Tukey’s HSD post hoc tests were used to test for differences in the least squares mean between groups. Paired t tests were used to test for BMF differences between affected and unaffected regions of the same vertebral body. Parametric tests were used since the data did not violate the normality assumption (p > 0.05, Anderson–Darling test).

To test associations between BMF measures (endplate-level variables) and clinical outcomes (subject-level variables), we derived two composite BMF measures for each patient with MC, using the absolute differences in BMF values between affected and unaffected marrow from the ROI analysis. The first BMF measure was the mean difference in BMF between affected and unaffected marrow for all MC-affected endplates in a particular patient; the second measure was the maximum difference. Linear regression and logistic regression were used to assess relationships between the BMF measures and ODI and VAS, respectively.

Repeatability of BMF mapping was characterized by the precision error, which was defined as the root-mean-square average of the precision errors in mean BMF for subjects who underwent repeat MRI [17].

Results

Study population characteristics

No significant differences were found in age, sex, weight, height, or BMI between CLBP patients and controls (Table 1). In the patient group, mean VAS was 6.6 ± 1.7 and mean ODI was 33.3 ± 14.4.

Endplate BML prevalence

Twenty-seven subjects (26 patients, 1 control) presented with at least one level having endplate BML, and BML presence was significantly associated with pain status (OR 13.0, 95% CI 2.6–64.2; p = 0.002). The overall prevalence of endplate BML by type was: MC 1 (4.3%, 26/600 endplates), MC 2 (7.8%, 49/600 endplates), and MC 3 (0%, 0/600 endplates). For all types except MC 3, the prevalence was significantly higher in the CLBP group (Table 1). Inter-rater reliability of MC classification was excellent (κ = 0.838).

Mean BMF – ROI analysis

In endplates with MC, mean BMF was altered compared to endplates without MC (Fig. 1a). Specifically, for endplates with fibrovascular BML (MC 1), mean BMF was significantly lower than in endplates without MC (absolute difference −22.3%; 95% CI −24.8, −19.8; p < 0.001). For endplates with fatty BML (MC 2), mean BMF was significantly higher (absolute difference 21.0%, 95% CI 19.1, 22.9; p < 0.001). For both MC types, mean BMF in the affected endplate was also significantly different compared to unaffected regions of the same vertebral body (Fig. 1b). For levels without MC, mean BMF was positively associated with age (p < 0.001) and higher in males than females (p = 0.012).

Fig. 1
figure 1

(a) Comparison of bone marrow fat fraction (mean ± 95% CI) between endplates with and without Modic changes (MC). Mean fat fractions were as follows: no MC, 45.6% (95% CI: 42.8–48.4; n = 525 endplates); MC 1, 23.3% (95% CI: 19.9–26.7; n = 26 endplates); MC 2, 66.6% (95% CI: 63.5–69.7; n = 49 endplates). Mixed effects models were used to estimate the least squares mean fat fraction, accounting for multiple measurements per subject; Tukey’s HSD tests were used compare the mean fat fraction between different MC types. (b) Pairwise differences in bone marrow fat fraction between the affected and unaffected regions of endplates with MC. Differences greater than zero indicate higher fat fraction in MC; differences less than zero indicate lower fat fraction in MC. P values show the result of paired t tests for all endplates with each type of MC. (c) Representative images of patients with and without MC. Arrows point to endplates with MC identified using the T1 and fat-saturated T2 images. The BMF maps are shown with an overlay of the employed segmentation masks

BMF distributions – vertebral body analysis

Table 2 shows the values of the six parameters derived from the BMF distributions of the segmented vertebral bodies. Each parameter differed significantly between vertebral bodies with and without MC (mean, standard deviation, coefficient of variation, full width at half maximum) or differed significantly between vertebral bodies with MC 1 versus MC 2 (mean, median, coefficient of variation, skewness). For example, the low BMF values inside MC 1 resulted in left-leaning distributions compared to the high BMF values inside MC 2, which resulted in right-leaning distributions (Fig. 2). Overall, MC presence was associated with BMF distributions that were more heterogeneous, having shorter and broader peaks.

Table 2 Characteristics of bone marrow fat fraction (BMF) distributions derived from segmentations of vertebral bodies with and without Modic changes (MC)
Fig. 2
figure 2

Histograms showing representative distributions of vertebral BMF values derived from water–fat MRI. Low BMF values inside MC 1 (blue arrow) resulted in right-skewed (left leaning) distributions compared to the high BMF values inside MC 2 (orange arrow), which resulted in left-skewed (right leaning) distributions. In both instances, MC presence within the vertebral body associated with BMF distributions having shorter and wider peaks compared to BMF distributions from vertebral bodies without MC

Associations between BMF measures and clinical outcomes

For patients with endplate BML, the absolute differences in BMF between affected and unaffected marrow were positively associated with ODI (p = 0.029–0.032; Fig. 3). These composite BMF measures were not associated with the VAS pain intensity scale (p = 0.49–0.83), age (p = 0.06–0.17), sex (p = 0.21–0.67), Pfirrmann grade (p = 0.10–0.11), or MC type (p = 0.19–0.80).

Fig. 3
figure 3

Scatterplots show associations between composite BMF measures and patient-reported disability (n = 26 patients). Composite BMF measures were defined using the absolute differences in bone marrow fat fraction (BMF) between affected and unaffected regions for each level with an endplate BML. Higher composite BMF values indicate greater absolute differences between affected and unaffected marrow in a particular patient. (a) The mean difference in BMF for all levels with endplate BML in a particular patient was positively associated with disability. (b) The maximum difference in BMF for all levels with endplate BML in a particular patient was positively associated with disability. Note that for patients with only one level with endplate BML, the mean difference in BMF and the maximum difference in BMF are equivalent

Repeatability of water–fat MRI

Test–retest repeatability of the imaging protocol and segmentation procedure indicated low inter-test variability in BMF relative to inter-subject variability (Fig. 4). The overall precision errors in mean BMF values were 3.43% (L4), 3.33% (L5), and 4.04% (S1).

Fig. 4
figure 4

Test–retest repeatability of water–fat imaging and bone marrow fat fraction (BMF) segmentation for two subjects that were imaged three times after exiting and re-entering the scanner between tests. The curves in each histogram show the distribution of BMF values for each test for the vertebral bodies of (a) L4, (b) L5, and (c) S1, the three levels with the highest incidence of endplate BML seen on MRI. (d) Representative images showing qualitative repeatability of segmented BMF maps for the same subjects

Discussion

We used IDEAL water–fat separation to measure bone marrow fat for the first time in CLBP patients with endplate BML. The data indicate that BMF was significantly altered in endplate BML: in MC 1, BMF was 22.3% lower (p < 0.001), and in MC 2, BMF was 21.0% higher (p < 0.001) than in unaffected marrow. The nature of those alterations agrees with histopathology reports of fibrovascular and fatty replacement of the bone marrow in endplate BML that appear as MC 1 and MC 2, respectively [6, 18], which support BMF as a quantitative and continuous assessment of marrow composition that reflects the pathophysiology of MC. Our findings also reveal increased heterogeneity in vertebral BMF distributions for levels with endplate BML, which may have value for objectively diagnosing MC presence. Finally, whereas past work explored the clinical significance of endplate BML size [19, 20], our present findings establish the potential importance of BML composition. Specifically, we found the absolute difference in BMF between affected and unaffected marrow varied widely, even between patients with a particular MC type, and larger differences in BMF were associated with more severe disability (p = 0.029–0.032). This suggests that these BMF measures may relate to the clinical presentation of CLBP patients with MC. The pathogenesis and clinical relevance of endplate BML is controversial, owing partly to the qualitative and binary assessment methodology [2]. Taken together, our present findings suggest that evaluation of vertebral BMF provides measures of BML composition that can form the basis for quantitative and continuous assessments. This could clarify BML progression and inform treatment development and patient phenotyping.

The sensitivity of the IDEAL method for detecting subtle differences in BMF within and between individuals has potential clinical implications for quantitative evaluation of endplate BML. The progression of endplate BML in relation to symptoms is unclear [21] and is hindered by the dynamic clinical presentation and by the lack of quantitative tools for measuring the composition of the underlying marrow. For example, previous studies reported that MC 1 and MC 2 are inter-convertible [22, 23] and that approximately 20% of BML show characteristics of both MC 1 and MC 2 or MC 2 and MC 3 [24]. This confounds binary MC classification, which could partly explain why the size of any one type of MC may not directly correlate with symptoms [20]. Compared to binary classification, continuous quantitative assessments of BMF appear well suited for capturing the dynamic and heterogeneous pathophysiology of endplate BML. For example, we found that heightened BMF values inside MC 2 were often marginated by a narrow band of low BMF values (Fig. 1c), consistent with active conversion. Also, for a particular MC type, the differences in BMF between affected and unaffected marrow spanned a wide range (Fig. 1c; MC 1: −37.9% to −14.3%; MC 2; −2.5% to 44.2%), indicating that binary classification belies the heterogeneity in marrow composition. Moreover, the significant associations between those BMF differences and ODI suggest clinical relevance of measuring BML composition. This is further corroborated by results from a histopathology study indicating that the degree of innervation in endplate BML—a theoretical basis for nociceptive pain—may associate with BML composition but not with BML size [18]. Although requiring confirmation in longitudinal studies, these data collectively suggest that BMF assessment may aid in clarifying BML progression in relation to symptoms.

Our framework using water–fat MRI is based on a 6-echo SPGR sequence with IDEAL reconstruction algorithm. Other approaches used for augmenting endplate BML diagnosis include chemical shift imaging (e.g. Dixon), short T1 inversion recovery (STIR), diffusion-weighted imaging (DWI), and ultrashort echo time (UTE) imaging. However, STIR is mainly used to confirm marrow oedema, DWI may differentiate between MC 1 with degenerative vs. infectious aetiologies [25], and UTE imaging can distinguish the bony endplate from cartilage endplate [13]. We are not aware of prior studies that measured marrow composition in MC; nonetheless, mean BMF in non-MC regions was in the range reported for similarly aged adults [9, 10], which supports our findings’ validity.

This study has limitations. First, it is a single-site, single-scanner study with a small number of CLBP patients. Although MC prevalence in the CLBP group (26/42 patients, 61.9%) was in the range found previously [1], there were only 11 patients with MC 1 and 15 patients with MC 2. A larger number of MC cases, including MC 3, are needed to explore the role of BML size and to investigate whether the relationship between the composite BMF measures and clinical outcomes depends on MC type or the severity of disc degeneration. Related, although endplate BML are highly specific predictors of CLBP [1, 2], other factors may contribute to disability and pain sensation independently. Second, this study was cross-sectional, which could partly explain the modest associations with clinical presentation; also, we were unable to determine how changes in BMF relate to changes in clinical characteristics. Nevertheless, our findings motivate BMF assessment in the future longitudinal studies.

In summary, we found that vertebral BMF was significantly altered in CLBP patients with endplate BML and that for a given patient, larger differences in BMF between affected and unaffected marrow were associated with more severe disability. Taken together, these data demonstrate that IDEAL water–fat separation enables measures of marrow composition that can form the basis for quantitative and continuous assessments of endplate BML. Future studies will use this approach to clarify BML progression, inform treatment development, and guide patient selection.