Introduction

Multiple myeloma (MM) and its precursors monoclonal gammopathy of undetermined significance (MGUS) and smoldering myeloma (SMM) are characterized by monoclonal proliferation of plasma cells with the spine as the predominant site of active hematopoietic bone marrow being the primary target. For MM, different infiltration patterns of the bone marrow have been described in conventional MRI [1]—normal, focal, diffuse, mixed, and variegated salt-and-pepper pattern—and were shown to have a prognostic impact [2, 3]. Diffusion-weighted imaging sequences (DWIs) and the apparent diffusion coefficient (ADC) in MRI have been shown to correlate with cellularity of the bone marrow in MM [4, 5]. ADC values differ significantly between infiltration patterns [6]. The International Myeloma Working Group currently recommends whole-body low-dose CT as the imaging modality of choice for the initial assessment in monoclonal plasma cell diseases based on the importance of detection of osteolytic lesions for disease definition and detection of imminent disabling fractures [7, 8]. Only in certain cases, such as equivocal CT results in MGUS and inconclusive CT in suspected SMM and MM, MRI is currently recommended as a further imaging modality [7]. However, particularly diffuse hypercellularity is obscured on conventional CT, and to date, MRI is the best non-invasive modality to detect bone marrow infiltration which is not necessarily associated with osteolytic bone destruction [7, 9].

In dual-layer spectral CT (DLCT), two different detector layers atop each other absorb different parts of the polychromatic patient-attenuated X-ray spectrum instead of activating a second X-ray tube or rapid-voltage switching tube before performing the examination. The benefit is the opportunity of retrospective spectral analyses and the selective depiction or suppression of materials, e.g., uric acid [10], iodine [11], or calcium [12]. In virtual non-calcium (VNCa) images, the osseous component is removed from the spectral base data in order to improve visualization of bone marrow. The degree of calcium suppression depends on the calcium suppression (CaSupp) index, which defines the calcium composition level.

Using the dual-source dual-energy technique, significantly increased VNCa-CT numbers in MRI-confirmed bone marrow lesions compared to non-infiltrated bone marrow were found [13] and VNCa-CT numbers differed significantly between different infiltration patterns [14]. The aim of this study is to correlate conventional and VNCa-CT images with MRI images to evaluate the quantitative and qualitative assessment of bone marrow infiltration in DLCT compared to the gold standard MRI.

Material and methods

Ethics approval and consent

This retrospective exploratory single-center study was approved by the local review board. The need for written informed consent was waived.

Clinical data selection and study design

Patients were screened in the hospital and radiological information system (I.S.-H.*med., SAP; Centricity RIS-i, GE Healthcare) for study inclusion. Patients with an initial diagnosis of a plasma cell dyscrasia (MM, SMM or MGUS) and routinely performed whole-body low-dose DLCT and whole-body MRI within 2 months of each other from August 2018 until October 2019 were enrolled into the study. Exclusion criteria were prior anti-myeloma chemotherapeutic treatment, additional solid malignancies, extensive imaging artifacts, or missing spectral data.

The patient cohort consisted of 38 patients matching the inclusion criteria, of whom 6 patients were excluded due to missing spectral information. A diffuse infiltration of the axial bone marrow was present in 14 cases and a non-diffuse infiltration in 18 cases. Twenty-seven patients were diagnosed with MM (14 with diffuse infiltration and 13 with non-diffuse infiltration), 4 patients with SMM (all with non-diffuse infiltration), and 1 patient with MGUS (non-diffuse infiltration) (Table 1). Median age at time of CT was 62.5 years (range 38–78). Median time between CT and MRI was 2.5 days (range 0–50).

Table 1 Study population with type of plasma cell dyscrasia and infiltration

DLCT acquisition and post-processing

Non-contrast CT acquisitions were performed as helical scans from the vertex of the skull to the knees using a dual-layer detector technique (IQon Spectral CT, Philips) with the following acquisition parameters: tube voltage 120 kVp, dose right index 15 (automated, attenuation-based dose modulation), average tube current-time product 93 mAs, mean computed tomography dose index (CTDIvol) 6.5 mGy (standard deviation [SD] 0.7 mGy, range 5.1–7.6 mGy, 32 cm body-phantom), pitch 1.0, gantry rotation time of 0.75 s, and collimation 64 × 0.625 mm. The field of view (FOV) was variable depending on patient body volume.

VNCa image reconstructions of DLCT data and CT number measurements were performed with the manufacturer’s dedicated image post-processing software (IntelliSpace Portal Version 11, Philips). VNCa images were reconstructed with CaSupp indices ranging from 25 to 95 in steps of 10 with minimum visibility of bony structures at index 25 and maximum visibility at index 95. Reconstruction was performed with slice thickness 3 mm, increment 1.5 mm, sharp kernel YA, and iterative reconstruction with iDose level 2 for conventional images and kernel B for VNCa images.

Using the same software, regions of interest (ROIs) were positioned manually in the vertebral bodies C7, Th12, L1-L5, and the right iliac bone to measure the respective CT numbers (mean value and SD). ROI sizes were 50 mm2 in C7 and 150 mm2 in all other locations (± 5 mm2). Inclusion of focal lesions into the ROI was avoided, if visible on CT or MRI. Pretreated vertebrae—radiotherapy, kyphoplasty, or spondylodesis—and adjacent vertebrae with treatment-associated artifacts or with unavoidable measurement of sclerosis were excluded. To ensure comparability, ROIs were copied between different CaSupp indices and conventional images.

A contrast-to-noise ratio (CNR) was calculated for each measurement location and for each CaSupp index:

$$ \mathrm{CNR}=\frac{\mid {\mu}_{\mathrm{diffuse}}-{\mu}_{\mathrm{non}-\mathrm{diffuse}}\mid }{\sigma } $$

Here, μnon − diffuse is the mean CT number of all patients with non-diffuse infiltration, μdiffuse is the mean CT number of all patients with diffuse infiltration, and σ is the mean of all measured SDs for the measurement location and the CaSupp index.

MRI acquisition and analysis

Whole-body MRI was performed on a 1.5 Tesla MRI (MAGNETOM AvantoFit, Siemens Healthineers) with the following imaging protocol: unenhanced coronal T1-weighted turbo spin echo sequences, short tau inversion recovery (STIR) sequences, and transversal DWI sequences (b-values 50 and 800 s/mm2, five blocks of 46 × 46 cm) with calculation of ADC maps. A signal loss in T1 and concurrent signal increase in STIR was defined as plasma cell infiltration of the bone marrow—either with signal alteration of focal areas within a normal fat signal or with diffuse homogeneous signal alteration of the spine [1]. By means of a consensus read by all radiologists, the infiltration pattern of each patient was considered either as diffuse (defined as diffuse infiltration, variegated salt-and-pepper pattern [1] or intense disseminated focal infiltration) or as non-diffuse (defined as normal pattern or sporadic focal infiltration). T1 and STIR sequences were also screened to avoid inclusion of large focal lesions into the quantitative evaluation. The consensus read was considered standard of reference for this study.

In the institutional PACS (GE Healthcare), ROIs were positioned manually in the ADC maps of the vertebral bodies C7, Th12, L1-L5, and the right iliac bone to obtain the respective mean signal intensity (ADC value) and SD. ROI sizes were 50 mm2 in C7, 200–250 mm2 in Th12, L1-L5, and 100–150 mm2 in the iliac bone (± 5 mm2). Slice thickness was 5 mm. Inclusion of focal lesions was avoided if determinable. CNR for each measurement location and CaSupp index was calculated for ADC values as described above for CT numbers.

Image analysis

Quantitative and qualitative image analysis was performed independently by two radiologists (4 and 8 years of experience).

For qualitative assessment of VnCa images, areas within the bone marrow that visually showed a density comparable to that of the erector spinae or gluteal muscles were defined as areas with plasma cell infiltration. Using the same criteria as described above for MRI, a decision was made for non-diffuse (Fig. 1) or diffuse infiltration (Fig. 2) on VNCa images at CaSupp index 65, which showed the strongest correlation to ADC in the quantitative evaluation.

Fig. 1
figure 1

Patient with non-diffuse infiltration. a CaSupp index 25; b CaSupp index 65; c CaSupp index 95; d conventional CT soft tissue window; e conventional CT bone window; f, g magnification of b and d for L1-5

Fig. 2
figure 2

Patient with diffuse infiltration; a CaSupp index 25; b CaSupp index 65; c CaSupp index 95; d conventional CT soft tissue kernel; e conventional CT sharp kernel; f, g magnification of b and d for L1-5

Statistical analysis

Statistical analysis was performed using SAS Version 9.4 (SAS Institute Inc.). Descriptive statistics were calculated, determining means and SDs for normal distributed data. For assessment of significant differences in CT numbers between the infiltration patterns, an independent two-sample t-test was applied (conventional and VNCa-CT) following a normal distribution. For MRI, a Wilcoxon-signed-rank test was applied for the same question (non-normal distribution of data). Correlations between VNCa-CT numbers and ADC values were calculated using Pearson’s correlation. Receiver operating characteristic (ROC) analysis with calculation of the area under the ROC curve (AUC) and Youden’s J statistic were performed to determine the cut-off for the mean VNCa-CT number of L1-L5 and of L3 at CaSupp index 65, which showed the highest correlation to ADC. AUCs were compared with each other using a large-sample χ2 test. The intraclass correlation coefficient (ICC) was calculated for determination of the interrater reliability of the quantitative and qualitative assessment. The significance level for statistical testing was set at p < 0.05.

Results

Quantitative analysis

Table 2 shows CT numbers and ADC values of different vertebral bodies, infiltration types, and CaSupp indices. CT numbers in VNCa images were significantly different between diffuse and non-diffuse infiltration at CaSupp indices from 55 to 95 (each with p < 0.0001). CT numbers in conventional images as well as ADC values also showed significant differences between both groups (both p < 0.0001).

Table 2 Means and standard deviations of MRI ADC and CT numbers in conventional and VNCa images for all measurement locations, splitted into diffuse and non-diffuse infiltration pattern.

Figure 3 summarizes the statistical results regarding the correlation between CT numbers and ADC values for different CaSupp indices and measurement locations. Regardless of the location, the highest correlation of VNCa-CT numbers and ADC values was at CaSupp indices from 55 to 95 (Pearson’s r between 0.26 and 0.68; all p < 0.05). Correlation of CT numbers in conventional images and ADC values was lower than that of CT numbers measured at CaSupp indices from 55 to 95 (Pearson’s r between 0.05 and 0.23; all p > 0.05). No significant correlation of CT numbers and ADC values could be shown in the vertebra C7 for all CaSupp indices. For each CaSupp index, the highest correlation of CT numbers and ADC values was in the vertebra L3. The strongest correlation of VNCa-CT numbers and ADC values for averaged L1-5 vertebrae (r = 0.66; p < 0.0001) and L3 vertebra (r = 0.68; p < 0.0001) was found at CaSupp index 65.

Fig. 3
figure 3

Heat map of Pearson’s correlation r and p value between CT numbers and ADC values for different CaSupp indices and measurement locations

ROC analysis for the differentiation between diffuse and non-diffuse infiltration for mean values of L1-5 and L3 alone revealed similar AUCs for DLCT at CaSupp index 65 (AUC = 0.819 and AUC = 0.885) (Fig. 4a) and for ADC (AUC = 0.863 and AUC = 0.846) (Fig. 4b), but lower AUCs for conventional CT (AUC = 0.641 and AUC = 0.636) (Fig. 4c). For mean values of L1-5, the AUC for conventional CT was significantly below that for VNCa-CT with CaSupp index 65 (p = 0.048) and ADC (p = 0.01), while there was no significant difference in AUC between VNCa-CT with CaSupp index 65 and ADC (p = 0.526). The optimum cut-off point for VNCa-CT in L1-L5 was −1.6 HU, with higher values indicating a diffuse infiltration with a sensitivity of 78.6% and a specificity of 75.0% (Fig. 5). The cut-off for ADC was 487.6 AU with values above indicating a diffuse infiltration with a sensitivity of 78.6% and a specificity of 86.1%. Considering L3 alone, the optimum cut-off points were 3.1 HU for VNCa-CT (sensitivity: 78.6%, specificity 88.9%) and 516.7 AU for ADC (sensitivity: 75.0%, specificity 86.1%).

Fig. 4
figure 4

ROC curves for CT and MRI; a ROC curve analysis: good differentiation between diffuse and non-diffuse infiltration for mean L1-5 vertebrae at CaSupp index 65 with AUC of 0.8185; b ROC curve of the MRI ADC for mean of L1-5 vertebrae with AUC of 0.8631. c ROC curve of conventional CT for mean of L1-5 vertebrae with AUC of 0.6409

Fig. 5
figure 5

Boxplots for VNCa-CT numbers for diffuse and non-diffuse infiltration at CaSupp index 65 for mean of L1-5 vertebra. Dashed line: optimum cut-off point of −1.6 HU for differentiation between both patterns.

On average, CNR of VNCa images was highest in L3. For VNCa images, the CNR for CaSupp indices from 65 to 95 exceeded the CNR for conventional CT on average. For CaSupp indices from 65 to 95, the highest CNR was found in the iliac bone.

Table 3 summarizes the CNR for different locations and image data in conventional CT, VNCa images, and MRI.

Table 3 Contrast-to-noise ratio for different locations and image data in conventional CT, VNCa-CT, and MRI. RIB right iliac bone

Qualitative analysis

At CaSupp index 65, diffuse infiltration was detected with a sensitivity and specificity of 83.3% and 78.6% for reader 1 and 92.3% and 68.4% for reader 2. Accordingly, average sensitivity and specificity were 87.8% and 73.5% for qualitative analysis.

Interrater reliability of quantitative and qualitative analysis

The interrater reliability of all quantitative measurements in CT was very high with an ICC of 0.98.

The interrater reliability of the qualitative assessment for the differentiation between diffuse versus non-diffuse infiltration in DLCT at CaSupp index 65 was substantial with an ICC of 0.69 [15].

Discussion

VnCa imaging on DLCT providing spectral data gives the opportunity to detect osteolytic bone lesions and evaluate bone marrow involvement in a quantitative and qualitative manner [16]. Therefore, our study aimed for a detailed quantitative assessment of the performance of VNCa-DLCT in the evaluation of bone marrow infiltration in plasma cell dyscrasias compared to MRI, considering CaSupp index and location of measurement.

The results show that differentiating MRI-confirmed diffuse infiltration from non-diffuse infiltration in quantitative DLCT is possible at high CaSupp indices, also showing comparable ROC curves and AUCs to ADC measurements for VNCa-CT. Strong correlations to ADC suggest that bone marrow infiltration can best be assessed in VNCa images at CaSupp index 65. The threshold of −1.6 HU for the average from measurements in L1-L5 vertebrae can be used for differentiation between diffuse and non-diffuse infiltration, reaching a sensitivity of 78.6% and a specificity of 75.0%.

The highest correlation of CT and MRI in the L3 vertebra might be due to the fact that L3 has the largest distance to junctions between kyphosis and lordosis with less likely sclerosis/fractures. The poor results for C7 can be attributed to beam hardening artifacts in CT—which might be overcome with future photon-counting detector technique [17]—and to position-provoked oblique sectioning of the small C7 in transversal MRI slices.

For CT number cut-off calculation, we chose to focus on the mean of L1-5 due to the large size of those vertebrae with maximized reliable measurement avoiding sclerosis, fractures, or disc herniations. Although bone marrow involvement in plasma cell dyscrasias is quite heterogeneous, ROI-measurement of the whole lumbar spine in the best performing CaSupp index 65 might be as representative as possible for differentiating diffuse and non-diffuse infiltration, while possible to integrate into clinical routine with acceptable effort.

Using DSCT, Kosmala et al distinguished infiltrated from normal bone marrow with a sensitivity of 93.3% and a specificity of 92.4%, exceeding the values reported in this study. However, Kosmala et al defined infiltrated marrow independent from the intensity of infiltration and placed ROIs directly on MRI-confirmed focal lesions [14]. In contrast, the main focus of this study is to delineate diffuse infiltration in CT, comprising MRI-confirmed diffuse or intense disseminated focal infiltration. Differences between the DSCT study and this DLCT study are also reflected by differences in cut-off CT numbers (−44.9 HU versus −1.6 HU) [14].

High sensitivities and specificities of 97–100% were also reached by another recent DSCT study from Kosmala et al differentiating MRI-determined diffuse or focal infiltration patterns from a normal pattern with cut-offs of 35.7 HU and −31.9 HU after placing ROIs in L4-S1, Th12 and the ilium [13]. However, for the differentiation between diffuse and focal pattern, they reported a low sensitivity of 33% and specificity of 25%. Again, besides different DECT techniques, different approaches of marrow infiltration classification and sites of measurement have to be considered.

Regarding qualitative assessment of VNCa-CT, a sensitivity and specificity of 91.3% and 90.9% was reported for differentiating infiltrated from normal bone marrow using color-coded DSCT VNCa maps [14], compared to the sensitivity and specificity of 87.8% and 73.5% for differentiation between diffuse and non-diffuse infiltration using DLCT VNCa grayscale images in this study, again considering different definitions of infiltration patterns.

In another DSCT study, Thomas et al achieved a sensitivity of 75.0% and specificity of 82.4% for the qualitative differentiation of no infiltration versus high-grade infiltration with color-coded maps, but a sensitivity of just 40.0% and specificity of 85.7% for the differentiation between no infiltration versus moderate- and high-grade infiltration [18]. For detection of solid tumor metastasis, diagnostic performance can be improved using contrast agent and low CaSupp indices (25 to 50) [19]. However, contrast media are usually not administered to patients with monoclonal plasma cell diseases due to increased risk for renal insufficiency, dose aspects, and the lack of necessity for the detection of osteolysis.

Currently, conventional MRI is the gold standard for bone marrow evaluation, but ADC measurements have been recommended to be added to standard protocols [4, 7]. Our study revealed an ADC cut-off of 487.6 for the differentiation between diffuse and non-diffuse infiltration with a sensitivity of 78.6% and a specificity of 86.1%. Koutoulidis et al showed higher sensitivity and specificity (100% and 98%) for the differentiation of diffuse versus normal bone marrow in MRI [6]. However, aside from different categories of bone marrow involvement, a comparison of specific ADC cut-offs in different studies is still not feasible due to the influence of technical parameters and patient-specific factors [4].

In summary, this study shows an approach for quantitative assessment of bone marrow infiltration in plasma cell dyscrasias in DLCT but further studies have to be performed in order to refine this approach for clinical application. The additional evaluation of spinal bone marrow infiltration in routinely performed CT would be of value, especially for patients with contraindications to MRI. The main limitation of this study is that with the current technical level of visualization in DLCT, it was not feasible to separate the five different infiltration patterns as proposed for MRI [1]. Hence, the main objective was to distinguish widespread infiltration in DLCT from low/no infiltration which was implemented by integration of intense disseminated focal infiltration into the diffuse category and sporadic focal infiltration together with the normal pattern into the non-diffuse category. This is in contrast to other studies, which used different definitions of infiltration patterns. Furthermore, future studies should investigate comparability of CT and MRI data acquired by different scanners, imaging protocols, and post-processing tools.

Conclusion

Quantitative and qualitative assessment of VNCa images in DLCT is feasible to determine the extent of bone marrow infiltration in plasma cell dyscrasias, especially when the gold standard MRI is not possible. Quantitative assessment of VNCa images produces comparable results to ADC of MRI at CaSupp index 65, averaging L1-L5 vertebrae, and ROC analysis suggests a cut-off of −1.6 HU for the differentiation between diffuse and non-diffuse infiltration.