Introduction

The incidental detection of adrenal lesions has increased over the past decades due to improved imaging techniques and increasing numbers of CT scans, with an estimated prevalence of 4–8% on abdominal CT scans [1,2,3,4]. Recent guidelines from the American College of Radiology provide an algorithm for the workup of incidentally detected adrenal lesions ≥ 1 cm [4]. Depending on size, the presence of indeterminate or high-risk imaging features, and history of cancer, either a follow-up or an adrenal CT is recommended [4]. An adrenal CT consists of a pre-contrast scan, a portal venous phase, and a late phase (15 min) [4, 5].

Adrenal adenomas can be differentiated from metastases by low CT values on unenhanced CT (< 10 HU) or by rapid washout between portal venous and late contrast phases (relative washout > 60%) [1, 4, 6]. In clinical routine, adrenal lesions are often detected on contrast-enhanced CT scans and therefore cannot be confidently identified as adenomas. In these cases, further imaging (e.g., an adrenal CT) or follow-up is recommended [4], resulting in additional radiation dose to the patient or a delayed clarification with the potential psychological burden respectively.

Previous studies on dual-energy CT (DECT) have addressed the value of virtual non-contrast (VNC) images for the assessment of adrenal lesions [1, 7,8,9]. However, most studies indicated that VNC overestimated CT values compared to true non-contrast (TNC). VNC series were therefore not recommended for routine clinical use.

In 2021, the photon-counting detector (PCD) technology was introduced. It enables the direct conversion of single X-ray photons into an electric signal, resulting in the reduction of electronic noise and the availability of broad spectral imaging with each scan [10]. A recent study assessed the value of VNC reconstructions on a PCD-CT for the evaluation of adrenal adenomas. However, similar to previous studies, VNC showed an over- or underestimation of CT values compared to TNC [11].

Besides the conventional VNC (VNCConv), a new algorithm is available for creating VNC series: the PureCalcium algorithm (VNCPC). Because iodine has a similar attenuation as calcium, calcium contrast is partially removed in VNCConv series. To address this problem and preserve calcium contrast, VNCPC creates a calcium mask before material differentiation. Recent studies analyzing this algorithm reported more consistent VNC values not only for calcified lesions but also for soft tissue [12,13,14].

Aims of this study were (i) to analyze the conventional VNC (VNCConv) and the new VNC algorithm (VNCPC) for assessment of adrenal lesions in comparison to TNC and (ii) to analyze the value of spectral imaging for the discrimination of adrenal lesions on contrast-enhanced CT scans.

Material and methods

Study population

We searched the local database consisting of patients with suspected malignancy and a contrast-enhanced CT of the abdomen between 04/2021 and 09/2022.

The local Medical Research and Ethics Committee (MREC) approved this retrospective single-center study and waived the need for informed consent for the patients included in the retrospective cohort (protocol number: 23–0451).

Inclusion criteria for the retrospective cohort were as follows: (1) age ≥ 18 years, (2) contrast-enhanced CT of the abdomen on a dual-source photon-counting detector CT (PCD-CT) between 04/2021 and 09/2022 in portal venous phase, (3) presence of an adrenal lesion ≥ 1 cm.

Reference standard

The definition of lesions as adenomas or metastases was performed as described previously [1, 4]. Lesions were diagnosed as adenomas when (1) unenhanced CT showed CT values ≤ 10 HU or (2) size did not change during at least 6 months or (3) abdominal MRI including chemical shift imaging showed signal drop in opposed-phase or (4) the lesion showed an absolute washout > 60% in multiphasic CT. Lesions were diagnosed as metastases when (1) size of the lesion changed within 6 months or (2) abnormal 18F FDG uptake was shown at PET/CT in a known malignancy.

Imaging protocol

All PCD-CT scans were performed on a dual-source photon-counting detector CT (NAEOTOM Alpha, Siemens Healthineers) as routine clinical acquisitions using a monophasic contrast injection protocol in portal venous phase. Each patient received a contrast bolus of 100 ml (Ultravist 300 mgI/mL, Bayer) that was injected via an antecubital vein (flow rate 4.0 ml/s) and followed by a saline bolus of 30 ml. Images were acquired in portal venous contrast phase (after fixed delay of 75 s after contrast injection).

All patients were scanned craniocaudally in a supine position during a single breath-hold. The following parameters were applied: acquisition mode with readout of spectral information (Abdomen QuantumPlus, Siemens Healthineers), 120-kV tube voltage, 0.5 s or 0.25 s, rotation time, 144 × 0.4 mm collimation.

A part of the patient cohort also received true non contrast CT, either within the same protocol prior to contrast application or within a separate acquisition, e.g., during FDG-PET or radiation planning.

Image reconstruction

CTs were reconstructed from image raw data using a dedicated research software (ReconCT 16.0, Siemens Healthineers). For each patient, two VNC series, one conventional (VNCconv) and one pureCalcium (VNCPC) at a virtual monoenergetic level of 70 keV, iodine maps, and virtual monoenergetic image (VMI) series from 40 to 90 keV in 10-keV increments were reconstructed. The slice thickness and increment were 1.0 and 0.5 mm. A soft tissue kernel optimized for quantitative evaluation, Qr40, was used with an iteration strength of 3. Image reconstruction of TNC images was performed using SyngoVia (Syngo.via VB60A, Siemens Healthineers) and same slice thickness and increment as in the PCD-CT.

Image analysis

Semiautomatic segmentation of adrenal lesions was performed by a board-certified radiologist (S.B.) with 7 years of experience in abdominal CT imaging using open-source software 3D Slicer (http://www.slicer.org [15]) (Fig. 1). Quantitative data were extracted after lesion segmentation for all reconstructions (VMI 40–90 keV, VNCConv, VNCPC, and iodine maps) using pyradiomics (version 3.1.0 [16]). First-order features (mean HU values) were used for further analyses. True non-contrast (TNC) images were available for 49/90 patients (for 14/45 metastasis and 35/45 adenomas).

Fig. 1
figure 1

Example of semiautomatic segmentation of a left adrenal adenoma using 3D Slicer

Statistical analysis

Analysis of descriptive data and statistical analyses were performed using R (R Statistics, version 4.3.1, R Core Team, [17]), RStudio (version 2023.06.2 [18]), and Python 3.7.1 (www.python.org). Shapiro–Wilk tests were performed to check for normal distribution. Non-normally distributed data are presented as median and interquartile range (IQR), normally distributed data as mean (± standard deviation). Mann–Whitney U tests (non-normally distributed data) or t-tests (normally distributed data) were performed to compare between two groups. Bonferroni correction was performed for multiple testing. Bland–Altman plots were used to describe the similarity between VNC and TNC. Logistic regression models with imaging features (HU values from spectral data) as independent variables and outcome (adenoma vs. metastasis) as dependent variable were performed after splitting the cohort into training and test cohort (80/20). Five models were randomly built on the training cohort (fivefold cross-validation). After 5 models were trained and validated, the method gave as final trained model the one that obtained the best results. This model was then applied on the test cohort. ROC analyses were performed in R using the pROC-package. Data are shown for all established models (folds 1–5) as well as for the selected best model (test set). Area under the curve (AUC) is shown for the model that was applied on the test cohort. The optimal cutoff value was determined using the cutpointr-package in R. For feature selection, the Boruta package was applied in R. The Boruta algorithm is a wrapper method for feature selection and often applied in radiomics analyses [19]. The algorithm uses a random forest (RF)–based classification model to select the most important features. Statistically significant differences were assumed at p-values ≤ 0.05.

Results

Patient population

From the local database, 110 patients with adrenal lesions were identified. We excluded patients due to missing ground truth (n = 10), lesions too small to measure (n = 4), missing contrast-enhanced scan (n = 1), diagnosis of adrenal carcinoma (n = 1), and missing raw data (n = 4).

Finally, 90 patients (36 female assigned at birth, mean age 67.8 years [range 39–87]) were included in this study. Twenty-six patients had bilateral adrenal lesions; in these cases, we analyzed the left side (except for one case with calcification in the left adrenals). Baseline characteristics are shown in Table 1.

Table 1 Baseline characteristics

Quantitative analysis

Adrenal adenomas showed significantly lower median CT values in VNCConv reconstructions (18.15 [12.92–24.26] vs. 31.28 [26.00–35.89], p = 0.007) and in VNCPC reconstructions (16.00 [11.02–21.56] vs. 27.22 [22.59–30.64], p = 0.007) compared to adrenal metastases (Fig. 2, Table 2).

Fig. 2
figure 2

Boxplots for quantitative analyses of mean CT values a VNCConv (p = 0.007), b VNCPC (p = 0.007), c 70 keV (p = 1.000) and d iodine density maps (p = 0.726) in adrenal lesions and differentiation between metastases and adenomas. 0 = no metastasis/adenoma; 1 = metastasis; VNCConv = conventional virtual non-contrast; VNCPC = Virtual non-contrast pure calcium

Table 2 Mean CT values at different reconstructions in metastasis and adenoma

For patients with available TNC images (n = 49; metastasis = 14, adenoma = 35), Bland Altman plots showed a mean difference of 14.85 for VNCPC reconstructions and of 17.67 for VNCConv reconstructions with higher values for both VNC algorithms compared to TNC (Fig. 3).

Fig. 3
figure 3

Bland Altman plots for difference between TNC and different VNC algorithms ­(a VNCConv and ­b VNCPC) in the subgroup of patients with available TNC (n = 49). TNC = true non-contrast; VNCConv = conventional virtual non-contrast; VNCPC = Virtual non-contrast pure calcium

Feature selection

Using the Boruta random forest method for feature selection, VNCConv and VNCPC were selected as the features with the highest importance (Table 3, Fig. 4). Also, iodine maps as well as VMI 80 and 90 keV were important features in this model yet with lower importance.

Table 3 Feature importance
Fig. 4
figure 4

Feature importance in a Random Forest Model (Boruta)

Logistic regression models

Logistic regression models showed that use of all available spectral data had the best ability to discriminate between adrenal adenomas and adrenal metastases (AUC = 0.938) (Fig. 5). Used independently, TNC showed an AUC of 0.762, VNCConv reconstructions showed an AUC of 0.857, and VNCPC reconstructions an AUC of 0.931 in ROC analyses (Fig. 5). Table 4 gives an overview of different cutoff values for VNCConv and VNCPC reconstructions and TNC and their sensitivity and specificity as well as accuracy and precision for discrimination between adenomas and metastases. Using for example 26 HU in the VNCConv reconstruction as a cutoff value, there is a sensitivity of 86.7% and a specificity of 75.6% for the correct diagnosis. For TNC, an optimal cutoff value of 10.8 was observed in our cohort and showed best discrimination between benign and malign adrenal lesions with a sensitivity of 71.4% and a specificity of 82.9%.

Fig. 5
figure 5

ROC analysis using metastasis as dependent variable and either a all features, b TNC, ­c VNCConv, or ­d VNCPC as independent variables. Data are shown for all five trained models (folds 1–5) and for the best model that was selected after validation and then applied to the test cohort (test set). AUC is shown for the model that was applied to the test cohort. TNC = true non-contrast; VNCConv = conventional virtual non-contrast; VNCPC = Virutal non-contrast pure calcium

Table 4 Best cutoff values

Discussion

Although the mean differences to TNC are smaller on VNCPC than on VNCConv series, there is still an overestimation of CT values in the assessment of adrenal lesions. Nevertheless, the use of VNC facilitates the discrimination of adenomas and metastases on contrast-enhanced scans, using a higher threshold for discrimination (e.g., 26 HU instead of the established 10 HU for TNC).

The diagnostic workup of incidental adrenal lesions on contrast-enhanced CT scans is a well-known challenge in radiology. In most cases, further imaging (e.g., adrenal CT or MRI) or follow-up is recommended. This leads to increased costs due to additional testing, which can also lead to increased radiation exposure and/or delayed diagnosis. In the era of DECT and PCD-CT, one might think that virtual non-contrast imaging could solve this problem. However, similar to previous studies on DECT, the first published experiences on PCD-CT also reported that VNC over- or underestimates CT values compared to TNC [1, 9, 20].

Therefore, VNC has not been recommended for routine clinical use because even a discreet discrepancy in CT values may have a significant impact on the diagnosis and further workup of adrenal lesions [4, 11]. Similar to previous studies, we also reported significant overestimation of VNCConv compared to TNC with a mean difference of about 17 HU.

In this study, we aimed also to assess the value of a new and promising VNC algorithm (VNCPC) in the diagnosis of adrenal lesions [13]. Whereas the discrepancy in CT values compared to TNC is smaller using this new algorithm, there is still a mean difference of about 15 HU. A mean deviation of 15 HU might be too high in the assessment of adrenal adenomas and might entail incorrect diagnoses.

Therefore, it is necessary to address this problem in the near future and to improve algorithms of VNC. However, VNC reconstructions might still have value in clinical routine currently if a “new” (different) threshold for VNC reconstructions is agreed on. Logistic regression analyses in this work revealed higher optimal thresholds for VNC in the discrimination between adenomas and metastases. For example, using a threshold of 26 HU instead of the established 10 HU for TNC results in high sensitivity and specificity for differentiating between adenomas and metastases. In our study, specificity and sensitivity were even higher using, e.g., 26 HU in VNC reconstructions compared to 10 HU in TNC. Our analyses reported different “optimal” cutoff values for VNCConv and VNCPC, which are presented in Table 4. They differ regarding specificity and sensitivity. With lower HU values, sensitivity increases while specificity decreases and vice versa. In our opinion, 26 HU was the best “compromise” between specificity and sensitivity for VNCConv. Therefore, we decided to choose this value. However, further studies with larger patient cohorts must confirm the results of this study and define the “final” optimal cutoff value. An integration of this threshold in clinical routine also requires further analyses; the findings of well-established 10 HU as the best threshold in TNC in our cohort support the validity of our examinations. Yet, one major challenge might be the standardization of the results in VNC reconstructions. Due to rapid technical developments, different VNC algorithms exist. Therefore, standardization of VNC pipelines or alternatively a harmonization of data will be crucial aspects for upcoming studies with larger patient cohorts. Especially multi-center studies with consented protocols are necessary to confirm the results of this study and to redefine the threshold for the diagnosis of adrenal lesions in contrast-enhanced CT.

The present study also analyzed the value of different imaging features derived from spectral imaging (e.g., iodine maps, VMI) in discriminating between adrenal adenomas and metastases. A previous study on DECT showed promising results for the combination of iodine density and VNC [1]. Other studies have also highlighted the discriminative power of fat fraction and radiomics [7, 8].

To our knowledge, no previous studies have addressed the value of spectral imaging for the discrimination between adrenal adenomas and adrenal metastases. Our study shows that adrenal adenomas and metastases have significantly different imaging characteristics (e.g., adenomas have lower CT values in VNCConv and VNCPC compared to metastases), which is similar to previously reported results on DECT [1].

Logistic regression analysis and random forest classifiers identified VNC (both VNCConv and VNCPC) as the most important parameters in discriminating between adrenal adenomas and metastases.

In view of these results, it is very important to address the technical optimization of VNC algorithms to approximate CT values of TNC and/or the definition of a “new” (different) threshold for VNC. The possibility to safely replace TNC with VNC could save many CT or MRI scans in the workup of adrenal lesions, might accelerate the diagnosis, and potentially reduce psychological burden for the patients due to delayed clarification.

In contrast to the previous study on DECT [1] and also to the previous study on PCD-CT [11], we performed a complete 3D segmentation of the entire adrenal lesion and did not use only ROI-based measurements. We aimed to improve the diagnostic accuracy and avoid bias that may be introduced by ROI-based measurements. However, similar results with an over- and underestimation of CT values in VNC (compared to TNC) were also observed with this method. The spectral information used in this study is based on PCD-CT differentiating photon energy (above a certain level to eliminate electronic noise) in high and low, e.g., using one threshold to create two so-called bins. In the future, up to four bins (differentiation with three thresholds) will most likely be available according to the manufacturer. This could provide even more information, potentially leading to improved CT values in VNC reconstructions.

Body mass index (BMI) is known to affect image noise and image quality [21]. A previous study on PCD-CT showed that contrast-to-noise ratio (CNR) for detection of liver metastases did not differ in a wide range of BMI suggesting a preservation of CNR on a PCD-CT [22]. Similar results were also shown in a previous study analyzing the conspicuity of pancreatic ductal adenocarcinoma on a PCD-CT; this study also found no significant differences in CNR in portal venous phase for patients with higher and lower BMI [23]. The present study showed no significant differences in BMI between patients with adenomas and metastases; however, a tendency towards a lower BMI in the metastases cohort was observed. According to the results of the previous studies and the suggested CNR-preserving potential of PCD-CT in a wide range of BMI, we do not expect these findings to significantly influence the results of the present study (also due to the small patient cohort).

This study has limitations: first, this was a single-center study with inclusion of both—patients with known or suspected malignancies and multiphasic CT scans for assessment of adrenal lesions [24]. Second, TNC was only available for a subgroup of patients with a higher proportion of patients with adenomas which might introduce a bias. Third, the diagnosis of adenomas or metastases was not confirmed histopathologically, but by either further imaging (e.g., adrenal CT, unenhanced CT, MRI) or follow-up.

Conclusion

VNC reconstructions tend to overestimate CT values in comparison to TNC in the assessment of adrenal lesions. However, there is still a high diagnostic accuracy for both, the conventional and the new calcium-preserving VNC algorithm in the discrimination of adrenal lesions, especially when elevating the established threshold of 10 HU to about 26 HU. Therefore, a “new” threshold for VNC reconstructions may safely discriminate adrenal lesions and might save further workup of unclear cases. Further studies with larger patient cohorts and multi-center studies using standardized methodology and data harmonization are necessary to confirm the results of this study.