Introduction

18F-2-Fluoro-2-deoxy-d-glucose ([18F]FDG) positron emission tomography/computed tomography (PET/CT) has secured an important role in the diagnosis and follow-up of large vessel vasculitis (LVV) [1,2,3]. Current guidelines recommend a visual evaluation of [18F]FDG PET with a standardized grading system based on the comparison with liver uptake, wherein uptake in a vessel equal to that of the liver is rated as possibly positive (grade 2) and uptake greater than that in the liver as definitively positive (grade 3) [4].

Although visual interpretation is robust if readers are expert, semi-quantitative values have been suggested, which may help in the diagnostic work-up by increasing readers’ confidence. In two recent publications, the ratio between maximum standardized uptake value (SUVmax) within a large vessel and the mean SUV in the liver (L-SUVmean) has been suggested as the most accurate semi-quantitative parameter for the diagnosis of LVV. Specifically, a ratio of 1.0 for supra-aortic (SA) vessels and 1.3 for thoracic aorta (TA) and infra-aortic (IA) vessels yielded the best diagnostic performance [5, 6].

However, validated semi-quantitative values and their thresholds rely on studies featuring analogue PET scanners only. It is well known that semi-quantitative parameters, especially SUVmax can vary considerably across different scanners [7, 8]. This appears of utmost importance for studies performed on newer generation digital scanners, including long-axial field-of-view (LAFOV) PET/CT systems. The limited resolution and poor signal recovery of small structures such as the vessel wall are partially overcome by digital systems using silicon photomultiplier (SiPM) systems [9]. Such scanners have improved spatial and temporal resolution compared to analogue systems [8, 10] and impact thresholds in the semiquantitative evaluation in the assessment of LVV.

As such, we aimed to evaluate the diagnostic accuracy of various semi-quantitative parameters in [18F]FDG LAFOV PET/CT and to identify the most accurate thresholds.

Materials and methods

Patient population

We retrospectively evaluated the first 50 patients with a final clinical diagnosis of LVV who had undergone [18F]FDG PET/CT for the diagnostic work-up of LVV starting from November 2020. All patients were referred for evaluation of a new diagnosis of LVV. No patients were included who underwent therapy assessment or follow up in known LVV. The final diagnosis of LVV was reached in a multidisciplinary setting based on laboratory results, clinical symptoms and imaging results (ultrasound and PET). All patients were scanned on a LAFOV PET/CT system (Biograph Vision Quadra, Siemens Healthineers, Erlangen, Germany) within 48 h from the first clinical consult, wherein LVV was suspected. As control group, another 50 patients were randomly selected from our oncologic database. To that end, all patients were free from signs of vasculitis. Patients with lymphomas, those referred for the search for infectious or inflammatory foci and those patients under therapy with monoclonal antibodies were excluded.

Imaging protocol

Patients fasted for at least 6 h prior to scanning, blood glucose levels were always  < 120 mg/dl (6.7 mmol/L). Sixty minutes after the intravenous administration of a weight-adapted activity of [18F]FDG (3.0 MBq/kg), images were acquired on the LAFOV PET/CT scanner in list-mode for 10 min in a single bed position (skull-vertex to mid femur). Image reconstruction was performed as previously described using high sensitivity mode (HS, maximum ring difference of 85) [10].

Whole body PET images were reconstructed in 3D to a 440 × 440 × 644 matrix with a voxel size of 1.65 × 1.65 × 1.65 mm3, with a zoom factor of 1.0 using the proprietary time of flight (TOF) point-spread-function (PSF) algorithm with 4 iterations and 5 subsets. A Gauss filter was applied (2-mm FWHM). Emission data were corrected for randoms, scatter, and decay. Non-contrast enhanced, low-dose CT images were used for attenuation correction, parameters have been also previously published [8].

Image evaluation

Images were visually and semi-quantitatively evaluated using appropriate workstation (Syngo.via MMOncology, Siemens Healthineers, Erlangen, Germany). Semi-quantitative parameters (SUVmax, SUVmean, and SUVpeak, respectively) were calculated in the relevant large vessels by manually placing a volume-of-interest (VOI) with a 40%-iso-contour around their whole diameter. Large vessels were divided into three groups: (1) SA vessels: temporal, carotid and subclavian arteries; (2) TA; and (3) IA vessels: abdominal aorta, external/internal iliac, and femoral arteries. Values were calculated for each group, as mean of the single values of all the relevant large vessels.

Consistent with previous reports [5], to-liver normalized values were also calculated for all vessels as the ratio of SUVmax of the relevant vessel to SUVmax of the liver (L-SUVmax) and SUVmean of the liver (L-SUVmean). To that end, these semi-quantitative parameters were calculated for the liver also by placing a standard 10 cm3 VOI in healthy liver tissue in the right lobe. To-blood pool (BP) normalized values were also calculated, by placing a standard 2 × 2 pixels wide VOI centred at the mitral valve plane.

Statistical analysis

Statistical analysis was carried out using IBM SPSS (Version 28.0.1.1, IBM Corp. Armonk, NY, USA). Data are presented as mean ± SD. Comparison between continuous variables in patients with and without LVV was tested using Mann–Whitney U test. The diagnostic accuracy of different semi-quantitative parameters was evaluated by means of receiver-operating-characteristic (ROC) curves analysis on a per-patient basis (with 95% CI) with calculation of Youden Index for the assessment of the best thresholds. P values < 0.05 were considered statistically significant.

Results

Semi-quantitative parameters

Of the 50 patients diagnosed with LVV, SA vessels were affected in 38 (76%), thoracic aorta in 42 (84%), and IA vessels in 26 (52%). Clinical charachteristics are displayed in Table 1. Only 3 patients with LVV (6.0%) presented with the involvement of only one segment (2 with involved temporal arteries and 1 with left subclavian artery), while the majority of patients had multilevel involvement. All semi-quantitative parameters were significantly different between patients with and without LVV (Table 2). Of note, L-SUVmax was also different between patients with and without LVV (2.98 ± 0.13 vs. 3.32 ± 0.14, p = 0.027), while L-SUVmean was not (2.51 ± 0.10 vs. 2.32 ± 0.09, p = 0.125).

Table 1 Clinical characteristics of the patients with LVV
Table 2 Age and semi-quantitative values in patients with and without large vessel vasculitis (LVV). Data are provided as mean ± SD. Area under the curve (AUC) is provided with 95% confidence intervals (CI)

Per-patient diagnostic accuracy

ROC-curves analysis showed higher diagnostic accuracy for to-liver normalized values (AUC always ≥ 0.90) than for non-normalized ones (AUC 0.74–0.89, Table 2). Best separators were calculated for to-liver normalized values, whose ROC curves are displayed in Figs. 1 and 2. For the SA vessels, best thresholds were 0.66 for SUVmax/L-SUVmax and 0.88 for SUVmax/L-SUVmean; for the TA, 1.0 for SUVmax/L-SUVmax and 1.30 for SUVmax/L-SUVmean; finally, for IA vessels, the best threshold was 0.83 for SUVmax/L-SUVmax and 1.11 for SUVmax/L-SUVmean. The thresholds and their sensitivity and specificity are reported in Table 3. To-blood pool normalized ratio proved less accurate (Supplemental Table).

Fig. 1
figure 1

ROC-curves analysis for normalized maximum standardized uptake value (SUVmax) of the relevant vessels to liver SUVmax (L-SUVmax). SA = supra-aortic vessels; TA = thoracic aorta; IA = infra-aortic vessels

Fig. 2
figure 2

ROC-curves analysis for normalized maximum standardized uptake value (SUVmax) of the relevant vessels to liver SUVmean (L-SUVmean). SA = supra-aortic vessels; TA = thoracic aorta; IA = infra-aortic vessels

Table 3 Best separators for semi-quantitative values with values of sensitivity and specificity

Accuracy of to-date suggested thresholds on LAFOV PET

The currently suggested thresholds showed inferior sensitivity and slight higher specificity when applied to LAFOV PET imaging. Using the current threshold of SUVmax/L-SUVmean = 1.0 for SA vessels, sensitivity was 68% and specificity 90%, (with threshold 0.88 sensitivity was 86% and specificity 80%). Using the threshold of SUVmax/L-SUVmean = 1.3 for IA vessels resulted in sensitivity 80% and specificity 96% for IA vessels (with threshold 1.11, sensitivity 96% and specificity 90%).

Discussion

While current guidelines recommend visual interpretation only in PET imaging in the assessment of LVV, recent evidence suggests that semi-quantitative methods may be preferred in clinical practice. Besides the abovementioned report on semi-quantitative parameters able to assist the clinicians in the diagnosis of LVV [5, 6], other scoring systems have been recently suggested [11], also able to differentiate between LVV and atherosclerosis with very good accuracy.

In this regard, it should be noted that a meta-analysis showed that the pooled sensitivity of a visual interpretation is good but not excellent, ranging between 75.9 and 83.3% using the clinical diagnosis as reference standard [12]. Adding also uptake intensity to the visual analysis yields higher diagnostic accuracy, as demonstrated by a recent prospective study featuring 64 patients with suspected giant cell arteritis, wherein PET had sensitivity 92% and specificity 85% [13].

As such, there is a clear rationale to pursue a semi-quantitative evaluation in PET imaging interpretation, although to-date there is still insufficient evidence of a superiority over visual interpretation only [3, 14]. However, another two questions arise: (1) what is the best semi-quantitative parameter and with which separator? and (2) is there a chance that semi-quantitative thresholds are not interchangeable across different PET scanners?

Our work expands on this topic, confirming that the to-liver normalized semi-quantitative values yield higher accuracy than non-normalized ones, consistent with previous reports [5, 6]. But in contrast to the previous works, we here demonstrate a substantial equivalence between SUVmax/L-SUVmax and SUVmax/L-SUVmean in all vascular territories. Furthermore, we provide for the first time data on the diagnostic accuracy of semi-quantitative PET using LAFOV scanners.

We found that the best separators are lower than reported for conventional, analogue PET systems. The most conceivable explanation relies on the intrinsic differences in the scanners. On LAFOV PET, using a 10-min acquisition 1 h post injection reduces the background noise compared to a standard-axial field-of-view scanner (SAFOV) [15]. Moreover, using the advantage of covering all coincidences in the whole FOV simultaneously with a LAFOV system leads to a gain in scanner sensitivity. It should be noted that the exact contribution from digital vs. analogue systems rather than LAFOF vs. SAFOV could not be elucidated.

Thresholds reported in the literature should be decreased and updated when scanning patients on LAFOV PET systems, in order to avoid false negative findings. It should be acknowledged that the to-date established SUVmax/L-SUVmean threshold of 1.3 is adequate for inflammation involving the thoracic aorta. We highly recommend using normalized values as we showed to avoid inter-individual variations of [18F]FDG-uptake.

Taken together, our results seem to indicate a complementary role of the semi-quantitative PET analysis. As a matter of fact, a global scan assessment by PET-experienced nuclear medicine physicians including both visual and semi-quantitative analysis provides high accuracy [13], and this may reflect the fact that an experienced reader can weigh the impact of the degree of [18F]FDG uptake depending on the location, the pattern of [18F]FDG-uptake (i.e., diffuse vs. focal) and the presence of possible atheroma [13].

Some limitations of our study should be acknowledged. First, due to the retrospective nature of the present study, patients already diagnosed with LVV were evaluated, and their results were compared with patients without any clinical and radiologic sign of LVV. Hence, conclusions about our proposed thresholds in patients with clinical signs of LVV but unclear diagnosis may not be fully applicable in clinical practice. To note, the same limitation also pertains to previous studies featuring analogue PET scanners [5, 6]. Furthermore, the patients’ sample is relatively small and further prospective studies are needed for their precise definition. However, the main aim of our study was to underline the need for different thresholds when using LAFOV PET, which bears importance in clinical practice for the nuclear medicine community.

It should also be noted that we cannot rule out incorrect information on ongoing steroid therapy in our population. This may have impacted our results, as glucocorticoids may lower [18F]FDG uptake and mask a subtending inflammation. In this regard, a recent study showed a decrease in the [18F]FDG uptake after 3 days of high-dose glucocorticoids, but without loss of diagnostic accuracy, which only occurred after 10 days of treatment [16]. In clinical practice, the start of glucocorticoid therapy cannot often be delayed during severe clinical symptoms, and therefore, it is conceivable that some of our patients were on steroid therapy at the time of PET/CT. But given the fact that PET was always performed within 48 h from the first clinical visit, it is extremely unlikely that patients were on medications for more than 3 days. Hence, the impact on our values is expected to be negligible.

The multisciplinary team was aware of the PET/CT results as they provided relevant information for patient management, and this may have an impact on the diagnosis. However, the team has ample clinical experience, and based its judgment on the extensive information available from all sources at the end of the diagnostic work-up. Although this gold standard may be subject to criticism, still it represents a common clinical situation, wherein biopsy cannot be performed, and is adhering to current recommendations [17]. Finally, we could not assess the impact of a different timing of imaging after [18F]FDG injection. While different timing was reported to affect the sensitivity for active vasculitis, being higher for later acquisitions [18], our study features the same uptake period post injection of the previous reports, wherein the current thresholds for semi-quantitative PET were suggested. As such, there is a full comparability among our studies, which gives more reliability when assessing the need of different thresholds using LAFOV-PET.

Conclusion

Our results confirm the importance of [18F]FDG-PET/CT in the diagnostic workup of LVV. In this regard, LAVOF PET/CT may yield increased diagnostic accuracy owing to reduced background noise and improved spatial/temporal resolution, but different thresholds in semi-quantitative parameters should be considered. Prospective studies are warranted to implement new reference values in clinical practice when using high sensitivity scanners.