Introduction

The latest generation positron emission tomography (PET) integrated with computed tomography (CT) systems is equipped with silicon photomultiplier (SiPM)-based detector elements. These systems with improved detection capabilities may contribute to enhance diagnostic performance [1,2,3], but could also allow a reduction in scan duration and/or reduction in administered radioactivity [3, 4].

Over the past decades, antibody-based PET (immunoPET) imaging has become increasingly important in drug development [5]. In addition, it may be a valuable tool to determine which patients may benefit from treatment with a specific monoclonal antibody (mAb) via non-invasive characterisation of tumour lesions, and for evaluation of treatment response [6].

Numerous advantages of zirconium-89 (89Zr), such as the long half-life of 78.4 h matching the pharmacokinetic behaviour of antibodies, its relatively low average positron energy of 395 keV resulting in low positron range (3.6 mm in water) for high-resolution PET imaging, and good in vivo stability, make it a suitable candidate for labelling of mAb [7,8,9]. For 89Zr immunoPET imaging, the higher sensitivity of SiPM-based PET/CT systems could be particularly beneficial as the low positron abundance (22.7%) causes the acquired PET images to have a low signal-to-noise level. In addition, the high-energy gamma emission of 909 keV causes high radiation burden and limits the amount of radiotracer that can be administered to patients [10]. Hence, long scan durations are required to obtain adequate statistical image quality, especially at later scan time points.

Efforts to characterise and harmonise 89Zr-PET imaging by means of phantom measurement comparisons among different types of systems to ensure quantitative images have been done by Makris et al. [11]. These efforts have been continued by Kaalep et al. [12]. Other system performance comparisons using 89Zr-filled phantom studies have been performed by Christian et al. [13] to compare and optimise image quality and phantom sphere (lesion) detectability between systems.

Currently, total body 89Zr immunoPET imaging can last up to 2 h acquisition at later time points at 6–7 days postinjection (p.i.), using the conventional photomultiplier tube (PMT)-based mCT Biograph PET/CT (from now on referred to as mCT (Siemens Healthineers)), our clinical reference system. The improved performance characteristics of the SiPM-based Biograph Vision PET/CT (from now on referred to as Vision (Siemens Healthineers)) are expected to lead to improved 89Zr immunoPET image quality, and therefore may allow for a reduction in scan duration to increase patient comfort and throughput, and/or administered radioactivity for reduction of radiation exposure.

This study aimed to investigate the difference in semiquantitative performance between the mCT and the Vision PET/CT systems in 89Zr immunoPET imaging. In addition, the effects of acquisition time reduction on semiquantitative imaging parameters and its influence on image quality assessment were evaluated.

Materials and methods

Patient population

Patients (n = 15 (5 men, 10 women; age 33–79, mean ± SD 58 ± 13 years; weight 52–109, mean ± SD 73 ± 15 kg) with cancer with visible [89Zr]mAb PET tracer uptake at day 4 p.i. in at least one tumour lesion were enrolled in this prospective study between June 2018 and February 2020 in case of a referral for an 89Zr immunoPET acquisition to solve a clinical dilemma [14, 15] or for research purposes (ClinicalTrials.gov identifiers NCT02453984 [16] and NCT04029181 [17]). All patients were scanned on the Vision PET/CT system. Patients (n = 5) that underwent the dual acquisition protocol were scanned on both PET/CT systems and gave additional written informed consent after being informed on the study aims, procedures and the additional acquisition of a low-dose CT (\(\sim\) 1 mSv). For this purpose, the local medical ethics committee exempted approval without additional procedures (waiver number: METc2017/489).

Imaging protocol

Patients received an intravenous injection of 37 MBq 89Zr-labelled mAb. PET/CT data were acquired on day 4 p.i. On the Vision PET/CT, a standard low-dose CT scan (an X-ray tube current of 43 mAs, a tube voltage of 100 kV and a spiral pitch factor of 1) was performed from the vertex to the toes and used for attenuation and scatter correction. A consecutive emission PET scan was acquired in listmode at 300 s per bed position (s/bp). In case patients were scanned on the mCT, the acquisition parameters of the low-dose CT were as follows: an X-ray tube current of 99 mAs, a tube voltage of 140 kV and a spiral pitch factor of 1.5. PET/CT imaging on the mCT were also performed in listmode at 300 s/bp.

Subsequently, PET listmode data acquired on the Vision were reprocessed to produce additional sets of sinograms corresponding to 225, 150 and 75 s/bp (scan durations are hereinafter referred to as 100%, 75%, 50% and 25% of the acquisition time mimicking shorter scan durations). For reconstruction of Vision PET images, three different protocols were used for each of the four scan durations. We applied the vendor provided reconstruction protocol currently used for [18F]FDG imaging, i.e. an ordinary Poisson ordered-subset expectation maximisation (OP-OSEM) 3D-iterative algorithm [18] using 4 iterations, 5 subsets, time-of-flight (ToF) application and resolution modelling, without filtering (hereinafter referred to as the Clinical Vision protocol). In addition, the European Association of Nuclear Medicine (EANM) Research Ltd. (EARL)1 and EARL2 reconstructions [19,20,21] currently used for quantification of clinically acquired oncological [18F]FDG images were obtained using 3D OP-OSEM with 4 iterations and 5 subsets, ToF, with resolution modelling and a Gaussian filter of 7 mm and 5 mm, respectively (hereinafter referred to as the EARL1 and EARL2 Vision protocols). The resulting image size of the images obtained on the Vision was 220 × 220 with a voxel size of 3.3 × 3.3 × 1.5 mm.

For PET data acquired on the mCT, also three different reconstruction protocols were used. The clinically preferred multicentre validated 89Zr PET reconstruction protocol [11, 12] using 3D OP-OSEM with 3 iterations, 21 subsets, ToF, resolution modelling and a Gaussian filter of 8 mm (hereinafter referred to as the 89Zr-EARL mCT protocol [11]). In addition, images acquired on the mCT were also reconstructed to comply with EARL [18F]FDG imaging settings using 3D OP-OSEM with 3 iterations, 21 subsets, ToF, resolution modelling and a Gaussian filter of 6.5 mm (the EARL1 mCT protocol) and a Gaussian filter of 5 mm (the EARL2 mCT protocol). The resulting image size of the images obtained using the mCT was 256 × 256 with a voxel size of 3.2 × 3.2 × 2.0 mm, thus closely matching the image voxel sizes between systems.

Semiquantitative image analysis

Reconstructed PET/CT data were semiquantitatively analysed using the quAntitative onCology moleCUlaR Analysis suiTE (ACCURATE) version v08072019 [22]. Per image, individual tumour lesions were manually delineated to obtain the maximum and the peak standardised uptake value (SUVmax and SUVpeak) of the tumour. In addition, 1-cm-diameter spherical volumes of interest (VOIs) were placed in healthy tissues: blood pool, kidney cortex and spleen well within the boundaries of the organ to avoid partial volume effects. From these VOIs, SUVmax, SUVpeak and mean standardised uptake value (SUVmean) were obtained. A 3-cm-diameter spherical VOI was placed in a homogeneous part of the liver to obtain SUVmax, SUVpeak and SUVmean, and to characterise image noise using the standard deviation of the activity within the VOI with regard to the mean activity within the VOI.

Qualitative image analysis

Images obtained on the Vision at the four different reconstructed scan durations using three different reconstruction protocols were evaluated on image quality. Two nuclear medicine physicians (AHB and WN, with 20 and 5 years of experience in 89Zr immunoPET image reading, respectively) independently assessed the images using a dedicated syngo.via VB30 (Siemens Healthineers) workstation. All images were scored based on a 5-point Likert scale regarding image noise, lesion margin demarcation and overall image quality (see Supplemental Fig. 1  for the used visual image assessment form).

Fig. 1
figure 1

Patient example 89Zr immunoPET images obtained using the Vision PET/CT. Maximum intensity projection PET images acquired at day 4 p.i. of 37 MBq [89Zr]mAb of a 79-year-old patient (weight 86 kg) with metastatic breast cancer acquired at 100%, 75%, 50% and 25% of the scan duration (from left to right, respectively) using the Clinical Vision, EARL2 Vision and EARL1 Vision reconstruction protocols (from top to bottom, respectively). Images were scaled at equal contrast intensities

Statistical analysis

Statistical analyses were performed in SPSS Statistics, version 25.0 (IBM Corp.). To evaluate the difference in semiquantitative performance between systems, non-parametric Wilcoxon signed rank tests were conducted. The difference in lesion SUVmax and SUVpeak and the difference in healthy tissue SUVmax, SUVpeak and SUVmean between systems were explored.

For each reconstruction method performed to obtain images on the Vision, lesion SUVmax and SUVpeak per scan duration, 75%, 50% and 25%, were compared with the lesion SUVmax and SUVpeak of images acquired at 100% of the count time. A repeated-measures analysis of variance (ANOVA) with post hoc Bonferroni adjustment for pairwise comparison was used. A P value of less than 0.05 was considered significant. This comparison was also performed for assessing the difference in healthy tissue SUVmax, SUVpeak and SUVmean between the 100% scan time images and images acquired at shorter scan durations.

Furthermore, the standard deviation of the voxel values within the liver VOIs was compared between the 100% scan time images and images acquired at shorter scan durations as well.

Inter-reader agreement concerning image noise, tumour lesion demarcation and overall image quality was analysed using kappa statistic. To this aim, the original 5-point scores were reassigned to 2-point scores: 1 + 2 + 3 became 1, and 4 + 5 became 2. A P value of less than 0.05 was considered significant.

Results

Semiquantitative image analysis

Five patients were scanned on both PET/CT systems and acquired images were evaluated for the semiquantitative performance comparison of the mCT versus Vision. Each acquisition using the mCT resulted in three images (obtained using three different reconstruction protocols), whereas each acquisition using the Vision resulted in 12 images (three reconstruction methods times four scan durations). For illustrative purposes, Fig. 1 shows example patient PET/CT images acquired using the three reconstruction protocols at different scan durations ranging from 100 to 25%. From top to bottom, Fig. 1 shows images obtained using the Clinical Vision, EARL2 Vision and EARL1 Vision reconstruction protocol, arranged by amount of applied smoothing from least to most. In the Clinical Vision images obtained using 100% scan duration, a clearly demarcated lesion in the vertebra is visible; however, the image has a noisy (speckly) outlook which becomes more disturbing towards shorter acquisition times. The images reconstructed according to EARL2 Vision settings show a sharply demarcated lesion and, because of the applied 5-mm Gaussian filter, the noise is smoothed away which prevents the speckly noise appearance to take the upper hand, also in the images obtained with shorter scan durations. The EARL1 Vision reconstructed images still clearly show the lesion in the vertebra, however apart from smoothing away the noise, the 7 mm Gaussian filter also smooths the edges of the lesion causing a slightly blurred demarcation. Regarding shorter scan durations, the EARL1 reconstructed images show the highest robustness to noise with the least increase in speckly noise pattern towards shorter acquisition times.

For each of the five patients scanned on both the mCT and Vision, a total of 15 images were obtained, whereas 12 images were obtained for each of the 10 patients undergoing acquisition on the Vision only. Overall, 195 images were collected and tumour segmentations were performed on each of the images individually. A total of 5 tumour lesions were found in the double scans, a single lesion per patient. The lesions were first identified on the images acquired on the mCT, our clinical reference system, and subsequently confirmed on the Vision images. Furthermore, 17 tumour lesions were included in the single acquisitions obtained using the Vision PET/CT (a total of 2 tumour lesions per tissue type per patient). In total, 279 tumour segmentations were made (a segmentation for each reconstruction method and acquisition time resulted for the acquisitions on the mCT in three segmentations per lesion (i.e. 5*3 tumour segmentations), and for the Vision in 12 segmentations per lesion (i.e. (17 + 5)*12 tumour segmentations).

Concerning the double acquired scans, median lesion SUVmax and SUVpeak derived from the images obtained using the mCT and the EARL standard 1 compliant harmonised reconstruction protocol were 14.1 (range 2.4–35.7) and 10.3 (range 1.7–14.4). For the dual images obtained with the Vision, median lesion SUVmax and SUVpeak derived from images reconstructed according to the EARL1 standard compliant protocol were 17.9 (range 4.1–29.7) and 10.1 (range 2.0–13.6), respectively. An overview of median lesion SUVmax and SUVpeak comparison between PET/CT systems obtained using different reconstruction protocols can be found in Table 1. Regarding the included healthy tissues, a comparison of semiquantitative parameters between PET/CT systems is shown in Table 2.

Table 1 Tumour lesion median SUVmax and SUVpeak comparison between both systems (only shown for acquisitions on both PET/CT systems (n = 5)
Table 2 Healthy tissue median SUVmax, SUVpeak and SUVmean comparison between systems (only shown for acquisitions on both PET/CT systems (n = 5))

Scatter plots in Fig. 2 show the difference in lesion SUVmax and SUVpeak between images derived from both PET/CT systems. For each of the healthy tissues, a similar comparison between systems was performed using SUVmax, SUVpeak and SUVmean (see Fig. 3). The results from the semiquantitative performance comparison between PET/CT systems using Wilcoxon signed rank tests are shown in Table 3; no significant difference is indicated with ‘equivalent’. A significant difference in lesion SUVmax and SUVpeak was found between systems when comparing the images obtained using the Clinical Vision protocol and the 89Zr-EARL mCT protocol (Z =  − 2.02, P < 0.05); no significant differences in tumour lesion SUVmax and SUVpeak were found when comparing system semiquantitative performance using the EARL standard compliant settings 1 and 2. Concerning the healthy tissues, significantly different SUVmax were found between systems in the blood pool, kidney cortex, spleen and liver when using the Clinical Vision and 89Zr-EARL mCT reconstruction protocol (P < 0.05). Using these reconstruction settings, SUVpeak measured in the kidney cortex differed significantly between systems as well (Z =  − 2.02, P < 0.05). No significant differences in healthy tissue SUVmax, SUVpeak and SUVmean were found when comparing system semiquantitative performance using the EARL standard compliant settings 1 and 2 for the blood pool, kidney cortex and liver. However, for the spleen, significant differences between systems regardless of reconstruction settings and semiquantitative parameters were observed (P < 0.05). Table 2 shows an overall increase of approximately 25% in median SUV for the spleen irrespective of reconstruction protocol.

Fig. 2
figure 2

Semiquantitative tumour lesion comparison between PET/CT systems. For each patient (n = 5), the lesion SUVmax (A) and SUVpeak (B) derived from images obtained with the Vision PET/CT system and using the Clinical Vision (Vision CLIN) (white dot), the EARL2 Vision (grey dot) and the EARL1 Vision reconstruction protocol (black dot) are compared directly with tumour lesion SUVs derived from images using the mCT PET/CT system and 89Zr-EARL mCT, EARL2 mCT and EARL1 mCT reconstruction protocols (white, grey and black triangles, respectively). Please note, for readability reasons, the tumour lesion SUVmax outliers found in patient 4 of 64.1 obtained using the Clinical Vision protocol and of patient 2 of 48.3 obtained using the EARL2 mCT protocol are not shown in subfigure A

Fig. 3
figure 3

Semiquantitative healthy tissue compared between PET/CT systems. For each patient (n = 5), SUVpeak (left column) and SUVmean (right column) of healthy tissues (blood pool (A and B), kidney cortex (C and D), spleen (E and F) and liver (G and H)) derived from images obtained through use of the Vision PET/CT system and the Clinical Vision (Vision CLIN) reconstruction protocol (white dot), the EARL2 Vision reconstruction protocol (grey dot) and the EARL1 Vision reconstruction protocol (black dot) are compared directly with healthy tissue SUVs derived from images using the mCT PET/CT system and.89Zr-EARL mCT, EARL2 mCT and EARL1 mCT reconstruction protocols (white, grey and black triangles, respectively)

Table 3 Semiquantitative performance in tumour lesion comparison between both PET/CT systems (n = 5) using the Wilcoxon signed rank test

Boxplots in Fig. 4 show lesion SUVmax and SUVpeak comparisons between different scan durations derived from images obtained using the Vision. A significant difference was found in lesion SUVmax between images using the Clinical Vision reconstruction protocol obtained at 100% scan duration and 25% (P = 0.009, 95% CI (− 18.0 to − 2.0)), and between images obtained at 75% scan duration and 25% (P = 0.008, 95% CI (− 17.1 to − 2.0)). When using the EARL2 Vision reconstruction settings, lesion SUVmax differed significantly between 75 and 25% scan duration (P = 0.02, 95% CI (− 5.5 to − 0.3)). Images reconstructed using the EARL1 Vision protocol showed significant differences in lesion SUVmax between 100 and 25% scan duration (P = 0.001, 95% CI (− 2.3 to − 0.5)), and between 75 and 25% (P = 0.009, 95% CI (− 2.5 to − 0.3)). No significant differences were found in lesion SUVpeak between images obtained at different scan durations.

Fig. 4
figure 4

Vision only semiquantitative tumour lesion comparison. For all patients (n = 15), the tumour lesion SUVmax (A) and SUVpeak (B) derived from images obtained using the Vision PET/CT system and the Clinical Vision reconstruction protocol (light grey), the EARL2 Vision reconstruction protocol (grey) and the EARL1 Vision reconstruction protocol (dark grey) are compared at different scan durations (100 to 25%, from left to right for each reconstruction protocol). The boxes bound the interquartile range (IQR) divided by the median SUV. The whiskers extend to a maximum of 1.5*IQR beyond the box. \(*\) indicates P < 0.05 and \(**\) indicates P < 0.01. Outliers are represented by dots

The results of the healthy tissue comparisons between scan durations are shown in Fig. 5. No significant differences in healthy tissue SUVmean and SUVpeak were found between images obtained using the Vision at different scan durations. Because of different 89Zr-labelled mAbs used in this study, a substantial visual difference was observed in the uptake in the spleen. For clarity, the healthy tissue comparisons per 89Zr-labelled mAb for the spleen are shown in Supplemental Fig. 2 .

Fig. 5
figure 5

Vision only semiquantitative healthy tissue comparison between scan durations. For all patients (n = 15), SUVpeak (left column) and SUVmean (right column) of healthy tissues (blood pool (A and B), kidney cortex (C and D), spleen (E and F) and liver (G and H)) derived from images obtained through use of the Vision PET/CT system and the Clinical Vision reconstruction protocol (light grey), the EARL2 Vision reconstruction protocol (grey) and the EARL1 Vision reconstruction protocol (dark grey) are compared at different scan durations (100 to 25%, from left to right for each reconstruction protocol)

Noise estimates from calculation of the coefficient of variation (COV) derived from the 3-cm-diameter liver VOIs in each image are shown in Fig. 6. A difference in image noise levels can be observed between reconstruction methods and scan durations. Noise levels increase with shorter scan times; this effect is more prominent when using the Clinical Vision reconstruction protocol compared to EARL2 and EARL1 compliant reconstruction settings.

Fig. 6
figure 6

Liver COV (n = 15) obtained from the Clinical Vision (light grey), EARL2 Vision (grey) and EARL1 Vision compliant (dark grey) reconstructed images at 100%, 75%, 50% and 25% of the scan time (from left to right)

Qualitative image analysis

Vision only images (n = 15 patients) were reconstructed using three different reconstruction protocols and visually assessed on noise levels, lesion demarcation and overall image quality. Highest mean scores on noise levels and overall image quality were assigned to the images reconstructed according to the EARL1 Vision protocol (mean scores on noise and image quality of 4.3 and 4.4 at 100% of the scan duration, 3.6 and 3.9 at 75% and 3.1 and 3.3 at 50%, respectively). The EARL2 Vision reconstructed images received a slightly higher appreciation on lesion demarcation with respect to the EARL1 Vision reconstructed images (mean score of 4.4 versus 4.2 at 100% of the scan duration, 4.1 versus 3.9 at 75% and 3.7 versus 3.5 at 50%).

Inter-reader agreement ranged from fair to moderate on noise, lesion demarcation and overall image quality with \(\kappa\) = 0.23 (P = 0.000, 95% CI (0.12–0.34)), \(\kappa\) = 0.27 (P = 0.000, 95% CI (0.13–0.41)) and \(\kappa\) = 0.41 (P = 0.000, 95% CI (0.28–0.55)), respectively.

Discussion

In the current study, semiquantitative performance of the Vision with regard to its predecessor, the mCT, was evaluated for 89Zr immunoPET imaging in oncology patients. In addition, possibilities of reducing scan time while maintaining image quality using the Vision were explored.

This study shows that when using the Vision PET/CT system, a reduction in scan time of 50% is possible regardless of reconstruction settings according to Fig. 4. However, the use of images reconstructed according to EARL2 Vision or the Clinical Vision protocol would result in too much elevated noise levels (see Fig. 6). Therefore, we recommend to apply the EARL1 Vision settings for image reconstruction for a 50% reduction in scan duration while maintaining semiquantitative PET image accuracy.

Significant differences in semiquantitative PET image parameters were found for shorter scan durations when using the Clinical Vision reconstruction protocol, especially SUVmax increases at shorter scan times (50% and 25%) with regard to 100% acquisition time. A non-negligible consideration when using SUVmax for uptake measurements in PET images is statistical quality. When reducing scan time, variability in SUVmax can largely be explained by the associated increase in noise [23] (see Fig. 6). The addition of a Gaussian filter (as used in the EARL2 and EARL1 Vision reconstruction settings (7 mm and 5 mm, respectively)) smooths the image hereby reducing noise and SUVmax variability [24] which results in similar results for lesion quantification at shorter scan times.

A more robust alternative, and diminishing the need for a filter at shorter acquisition times, is to use SUVpeak for lesion quantification. Although SUVpeak is expected to be more susceptible to partial volume effect in small lesion segmentation (19), its semiquantitative performance is less affected by scan duration and reconstruction protocol (see Figs. 4 and 5). Makris et al. [11] previously recommended the use of SUVpeak for performing semiquantitatively accurate 89Zr immunoPET imaging studies. They found very low variability in SUVpeak between various PET/CT systems and imaging sites. Moreover, this recommendation has then been affirmed in an 89Zr immunoPET imaging study by Kaalep et al. [12] describing a multicentre PET/CT system and reconstruction comparison trial in which SUVpeak was found to be least sensitive to noise and reconstruction differences. As we also found SUVmax to vary not only between reconstructed images obtained from different PET/CT systems, but also between differently reconstructed images obtained from a single system, our recommendation is (in line with previously reported results described above) to use SUVpeak for quantification of 89Zr immunoPET images. In case EARL standard 1 compliant settings are used (for comparison between systems), SUVmax could be reported as well for lesion quantification besides SUVpeak.

In order to obtain quantitatively comparable results, we standardise and harmonise PET imaging procedures [19] incorporating various methods, including different amounts of applied smoothing. With the introduction of new PET systems with improved performance characteristics, the harmonisation and standardisation specifications are updated as well [20] to preserve some of the improved image quality that can be obtained. However, in the case of 89Zr immunoPET imaging, only a low amount of activity can be administered (37 MBq) due to the long half-life associated with high radiation exposure. This low amount of activity in addition to the low positron abundance causes image quality of 89Zr immunoPET images to be in the bottom range; low count statistics result in overall poor image quality. Hence, reconstruction protocols that smooth in excess, such as incorporated in the EARL1 compliant reconstruction settings, are still required under these circumstances.

In the current study, the difference between SUVpeak and SUVmean derived from healthy tissues is minimal due to the small 1-cm-diameter spherical VOI that was used in the blood pool, kidney cortex and spleen. SUVmean was included in the evaluation of semiquantitative performance comparison of healthy tissues between systems as SUVmean measurement of these organs is used, for example, in whole organ dosimetry analyses. Using SUVmean for whole organ dosimetry avoids susceptibility to segmentation variability as opposed to using SUVmax or SUVpeak [25]. Please note, the significant difference in SUVmax for tumour lesions and all healthy tissues found between systems using the Clinical Vision reconstruction protocol versus the 89Zr-EARL mCT protocol (see Tables 3 and 4). This difference can be explained by the 8-mm Gaussian smoothing filter applied to the images acquired using the mCT, whereas images acquired on the Vision were not smoothed at all. With regard to semiquantitative performance between systems at 100% acquisition time, no further significant differences between tumour lesions and blood pool, kidney cortex and liver measurements were found. For the spleen however, a significant increase in SUV measured on the images obtained from the Vision PET/CT was observed with respect to the images obtained from the mCT PET/CT. This was only the case for the spleen (and not for the other healthy tissues). Thus far, we have not found a plausible explanation. We suspect the improved tissue demarcation due to improved sensitivity and ToF on the Vision PET/CT system could play a role here. No patient instructions with respect to food and fluid intake prior to 89Zr immunoPET acquisition were given, resulting in large cold areas with the size of a filled stomach on the acquired images. These cold spots were more prominently visible on the Vision images with a clearer demarcation of the stomach. Due to the improved ToF, better contrast recovery in the spleen using the Vision PET/CT may have resulted in a better reflection of the true counts originating from the spleen as opposed to some possible larger signal spill over between the cold stomach and the very intense spleen on the mCT images. Future 89Zr immunoPET studies with a larger homogeneous patient population will have to explore this phenomenon to clarify these findings. Until then, the spleen should not be used as reference tissue in 89Zr immunoPET imaging studies.

Table 4 Quantitative performance in healthy tissue comparison between systems (n = 5) using the Wilcoxon signed rank test

Regarding qualitative image assessment, overall, the images reconstructed according to the EARL1 protocol received the highest scores resulting from the visual assessment; higher mean scores were obtained for noise levels and overall image quality, and there was only a slight difference in lesion demarcation scores in favour of the EARL2 reconstructed images. Furthermore, fair to moderate inter-reader agreement was achieved. As the nuclear medicine physicians were asked to score the images individually without a direct comparison with optimal 89Zr immunoPET image quality, scoring was considered difficult. Another factor that should be considered when interpreting these results is that the image quality of [18F]FDG PET/CT scans (compared to 89Zr PET/CT images) on the Vision is excellent. These excellent [18F]FDG PET images might have been an unintentional reference for visual 89Zr immunoPET image assessment. Furthermore, previous experience with 89Zr immunoPET readings, and personal preference of the reading Nuclear Medicine physicians regarding acceptable image quality could have played a role in the observed variation in the image quality assessment.

Previous work also performed by our research group explored the effect of scan time reduction on semiquantitative PET image parameters and image quality in [18F]FDG PET imaging using the Vision [4]. Here, a factor 3 reduction in scan time was considered possible while maintaining image quality using the clinically preferred Vision reconstruction protocol with additional 2-mm Gaussian filtering. In the current study, possibilities to reduce scan duration in 89Zr immunoPET imaging were explored. Using the EARL Vision protocols, semiquantitative performance remains reliable when decreasing scan duration up to a factor of 2 (see Figs. 4 and 5) at the cost of a slight increase in noise (see Fig. 6). Therefore, for 89Zr immunoPET imaging in the clinic using an SiPM-based PET/CT, one may choose to reduce scan duration to improve patient comfort and increase throughput. On the other hand, in case of paediatric patients or for non-life-threatening diseases, an equally proportional reduction in the amount of injected activity would be recommended to reduce radiation exposure. However, as shown before for [18F]FDG PET studies [4], more room for optimising image quality by changing amount of administered activity and/or scan duration is feasible because of the higher positron abundance and typically higher injected activities allowed from a radiation safety perspective.

Conclusion

In this study, we found, when using the SiPM-based Vision PET/CT for 89Zr immunoPET imaging and the EARL standard 1 compliant reconstruction settings, semiquantitative PET image parameters to remain reliable when using images obtained at reduced scan durations up to a factor of 2 compared to using the conventional PMT-based mCT PET/CT system.

Also, as SUVmax is highly affected by noise and reconstruction settings, and differs considerably in quantification of tumour lesions as well as healthy tissues between various PET/CT systems, we strongly recommend using the EARL standard 1 compliant reconstruction protocol and to report SUVpeak for reliable, comparable across systems, tumour lesion quantification in 89Zr PET/CT imaging.