A cross-scanner and cross-tracer deep learning method for the recovery of standard-dose imaging quality from low-dose PET

Xue, Song; Guo, Rui; Bohn, Karl Peter; Matzke, Jared; Viscione, Marco; Alberts, Ian; Meng, Hongping; Sun, Chenwei; Zhang, Miao; Zhang, Min; Sznitman, Raphael; El Fakhri, Georges; Rominger, Axel; Li, Biao; Shi, Kuangyu

doi:10.1007/s00259-021-05644-1

A cross-scanner and cross-tracer deep learning method for the recovery of standard-dose imaging quality from low-dose PET

Original Article
Open access
Published: 24 December 2021

Volume 49, pages 1843–1856, (2022)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Nuclear Medicine and Molecular Imaging Aims and scope Submit manuscript

A cross-scanner and cross-tracer deep learning method for the recovery of standard-dose imaging quality from low-dose PET

Download PDF

Song Xue¹,
Rui Guo^2,3,
Karl Peter Bohn¹,
Jared Matzke⁴,
Marco Viscione¹,
Ian Alberts¹,
Hongping Meng^2,3,
Chenwei Sun^2,3,
Miao Zhang^2,3,
Min Zhang^2,3,
Raphael Sznitman⁵,
Georges El Fakhri⁶,
Axel Rominger¹,
Biao Li ORCID: orcid.org/0000-0002-0493-2062^2,3 &
…
Kuangyu Shi^1,4

4435 Accesses
29 Citations
Explore all metrics

Abstract

Purpose

A critical bottleneck for the credibility of artificial intelligence (AI) is replicating the results in the diversity of clinical practice. We aimed to develop an AI that can be independently applied to recover high-quality imaging from low-dose scans on different scanners and tracers.

Methods

Brain [¹⁸F]FDG PET imaging of 237 patients scanned with one scanner was used for the development of AI technology. The developed algorithm was then tested on [¹⁸F]FDG PET images of 45 patients scanned with three different scanners, [¹⁸F]FET PET images of 18 patients scanned with two different scanners, as well as [¹⁸F]Florbetapir images of 10 patients. A conditional generative adversarial network (GAN) was customized for cross-scanner and cross-tracer optimization. Three nuclear medicine physicians independently assessed the utility of the results in a clinical setting.

Results

The improvement achieved by AI recovery significantly correlated with the baseline image quality indicated by structural similarity index measurement (SSIM) (r = −0.71, p < 0.05) and normalized dose acquisition (r = −0.60, p < 0.05). Our cross-scanner and cross-tracer AI methodology showed utility based on both physical and clinical image assessment (p < 0.05).

Conclusion

The deep learning development for extensible application on unknown scanners and tracers may improve the trustworthiness and clinical acceptability of AI-based dose reduction.

Deep learning-assisted ultra-fast/low-dose whole-body PET/CT imaging

Article Open access 25 January 2021

Low-count whole-body PET/MRI restoration: an evaluation of dose reduction spectrum and five state-of-the-art artificial intelligence models

Article 12 January 2023

Artificial intelligence-based joint attenuation and scatter correction strategies for multi-tracer total-body PET

Article Open access 19 July 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Positron emission tomography (PET) is one of the main imaging modalities in clinical routine procedures of oncology [1, 2], neurology [3], and cardiology [4]. One of the critical bottlenecks for the wide application of PET is the ionizing radiation dose [5]. Although the general principle of as low as reasonably achievable (ALARA) [5] is followed in clinical practice, patients are typically exposed to more than 4 mSv of equivalent dose [6]. In general, the imaging quality of PET is directly influenced by the activity of the injected tracer and the consequent radiation dose. A reduction of the radiation dose in PET protocols however leads to the degradation of imaging quality.

The technical advancement of PET scanners in recent decades has steadily reduced the radiation burden while preserving the imaging quality [7]. Breakthroughs have been made in signal measurement and imaging generation, including developments on scintillator crystals, photodetectors, acquisition electronics, and image reconstruction techniques [8]. Modern commercial PET scanners have introduced time-of-flight (TOF) techniques at a higher level of coincidence time resolution, which largely improved image quality [9,10,11]. Analog scanners are still commercially available but are increasingly being replaced by solid-state solutions. The transition of all major commercial vendors to silicon photomultiplier (SiPM)-based (digital) scanners has enabled a much-improved TOF resolution [12,13,14,15], as well as higher sensitivity, which increased measurement efficiency [16,17,18] and might afford radiation dose reductions of more than 40% [19,20,21]. The recent innovation of total-body PET technology further improves the sensitivity of PET and may allow for further reductions of radiation exposure associated with PET imaging [22,23,24,25]. However, such high-end scanners are only available in a small number of centers.

By contrast, computational techniques provide an alternative, cost-effective solutions to improve image quality for low-dose PET imaging. Denoising methods such as nonlocal means [26] or multi-scale curvelet and wavelet analysis [27] were developed to reduce the noise in low-dose PET images. Data-driven methods have been employed to synthesize high-quality standard-dose PET images from low-dose measurements using machine learning, such as random-forest-based regression [28], mapping-based sparse representation [29], semi-supervised tripled dictionary learning [30], multilevel canonical correlation analysis framework [31], and so on. However, these small patch-based learning estimations may result in over-smoothed images lacking texture information that limit the quantification of small structures in synthesized PET images. Recently developed deep learning techniques have been shown to better predict textural information in radiological images. Xiang et al. [32] proposed a concatenated end-to-end convolutional neural network (CNN) to estimate full-dose PET images, which effectively utilize the structural information from input data.

One challenge in deep learning is defining an analytical error function that enables an image quality perception comparable to human perception. GAN [33] is a special type of neural network model consisting of two units, with the generator unit synthesizing candidates while the discriminator unit attempts to decipher whether the candidate’s images are synthetic or real. The development of GAN has strengthened the capability of neural networks in this regard, allowing them to capture complex probability distributions. Wang et al. employed the adversarial training scheme to recover full-dose PET images from low-dose PET using a conditional GANs model [34] and further improved the performance by incorporating MRI images that provide extra anatomical information [35].

However, the translation of this technology to a clinical setting is not straightforward. PET imaging is characterized by the variability of instrumentation and imaging protocols [36, 37], such as geometric configuration, detector capability (e.g., TOF [38], depth-of-interaction (DOI) [39]), data correction, and system calibration. Furthermore, PET imaging is also strongly influenced by the variability of injected radiopharmaceuticals. Even in different tracers using the same radioisotope, the signal texture may be different due to other different molecules of the tracers. This issue may be especially important for the development of new tracers, where PET datasets from new or uncommonly used tracers may not be adequately available. Moreover, the trustworthiness of AI has been rigorously questioned over the last decade, for its reproducibility and stability when applied to external datasets.

Therefore, our goal was to develop and optimize a deep learning method for the recovery of standard-dose imaging quality from low-dose PET in a versatile clinical setting, including different imaging instrumentations and radiopharmaceuticals.

Materials and methods

Patient cohorts

The study was conducted in accordance with the requirements of the respective local ethics committees in Switzerland and China. Seven cohorts with 310 subjects were retrospectively included in this study (Table 1). For the Chinese cohorts, we selected 255 subjects who referred to [¹⁸F]FDG PET for various non-neurological/psychiatric purposes and that were considered neurologically healthy on PET imaging between April and December 2019. We also selected 10 patients who underwent [¹⁸F]Florbetapir PET for suspected neurodegenerative disease between April and August 2021. For the Swiss cohorts, we selected 27 patients who underwent [¹⁸F]FDG PET for suspected neurodegenerative disease and 18 patients who underwent [¹⁸F]FET PET for suspected brain tumors between February and November 2019.

Table 1 Information on patients’ demographics and diagnosis

Full size table

The subjects were scanned on 3 different PET scanners (GE Discovery MI, Siemens Biograph mCT, Siemens Biograph Vision) with 3 different tracers ([¹⁸F]FDG, [¹⁸F]Florbetapir, and [¹⁸F]FET). The first cohort consists of 237 subjects considered neurologically healthy referred to [¹⁸F]FDG PET on DMI (GE, Discovery MI), which was employed for the development of our deep learning methods. The second cohort consists of 10 patients for suspected neurodegenerative disease, who underwent [¹⁸F]Florbetapir PET were scanned on DMI. The third and fourth cohorts with suspected neurodegeneration were scanned on a mCT (Siemens, Biograph mCT) (n = 20) and Vision (Siemens, Biograph Vision) (n = 7) with [¹⁸F]FDG. The fifth cohort contained 18 subjects with suspected brain tumors in the brain who underwent [¹⁸F]FDG PET on DMI. The last two cohorts with suspected brain tumors were acquired on mCT (n = 10) and Vision (n = 8) with [¹⁸F]FET PET.

Imaging protocols

All data was acquired in list mode allowing for rebinding of data to simulate different acquisition times. PET data were reconstructed using OSEM (ordered subset expectation maximization). More detailed information concerning scanner properties and reconstruction parameters can be found in Supplementary Table S1. Each simulated low-dose PET with a certain dose reduction factor (DRF) was reconstructed from the counts of a time window resampled at the middle of the acquisition with correspondingly reduced time. For example, the full-dose PET images from the DMI are reconstructed with 5-min raw data, while the simulated low-dose PET with DRF = 2 is reconstructed with 2.5-min (from the 75th second to the 225th second) resampled raw data but with the same reconstruction parameters and post-processing procedure, ensuring that both images have a comparable spatial resolution.

Deep neural network setup

Our network was developed based on the conditional GANs (c-GANs) [33, 34] architecture, which consists of a generator network to synthesize the full-dose images from low-dose measurements and a discriminator to distinguish between the synthesized full-dose image and the real input. As shown in Figure 1, we specifically customized our model for cross-scanner and cross-tracer application including a U-net like architecture featuring skip connection (referred to as “Concatenate” in Figure 1) [40], batch normalization (BN) [41], a modified objective function with both conventional content loss [33] and also voxel-wise loss. Techniques like skip connection and BN allow the network architecture to be much deeper, which endows the network with a better capability of generalization. Customized loss function helps to preserve complex image details. The model was trained by mixing the image pairs of all DRF up to 20 from DMI and later tested on datasets from different scanners and tracers with DRF up to 100. More information on the network design and training procedure is attached in the corresponding part of the Supplementary material.

Evaluation based on physical metrics

To evaluate the quality of the enhanced images on all test datasets, we calculated and compared the physical metrics including the normalized root mean squared error (NRMSE) which measures the overall pixel-wise intensity deviation, peak signal-to-noise ratio (PSNR) as well as structural similarity index measurement (SSIM) that reflects perceived image quality [42]. Differences between the AI-enhanced and non-AI-enhanced groups for NRMSE were assessed for statistical significance by means of the paired two-tailed t-test. Furthermore, to examine the level of difference of AI enhancement in a cross-scanner and cross-tracer setting, an unpaired two-tailed t-test was performed for NRMSE improvement (percentage error calculated between AI-enhanced and non-AI-enhanced groups) on results from all three scanners and both included tracers. A p-value lower than 0.05 was considered statistically significant.

Clinical assessment for cross-scanner application

For the cross-scanner assessment, the neurodegeneration cohorts imaged with [¹⁸F]FDG (scanned with mCT n = 20 and Vision n = 7) were assessed with NEUROSTAT/3D-SSP [43] according to a standardized procedure used in everyday clinical practice, comparing each patient’s images with an age-matched healthy collective.

In a first step, the 3D-SSP results as well as complete axial images (full-dose, AI-enhanced, and non-AI-enhanced low-dose images from DRF 2 to 100) of each patient were directly visually compared with each other by two board-certified nuclear medicine physicians (A.R. and K.P.B.). Subsequently, the physicians determined at which DRF the AI-enhanced images started preserving a better diagnostic value in comparison with non-AI-enhanced images and thus came closer to full-dose images. In a second step, the two nuclear medicine physicians independently assessed three subsets of the images of the neurodegeneration cohorts (full-dose, DRF = 50 with and without AI enhancement) as explained in the following passage. The DRF = 50 subset was chosen based on the results of the first step. The physicians were blinded regarding the source of the image (e.g., full-dose or DRF image) as well as patient clinical information.

The results from the 3D-SSP analysis were rated regarding the visual hypometabolism compared to healthy controls in four regions (frontal, parietal, temporal lobe, and PCC), for each hemisphere on a four-point scale (0 = no hypometabolism, 1 = little hypometabolism, 2 = medium hypometabolism, 3 = strong hypometabolism). The results of the rating were also simplified to a binary scale (0 = no or little hypometabolism, 1 = medium or strong hypometabolism). The four-point scale and binary results of the rating were compared between the three subsets by the Friedman test (p < 0.05) for significant differences using SPSS Version 25.0. In case of significant differences on the Friedman test, additional post hoc tests using Wilcoxon signed-rank test with Bonferroni adjustment were performed, with p < 0.017 considered significant.

Clinical assessment for cross-tracer application

For the cross-tracer assessment, [¹⁸F]Florbetapir standardized uptake value ratio (SUVR) maps were generated using the cerebellum gray matter as reference regions, for the purpose of visual assessment [44] by a nuclear medicine physician. As for the brain tumor cohorts (imaged with [¹⁸F]FDG and [¹⁸F]FET), we measured clinical imaging parameters such as SUVmean, SUVmax, as well as the most relevant radiomics features [45] described in literature within [46,47,48,49,50,51,52,53,54,55]. The lesions were delineated manually and reviewed by a board-certified nuclear medicine physician. The accuracy of the clinical imaging parameters and radiomics features of the lesions were calculated in reference to full-dose images (percentage error). The results of the AI-enhanced and non-AI-enhanced groups were compared at all DRFs. More detailed information regarding feature selection and the analysis procedure can be found in the corresponding part of the Supplementary material.

Results

Physical metrics evaluation for cross-scanner application

The customized c-GAN trained on [¹⁸F]FDG images from DMI was tested on [¹⁸F]FDG images on three different scanners. The results for NRMSE on [¹⁸F]FDG imaging are shown in Figure 2. Figure 2A, B, and C showed that NRMSE improvement using AI tended to increase with increasing DRF on all three scanners. Compared to non-AI-enhanced group, the AI-enhanced group achieved statistically significant advantage for the paired t-test on DMI from DRF = 2 (p = 1.8E−6), on mCT from DRF = 10 (p = 4.5E−5), and on Vision from DRF = 20 (p = 0.03). Additional results of PSNR and SSIM on [¹⁸F]FDG imaging on the three different scanners showed the same tendency as the NRMSE results (Supplementary Figure S2).

Figure 2D and E illustrated the improvement by AI enhancement referring to baseline image quality. The baseline image quality (x-axis) was represented by the normalized dose acquisition (D), which is the injecting dose corrected for acquisition time and patient weight, and SSIM (E) of the non-AI-enhanced images. The NRMSE improvement (y-axis) on low-dose images by using AI enhancement significantly negatively correlated with the baseline image quality (normalized dose acquisition: r = −0.60, p = 3.6E−24 and SSIM: r = −0.71, p = 1.1E−37).

Figure 2A–E overall suggested that the benefits of AI increase with decreasing image quality and the image quality degradation of mCT and Vision was less affected by the dose reduction and was milder compared to DMI. The unpaired t-test results illustrated that the application of AI on different scanners achieved comparable results, although not as good as the trained scanner (DMI). For example, the NRMSE improvement on mCT at DRF = 100 achieved the same level as in the case of DRF = 4 on DMI (p = 0.12). The level of improvement on Vision at DRF = 100 achieved the same level as in the case of DRF = 2 on DMI (p = 0.63).

The aforementioned points were also confirmed by the visual reading (Figure 3), namely that our model was able to enhance image quality on all three scanners, especially at high DRF. AI enhancement achieved overall good performance on DMI. As for the mCT data, AI enhancement started to show its advantages from DRF = 50, with the non-AI-enhanced images still maintaining good image quality under DRF = 50. The level of improvement on Vision was not as evident as on mCT.

Physical metrics evaluation for cross-tracer application

The same trained c-GAN was tested on cross-tracer data from three different scanners. The results for NRMSE are shown in Figure 2F–H. Compared to the non-AI-enhanced group, the AI-enhanced group achieved statistically significant advantage for the paired t-test on [¹⁸F]Florbetapir (DMI) from DRF = 2 (p = 0.03), on [¹⁸F]FET (mCT) from DRF = 10 (p = 0.001), no significant advantage observed on [¹⁸F]FET (Vision). Furthermore, the unpaired t-test results illustrated that there were no statistically significant differences between the application of AI to a different tracer with the same scanner. For example, the NRMSE improvement on [¹⁸F]Florbetapir achieved almost the same level as [¹⁸F]FDG (p = 0.6, DRF = 10; p = 0.8, DRF = 20). Additional results of PSNR and SSIM on both tracers showed the same tendency as the NRMSE results (Supplementary Figure S2).

The aforementioned points were also confirmed by the visual reading (Figure 4), namely that our model was able to enhance image quality for both [¹⁸F]Florbetapir and [¹⁸F]FET, especially at high DRF. AI enhancement achieved overall good performance on [¹⁸F]Florbetapir. As for [¹⁸F]FET, AI enhancement started to show its advantages from DRF = 50 on mCT. The level of improvement on Vision was not as evident as on mCT.

Clinical assessment for cross-scanner application

The comparison of the 3D-SSP data and the axial images of the neurodegeneration data for all available DRF showed an advantage of AI enhancement starting at DRF = 50 in most cases. This was mainly due to mCT data, which makes up the biggest part of the neurodegeneration group. For DRF = 50, non-AI-enhanced images tended to be more blurred and to overestimate the extent of pathology. For example, in Figure 5A, all 3D-SSP images showed a fairly stable pattern of predominantly temporal bilateral hypometabolism, with a slight tendency of the non-AI-enhanced images to be more blurred. The corresponding axial images showed the disadvantages of the non-AI-enhanced images clearer, as they were overall more blurred, and as the areas of temporal hypometabolism were harder to separate from the adjacent non-affected areas, as well as basal ganglia being less demarcated. The increased tendency of non-enhanced images compared to AI-enhanced images to overestimate the extent of pathology can be seen in the frontal lobes in Figure 5B.

In some cases, the 3D-SSP results of the non-AI-enhanced images even showed strong incorrect/artificial hypometabolism of some regions, which was not visible in the 3D-SSP results of the full-dose images. This is demonstrated by Figure 5C, where non-AI-enhanced images showed bilateral frontal hypometabolism, which could not be seen on full-dose or AI-enhanced images. This erroneous frontal hypometabolism on non-AI-enhanced images was not visible for images under DRF = 50. More examples can be found in Supplementary Figure S6.

In contrast, the effect of AI enhancement was not as evident on data from Vision, being the scanner with the overall best imaging quality (Supplementary Figure S7).

On an additional inspection, AI enhancement performed best on data from DMI, with the advantage of AI being particularly evident in the case of high DRF or poor image quality. An exemplary case is shown in Supplementary Figure S8.

The rating of the 3D-SSP data also showed an overall advantage of AI enhancement. The Friedman test showed significant differences (p < 0.05) between the three assessed groups for rater 1 on the four-point scale (p = 0.017, χ² 8.133) and the binary scale (p = 0.002, χ² 12.133), whereas there were no significant differences for rater 2 (four-point scale p = 0.551, binary scale p = 0.472). For rater 1, the following post hoc test showed significant differences between the full-dose and the DRF = 50 non-AI-enhanced groups (four-point scale p = 0.005, binary scale p = 0.013), and partly between the DRF = 50 non-AI-enhanced and AI-enhanced groups (four-point scale p = 0.133, binary scale p = 0.004). No significant differences were found between the full-dose and DRF = 50 AI-enhanced groups.

Clinical assessment for cross-tracer application

Results of [¹⁸F]Florbetapir dataset showed an overall advantage of AI enhancement, especially starting from DRF 10. The most noticeable improvement in image quality was observed with DRF 100, but some inconsistencies were observed compared to the full-dose images (Figure 6).

Regarding the brain tumor dataset, results of [¹⁸F]FDG imaging from DMI suggested that the AI-enhanced images overall preserved an improved quality in terms of the selected features and the improvement tended to increase with higher DRF (Figure 7). Yet, none of the clinical features of the [¹⁸F]FET images benefited from the enhancement (Supplementary Figure S5). Additional results of lesion segmentation and analysis can be found in Supplementary Figures S4 and 5.

Discussions

A critical concern when using machine learning is its reproducibility and extensibility to unknown complexity in real application [56]. Methods optimized in one cohort have been reported to have limited performance in other cohorts or other applications [57]. Despite the demonstrable potential of AI for PET dose reduction, the main challenge for its clinical translation for routine clinical use remains its ability to take the large complexity involved in molecular imaging into account, such as the variety of tracers, scanners, imaging protocols, reconstruction settings, metabolic dynamics, and so on [36, 37]. The strength of this study lies in its trustworthy design. The model, trained with data from one center was applied to data from different scanners, diseases, and tracers in another center. Our results demonstrated that the customized deep learning was able to synthesize images comparable to full-dose PET images from low-dose PET images with certain restrictions. The improved capability of cross-scanner and cross-tracer application can enhance the translational credibility of the AI methods in nuclear medicine, considering the diversity and rapid growth of new instruments and radiopharmaceuticals. Our study attempts to explore the translational potential of deep learning for low-dose PET protocols in-depth and for moving a step toward clinical practice.

We included both digital and analog scanners for variability. The digital PET scanners were equipped with SiPM that enables higher efficiency and better TOF measurements, compared to conventional analog PET scanners [58, 59], which is a major source of variability of input image quality. Our results indicated that although our model was developed based on a digital scanner (DMI), AI tends to be more helpful when recovering from low-dose PET on an analog scanner (mCT). Considering the overall better properties of the digital scanner, e.g., producing images with higher spatial resolution and less noise or artifacts, less room for AI improvements seems to be left. Acquisition protocols including aspects of injected dose or acquisition time may also contribute to variability. As shown in Table 1 and Supplementary Table S2, the two included centers follow different protocols each with respect to different local conditions. As shown in Figure 2D, owing to longer acquisition time, the image quality degradation of mCT and Vision was less affected by the dose reduction compared to DMI, especially in the case of the SiPM-based digital scanner (Vision) as seen in Figure 2E. Therefore, we additionally obtained DRF = 50, 100 data from both Siemens scanners (mCT and Vision) to make the data more comparable. Additionally, image reconstruction was performed using manufacturer-provided software with recommended parameters, which differ in several aspects such as iterations and subsets when performing iterative reconstruction with OSEM [60]. Algorithms for physical corrections including attenuation and scatter corrections also vary between scanners. The reconstruction procedures deliberately followed the vendors’ recommendations and were in line with normal clinical settings, in order to fairly assess the robustness of the proposed model in handling routine applications. Despite all the aforementioned variabilities in the cross-scanner application setting, the results demonstrated that our customized c-GAN was able to achieve a comparable level of enhancement regarding image quality.

The clinical assessment overall showed AI to be advantageous when applied to low-dose PET images. Although the clinical and physical evaluations were carried out independently, the results were consistent with each other. Accordingly, the clinical evaluation also showed that the positive effect of AI becomes greater with decreasing image quality as shown in Figures 3, 4, 5, and 6. As the clinical evaluation had a focus on DRF = 50 on the cross-scanner setting, the benefit of AI should be evaluated in a clinical setting with even higher DRFs. Nevertheless, it also remains unclear how AI enhancements will perform in a real clinical setting, in which the raters have further clinical information that they can use to interpret the images and come to a conclusion/diagnosis. Therefore, it needs further assessment in a routine setting and within larger cohorts. Furthermore, it should be possible to significantly reduce the dose without a relevant impact on clinical assessment results or image quality, even without the use of AI, e.g., up to DRF = 20 on mCT data. However, we should also be aware of cases like the one in Figure 5C where the 3D-SSP results of the non-AI-enhanced images showed strong incorrect/artificial hypometabolism for DRF = 50 which might lead to a false diagnosis in clinical routine. In such a situation, information might have been affected during the 3D-SSP processing pipeline. Since the corresponding axial slices showed the same findings independently of the 3D-SSP data, we can state that this was not the case. In summary, the clinical evaluations showed that AI is beneficial, especially in the cross-scanner application of AI enhancement on mCT data.

We employed imaging with the same radioisotope but different molecules for the tracer. AI overall performed well on [¹⁸F]Florbetapir imaging of Alzheimer’s disease, since it was acquired from the same scanner as the training dataset (DMI). The improvement became more evident starting from DRF 10, while inevitably producing some artifacts at a higher reduction rate (100), which must be treated with caution when diagnosing. This fact may also be related to the highest DRF included in our training is only 20. We observed that AI enhancement led to an increase in NRMSE for [¹⁸F]FET imaging obtained from Vision (Figure 2H), which was most pronounced at low DRF. This can be explained that our current AI training may have limited performance when dealing with complicated situations, i.e., cross-scanner and cross-tracer at the same time. The large variability imposed in both cross-tracer and cross-scanner can place too much burden on the AI model trained with limited complexity. Future work of incorporating diverse training data may overcome the limitation and further improve the performance of AI.

Overall, some limitations of AI application and potential risks need to be considered. There might be some hidden problems associated with GAN technology in image synthesis such as feature hallucination, where GANs may add or remove image features since the source and target domain distributions are paired data [61]. It is therefore important to recruit domain experts to further evaluate the resulting images, considering that physical indicators often fall into this trap. Another limitation of this study is the inherent bias in the limited datasets and the inclusion of additional subjects may further improve the generalizability and robustness of the developed model. Additionally, the low-dose images are simulated by reconstructions with shorter acquisition time and do not originate from patients studied with reduced injection dose and reconstruction over the entire acquisition time. Our study trained a model on a dataset from one scanner and one tracer, which was not optimal for AI development. Nevertheless, our preliminary results confirmed the potential of our initial hypothesis, albeit in such a challenging cross-scanner and cross-tracer setup. This proof of concept can therefore support the design of more realistic studies in the future, by including a larger and heterogeneous dataset that is not limited by the center, scanner, tracer, disease, or body region. It would be also helpful to further develop algorithms directly based on high-level information extracted from PET raw data. In addition, multimodal methods for dose reduction may be of benefit. Finally, since CT is another major contributor to the total effective dose when performing PET/CT, it would be helpful to investigate deep learning methods for the dose reduction on CT imaging as well. However, this aspect might be more relevant in body PET/CT protocols, where CT is the main contributor to the effective dose whereas the used dose of the radiopharmaceutical is the main contributor to brain PET/CT [62].

Conclusion

The deep learning approach developed for low-dose PET image enhancement had the potential to be applied on different scanners and tracers with certain limitations. The improvement of image quality by using AI tended to increase with decreasing image quality when applied on cross-scanner and cross-tracer data. When applying high DRFs in cross-tracer applications, potential artifacts must be treated with caution, especially when applied to radiomics feature analysis. Clinical evaluations suggested that using AI is advantageous, although further validation is needed, including in the context of clinical routine. It is reasonable to suggest training with more available data would further consolidate the capability of AI.

References

Beer AJ, Eiber M, Souvatzoglou M, Schwaiger M, Krause BJ. Radionuclide and hybrid imaging of recurrent prostate cancer. The lancet oncology. 2011;12:181–91. https://doi.org/10.1016/S1470-2045(10)70103-0.
Article PubMed Google Scholar
Moskowitz AJ, Schoder H, Yahalom J, McCall SJ, Fox SY, Gerecitano J, et al. PET-adapted sequential salvage therapy with brentuximab vedotin followed by augmented ifosamide, carboplatin, and etoposide for patients with relapsed and refractory Hodgkin's lymphoma: a non-randomised, open-label, single-centre, phase 2 study. The lancet oncology. 2015;16:284–92. https://doi.org/10.1016/S1470-2045(15)70013-6.
Article CAS PubMed Google Scholar
Barthel H, Gertz HJ, Dresel S, Peters O, Bartenstein P, Buerger K, et al. Cerebral amyloid-beta PET with florbetaben (18F) in patients with Alzheimer's disease and healthy controls: a multicentre phase 2 diagnostic study. Lancet neurology. 2011;10:424–35. https://doi.org/10.1016/S1474-4422(11)70077-1.
Article CAS PubMed Google Scholar
Skeoch S, Williams H, Cristinacce P, Hockings P, James J, Alexander Y, et al. Evaluation of carotid plaque inflammation in patients with active rheumatoid arthritis using (18)F-fluorodeoxyglucose PET-CT and MRI: a pilot study. Lancet. 2015;385(Suppl 1):S91. https://doi.org/10.1016/S0140-6736(15)60406-8.
Article PubMed Google Scholar
Voss SD, Reaman GH, Kaste SC, Slovis TL. The ALARA concept in pediatric oncology. Pediatric radiology. 2009;39:1142.
Article PubMed Google Scholar
Martí-Climent JM, Prieto E, Morán V, Sancho L, Rodríguez-Fraile M, Arbizu J, et al. Effective dose estimation for oncological and neurological PET/CT procedures. EJNMMI research. 2017;7:37.
Article PubMed PubMed Central Google Scholar
Cherry SR, Jones T, Karp JS, Qi J, Moses WW, Badawi RD. Total-body PET: maximizing sensitivity to create new opportunities for clinical research and patient care. Journal of Nuclear Medicine. 2018;59:3–12.
Article CAS PubMed PubMed Central Google Scholar
Jones T, Townsend DW. History and future technical innovation in positron emission tomography. Journal of Medical Imaging. 2017;4:011013.
Article PubMed PubMed Central Google Scholar
Lecoq P, Morel C, Prior JO, Visvikis D, Gundacker S, Auffray E, et al. Roadmap toward the 10 ps time-of-flight PET challenge. Physics in Medicine & Biology. 2020;65:21RM01.
Tsoumpas C. Why ultrafast is ultra-good. Physics World. 2020;33:41.
Article Google Scholar
Lecoq P. Pushing the limits in time-of-flight PET imaging. IEEE Transactions on radiation and plasma medical sciences. 2017;1:473–85.
Article Google Scholar
Hsu DF, Ilan E, Peterson WT, Uribe J, Lubberink M, Levin CS. Studies of a next-generation silicon-photomultiplier–based time-of-flight PET/CT system. Journal of Nuclear Medicine. 2017;58:1511–8.
Article CAS PubMed Google Scholar
Van Sluis J, De Jong J, Schaar J, Noordzij W, Van Snick P, Dierckx R, et al. Performance characteristics of the digital Biograph Vision PET/CT system. Journal of Nuclear Medicine. 2019;60:1031–6.
Article PubMed Google Scholar
Chen S, Hu P, Gu Y, Yu H, Shi H. Performance characteristics of the digital uMI550 PET/CT system according to the NEMA NU2-2018 standard. EJNMMI physics. 2020;7:1–14.
Article Google Scholar
Zhang J, Maniawski P, Knopp MV. Performance evaluation of the next generation solid-state digital photon counting PET/CT system. EJNMMI research. 2018;8:97.
Article PubMed PubMed Central Google Scholar
Alberts I, Prenosil G, Sachpekidis C, Weitzel T, Shi K, Rominger A, et al. Digital versus analogue PET in [(68)Ga]Ga-PSMA-11 PET/CT for recurrent prostate cancer: a matched-pair comparison. European journal of nuclear medicine and molecular imaging. 2020;47:614–23. https://doi.org/10.1007/s00259-019-04630-y.
Article CAS PubMed Google Scholar
Lopez-Mora DA, Flotats A, Fuentes-Ocampo F, Camacho V, Fernandez A, Ruiz A, et al. Comparison of image quality and lesion detection between digital and analog PET/CT. European journal of nuclear medicine and molecular imaging. 2019;46:1383–90. https://doi.org/10.1007/s00259-019-4260-z.
Article PubMed Google Scholar
Nguyen NC, Vercher-Conejero JL, Sattar A, Miller MA, Maniawski PJ, Jordan DW, et al. Image quality and diagnostic performance of a digital PET prototype in patients with oncologic diseases: initial experience and comparison with analog PET. Journal of nuclear medicine : official publication, Society of Nuclear Medicine. 2015;56:1378–85. https://doi.org/10.2967/jnumed.114.148338.
Article CAS Google Scholar
Shkumat NA, Vali R, Shammas A. Clinical evaluation of reconstruction and acquisition time for pediatric (18)F-FDG brain PET using digital PET/CT. Pediatric radiology. 2020;50:966–72. https://doi.org/10.1007/s00247-020-04640-1.
Article PubMed Google Scholar
Gnesin S, Kieffer C, Zeimpekis K, Papazyan JP, Guignard R, Prior JO, et al. Phantom-based image quality assessment of clinical (18)F-FDG protocols in digital PET/CT and comparison to conventional PMT-based PET/CT. EJNMMI physics. 2020;7:1. https://doi.org/10.1186/s40658-019-0269-4.
Article PubMed PubMed Central Google Scholar
Alberts I, Sachpekidis C, Prenosil G, Viscione M, Bohn KP, Mingels C, et al. Digital PET/CT allows for shorter acquisition protocols or reduced radiopharmaceutical dose in [18 F]-FDG PET/CT. Annals of Nuclear Medicine. 2021;35:485–92.
Article CAS PubMed PubMed Central Google Scholar
Vandenberghe S, Moskal P, Karp JS. State of the art in total body PET. EJNMMI physics. 2020;7:35. https://doi.org/10.1186/s40658-020-00290-2.
Article PubMed PubMed Central Google Scholar
Berg E, Gill H, Marik J, Ogasawara A, Williams S, van Dongen G, et al. Total-body PET and highly stable chelators together enable meaningful (89)Zr-antibody PET studies up to 30 days after injection. Journal of nuclear medicine : official publication, Society of Nuclear Medicine. 2020;61:453–60. https://doi.org/10.2967/jnumed.119.230961.
Article CAS Google Scholar
Badawi RD, Shi H, Hu P, Chen S, Xu T, Price PM, et al. First human imaging studies with the EXPLORER total-body PET scanner. Journal of nuclear medicine : official publication, Society of Nuclear Medicine. 2019;60:299–303. https://doi.org/10.2967/jnumed.119.226498.
Article CAS Google Scholar
Alberts I, Hünermund J-N, Prenosil G, Mingels C, Bohn KP, Viscione M, et al. Clinical performance of long axial field of view PET/CT: a head-to-head intra-individual comparison of the Biograph Vision Quadra with the Biograph Vision PET/CT. European Journal of Nuclear Medicine and Molecular Imaging. 2021;1-10.
Dutta J, Leahy RM, Li Q. Non-local means denoising of dynamic PET images. PloS one. 2013;8.
Le Pogam A, Hanzouli H, Hatt M, Le Rest CC, Visvikis D. Denoising of PET images by combining wavelets and curvelets for improved preservation of resolution and quantitation. Medical image analysis. 2013;17:877–91.
Article PubMed Google Scholar
Kang J, Gao Y, Shi F, Lalush DS, Lin W, Shen D. Prediction of standard-dose brain PET image by using MRI and low-dose brain [18F] FDG PET images. Medical physics. 2015;42:5301–9.
Article PubMed PubMed Central Google Scholar
Wang Y, Zhang P, An L, Ma G, Kang J, Shi F, et al. Predicting standard-dose PET image from low-dose PET and multimodal MR images using mapping-based sparse representation. Physics in Medicine & Biology. 2016;61:791.
Article CAS Google Scholar
Wang Y, Ma G, An L, Shi F, Zhang P, Lalush DS, et al. Semisupervised tripled dictionary learning for standard-dose PET image prediction using low-dose PET and multimodal MRI. IEEE Transactions on Biomedical Engineering. 2016;64:569–79.
Article CAS PubMed Google Scholar
An L, Zhang P, Adeli E, Wang Y, Ma G, Shi F, et al. Multi-level canonical correlation analysis for standard-dose PET image estimation. IEEE Transactions on Image Processing. 2016;25:3303–15.
Article PubMed PubMed Central Google Scholar
Xiang L, Qiao Y, Nie D, An L, Lin W, Wang Q, et al. Deep auto-context convolutional neural networks for standard-dose PET image estimation from low-dose PET/MRI. Neurocomputing. 2017;267:406–16.
Article PubMed PubMed Central Google Scholar
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, et al. Generative adversarial nets. Advances in neural information processing systems; 2014. p. 2672-80.
Wang Y, Yu B, Wang L, Zu C, Lalush DS, Lin W, et al. 3D conditional generative adversarial networks for high-quality PET image estimation at low dose. NeuroImage. 2018;174:550–62.
Article PubMed Google Scholar
Wang Y, Zhou L, Yu B, Wang L, Zu C, Lalush DS, et al. 3D auto-context-based locality adaptive multi-modality GANs for PET synthesis. IEEE transactions on medical imaging. 2018;38:1328–39.
Article PubMed PubMed Central Google Scholar
Galavis PE, Hollensen C, Jallow N, Paliwal B, Jeraj R. Variability of textural features in FDG PET images due to different acquisition modes and reconstruction parameters. Acta oncologica. 2010;49:1012–6.
Article PubMed Google Scholar
Yan J, Chu-Shern JL, Loi HY, Khor LK, Sinha AK, Quek ST, et al. Impact of image reconstruction settings on texture features in 18F-FDG PET. Journal of nuclear medicine. 2015;56:1667–73.
Article CAS PubMed Google Scholar
Moses W. Time of flight in PET revisited. IEEE Transactions on Nuclear Science. 2003;50:1325–30.
Article Google Scholar
Ohi J, Tonami H. Investigation of a whole-body DOI-PET system. Nuclear instruments and methods in physics research section a: accelerators, spectrometers, detectors and associated equipment. 2007;571:223–6.
Article CAS Google Scholar
Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation. International Conference on Medical image computing and computer-assisted intervention: Springer; 2015. p. 234-41.
Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:150203167. 2015.
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing. 2004;13:600–12.
Article PubMed Google Scholar
Minoshima S, Frey KA, Koeppe RA, Foster NL, Kuhl DE. A diagnostic approach in Alzheimer's disease using three-dimensional stereotactic surface projections of fluorine-18-FDG PET. Journal of Nuclear Medicine. 1995;36:1238–48.
CAS PubMed Google Scholar
Pascoal TA, Mathotaarachchi S, Shin M, Park AY, Mohades S, Benedet AL, et al. Amyloid and tau signatures of brain metabolic decline in preclinical Alzheimer’s disease. European journal of nuclear medicine and molecular imaging. 2018;45:1021–30.
Article CAS PubMed PubMed Central Google Scholar
Ha S, Choi H, Paeng JC. Cheon GJJNm, imaging m. Radiomics in oncological PET/CT: a methodological overview. 2019;53:14–29.
Google Scholar
Brooks FJ. Grigsby PWJJoNM. The effect of small tumor volumes on studies of intratumoral heterogeneity of tracer uptake. 2014;55:37–42.
CAS Google Scholar
Hatt M, Majdoub M, Vallières M, Tixier F, Le Rest CC, Groheux D, et al. 18F-FDG PET uptake characterization through texture analysis: investigating the complementary nature of heterogeneity and functional tumor volume in a multi–cancer site patient cohort. 2015;56:38–44.
CAS Google Scholar
Hatt M, Tixier F, Pierce L, Kinahan PE, Le Rest CC, DJEjonm V, et al. Characterization of PET/CT images using texture analysis: the past, the present… any future? 2017;44:151–65.
Google Scholar
Presotto L, Bettinardi V, De Bernardi E, Belli M, Cattaneo G, Broggi S, et al. PET textural features stability and pattern discrimination power for radiomics analysis: an “ad-hoc” phantoms study. 2018;50:66–74.
CAS Google Scholar
Kim BH, Kim S-J, Kim K, Kim H, Kim SJ, Kim WJ, et al. High metabolic tumor volume and total lesion glycolysis are associated with lateral lymph node metastasis in patients with incidentally detected thyroid carcinoma. 2015;29:721–9.
CAS Google Scholar
Kong Z, Lin Y, Jiang C, Li L, Liu Z, Wang Y, et al. 18 F-FDG-PET-based radiomics signature predicts MGMT promoter methylation status in primary diffuse glioma. 2019;19:58.
Li L, Mu W, LIU Z, Liu Z, Wang Y, Ma W, et al. A non-invasive radiomic method us 18F-FDG PET predicts isocitrate dehydrogenase genotype and prognosis in patients with glioma. 2019;9:1183.
Wu Y, Jiang J-H, Chen L, Lu J-Y, Ge J-J, Liu F-T, et al. Use of radiomic features and support vector machine to distinguish Parkinson’s disease cases from normal controls. 2019;7.
Lohmann P, Kocher M, Ceccon G, Bauer EK, Stoffels G, Viswanathan S, et al. Combined FET PET/MRI radiomics differentiates radiation injury from recurrent brain metastasis. 2018;20:537–42.
Google Scholar
Lohmann P, Stoffels G, Ceccon G, Rapp M, Sabel M, Filss CP, et al. Radiation injury vs. recurrent brain metastasis: combining textural feature radiomics analysis and standard parameters may increase 18 F-FET PET accuracy without dynamic scans. 2017;27:2916–27.
Google Scholar
Roberts M, Driggs D, Thorpe M, Gilbey J, Yeung M, Ursprung S, et al. Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans. Nature Machine Intelligence. 2021;3:199–217.
Article Google Scholar
Maier-Hein L, Eisenmann M, Reinke A, Onogur S, Stankovic M, Scholz P, et al. Why rankings of biomedical image analysis competitions should be interpreted with care. Nat Commun. 2018;9:1–13.
Article Google Scholar
Degenhardt C, Rodrigues P, Trindade A, Zwaans B, Mülhens O, Dorscheid R, et al. Performance evaluation of a prototype positron emission tomography scanner using digital photon counters (DPC). 2012 IEEE Nuclear Science Symposium and Medical Imaging Conference Record (NSS/MIC): IEEE; 2012. p. 2820-4.
Nguyen NC, Vercher-Conejero JL, Sattar A, Miller MA, Maniawski PJ, Jordan DW, et al. Image quality and diagnostic performance of a digital PET prototype in patients with oncologic diseases: initial experience and comparison with analog PET. Journal of Nuclear Medicine. 2015;56:1378–85.
Article CAS PubMed Google Scholar
Hudson HM, Larkin RS. Accelerated image reconstruction using ordered subsets of projection data. IEEE transactions on medical imaging. 1994;13:601–9.
Article CAS PubMed Google Scholar
Cohen JP, Luck M, Honari S. Distribution matching losses can hallucinate features in medical image translation. International conference on medical image computing and computer-assisted intervention: Springer; 2018. p. 529–36.
Google Scholar
Martí-Climent JM, Prieto E, Morán V, Sancho L, Rodríguez-Fraile M, Arbizu J, et al. Effective dose estimation for oncological and neurological PET/CT procedures. EJNMMI research. 2017;7:1–8.
Article Google Scholar

Download references

Funding

This work was supported by the Swiss National Science Foundation (SNF 188350 and 188914) and Shanghai Municipal Key Clinical Specialty (No. shslczdzk03403).

Author information

Authors and Affiliations

Department of Nuclear Medicine, University of Bern, Bern, Switzerland
Song Xue, Karl Peter Bohn, Marco Viscione, Ian Alberts, Axel Rominger & Kuangyu Shi
Department of Nuclear Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Rui Guo, Hongping Meng, Chenwei Sun, Miao Zhang, Min Zhang & Biao Li
Collaborative Innovation Center for Molecular Imaging of Precision Medicine, Ruijin Center, Shanghai, China
Rui Guo, Hongping Meng, Chenwei Sun, Miao Zhang, Min Zhang & Biao Li
Department of Informatics, Technical University of Munich, Munich, Germany
Jared Matzke & Kuangyu Shi
ARTORG Center, University of Bern, Bern, Switzerland
Raphael Sznitman
Gordon Center for Medical Imaging, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Georges El Fakhri

Authors

Song Xue
View author publications
You can also search for this author in PubMed Google Scholar
Rui Guo
View author publications
You can also search for this author in PubMed Google Scholar
Karl Peter Bohn
View author publications
You can also search for this author in PubMed Google Scholar
Jared Matzke
View author publications
You can also search for this author in PubMed Google Scholar
Marco Viscione
View author publications
You can also search for this author in PubMed Google Scholar
Ian Alberts
View author publications
You can also search for this author in PubMed Google Scholar
Hongping Meng
View author publications
You can also search for this author in PubMed Google Scholar
Chenwei Sun
View author publications
You can also search for this author in PubMed Google Scholar
Miao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Min Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Raphael Sznitman
View author publications
You can also search for this author in PubMed Google Scholar
Georges El Fakhri
View author publications
You can also search for this author in PubMed Google Scholar
Axel Rominger
View author publications
You can also search for this author in PubMed Google Scholar
Biao Li
View author publications
You can also search for this author in PubMed Google Scholar
Kuangyu Shi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Song Xue, Rui Guo, and Karl Peter Bohn are shared first authors.

Corresponding author

Correspondence to Biao Li.

Ethics declarations

Ethics approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. This article does not contain any studies with animals performed by any of the authors.

Consent to participate

Informed consent was obtained from all patients included in this study.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the Topical Collection on Advanced Image Analyses (Radiomics and Artificial Intelligence).

Supplementary Information

ESM 1

(DOCX 2436 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xue, S., Guo, R., Bohn, K.P. et al. A cross-scanner and cross-tracer deep learning method for the recovery of standard-dose imaging quality from low-dose PET. Eur J Nucl Med Mol Imaging 49, 1843–1856 (2022). https://doi.org/10.1007/s00259-021-05644-1

Download citation

Received: 29 June 2021
Accepted: 30 November 2021
Published: 24 December 2021
Issue Date: May 2022
DOI: https://doi.org/10.1007/s00259-021-05644-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A cross-scanner and cross-tracer deep learning method for the recovery of standard-dose imaging quality from low-dose PET

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Deep learning-assisted ultra-fast/low-dose whole-body PET/CT imaging

Low-count whole-body PET/MRI restoration: an evaluation of dose reduction spectrum and five state-of-the-art artificial intelligence models

Artificial intelligence-based joint attenuation and scatter correction strategies for multi-tracer total-body PET

Explore related subjects

Introduction

Materials and methods

Patient cohorts

Imaging protocols

Deep neural network setup

Evaluation based on physical metrics

Clinical assessment for cross-scanner application

Clinical assessment for cross-tracer application

Results

Physical metrics evaluation for cross-scanner application

Physical metrics evaluation for cross-tracer application

Clinical assessment for cross-scanner application

Clinical assessment for cross-tracer application

Discussions

Conclusion

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Conflict of interest

Additional information

Publisher’s note

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation