Evaluation of a 2D UNet-Based Attenuation Correction Methodology for PET/MR Brain Studies

Presotto, Luca; Bettinardi, Valentino; Bagnalasta, Matteo; Scifo, Paola; Savi, Annarita; Vanoli, Emilia Giovanna; Fallanca, Federico; Picchio, Maria; Perani, Daniela; Gianolli, Luigi; De Bernardi, Elisabetta

doi:10.1007/s10278-021-00551-1

Evaluation of a 2D UNet-Based Attenuation Correction Methodology for PET/MR Brain Studies

Open access
Published: 28 January 2022

Volume 35, pages 432–445, (2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of Digital Imaging Aims and scope Submit manuscript

Evaluation of a 2D UNet-Based Attenuation Correction Methodology for PET/MR Brain Studies

Download PDF

Luca Presotto¹,
Valentino Bettinardi¹,
Matteo Bagnalasta¹,
Paola Scifo¹,
Annarita Savi¹,
Emilia Giovanna Vanoli¹,
Federico Fallanca¹,
Maria Picchio^1,2,
Daniela Perani^1,2,
Luigi Gianolli¹ &
…
Elisabetta De Bernardi ORCID: orcid.org/0000-0002-8394-0342^3,4

1962 Accesses
Explore all metrics

Abstract

Deep learning (DL) strategies applied to magnetic resonance (MR) images in positron emission tomography (PET)/MR can provide synthetic attenuation correction (AC) maps, and consequently PET images, more accurate than segmentation or atlas-registration strategies. As first objective, we aim to investigate the best MR image to be used and the best point of the AC pipeline to insert the synthetic map in. Sixteen patients underwent a 18F-fluorodeoxyglucose (FDG) PET/computed tomography (CT) and a PET/MR brain study in the same day. PET/CT images were reconstructed with attenuation maps obtained: (1) from CT (reference), (2) from MR with an atlas-based and a segmentation-based method and (3) with a 2D UNet trained on MR image/attenuation map pairs. As for MR, T1-weighted and Zero Time Echo (ZTE) images were considered; as for attenuation maps, CTs and 511 keV low-resolution attenuation maps were assessed. As second objective, we assessed the ability of DL strategies to provide proper AC maps in presence of cranial anatomy alterations due to surgery. Three 11C-methionine (METH) PET/MR studies were considered. PET images were reconstructed with attenuation maps obtained: (1) from diagnostic coregistered CT (reference), (2) from MR with an atlas-based and a segmentation-based method and (3) with 2D UNets trained on the sixteen FDG anatomically normal patients. Only UNets taking ZTE images in input were considered. FDG and METH PET images were quantitatively evaluated. As for anatomically normal FDG patients, UNet AC models generally provide an uptake estimate with lower bias than atlas-based or segmentation-based methods. The intersubject average bias on images corrected with UNet AC maps is always smaller than 1.5%, except for AC maps generated on too coarse grids. The intersubject bias variability is the lowest (always lower than 2%) for UNet AC maps coming from ZTE images, larger for other methods. UNet models working on MR ZTE images and generating synthetic CT or 511 keV low-resolution attenuation maps therefore provide the best results in terms of both accuracy and variability. As for METH anatomically altered patients, DL properly reconstructs anatomical alterations. Quantitative results on PET images confirm those found on anatomically normal FDG patients.

A deep learning approach for 18F-FDG PET attenuation correction

Article Open access 12 November 2018

Validation of PET/MRI attenuation correction methodology in the study of brain tumours

Article Open access 25 November 2020

Evaluation of zero-echo-time attenuation correction for integrated PET/MR brain imaging—comparison to head atlas and 68Ge-transmission-based attenuation correction

Article Open access 22 October 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Several investigators have shown that synthetic computed tomography (CT) images can be obtained with deep learning (DL) strategies applied to magnetic resonance (MR) images [1,2,3,4,5,6,7]. This approach currently appears to be the most promising both in attenuation correction (AC) of positron emission tomography (PET) images in hybrid PET/MR scanners and in MR-based radiotherapy treatment planning [8, 9]. In this work, we focused on the AC of brain PET/MR studies, where the challenge is to estimate 511 keV gamma ray attenuation maps to correctly reconstruct the uptake of an injected radiotracer in brain areas.

The AC pipeline implemented in PET/MR scanners directly derives from PET/CT scanner AC pipelines and schematically consists of the following steps: (1) generation of a synthetic CT image corresponding to a high resolution map of tissue attenuation coefficients at ~80–120 keV; (2) conversion from ~80 to 120 keV attenuation coefficients to 511 keV attenuation coefficients by means of bilinear or trilinear functions; (3) smoothing with Gaussian kernel to match PET low spatial resolution; (4) resampling on a coarser grid (for some vendors). The obtained 511 keV low-resolution attenuation map (LRAM) is then used inside the PET image reconstruction process. Tools currently implemented on PET/MR scanners generate synthetic CT images following atlas-based or segmentation-based methods. In atlas-based approaches, single or multiple pairs of MR/CT templates are registered to patient MR images; in segmentation-based approaches, MR images are segmented in three or four tissue classes, and proper attenuation coefficients are then assigned to voxels in each class [10]. In brain studies, Ultrashort Echo Time (UTE) or Zero Echo Time (ZTE) MR sequences are generally used, thanks to their ability to collect MR signal from bones [11].

Most of the DL approaches presented so far in literature for AC correction in PET/MR brain studies aim to construct synthetic CT images starting from diagnostic [12, 13] or non-diagnostic [4, 14, 15] MR images. These methods outperform conventional synthetic CT construction methods, reducing the average bias in PET quantification from about 5% to about 2% [8, 9]. The rationale at the base of this work is that synthetic CTs, which are not used for diagnostic purpose, are not needed for AC in PET/MR studies if synthetic LRAM images can be directly generated. LRAMs contain information at lower resolution and contrast than CTs, and therefore, simpler and leaner DL networks than those required to generate CTs may be sufficient. An approach of this kind has been followed by Spuhler et al. [3] who used a Convolutional Neural Network (CNN) to directly estimate LRAMs, starting from MR T1-weighted (T1) images. T1 images may however be suboptimal for X-ray attenuation estimation, since they do not collect any signal from air and cortical bone, which both appear comparably dark. ZTE images are expected to perform better than conventional sequences in this task, thanks to their ability to discriminate air and bone and to the inversely proportional relationship between ZTE signal intensity and attenuation coefficients in bones [16].

The aim of this work is therefore to generate LRAMs for AC in brain PET/MR from ZTE images and to compare their AC performance against T1-based LRAMs and T1- or ZTE-based synthetic CTs. A unique standard 2D UNet architecture [17] was used throughout the study. Synthetic images to be used at different points of the AC pipeline were generated, namely synthetic CTs, synthetic LRAMs before resampling and synthetic LRAMs after resampling. T1-weighted images or ZTE images were considered as inputs. AC-corrected reconstructed PET images were compared, taking as reference the ones obtained with patient-specific true CT images. The proposed methodologies were assessed both on normal and surgically altered cranial anatomies. Atlas-based and segmentation-based methods were also considered for comparison.

Material and Methods

Brief Literature Review

MR-based DL approaches presented so far in literature for AC correction in brain PET/MR are described in Table 1.

Table 1 MR-based DL approaches for AC correction in brain PET/MR

Full size table

Patient Information

Two datasets were considered in this study. The first one (indicated as FDG dataset in the following) contains images of sixteen patients (9 men, 7 women, 68 ± 9 years old) who underwent a 18F-FDG PET/CT study (Discovery-STE scanner, General Electric Medical Systems, GEMS, Waukeska, WI, USA) for a neurological evaluation of cognitive impairment and accepted to perform a second scan on a PET/MR system (PET/MR SIGNA, GEMS, Waukeska, WI, USA). The double-study protocol was approved by the IRCCS San Raffaele Hospital local ethical committee and patients signed a written informed consent form. The second dataset (indicated as METH dataset in the following) contains images of three women who underwent a 11C-methionine-PET/MR study for oncological evaluation after radiotherapy. These patients were selected to test the proposed techniques as they present cranial anatomy variations due to surgery. They all signed the informed consent allowing using images for research and educational purpose.

FDG PET/CT Data Acquisition

A standard 18F-FDG PET imaging procedure was performed [18]. Patient preparation required at least 4 h of fasting, an i.v. injection with 125–250 MBq (average 150 MBq) of 18F-FDG followed by an uptake time of 45 min to achieve an optimal cerebral uptake. Patients were then positioned on the PET/CT scanner bed with the head in a head holder (HH) to reduce head movements. The PET/CT acquisition protocol consisted of a low dose CT scan (120kVp, 30 mA) followed by a 3D PET study (15 min).

FDG PET/MR Data Acquisition

The PET/MR acquisition started at about 90 min from the FDG injection time. The PET/MR study consisted in four MR sequences (3D LAVA-Flex, 3D Proton Density weighted ZTE, T1-weighted 3D-BRAVO and 2D T2-weighted PROPELLER) acquired using a 32-channel coil array within a simultaneous 20-min PET scan. Only the first three MR sequences were used in this work. Acquisition parameters were as follows:

Axial 3D LAVA-Flex: TR = 4 ms, TE = 1.7 and 2.23 ms, flip angle = 5˚, acquisition matrix = 256 × 128 × 480; reconstructed matrix: 256 × 256, reconstructed FOV = 264 × 264 mm², reconstructed voxel size = 1.95 × 1.95 × 2.6 mm³
3D ZTE: TR = 399.5 ms, TE = 0.016 ms, flip angle = 0.8˚, acquisition matrix = 110 × 110 × 166, reconstructed matrix = 128 × 128, reconstructed FOV = 264 × 264 mm², reconstructed voxel size = 2.06 × 2.06 × 2.4 mm³
Axial 3D T1-weighted (BRAVO): TR = 8.34 ms, TE = 3.10 ms, TI = 500 ms, flip angle = 12˚, acquisition matrix = 240 × 240 × 328, reconstructed matrix = 512 × 512, reconstructed FOV 240 × 240 mm², reconstructed voxel size 0.47 × 0.47 × 0.5 mm³

511 KeV LRAM Generation for FDG Patients

FDG PET data acquired with the PET/CT scanner were reconstructed off line with the Recon Research Tool provided by GEMS (PETTOOLBOX 2.0). The reconstruction algorithm requires as inputs PET raw data and a LRAM. In the Discovery-STE PET/CT scanner AC pipeline, the LRAM is obtained by applying a bilinear scaling and a 10-mm full width at half maximum (FWHM) Gaussian filter to CT images and by resampling the result on a coarse voxel grid of 5.47 × 5.47 × 3.27 mm³. Eight LRAM volumes were generated:

LRAM_CT-System: LRAMs automatically generated by the PET/CT system after processing the original CT images reconstructed on a 0.8 × 0.8 × 2.5 mm³ voxel grid. To ensure coherence among multiple datasets, the bilinear scaling function used in the SIGNA PET/MR system was used. PET images reconstructed with LRAM_CT-System were considered as reference. LRAMs obtained after the bilinear scaling and the 10 mm FWHM Gaussian filter but before the coarse grid resampling were called LRAM_{CT-System-HighRes} and were used as reference output by UNets generating LRAMs before resampling, as described in the following.
LRAM_ATLAS and LRAM_ZTE: LRAMs automatically generated by the SIGNA PET/MR system with atlas-based and segmentation-based methods, respectively relying on LAVA-Flex and LAVA-Flex and ZTE sequences. Both LRAM_ATLAS and LRAM_ZTE accounted for the radiofrequency coil contribution to attenuation. To allow a comparison of the MR based attenuation maps when using PET emission data obtained in the PET/CT study session, the MR coil attenuation component had to be removed and substituted with the PET/CT head holder (HH). Volumes were then registered to CT volumes by means of a rigid transformation with bilinear interpolation using MIPAV software (https://mipav.cit.nih.gov/).
LRAM_CNN-ZTE and LRAM_CNN-T1: LRAMs obtained with a UNet trained on 0.8 × 0.8 × 2.5 mm³ LRAM_{CT-System-HighRes} and ZTE (T1) pairs.
LRAM_{CNN-ZTE-Coarse}: LRAM obtained with a UNet trained on 5.47 × 5.47 × 3.27 mm³ LRAM_CT-System and ZTE pairs.
LRAM_{CNN-pseudoCT-ZTE} and LRAM_{CNN-pseudoCT-T1}: LRAMs obtained applying the AC pipeline to pseudoCTs (pseudoCT-ZTE and pseudoCT-T1) obtained with a UNet trained on 0.8 × 0.8 × 2.5 mm³ CT and ZTE (T1) pairs.

The process of image generation is represented in Fig. 1.

Corresponding images are shown in Fig. 2.

CT, LRAM_{CT-System-HighRes} and LRAM_CT-System to be given as reference outputs to UNets were obtained by removing the HH. ZTE and T1 volumes to be given as inputs to UNets were obtained by correcting ZTE and T1 volumes for magnetic field inhomogeneity using the N4 algorithm implemented in the 3D Slicer Software [20, 21], by normalizing intensities on a case by case basis by median tissue values and by registering the resulting volumes to corresponding CTs by means of a rigid transform with bilinear interpolation using MIPAV software (https://mipav.cit.nih.gov/). HH was lastly added to volumes estimated by UNets.

METH PET/MR and CT Data Acquisition

The PET/MR acquisition protocol for METH studies consisted in axial 3D LAVA Flex and 3D ZTE sequences (already described in “FDG PET/MR Data Acquisition”) and in a set of diagnostic MR sequences acquired simultaneously to a 20-min PET scan. Patients had also a diagnostic CT scan (120 kVps, 250–330 mA) performed to exclude complications immediately after surgery, about 1 month before the PET/MR session.

511 KeV LRAM Generation for METH patients

METH PET data acquired with the PET/RM scanner were reconstructed using five LRAM volumes:

LRAM_CT-System: LRAM generated off line with the Recon Research Tool provided by GEMS (PETTOOLBOX 2.0) starting from the diagnostic CT scan rigidly registered to corresponding RMs with MIPAV software (https://mipav.cit.nih.gov/)
LRAM_ATLAS and LRAM_ZTE, automatically generated by the SIGNA PET/MR system with atlas-based and segmentation-based methods
LRAM_CNN-ZTE: LRAM obtained with a UNet trained on LRAM_{CT-System-HighRes} and ZTE pairs of the sixteen FDG patients, when receiving in input the patient ZTE volume
LRAM_{CNN-pseudoCT-ZTE}: LRAM obtained applying the AC pipeline to pseudoCTs obtained with a UNet trained on CT and ZTE pairs of the sixteen FDG patients, when receiving in input the patient ZTE volume

PET Reconstruction

FDG PET images were reconstructed with 3D Ordered Subsets Expectation Maximization (OSEM). OSEM parameters were constrained to be consistent to those used in our institution to reconstruct the PET database needed for Statistical Parametric Mapping (SPM) analysis [19] (2 iterations, 24 subsets, FOV 25 cm, voxel grid 128, voxel size 1.95 × 1.95 × 3.27 mm³, Transaxial Gaussian Filter of 4 mm FWHM, Axial-Post Filter Standard). METH PET images were reconstructed with 3D OSEM with time of flight (TOF) information. OSEM parameters were: 3 iterations, 16 subsets, FOV 25 cm, voxel grid 128, voxel size 1.95 × 1.95 × 3.27 mm³, Transaxial Gaussian Filter of 3 mm FWHM, Axial-Post Filter Standard.

U-Net Architecture, Implementation and Parameters

A 2D UNet model was implemented in Keras with TensorFlow as backend on a NVIDIA Quadro RTX 8000 GPU. The network architecture is shown in Fig. 3. The network takes in input a 2D axial slice of the ZTE (T1) volume and provides in output the corresponding slice of the estimated CT or LRAM volume. The UNet is composed by 5 convolution layers in the encoding/decoding part with 3 × 3 kernels and Elu activation function. In encoding, each convolution layer is constituted by a first convolution extracting 16/32/64/128/256 features, by dropout, by a second convolution extracting 16/32/64/128/256 features and by 2 × 2 max-pooling. In decoding, feature maps from deeper layers are deconvolved with stride 2, concatenated with corresponding feature maps coming from the encoding part and given in input to a convolution layer identical to the encoding one. The squared L2 norm of the difference between reference and UNet outputs was used as loss function during training with the Adam stochastic optimization method (learning rate = 0.0005, β₁ = 0.9, β₂ = 0.999). UNet weights were initialized with the He Normal Keras initialization, i.e. weights were sampled from a truncated normal distribution N(0,σ) with $\sigma =\sqrt{2/\nu }$, where ν is the number of input units in the weight tensor.

On the FDG dataset, a leave-one-out (LOO) scheme was used to fairly evaluate the strategy performance on all the sixteen subjects. For each LOO run, twelve of the remaining patients were used for training and three for validation, with an early stopping on validation loss to prevent overfitting. On the METH dataset, UNets trained on the whole FDG dataset were used.

Data Analysis

As for the FDG dataset, PET images reconstructed with LRAM_CT-system, LRAM_ATLAS, LRAM_ZTE, LRAM_CNN-ZTE, LRAM_CNN-T1, LRAM_{CNN-pseudoCT-ZTE}, LRAM_{CNN-pseudoCT-T1} and LRAM_{CNN-ZTE-Coarse} were all normalized to the stereotactic space by using the SPM software (https://www.fil.ion.ucl.ac.uk/spm/). PET images were then qualitatively evaluated by two expert nuclear medicine physicians, who looked for differences in the radiotracer distribution depiction and ultimately in the clinical interpretation.

For each patient, a mask was defined on the reference (i.e. PET reconstructed with LRAM_CT-system) by selecting voxels with more than 1/10 of the FOV mean activity. This mask practically discards all background (air) voxels, which are not of interest for the analysis as they are not impacted by attenuation. PET images were then compared to the reference inside the mask in terms of mean-error (ME) and root mean-square-error (RMSE), defined as follows:

$$M{E}_{PET}=\frac{1}{{\sum }_{{i\in N}_{vox}}{PET}_{REF}}{\sum\limits_{i\in {N_{vox}}}}\frac{PET-{PET}_{REF}}{{N}_{vox}}$$

$$RMS{E}_{PET}=\frac{1}{{\sum }_{{i\in N}_{vox}}{PET}_{REF}}\sqrt{{\sum\limits_{{i\in N}_{vox}}}\frac{{\left(PET-{PET}_{REF}\right)}^{2}}{{N}_{vox}}}$$

PET images were finally quantified on a regional basis inside six macro volumes of interest (VOIs) (Fig. 4) identified on the automated-anatomical-labelling (AAL) atlas [22]. In each lobe, we specifically chose the most critical areas for attenuation factor estimation: (1) frontal superior orbital and frontal mid orbital region in the frontal lobe; (2) frontal superior and frontal mid region in the frontal lobe; (3) calcarine, cuneus and lingual region in the occipital lobe; (4) parietal superior and inferior areas in the parietal lobe; (5) temporal inferior areas in the temporal lobe; (6) whole cerebellum. Bilateral regions were always used. Significant differences with respect to the reference were assessed by means of the Mann Whitney test.

As for the METH dataset, multiple 2D ROIs were manually delineated on registered CT images in the anatomically altered region. Mean PET counts in each ROI were then computed and compared, taking as reference the counts obtained on PET images reconstructed with LRAM_CT-system. ROI examples are shown in Fig. 5.

Results

As for the FDG dataset, visual image analysis did not reveal any qualitative differences or visually recognizable artefacts in any of the sixteen patients. In Fig. 6, a representative transaxial PET image is shown for three patients (A, B, C) reconstructed using the eight considered LRAMs. Patients B and C show regions of reduced uptake (white arrows) which are due to their brain dysfunction.

ME and RMSE computed on PET images are shown in Fig. 7.

Atlas-based and segmentation-based LRAMs together with LRAMs computed on the coarse voxel grid provide the largest error on PET images. Errors are smaller and comparable on PET images corrected with LRAMs computed on the finer grid and with LRAMs derived from pseudoCTs. In particular, LRAMs obtained from ZTE provide smaller bias and smaller variability than LRAMs obtained from T1-weighted images.

Results of the regional analysis are shown in Fig. 8.

Atlas-based LRAM and segmentation-based LRAM provide a null intersubject 18F-FDG uptake average bias respectively in frontal region and temporal lobe, but larger bias in the other regions. In particular, LRAM_ATLAS induces a 3% average bias in temporal and parietal lobes, LRAM_ZTE a nearly 3% average bias in parietal lobe and frontal superior region. Similarly, LRAM computed from coarse grid ZTE generates a negligible average bias in the frontal region, but an unacceptably large bias in most of the other regions. LRAMs derived from pseudoCT and those directly estimated on the finer grid, from both ZTE and T1-weighted images, perform more regularly among regions, with an intersubject 18F-FDG uptake average bias always smaller than 1.5%. The intersubject bias variability is large for LRAM_{CNN-ZTE-Coarse} and for LRAMs generated starting from T1-weighted images. It is instead the lowest (always lower than 2%) for LRAMs obtained with UNet and ZTE images. No differences with respect to the reference resulted statistically significant.

As for the METH dataset, original CTs and corresponding pseudoCTs generated by the UNet trained on the anatomically normal FDG patients are shown in Fig. 9. Anatomic variations appear all well reconstructed.

In Fig. 10, original CTs and LRAMs generated by the PET/MR system (LRAM_ZTE, LRAM_ATLAS) and by the UNet (LRAM_{CNN-pseudoCT-ZTE}, LRAM_CNN-ZTE) are shown for a METH patient.

Results of the regional analysis for a METH patient (over 29 ROIs corresponding to 29 brain consecutive image sections) are shown in Fig. 11. LRAM_CNN-ZTE and LRAM_{CNN-PseudoCT-ZTE} appear able to provide values similar to LRAM_CT-System with respect to LRAM_ZTE and LRAM_ALTAS. Globally, LRAM_CNN-ZTE, LRAM_{CNN-PseudoCT-ZTE} and LRAM_ZTE obtained a mean error smaller than 2% in all the three patients; LRAM_ATLAs on a patient got a mean error of 4%.

Discussion

Many works in literature have already shown that deep learning strategies applied to MR images in PET/MR are able to provide synthetic AC maps, and consequently PET images, more accurate than segmentation or atlas-registration strategies. In this work, we investigated the best MR image to be used and the best point of the AC pipeline to insert the synthetic map in. For this reason, we used the same UNet architecture throughout the study and evaluated all the possible inputs/outputs combinations.

The assessment was firstly done on FDG PET data acquired on a PET/CT scanner and reconstructed with the vendor offline tool. MR images of the same patient to be given in input to the UNet were acquired on a PET/MR scanner. MR images were corrected for bias field inhomogeneity, normalized and registered to CT images; MR coils were removed and substituted with the CT HH. We can confirm the results obtained by other authors [2] about the greater importance of data pre-processing (bias field correction and spatial registration) compared to CNN architecture parameters and hyperparameters (results not shown). We think that, probably, even better results could be obtained by further improving these data pre-processing steps.

Sixteen patients were considered and a LOO training/testing scheme was adopted. A 2D UNet with 5 levels of convolution in the encoding/decoding parts was successfully trained on 12 patients with an early stopping on the validation loss computed on 3 patients. PET images reconstructed with AC maps estimated by the UNet were quantitatively compared to reference PET reconstructed with AC maps derived from acquired CT data. Attenuation maps generated by the PET/MR scanner were also assessed, for comparison.

Results show that UNet AC models generally provide an estimate of 18F-FDG uptake with a lower bias than state-of-the-art atlas-based or segmentation-based AC methods. This confirms the results previously obtained by other authors [4, 8, 12,13,14]. As for the best point in the AC pipeline to insert the synthetic map, results show that it is nearly equivalent to generate synthetic CT or directly LRAM after bilinear scaling and low pass filtering. Synthetic CTs provide a mean average bias of −0.7% (−1.0%) and a mean average variability of 2.4% (1.1%) among brain regions, if obtained from T1-weighted (ZTE) images; LRAM has a mean average bias of −0.7% (−0.8%) and a mean average variability of 2.5% (1.4%). It is instead not recommended to generate LRAM after the coarse resampling: in this condition, average bias and bias variability nearly double. The MR image resampling on the coarse grid induces in fact an excessive detail loss which leads to the reconstruction of incorrect LRAM and PET images. As regards instead the best MR sequence, results show that the average bias among patients is nearly the same on images corrected with AC maps obtained from ZTE and T1-weighted images. Only in the frontal region the average bias is larger on images coming from ZTE images (about −1.3%) and lower on images coming from T1-weighted images (about −0.5%). If, however, we look at the bias variability, results clearly show that the least variability is obtained with AC methods that exploit ZTE information. This also confirms the results previously obtained by other authors [16]. In particular, we observed that the same UNet architecture for CT (LRAM) generation provides on average a 120% (70%) lower variability when working on ZTE images than on T1 images. This can be explained by the fact that a CNN can always “learn” the characteristics of the population on average, but accurate prediction at the individual level requires subject-specific information to be present in the input layer. Overall, we can conclude that UNet models working on ZTE and generating synthetic CT or LRAM provide the best results in terms of both accuracy and variability.

To contextualize these numbers within the clinical application, we analysed a typical ¹⁸F-FDG brain analysis pipeline, which involves pixel-wise statistical comparisons against a pool of healthy controls [23]. We found that, after image scaling to the mean brain activity, the coefficient of variation among subjects lies between 7 and 9%, depending on the location. This means that, even at very liberal statistical thresholds (p < 0.01; uncorrected), regions are highlighted as hypometabolic only when the activity reduction is at least larger than 14%. Indeed, in a group comparison on a large number of Alzheimer’s disease subjects [23], we observed a 19% average activity reduction in the most impacted region, the parietal lobe. It is therefore very important to have a systematic bias less than 1% in all the regions and a standard deviation smaller than 2% as provided by ZTE-based AC methods, since these values are mostly guaranteed not to significantly influence the outcome and therefore to provide robust quantification.

We successively assessed the UNet ability to provide AC maps on three PET/MR METH patients with anatomical abnormalities due to surgery. We did not use T1 images in this case, but only ZTE images which intrinsically contain the whole information needed for AC. UNets trained on the sixteen anatomically normal FDG patients on ZTE/CT and on ZTE/LRAM_CT-System pairs resulted able to correctly reconstruct bone abnormalities, with improved accuracy with respect to LRAM_ATLAS and LRAM_ZTE. The generation of LRAM volumes, which intrinsically have a low resolution, resulted less sensitive to such anatomical variations respect to the generation of high resolution CT volumes. PET mages were quantified near bone abnormalities and results were compared with those obtained with AC coming from coregistered diagnostic CTs. Results confirmed those obtained on the FDG dataset. Further studies on this kind of patient/lesions are however needed.

This study has several limitations, which will be discussed in the following.

We chose to use a 2D UNet instead of the best-performing 2.5D or 3D UNets because of the scarce number of patients. However, we do not believe that the results obtained in terms of best MR sequence and best synthetic image insertion point would change as the network changes. Other strategies have been recently proposed in literature, such as GAN or cycleGAN [13], which could provide better results thanks to their robustness to misregistration. The generator used by most of this network is however a 2D UNet, like the one used in this study. Furthermore, these networks require a larger number of patients to be trained. We think we should invest in these networks for abdominal PET/MR AC, where misregistration is definitely a challenge; in brain studies, simpler strategies may be sufficient.
In this work, the AC correction pipeline implemented on our PET/CT and PET/MR scanner was considered. Other vendors probably use slightly different pipelines, e.g. with a reduced low pass filter FWHM of with a less coarse resampling grid. Indeed, this was one of the reasons motivating our choice to compare synthetic image insertion into different points of the AC pipeline. If nearly equivalent PET was obtained generating CTs or LRAMs after 10 mm filtering, we think that equivalent PETs will be also obtained if generating less filtered LRAMs.
A further limitation is due to the smaller axial FOV of the PET/CT scanner (15.7 cm) compared to the wider axial FOV of the PET/MR scanner (25.0 cm). In order to obtain pairs of well spatially correlated CT-MR data, we necessarily had to remove the lower neck region from MR images. However, considering that the lower neck region is generally not crucial for brain neurological evaluations, this limitation reasonably seems not critical for the work.

Conclusions

In this work, we generated synthetic images for AC correction in brain PET/MR from ZTE or T1 images by means of a 2D UNet. The first objective was to understand which of the two MR sequences is able to provide more accurate AC maps and PET images. Results obtained on the 16 anatomically normal FDG patients show that ZTE provides a comparable intersubject average bias on PET images, but a lower intersubject bias variability with respect to T1. ZTE results appear also accurate on the 3 methionine patients containing cranial anatomy abnormalities due to surgery. Well knowing the PET/MR AC pipeline which does not make a full usage of the synthetic CT information but uses a reduced contrast and a reduced spatial resolution version (LRAM), as a second objective, we wanted to understand whether there are differences in generating UNet images at different points of the AC pipeline. Results obtained on both the 16 anatomically normal FDG patients and the 3 methionine patients show that you can equivalently generate synthetic CTs or LRAMs after bilinear scaling and smoothing but before coarse resampling.

Availability of Data and Material

Data are available upon reasonable request.

References

Han X. MR‐based synthetic CT generation using a deep convolutional neural network method. Medical Physics 2017;44(4):1408-1419.
Article CAS Google Scholar
Leynes A. P., Yang J., Wiesinger F., Kaushik S. S., Shanbhag D. D., Seo Y., Hope A.H., Larson P. E. Z. Zero-echo-time and Dixon deep pseudo-CT (ZeDD CT): direct generation of pseudo-CT images for pelvic PET/MR attenuation correction using deep convolutional neural networks with multiparametric MR. Journal of Nuclear Medicine 2018;59(5):852-858.
Article Google Scholar
Spuhler K. D., Gardus J., Gao Y., DeLorenzo C., Parsey R., Huang C. Synthesis of patient-specific transmission data for PET attenuation correction for PET/MR neuroimaging using a convolutional neural network. Journal of Nuclear Medicine 2019;60(4):555-560.
Article CAS Google Scholar
Blanc-Durand P., Khalife M., Sgard B., Kaushik S., Soret M., Tiss A., El Fakiri G., Habert M-O., Wiesinger F., Kas A. Attenuation correction using 3D deep convolutional neural network for brain 18F-FDG PET/MR: Comparison with Atlas, ZTE and CT based attenuation correction. PloS one 2019;14(10):e0223141.
Liu Y., Lei Y., Wang Y., Wang T., Ren L., Lin L., McDonald M., Curran W.J., Liu T., Zhou J., Yang X. MR-based treatment planning for proton radiotherapy: dosimetric validation of a deep learning-based liver synthetic CT generation method. Physics in Medicine & Biology 2019;64(14):145015.
Torrado-Carvajal A., Vera-Olmos J., Izquierdo-Garcia D., Catalano O. A., Morales M. A., Margolin J.,Soricelli A., Salvatore M., Malpica N., Catana C. Dixon-VIBE deep learning (DIVIDE) pseudo-CT synthesis for pelvis PET/MR attenuation correction. Journal of Nuclear Medicine 2019;60(3):429-435.
Article Google Scholar
Kazemifar S., McGuire S., Timmerman R., Wardak Z., Nguyen D., Park Y., Jiang S., Owrangi A. MR-only brain radiotherapy: Assessing the dosimetric accuracy of synthetic CT images generated using a deep learning approach. Radiotherapy and Oncology 2019;136: 56-63.
Article Google Scholar
Yu B., Wang Y., Wang L., Shen D., Zhou L. Medical Image Synthesis via Deep Learning 2020:23–44. In: Lee G., Fujita H. (eds) Deep Learning in Medical Image Analysis. Advances in Experimental Medicine and Biology, vol 1213. Springer, Cham.
Lee, J. S. A Review of Deep-Learning-Based Approaches for Attenuation Correction in Positron Emission Tomography. IEEE Transactions on Radiation and Plasma Medical Sciences 2020, 5(2), 160-184.
Article Google Scholar
Sekine T., Ter Voert E. E., Warnock G., Buck A., Huellner M., Veit-Haibach P., Delso G. Clinical evaluation of zero-echo-time attenuation correction for brain 18F-FDG PET/MR: comparison with atlas attenuation correction. Journal of Nuclear Medicine 2016;57(12):1927-1932.
Article CAS Google Scholar
Wiesinger F., Bylund M., Yang J., Kaushik S., Shanbhag D., Ahn S., Jonsson J.H., Lundman J.A., Hope T., Nyholm T., Larson P., Cozzini C. Zero TE‐based pseudo‐CT image conversion in the head and its application in PET/MR attenuation correction and MR‐guided radiation therapy planning. Magnetic Resonance in Medicine 2018;80(4):1440-1451.
Article Google Scholar
Arabi H., Zeng G., Zheng G., Zaidi H. Novel adversarial semantic structure deep learning for MR-guided attenuation correction in brain PET/MR. European Journal of Nuclear Medicine and Molecular Imaging 2019;46(13): 2746-2759.
Article Google Scholar
Tao, L., Fisher, J., Anaya, E., Li, X., Levin, C. S. Pseudo CT image synthesis and bone segmentation from MR images using adversarial networks with residual blocks for MR-based attenuation correction of brain PET data. IEEE Transactions on Radiation and Plasma Medical Sciences 2021, 5(2), 193-201.
Article Google Scholar
Gong K., Yang J., Kim K., El Fakhri G., Seo Y., Li Q. Attenuation correction for brain PET imaging using deep neural network based on Dixon and ZTE MR images. Physics in Medicine & Biology 2018;63(12):125011.
Gong, K., Han, P. K., Johnson, K. A., El Fakhri, G., Ma, C., Li, Q. Attenuation correction using deep Learning and integrated UTE/multi-echo Dixon sequence: evaluation in amyloid and tau PET imaging. European Journal of Nuclear Medicine and Molecular Imaging 2021, 48(5), 1351-1361.
Article CAS Google Scholar
Schramm G., Koole M., Willekens S., Rezaei A., Van Weehaeghe D., Delso G., Peeters R., Mertens N., Nuyts J., Van Laere K. Regional accuracy of ZTE-based attenuation correction in static [18 F] FDG and dynamic [18 F] PE2I brain PET/MR. Frontiers in Physics 2019;7:211.
Article Google Scholar
Ronneberger O., Fischer P., Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab N., Hornegger J., Wells W., Frangi A. (eds) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. MICCAI 2015:pp 234–241. Lecture Notes in Computer Science, vol 9351. Springer, Cham.
Varrone A., Asenbaum S., Vander Borght T., Booij J.,Nobili F.,Någren K.,Darcourt J.,L Kapucu O.,Tatsch K.,Bartenstein P.,Van Laere K.,European Association of Nuclear Medicine Neuroimaging Committee. European Journal of Nuclear Medicine and Molecular Imaging 2009;36(12):2103–2110.
Friston K.J. Statistical Parametric Mapping. In:Kötter R. (eds) Neuroscience Databases. Springer, Boston, MA. Chapter 16:237–250.
Tustison N.J.,Avants B.B., Cook P.A., Zheng Y., Egan A., Yushkevich P.A., Gee J.C. N4ITK: Improved N3 Bias Correction. IEEE Trans Med Imaging 2010;29(6):1310-1320.
Article Google Scholar
Fedorov A., Beichel R., Kalpathy-Cramer J., Finet J., Fillion-Robin J-C., Pujol S., Bauer C., Jennings D., Fennessy F.M., Sonka M., Buatti J., Aylward S.R., Miller J.V., Pieper S., Kikinis R. 3D Slicer as an Image Computing Platform for the Quantitative Imaging Network. Magn Reson Imaging. 2012;30(9):1323-1341.
Article Google Scholar
Tzourio-Mazoyer N., Landeau B., Papathanassiou D., Crivello F., Etard O., Delcroix N., Mazoyer B., Joliot M. Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MR single subject brain. NeuroImage 2002; 15(1):273-289.
Article CAS Google Scholar
Presotto L., Ballarini T., Caminiti S. P., Bettinardi V., Gianolli L., Perani D. Validation of 18 F–FDG-PET Single-Subject Optimized SPM Procedure with Different PET Scanners. Neuroinformatics 2017; 15(2), 151-163.
Article Google Scholar

Download references

Funding

The SIGNA PET/MR scanner has been acquired with funding from the Italian Ministry of Health “CONTO CAPITALE 2010”.

Author information

Authors and Affiliations

Nuclear Medicine Department, IRCCS San Raffaele Scientific Institute, Milan, Italy
Luca Presotto, Valentino Bettinardi, Matteo Bagnalasta, Paola Scifo, Annarita Savi, Emilia Giovanna Vanoli, Federico Fallanca, Maria Picchio, Daniela Perani & Luigi Gianolli
Vita-Salute San Raffaele University, Milan, Italy
Maria Picchio & Daniela Perani
School of Medicine and Surgery, University of Milano-Bicocca, via Cadore 48, Monza, 20900, Italy
Elisabetta De Bernardi
Bicocca Bioinformatics Biostatistics and Bioimaging Centre - B4, University of Milan-Bicocca, Monza, Italy
Elisabetta De Bernardi

Authors

Luca Presotto
View author publications
You can also search for this author in PubMed Google Scholar
Valentino Bettinardi
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Bagnalasta
View author publications
You can also search for this author in PubMed Google Scholar
Paola Scifo
View author publications
You can also search for this author in PubMed Google Scholar
Annarita Savi
View author publications
You can also search for this author in PubMed Google Scholar
Emilia Giovanna Vanoli
View author publications
You can also search for this author in PubMed Google Scholar
Federico Fallanca
View author publications
You can also search for this author in PubMed Google Scholar
Maria Picchio
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Perani
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Gianolli
View author publications
You can also search for this author in PubMed Google Scholar
Elisabetta De Bernardi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elisabetta De Bernardi.

Ethics declarations

Ethics Approval

The double study protocol was approved by the IRCCS San Raffaele Hospital local ethical committee.

Consent to Participate

Patients signed a written informed consent form.

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Presotto, L., Bettinardi, V., Bagnalasta, M. et al. Evaluation of a 2D UNet-Based Attenuation Correction Methodology for PET/MR Brain Studies. J Digit Imaging 35, 432–445 (2022). https://doi.org/10.1007/s10278-021-00551-1

Download citation

Received: 27 November 2020
Revised: 10 November 2021
Accepted: 16 November 2021
Published: 28 January 2022
Issue Date: June 2022
DOI: https://doi.org/10.1007/s10278-021-00551-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Evaluation of a 2D UNet-Based Attenuation Correction Methodology for PET/MR Brain Studies

Abstract

Similar content being viewed by others

A deep learning approach for 18F-FDG PET attenuation correction

Validation of PET/MRI attenuation correction methodology in the study of brain tumours

Evaluation of zero-echo-time attenuation correction for integrated PET/MR brain imaging—comparison to head atlas and 68Ge-transmission-based attenuation correction

Introduction