Background

Positron emission tomography (PET) is one of the most sensitive non-invasive in vivo imaging techniques [1] and has demonstrated its tremendous impact in clinical and research studies [2,3,4,5].

The high resolution research tomograph (HRRT, CTI PET Systems, Knoxville, TN, USA), introduced in the late 1990s/early 2000s, is to this day arguably one of the highest resolution PET scanners for human brain imaging [6]. The double layer of cerium-doped lutetium oxyorthosilicate (LSO) and cerium-doped lutetium-yttrium oxyorthosilicate (LYSO) crystals, which enable photon depth-of-interaction (DOI) detection, results in a spatial resolution of (~ 2.5 mm)3 at 1 cm tangential offset from center of the field of view (cFOV) and fairly uniform across the FOV [7]; the scanner can thus serve as a very useful benchmark when evaluating performance of newer scanners. Given that the HRRT is no longer commercially available, most dedicated brain imaging sites consider acquiring newer PET systems. Even though in most cases dedicated brain PET scanners would be preferred for cost, and potentially better sensitivity and resolution performance, the choice is generally practically limited to whole-body hybrid systems, i.e., whole-body PET systems in combination with computed tomography (CT) or magnetic resonance (MR) imaging, as standalone dedicated PET systems such as the HRRT are no longer available or common.

The GE SIGNA PET/MR system (GE Healthcare, Chicago, IL, USA) is the first whole-body hybrid PET/MR system based on silicon photomultipliers (SiPMs) with time-of-flight (TOF) capabilities. The PET detectors are integrated in the MR bore allowing whole-body simultaneous acquisition of PET and MR [8] with high PET detection stability [9].

The aim of this work was to perform a cross-validation study between the HRRT and the SIGNA PET/MR beyond the National Electrical Manufacturers Association (NEMA) evaluation to provide a direct and clinically relevant comparison with particular focus on brain imaging. While the benefits of simultaneous PET/MR imaging for neurological applications have been discussed elsewhere [10, 11], the present study focuses on the comparison of the image quality obtained with TOF data acquired on the PET/MR (default acquisition mode) and the non-TOF data acquired on the HRRT (by hardware default).

In a first step, phantom data were acquired to determine contrast recovery coefficients (RCs) and percent background variability (%BG), as well as percent voxel-level noise (%) for both systems. Cross-validation studies were then performed with six healthy volunteers scanned on both systems with commonly used PET tracers: [11C]dihydrotetrabenazine ([11C]DTBZ), a marker for the vesicular monoamine transporter 2 (VMAT2), the glucose analog [18F]-fluorodeoxyglucose ([18F]FDG) and [11C]raclopride, a D2/3 receptor antagonist. The choice of tracers was dictated by the requirement to examine cases of widespread tracer distribution ([18F]FDG) and tracers with more localized distribution and a wide range of count-rates ([11C]DTBZ and [11C]raclopride).

Methods

Systems description

The HRRT system is based on a dual-layer design of 2.44 × 2.44 × 10 mm3 LSO/LYSO crystals enabling DOI encoding and coupled to photomultiplier tubes (PMTs). The FOV spans 25 cm in the axial and 35 cm in the transaxial direction [7, 12].

The PET component of the SIGNA PET/MRI is based on SiPMs coupled to 4.0 × 5.3 × 25 mm3 lutetium-based scintillation crystals, which enable the use of TOF information during data reconstruction (timing resolution < 400 ps) [8]. The detector ring, spanning an axial FOV of 25 cm and a transaxial FOV of 60 cm [8, 13], is fully enclosed in the MRI scanner. The MR component is equipped with a radiofrequency transmit body coil embedded in a static 3 T magnet. All MR subject data used in this study were acquired with the GE head neck unit 12 channel coil. Table 1 lists a detailed comparison of both systems’ specifications and reported performance parameters.

Table 1 System specification and performance parameters of the Siemens HRRT and GE SIGNA PET/MR system

Phantom studies

RCs and %BG variability were determined using a cylindrical phantom (referred to as contrast phantom, Flanged Jaszczak ECT Phantom, Data Spectrum Corporation, Durham, NC, USA) scanned on the HRRT and on the PET/MR to enable a direct comparison, as the NEMA recommended phantom does not fit in the HRRT FOV. However, to enable a comparison with other systems, the NEMA recommended image quality phantom was additionally scanned on the PET/MR. Results can be found in the supplementary data.

The contrast phantom contained 6 fillable hollow spheres with inner diameters (IDs) of 9.9, 12.4, 15.4, 19.8, 24.8, and 31.3 mm. The two largest spheres were filled with water; the four smaller spheres and the background (volume 6000 ml) were filled with 18F activity in a 3.88:1 and 3.91:1 contrast ratio for the PET/MR and HRRT, respectively (total activity: ~ 29 MBq). No background activity was placed outside the FOV. List-mode data were acquired and histogrammed into one frame with activity concentration and counts matched between scanners.

A transmission scan using a rotating 137Cs source was performed on the HRRT to correct for attenuation. HRRT data were reconstructed using an ordinary Poisson-ordered subset expectation maximization (OP-OSEM) algorithm [14] with a 256 × 256 × 207 matrix resulting in a reconstructed voxel size of 1.22 × 1.22 × 1.22 mm3. Sixteen subsets and 6 iterations were used for reconstructions without post-filtering (this will be denoted as “native”) and with a 2-mm FWHM Gaussian filter (standard in-house reconstruction), and 10 iterations for reconstructions with PSF [15, 16] and with PSF and a 2-mm FWHM Gaussian filter, respectively (see Table 2). 

Table 2 Reconstruction parameters used for the HRRT phantom and healthy subject scans

In order to correct the data acquired on the PET/MR for attenuation, the transmission maps acquired on the HRRT were resliced and co-registered to the non-attenuation corrected PET/MR images. The manufacturer’s attenuation map of the PET/MR coil was then co-registered and integrated into the resliced HRRT attenuation map.

PET/MR data were reconstructed using a TOF-OSEM algorithm with 28 subsets, 2 iterations, and a 128 × 128 × 89 matrix resulting in a reconstructed voxel size of 2.781 × 2.781 × 2.780 mm3 (reconstructed in-plane FOV: 35.6 cm, GE recommendation for phantom data). Additionally, the data were reconstructed using TOF-OSEM + Gaussian post-filtering with a 3.5 mm full width at half maximum (FWHM) filter in all three dimensions (TOF + filter), TOF-OSEM + resolution modeling with point spread function (PSF, TOF + PSF) (PSF correction was implemented by the manufacturer following the approach from [17]) and TOF-OSEM + PSF + Gaussian post-filtering with a 3.5-mm FWHM filter in all three dimensions (TOF + PSF + filter, Table 3). Manufacturer supplied corrections for decay, random and scattered coincidences, normalization, and dead time were applied. PET/MR phantom data were also reconstructed without TOF information (woTOF) using the same parameter settings as previously described. Different sized post filters between the HRRT and PET/MR were applied to achieve noise reduction without significantly degrading the resolution; i.e., the FWHM of the filter is in each case slightly smaller than the smallest dimension of the detector crystal.

Table 3 Reconstruction parameters used for the PET/MR phantom and healthy subject scans

Phantom data analysis

Data analysis was performed by placing regions of interest (ROI) using the software package PMOD (version 3.602 & 4.005, PMOD Technologies Ltd., Zurich, Switzerland). A single slice ROI equal to the size of the actual inner sphere diameter was placed in the transaxial plane in which the sphere was most visible (according to the NEMA recommendations) [18]. Furthermore, a spherical VOI with the same diameter as the sphere size was placed on the volumetric sphere image. For the background, 12 ROIs with sizes corresponding to those placed on the spheres were drawn on the same plane where the single slice ROI was placed and at an axial offset of +/− 1 cm and +/− 2 cm, resulting in a total number of 60 background ROIs for each sphere, respectively. Single slice background ROIs were not extended to spherical VOIs.

RCs and %BG variability were determined according to the standardized NEMA NU 2-2007 protocol [18] using the following formulas:

$$ {\mathrm{RC}}_{hot}=\frac{\left(\raisebox{1ex}{${C}_{hot\_ sphere}$}\!\left/ \!\raisebox{-1ex}{${C}_{background\_ sphere}$}\right.\right)-1}{\left(\raisebox{1ex}{${A}_{hot}$}\!\left/ \!\raisebox{-1ex}{${A}_{background}$}\right.\right)-1}\times 100\% $$
(1)
$$ {\mathrm{RC}}_{cold}=\left(1-\frac{C_{cold\_ sphere}}{C_{background\_ sphere}}\right)\times 100\% $$
(2)
$$ \%\mathrm{BG}\ \mathrm{variability}=\frac{SD_{sphere}}{C_{background\_ sphere}}\times 100\% $$
(3)
$$ {\mathrm{SD}}_{\mathrm{sphere}}=\sqrt{\sum \limits_{k=1}^K\raisebox{1ex}{${\left({C}_{background\_ sphere,k}-{C}_{background\_ sphere}\right)}^2$}\!\left/ \!\raisebox{-1ex}{$\left(K-1\right)$}\right.},K=60 $$
(4)

where Chot _ sphere, Ccold _ sphere, and Cbackground _ sphere correspond to the mean concentration measured in ROIs placed on the images of the hot and cold sphere and on the background for the respective spheres. Ahot and Abackground correspond to the true activity concentration measured with a well-counter for the hot spheres and background, respectively.

For the contrast phantom data, the coefficient of variation between voxel values within uniform background regions was used as a measure of voxel noise (%) in the reconstructed images according to the following formula:

$$ \mathrm{voxel}\ \mathrm{noise}\ \left(\%\right)=\frac{SD_{background\_ sphere}}{C_{background\_ sphere}}\times 100\% $$
(5)

where SDbackground _ sphere describes the standard deviation measured in ROIs placed on the background for the respective spheres.

The voxel noise (%) was determined for frames with a relatively high number of prompts (~ 135 million), and a low number of prompts (~ 12 million) to mimic count statistics encountered during typical human 11C scans. High count data were analyzed based on a single realization while low count data were based on five replicates. Data were reconstructed with up to 10 iterations using the reconstruction parameters as specified above.

Human scans

Cross-validation studies were performed with six healthy volunteers (age 63.5 ± 15.18 years). The Clinical Research Ethics Board of the University of British Columbia approved the study, and informed written consent was provided by all participating subjects.

Two healthy volunteers were scanned using [11C]DTBZ, three with [18F]FDG, and one with [11C]raclopride on both scanners. The two scans of each subject were performed within 1.5 months except for one [11C]DTBZ subject whose time interval between the two scans was 8 months; this was not considered to be a confound as VMAT2 binding shows a negligible age dependence over this time frame [19]. Subjects scanned with [18F]FDG fasted for at least 6 h before tracer injection.

On the HRRT, the subjects were positioned using external lasers aligning the gantry with the inferior orbital-external metal line, and custom-fitted thermoplastic masks were applied to minimize head movement. Subject positioning on the SIGNA PET/MR scanner was performed based on an MRI localizer sequence. Intravenous administration of the tracers (see Table 4 for detailed information on injected activity amounts) over 60 s were performed using an infusion pump (Harvard Instruments, Southnatick, MA, USA; one volunteer scanned with [18F]FDG was manually injected due to more fragile veins of this subject). Injected activity between scanners was matched for each subject, respectively, with the exception of one scan, where the injected activity was lower for the HRRT scan due to technical reasons (Table 4). List-mode data were acquired for 60 min and histogrammed into 16 frames for [11C]DTBZ and [11C]raclopride (4 × 60 s, 3 × 120 s, 8 × 300 s, 1 × 600 s) and into 17 frames for [18F]FDG (4 × 60 s, 3 × 120 s, 10 × 300 s).

Table 4 Injected activity of the respective tracer for each subject at start of the acquisition

For the HRRT, a transmission scan with a 137Cs source was performed before tracer injection. Reconstruction of the HRRT data was performed using OP-OSEM with 16 subsets and 6 iterations (see Table 2). Corrections for decay, dead-time, random, attenuated and scattered coincidences, and detector normalization were applied. Reconstructed images were post-processed with a 2.0-mm FHWM Gaussian filter (standard in-house reconstruction).

For the PET/MR, a zero echo time (ZTE) approach was utilized to correct for attenuation (ZTE MRAC) with sequence parameters: echo time (TE) = 0.016 ms, repetition time (TR) = 399.564 ms, matrix size = 110 × 100 × 116, voxel size = 2.4 × 2.4 × 2.4 mm3, flip angle (FA) = 0.8°, number of excitations (NEX) = 4, acquisition time = 42 s [20]. PET/MR data were reconstructed using TOF-OSEM with 28 subsets and 4 iterations, a matrix size of 256 × 256 resulting in a reconstructed voxel size of 1.391 × 1.391 × 2.780 mm3 (see Table 3). Based on the results of the phantom study, the post-processing parameters chosen for the human data included the manufacturer supplied 3.5 mm Gaussian transaxial filter as well as a 3-point axial convolution filter and PSF resolution modeling. All manufacturer-provided corrections were applied (dead-time, decay, random, attenuation, and scattered coincidences, normalization).

Reconstructed dynamic PET images were frame-to-frame realigned based on a rigid-body transformation (Statistical Parametric Mapping (SPM), version 12, Wellcome Trust Centre for Neuroimaging, University College London, UK) to correct for potential motion during the scan. Anatomical MR images were co-registered and resampled to the mean PET images using SPM. MR sequence parameters were: 3D brain volume imaging (BRAVO) sequence, TE = 2.984 ms, TR = 7.948 ms, matrix size = 256 × 256 × 162, voxel size = 1 × 1 × 1 mm3, FA = 12°, NEX = 1, acquisition time = 6:13 min for healthy volunteer 1-4; magnetization-prepared rapid acquisition gradient echo (MPRAGE) sequence, TE = 3.168 ms, TR = 8.412 ms, MPRAGE TR = 2488 ms, matrix size = 256 × 256 × 164, voxel size = 1 × 1 × 1mm3, FA = 8°, NEX = 1, and acquisition time = 7:39 min for healthy volunteer 5-6. Predefined target and reference ROIs based on the Montreal Neurological Institute (MNI) template were eroded and applied to the PET images by using the inverse transformation of the co-registered MR images to the MNI space.

For [11C]DTBZ and [11C]raclopride scans, non-displaceable binding potentials (BPND) in the caudate (left and right) and for three putamen regions covering the entire length were calculated using the Logan graphical analysis [21] with the occipital cortex and cerebellum, respectively, as reference regions.

[18F]FDG binding ratios were determined based on the last 30 min of each dataset using the pons as reference region for the same striatal ROIs used in the evaluations of [11C]DTBZ and [11C]raclopride and additional regions including the cerebellum, left and right medial front gyrus, medulla, midbrain and occipital cortex to reflect the widespread [18F]FDG distribution across the brain.

Regional time activity curves (TACs) were determined for all subjects for two target regions (left caudate and left putamen 1) and the respective reference region (DTBZ: occipital cortex; FDG: pons; raclopride: cerebellum). Radial profiles of a single voxel width were placed on the images averaged over the entire acquisition to determine potential spatial variations between the HRRT and PET/MR data. The activity concentrations along the profiles were normalized to the injected activity.

Results

RCs and %BG variability

Figure 1 shows representative images of the contrast phantom data for different reconstruction parameters. RCs and %BG variability as a function of sphere diameter for different reconstruction settings are depicted in Fig. 2 and listed in Tables 5 and 6. Supplementary Figure 1 and 2 and Supplementary Table 1 depict the corresponding results for the image quality phantom according to NEMA recommendations. As expected, the highest RCs for all hot spheres of the contrast phantom scanned on the PET/MR were obtained from the TOF + PSF reconstruction, while the lowest RCs were obtained by applying post-filtering without PSF correction (Fig. 2a). The cold spheres revealed similar RCs for all 4 evaluated reconstruction settings. %BG variability was highest for TOF only reconstruction and lowest for TOF + PSF + filter reconstruction for most of the spheres (Fig. 2b).

Fig. 1
figure 1

Representative images of the contrast phantom data for different reconstruction parameters for the two systems. Color scales of the images were normalized based on the average background value multiplied by the respective sphere to background ratio

Fig. 2
figure 2

RCs and %BG variability as a function of sphere diameter for the contrast phantom (PET/MR: RCs (a), %BG variability (b); HRRT: RCs (c), %BG variability (d)). PET/MR data were reconstructed with TOF, TOF with filter, TOF with PSF, and TOF with PSF and filter. Two iterations and 28 subsets were used, respectively (GE recommendation for phantom data). HRRT data were reconstructed without any filters (native), with a 2-mm Gaussian filter (standard in-house reconstruction for human data) using 6 iterations and 16 subsets, respectively, with PSF, and with PSF correction and 2-mm filter using 10 iterations and 16 subsets, respectively. Note the gap between spheres marks cold vs. hot spheres

Table 5 RCs for the contrast phantom scanned on the PET/MR and HRRT
Table 6 %BG variability for the contrast phantom scanned on the PET/MR and HRRT

For the contrast phantom scanned on the HRRT (Fig. 2c), similar RCs were obtained from the PSF reconstruction with and without filter for all spheres except for the smallest one (PSF: 54.4%; PSF + filter: 49.5%). Lowest RCs for the four hot spheres were observed from the native and the 2-mm Gaussian post filter reconstruction. As expected, the relatively narrow post filter had the biggest impact on the RC of the smallest sphere. The filtered reconstruction yielded lowest %BG variability values for all spheres (Fig. 2d).

In direct comparison of the contrast phantom data, both the PET/MR TOF only and TOF + PSF + filter reconstructions revealed similar RCs for the hot spheres compared to both the HRRT PSF and PSF + filter reconstructions (Fig. 2 and Tables 5 and 6), except for the smallest sphere for the HRRT PSF reconstruction (deviations ~ 10% between scanners).  The PET/MR TOF + filter reconstruction revealed similar RC values compared to the HRRT filter only reconstruction.

RCs of the PET/MR decreased for all sphere sizes when the TOF information was not used (woTOF, Supplementary Figure 3 and Supplementary Table 2).

RCs of all hot spheres decreased for the spherical VOI analysis in comparison to the single slice ROI analysis (except for the largest hot sphere for the HRRT PSF reconstruction, Supplementary Figure 4).

RC and voxel-level noise (%)

RCs versus voxel noise (%) comparison between various reconstruction parameters, two different sphere sizes, and count statistics for the contrast phantom scanned on the PET/MR and the HRRT is illustrated in Figs. 3 and 4, respectively. A systematic increase in voxel noise (%) was observed for the low count frame in comparison to the high count frame. The highest noise levels for the PET/MR were obtained for the TOF only reconstruction, whereas the lowest noise levels were obtained when using TOF with filter and TOF with PSF and filtering. The HRRT native reconstruction revealed the highest voxel noise (%), while the lowest voxel noise (%) was determined for both filtered reconstructions.

Fig. 3
figure 3

PET/MR RCs (%) as a function of voxel noise (%) for two different sphere sizes and for a high and low count frame for the contrast phantom (19.8 mm sphere: high count frame (a), low count frame (b); 9.9 mm sphere: high count frame (c), low count frame (d)). The high count frame corresponds to ~ 135 million prompts within the frame, the low count frame to ~ 12 million prompts. Note the difference in x-axis limits. PET/MR data were reconstructed with TOF, TOF with filter, TOF with PSF, and TOF with PSF and filter using 28 subsets, respectively. High count data were analyzed based on a single realization while low count data were based on five replicates. Each point represents an OSEM iteration, and the number of iterations increases from left to right. The dotted line represents the HRRT standard in-house reconstruction (2-mm Gaussian filter) used for human data reconstruction

Fig. 4
figure 4

HRRT RCs (%) as a function of voxel noise (%) for two different sphere sizes and for a high and low count frame for the contrast phantom (19.8 mm sphere: high count frame (a), low count frame (b); 9.9 mm sphere: high count frame (c), low count frame (d)). The high count frame corresponds to ~ 135 million prompts within the frame, the low count frame to ~ 12 million prompts. Note the difference in x-axis limits. HRRT data were reconstructed without any filters (native), with a 2-mm Gaussian filter (standard in-house reconstruction for human data), with PSF, and with PSF correction and 2-mm filter using 16 subsets, respectively. High count data were analyzed based on a single realization while low count data were based on five replicates. Each point represents an OSEM iteration, and the number of iterations increases from left to right. The dotted line represents the PET/MR reconstruction (wTOF, PSF, 3.5-mm Gaussian filter) used for human data reconstruction

Figure 5 shows the RCs (%) versus voxel noise (%) for all sphere sizes based on the reconstruction parameters used for the human data. Increased voxel noise (%) for the HRRT compared to the PET/MR was found for the high and low count frame. For the PET/MR, the trajectory of RCs versus voxel noise (%) reached a relatively stable value from iteration 4 on for most of the hot spheres; reconstructing with more iterations increased noise but not RCs significantly.

Fig. 5
figure 5

RCs (%) as a function of voxel noise (%) for all sphere sizes for a high and low count frame for the contrast phantom (PET/MR (a), HRRT (b)). The high count frame corresponds to ~ 135 million prompts within the frame, the low count frame to ~ 12 million prompts. Note the difference in x-axis limits. Each point represents an OSEM iteration, and the number of iterations increases from left to right. PET/MR data were reconstructed with TOF + PSF + filter; HRRT data were reconstructed with filter only (standard in-house reconstruction for human data). High count data were analyzed based on a single realization while low count data were based on five replicates. Solid lines represent the hot spheres, dotted lines the cold spheres

Based on the phantom results, the best trade-off in terms of RCs, %BG variability, and voxel noise (%) for the PET/MR was achieved with the reconstruction setting of TOF + PSF + filter and 4 iterations (lowest %BG variability, voxel noise in % vs RCs for almost all spheres converged along with high RCs due to PSF correction), which was consequently used for the reconstruction of all human data.

Human scans

Figure 6 depicts representative images of the mean [11C]DTBZ, [18F]FDG, and [11C]raclopride tracer distribution for each tracer. Figure 7 illustrates regional time-activity curves for each subject. A voxel-wise correlation of the HRRT to PET/MR activity concentration for each subject can be found in Supplementary Figure 5.

Fig. 6
figure 6

Representative images of the mean [11C]DTBZ, [18F]FDG, and [11C]raclopride tracer distribution for each subject. Color scales of the images were adjusted based on SUV values for each tracer separately with a common maximum between scanners. PET/MR data were reconstructed with TOF with PSF and filter using 4 iterations and 28 subsets; HRRT data were reconstructed using 6 iterations and 16 subsets and post-processed with a 2.0 mm FHWM Gaussian filter (standard in-house reconstruction). Note: Subject 4 was injected with a lower amount of [18F]FDG activity for the HRRT scan compared to the other scans; hence, noise characteristics differ accordingly

Fig. 7
figure 7

Regional time activity curves for each subject for two target regions (left caudate and left putamen 1) and the reference region ([11C]DTBZ: occipital cortex, [18F]FDG: pons, [11C]raclopride: cerebellum) for all subjects. Note that the offset in time activity curves between the PET/MR and HRRT is due to different acquisition start triggering mechanisms (HRRT: manual acquisition start; PET/MR: acquisition start based on counts). Solid lines represent the PET/MR data, dotted lines the HRRT data

Line profiles

Line profiles show excellent spatial agreement between the images obtained with the two scanners (Fig. 8). [11C]DTBZ and [11C]raclopride profiles reveal partial magnitude differences, while [18F]FDG profiles were almost identical.

Fig. 8
figure 8

Line profiles ([11C]DTBZ, [18F]FDG, [11C]raclopride) of mean PET images. Profiles were normalized to the injected activity of each subject. Position of line profiles along the FOV (solid lines: typical target regions; dotted lines: typical reference regions) are illustrated on a representative slice of the mean [11C]DTBZ, [18F]FDG and [11C]raclopride image

[11C]DTBZ

The estimated BPND values obtained on the two scanners were found to be in good agreement and deviations were below 10% except for the left caudate for both subjects, the right anterior putamen for subject 1 and the right posterior putamen for subject 2 (Fig. 9a and b).

Fig. 9
figure 9

BPND of [11C]DTBZ (a) and [11C]raclopride (e) scans and ratios to reference region for [18F]FDG (c) scans. Calculated ratios of HRRT to PET/MR scans are depicted for [11C]DTBZ (b), [18F]FDG (d), and [11C]raclopride (f). Mean ratios of HRRT to PET/MR values for each tracer are depicted in (g). Dotted red line represents the ratio of 1

[18F]FDG

Overall, the ratios between different regions and pons were comparable between scanners (Fig. 9c and d). Deviations larger than 10% were detected for the anterior left and right putamen and left and right medial front gyrus for subject 3 and subject 5, respectively, and for the midbrain for subject 3 and the right putamen 2, as well as the medulla for subject 5.

[11C]raclopride

BPND values revealed deviations below 10% except for the left caudate and the right posterior putamen (Fig. 9e and f).

Figure 9g displays mean HRRT to PET/MR parameters of all subjects for each tracer, respectively. Mean [11C]DTBZ ratios revealed deviations below 10% for all investigated regions. The same was detected for the mean [18F]FDG ratios except for the left anterior putamen (10.4%) and left and right medial front gyrus (11.9% and 11.1%, respectively). [11C]raclopride ratios were determined based on a single subject.

Discussion

Cross-validation studies between the HRRT and the SIGNA PET/MR system were performed to assess the comparability with a specific focus on brain imaging.

For phantom studies, different PET/MR reconstruction parameters were tested to evaluate those most suitable for in vivo studies, though the scope of the study was not to necessarily perform a full optimization of the reconstruction and post-processing parameters, but rather to evaluate parameters that can be routinely used for human data acquisition.

As expected, the highest RCs were observed from PSF reconstructions for both PET/MR and HRRT data (Fig. 2) as has been reported by multiple studies for clinical PET systems [8, 16, 22,23,24,25]. However, since resolution modeling with PSF can cause edge artifacts (Gibbs artifacts), PSF reconstruction is not necessarily always the most quantitatively accurate method and it needs to be carefully evaluated [16, 22, 26, 27]. %BG variability was lowest when applying post-filtering for both scanners and this concurrently resulted in a decrease in RCs (Fig. 2).

Reconstruction with TOF only and with TOF, PSF correction and filter, revealed similar RCs for the PET/MR. This is likely due to fact that the improvement in RCs due the resolution modeling is reversed due to the post-processing filter with the chosen width. The net result was noise reduction without altering RCs. Applying a different filter width would likely yield differences in RCs between TOF only and TOF, PSF correction + filter.

The HRRT %BG variability was slightly higher compared to the PET/MR for most of the spheres (Fig. 2b and d). Voxel noise (%) of HRRT images was significantly higher (Fig. 5). As the HRRT uses smaller detector crystals compared to the PET/MR and the reconstructed voxel size was also smaller, higher noise levels both on a regional and voxel-level are expected due to the lower measured counts per detector pair.

Determined voxel noise (%) for the HRRT (Fig. 5) is comparable to already published results [28] within the limits of small differences in the phantom used and methodological approaches.

While the HRRT clearly outperforms the PET/MR in terms of intrinsic spatial resolution and resolution uniformity, the PET/MR TOF capability leads to significantly improved signal-to-noise ratios [29, 30]. It has been reported, that TOF reconstruction results in higher RCs at matched noise levels with faster convergence when comparing to woTOF data [31,32,33]. This is in line with our results (Fig. 5). Furthermore, when comparing RCs reconstructed with and without TOF, all spheres (Supplementary Figure 3 and Supplementary Table 2), revealed a decrease in RCs when no TOF information was used for reconstruction, as previously reported [30, 34, 35]. The cold spheres exhibited a huge decrease in RCs when TOF information was not utilized for reconstruction, which is in line with published results [8, 36]. Furthermore, comparing the woTOF PET/MR data to the HRRT data revealed lower RCs for all PET/MR reconstructions for the two smallest hot spheres due to the lower spatial resolution of the PET/MR system (Supplementary Figure 3 and Supplementary Table 2). Hence, the use of TOF information for reconstruction contributes to partially off-set the intrinsically higher spatial resolution of the HRRT system.

RCs were determined according to the NEMA protocol and are based on a single slice ROI; the single slice ROI shall be positioned on the image slice with the highest sphere activity concentration [18]. However, this might be prone to inaccuracies, as the center of the sphere might be between two adjacent slices (axial slice thickness PET/MR: 2.78 mm; HRRT: 1.22 mm). A VOI-based analysis method covering the entire sphere was proposed as a more robust analysis method [37]. Determining the recovery with a VOI-based method revealed lower RCs for all hot spheres for the PET/MR and HRRT (except for the largest hot sphere PSF reconstruction), though changes in RCs due to the analysis method were larger for the PET/MR (Supplementary Figure 4). Our results are in line with results from the literature [37], demonstrating that the analysis method clearly has a significant impact on the determination of the RCs.

Within the limitations discussed above (different reconstructed voxel sizes between scanners, spatial resolution of the scanners and reconstruction parameters (TOF vs. woTOF, etc.)), a direct comparison of the phantom data between the HRRT and PET/MR revealed comparable RCs, %BG variability, though voxel noise (%) differed significantly. The best trade-off (lowest %BG variability, voxel noise in % vs RCs for almost all spheres was converged along with high RCs due to PSF correction) between RCs, %BG variability and voxel noise (%) for the PET/MR was found with PSF correction and filtering. Hence, this reconstruction paradigm was used for reconstruction of all subject scans and compared to results from our own standard HRRT reconstruction setting used for patient scans (2 mm Gaussian filtering).

TACs of the HRRT subject scans revealed higher noise than the PET/MR TACs, due to the smaller crystals, as well as smaller reconstructed voxel sizes (Fig. 7), consistent with the phantom data. No systematic differences between line profiles were detected (Fig. 8), indicating that differences in profiles are likely due to scan-to-scan or biological variations. This also indicates that the quantitative data correction algorithms implemented on each scanner do not contribute to significant acquisition and scanner specific biases.

The variation of the [11C]DTBZ BPND was found to be within 10% (Fig. 9g). Although the time interval between both scans for subject 1 was 8 months, no influence of the extended time interval was detected, which is in line with literature reporting that VMAT2 binding shows a negligible age dependence over this time frame [19].

Similar differences as for [11C]DTBZ were found for [18F]FDG ratios for most of the investigated regions, which were spanning a wider fraction of the brain. The largest deviation was detected for the medial front gyrus possibly due to the differences in spatial resolution uniformity across the FOV [8, 12]. Considering also the fact that calculated deviations are based on 3 subjects only, we can conclude that both systems revealed very similar [18F]FDG ratios.

In general, a trend towards larger differences between both systems for the posterior putamen was observed. This region is affected most by the partial volume effect, as it is the smallest region investigated in this study [38]. As the HRRT has a higher spatial resolution, differences in BPND, as well as ratios, could be expected to be the largest for this region.

Comparison of the acquired healthy subject data might be biased due to different attenuation correction approaches used for the HRRT and PET/MR data. HRRT subject data were corrected for attenuation using a 137Cs transmission scan approach, whereas the PET/MR subject data were corrected based on a ZTE MRI scan of the respective scan. Transmission scans are in general more sensitive to noise, as the signal to noise ratio solely depends on the acquisition duration of the transmission scan and the activity of the used transmission source. Furthermore, Sousa et al. determined higher linear attenuation coefficients in brain imaging for ZTE attenuation maps compared to 68Ge attenuation maps. For the cerebellum and posterior cortical regions, a higher correlation and improved precision in standard uptake values were detected when ZTE attenuation maps were used compared to 68Ge transmission scans [39]. Multiple studies have shown that this is due to an improved estimation of the temporal bone in ZTE maps [20, 39, 40]. However, based on the evaluated line profiles as shown in Fig. 8, no significant and systematic impact due to different attenuation correction approaches was determined.

Smaller voxel sizes were chosen for the reconstruction of PET/MR subjects’ scans compared to phantom data to enable the VOI position based on a finer pixel grid. Voxel size was determined to have no impact on the RCs (deviation of RCs between 128 and 256 matrix size was below 1.6%).

Several studies have reported that variability (instrumentation and biological based) in test-retest scans of multiple applications is approximately 10% or more [41,42,43,44]. Given that the number of subjects in our study was relatively low, coupled with the outcomes of the phantom studies, it is reasonable to expect that the estimate of variability between same-subject scans performed on the two scanners may even decrease when increasing the number of subjects.

It is of interest to note that our results most likely are applicable to a comparison between the HRRT and a GE PET/CT system, as both the GE PET/MR and PET/CT systems are based on a similar detector design and reconstruction algorithms [45]. Especially the 5-ring configuration of the PET/CT system with the same axial FOV as the PET/MR will most likely demonstrate similar in vivo image quality parameters [45].

Conclusions

This work demonstrates that the whole-body hybrid SIGNA PET/MR system is well suited for high-resolution human brain imaging and that the scanner performance in a clinical setting is quite comparable to that of the HRRT system, which is still arguably the human scanner with the highest intrinsic resolution. The HRRT outperforms the PET/MR in terms of spatial resolution, but exhibits a higher voxel noise (%) due to its smaller crystals and smaller reconstructed voxel sizes and lack of TOF capability; the PET/MR TOF capability indeed contributed to off-set to some degree the higher spatial resolution of the HRRT in terms of overall image quality.

BPND of [11C]DTBZ subjects, as well as ratios of [18F]FDG scans, revealed a variability between both scans of less than 12%, which is in the range of typical test-retest variability. From a clinical point of view and based on our results, we expect the two scanners to provide similar results in regard to brain imaging. The main difference between the two scanners is the attenuation correction approach with the HRRT scanner using a transmission source scan, whereas the PET/MR is based on a ZTE MRI scan. Based on our results, this was not a confound in comparing the two scanners. A significant clear advantage of the PET/MR however remains the fact that the simultaneous imaging approach of PET and MR enables the acquisition of multiple parameters with the brain in the same physiological state, in addition to providing an anatomical reference required for several image analysis approaches.