Background

Deposition of amyloid-beta (Aβ) plaques in the brain is the earliest in vivo measurable hallmark in the development of Alzheimer’s disease (AD), which is the most common type of dementia [1,2,3]. Therefore, visualization of Aβ deposits in vivo is essential for improving early diagnosis and monitoring treatment effects [4]. To this end, various positron emission tomography (PET) amyloid tracers have been developed [5]. Among those, fluorine-18 (18F)-labeled tracers approved by the European Medicines Agency (EMA)/Food and Drug Administration (FDA) are of special interest for clinical trials due to their relatively long half-life t1/2 = 109.8 min compared to [11C]PiB (Carbon-11 Pittsburgh Compound B) and commercial availability [5, 6].

In addition to visualization, amyloid PET allows for quantification of underlying physiological processes, such as the level of Aβ plaque burden [7,8,9,10]. For diagnostic purposes, a static scan acquired at pseudo-equilibrium, using a tracer-specific approved method, has been deemed sufficient in combination with visual assessment of the images. In research settings, this simplified protocol is commonly used to calculate the standardized uptake value ratio (SUVR) [5]. SUVR, however, is only a semi-quantitative parameter that is known to be affected by both scanning time window and (changes in) blood flow [11, 12]. Given this dependency, full quantification using pharmacokinetic modeling may be required to obtain higher overall sensitivity for measuring longitudinal changes (e.g., for monitoring disease progression or treatment response), especially during the early stages of the disease when amyloid is still accumulating. Pharmacokinetic modeling, however, requires a dynamic scanning protocol, which can last for up to 2 h depending on the actual tracer. These long acquisition protocols result in lower patient comfort and lesser efficient use of both scanner and tracer batch, in addition to an increased risk of motion artifacts. Dynamic data acquisition in a dual-time-window protocol (also called “coffee-break” protocol), however, can be used to reduce overall scanning time, in which data are acquired separately for early and late phases. Such a protocol provides a resting period for the patient and, when long enough, may also allow for interleaved scanning protocols, thereby optimizing tracer batch and scanner usage (i.e., costs), while maintaining a high quantitative accuracy.

So far, some studies have used a dual-time-window protocol using static acquisition of amyloid-PET data. An early scan (i.e., 0–10 min p.i.) was proposed in addition to the (standard) late static scan, as it has been reported that the early scan may provide information on metabolism and neuronal injury, possibly circumventing the need for additional [18F]FDG imaging [12,13,14,15,16]. Recently, Bullich and colleagues demonstrated that the non-displaceable binding potential (BPND) obtained using a dual-time-window acquisition protocol (0–30 and 120–140 min p.i.) correlated well with BPND obtained using a full dynamic acquisition protocol of 140 min [17]. This [18F]florbetaben study, however, did not report details about different resting periods, nor did it assess the robustness of the dual-time-window protocol for subjects across the AD spectrum and for different noise levels (e.g., for regions of different sizes).

The purpose of the present simulation study was to define optimal dual-time-window acquisition protocols for [18F]florbetaben and [18F]flutemetamol, both in terms of patient comfort and throughput, while maintaining high quantitative accuracy. These simulations were focused on early stages of the disease, given the potential value of amyloid imaging to guide interventions aimed at secondary prevention of AD dementia.

Methods

Subjects and PET data

[18F]flutemetamol whole blood input curves, metabolite-corrected arterial plasma input curves, and time-activity curves (TACs) from 12 volumes of interest (VOIs) of three healthy controls and three probable AD subjects were obtained from Heurling et al. and Nelissen et al. [7, 18]. [18F]florbetaben metabolite-corrected and metabolite-uncorrected plasma input curves together with whole blood samples and TAC data from 13 VOIs of 10 healthy controls and 10 AD subjects were obtained from Becker et al. [8].

Kinetic models for BPND estimation

It has been shown that the reversible two-tissue compartment model (4 rate constants) with additional blood volume fraction parameter (2T4k_Vb) is the optimal plasma input model for describing both [18F]flutemetamol and [18F]florbetaben kinetics [8, 18]. In addition, several non-invasive reference tissue-based approaches have also been used: the simplified reference tissue model (SRTM) and its basis function approach (receptor parametric mapping, RPM), the multilinear reference tissue method (MRTM), and reference Logan [19,20,21,22]. In the present study, the 2T4k_Vb, SRTM and the full reference tissue model (FRTM [23]) were examined (Fig. 1 provides an overview of the kinetic models used during each step of the analysis). The main aim, however, was to verify the applicability of a reference tissue model approach given its applicability for large clinical trials.

Fig. 1
figure 1

Overview of the kinetic models used during each step of the analysis

Kinetic parameters for TAC simulations

Both reported whole blood and metabolite-corrected arterial plasma input curves were used for [18F]flutemetamol analysis [24]. For [18F]florbetaben, continuous whole blood curves were generated by scaling the continuous (non-metabolite-corrected) plasma curves using discrete whole blood samples. Subsequently, all cortical and cerebellar TACs of all subjects were analyzed using the 2T4k_Vb model, SRTM, and FRTM [19, 23, 25]. The Akaike information ciriterion was used to determine which reference tissue method best described the kinetics of the tracer [26]. Both the optimal reference tissue model and the 2T4k_Vb model were then used for estimating rate constants.

Finally, from the rate constants of the cortical regions (target tissue consisting of anterior and posterior cingulate, frontal, parietal, and lateral and medialtemporal cortex) and cerebellum gray matter (reference tissue), mean and standard deviations were calculated and used to establish the range of rate constants for composite cortical and reference tissue regions. The resulting rate constants for the 2T4k_Vb and reference tissue model can be found in Tables 1 and 2, respectively.

Table 1 Pharmacokinetic parameters for simulating TACs using 2T4k_Vb
Table 2 Pharmacokinetic parameters for simulating TACs using SRTM

TAC simulations

Plasma input-generated TACs

Noiseless target and reference tissue TACs of 110 min duration were simulated (see Table 1 for kinetic parameters used) using the 2T4k_Vb model to assess the bias in BPND estimates when fitting these TACs with SRTM. The 2T4k_Vb model was also used to generate a reference tissue TAC for the SRTM simulations described in the next section.

SRTM-generated TACs

Using SRTM, tissue target TACs of 110 min duration were simulated for the range of BPND values observed clinically (50 TACs per BPND, see Table 2 for kinetic parameters used), along the AD continuum. Various levels of typical PET noise were added to these target TACs only (coefficient of variation (COV) of 1, 2, and 5%, respectively) according to the variance model used by Yaqub et al., creating 50 TACs per noise level for each BPND [27]:

$$ {\sigma}_1^2=\alpha \bullet \mathrm{dcf}\bullet \mathrm{dcf}\ \frac{T}{L^2\ } $$
(1)

where \( {\sigma}_1^2 \) is the variance for each frame, calculated using the whole scanner true counts T, dcf is the decay correction factor, L is the frame length, and α is a proportionality constant signifying the variance level. In practice, most clinical TACs corresponded best with simulated TACs with 1 or 2% noise added, while TACs with 5% noise only corresponded with very small regions with a low BPND [7, 8].

Dual-time-window protocols

The “late frame” acquisition window of 90–110 min (used within Europe) was left intact, given that it constitutes the approved acquisition protocol for clinical use with visual analysis. Next, data points corresponding to the intervals in the dual-time-window protocols were removed from both the target and reference tissue TACs (creating “interval TACs”), ranging from 0 min (no interval) to 80 min (i.e., interval 10–90 min: a 0–10-min p.i. acquisition followed by a 90–110-min p.i. acquisition) in steps of 10 min. This resulted in a total of nine different protocols.

Estimating parameters of interest

Missing data points in the reference tissue TACs, resulting from the introduction of the interval, were interpolated using the 2T4k_Vb model, which was used to fit the interval TACs together with a typical, tracer-specific input function, since a well-defined complete reference tissue input curve is required for SRTM to estimate the kinetic parameters of interest (R1, BPND, k2) [19]. A typical input function could be used for this purpose, based on the observed negligible between-subject variation in the tail of the curve. In future applications of this protocol, either an equivalent approach or an existing population-derived input function could be used, provided that a similar injection protocol is used. In addition, boundary values were set for all kinetic parameters (Additional file 1: Tables S1a and b) and for k3 of the 2T4k_Vb interpolation of the interval (lower boundary k3 = 0.005).

All TACs were fitted with SRTM, and DVR values were calculated as DVR = BPND + 1. This additional parameter was introduced as it is frequently used to express amyloid burden in other studies [28], and it better allows for expressing any bias in percentages due to its larger values.

Evaluation of outcome parameters

Results of 2T4k_Vb and SRTM-generated TACs fitted with SRTM were checked for values that were physiologically not expected, here called outliers (≥ 1.5∙max simulated BPND). These outliers were registered and removed from the overall dataset before further analysis. Subsequently, bias as induced by the interval was assessed for all simulated dual-time-window protocols, by calculating the bias between simulated BPND \( \Big({\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \)) and corresponding mean fitted BPND (\( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{fit}}\Big) \):

$$ \mathrm{Bias}\ {\mathrm{BP}}_{\mathrm{ND}}={\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{fit}}-{\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} $$
(2)

Next, for each set of 50 simulated TACs at a given \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) and noise level, an acceptability curve was computed by plotting the fraction of data versus a span of bias values ranging from 0 to 1 called the “absolute bias threshold.” These curves were generated for all \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) values across all protocols, and the area under the curve (AUC) was calculated for all dual-time-window protocols, as a global measure of reliability. Finally, the percentage bias in the interval TAC-derived DVR was assessed by comparing it to DVR derived from the full-kinetic curve:

$$ \mathrm{Bias}\ {\mathrm{DVR}}_{\mathrm{ND}}\left(\%\right)=\frac{\ {\mathrm{DVR}}_{\mathrm{dual}-\mathrm{time}\ \mathrm{window}\ \mathrm{protocol}}-{\mathrm{DVR}}_{\mathrm{full}\ \mathrm{dataset}\kern0.5em }}{{\mathrm{DVR}}_{\mathrm{full}\ \mathrm{dataset}}}\bullet 100\% $$
(3)

Finally, also the percentage bias in R1 was assessed by comparing R1 derived from the various dual-time-window protocols with R1 obtained from the full-kinetic curve (in a similar way as for DVR, see Eq. 3).

Results

Kinetic parameters for TAC simulation

The Akaike information criterion showed that for [18F]flutemetamol (62.7%) and [18F]florbetaben (79.2%) SRTM was the preferred reference tissue method. Pharmacokinetic parameters derived from existing clinical data and used for simulating TACs based on the 2T4k_Vb model and SRTM are presented in Tables 1 and 2. The range of BPND values is equally spaced and corresponds to the BPND range present in the data, with BPND I being the lowest and BPND V the highest value. Of note, as described previously, BPND estimates are always different between 2T4k_Vb and SRTM. In the present study, additional differences are present, since the first corresponds to the sum of specific binding and a slow component of non-specific binding (in the target tissue), while in the latter a correction for all non-specific binding is made, provided that it is the same in target and reference tissues [18, 28, 29].

TAC simulation

Full reference and target tissue TACs, the latter covering the range of \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) values, are shown in Fig. 2. Figure 3 shows the pattern in which the different noise levels were simulated for a target tissue TAC (global cortical region), resembling the shape of clinical TACs published previously [7, 8].

Fig. 2
figure 2

SRTM simulated target tissue TACs for the range of \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) values and 2T4k_Vb-generated reference tissue TACs for a [18F]flutemetamol and b [18F]florbetaben

Fig. 3
figure 3

SRTM simulated TACs from a global cortical region (mean TAC value ± SD, shown solid and dashed lines, respectively) for all noise levels (COV 0–5%) for a [18F]flutemetamol and b [18F]florbetaben. All TACs were simulated using a \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) III (Table 2)

Estimating parameters of interest

Plasma input-generated TACs

As can be seen in Fig. 4, SRTM-derived DVR resulted in a systematic bias when fitting the noiseless, full-kinetic curve (90–90) 2T4k_Vb-generated TACs for both tracers. For [18F]flutemetamol, this bias ranged between the 0.17 and 1.95% and for [18F]florbetaben between the 2.62 and 6.04%. Compared to the full-kinetic curves, 2T4k_Vb interval TACs showed a greater bias in SRTM-derived DVR only for the 10–90 and 20–90 interval TACs (maximum bias of 3.10 and 2.25%, respectively for [18F]flutemetamol and maximum bias of 8.73 and 10.10%, respectively for [18F]florbetaben) and comparable or smaller bias for the other interval TACs.

Fig. 4
figure 4

Percentage bias in SRTM-derived DVR of 2T4k_Vb-generated TACs. a Bias in DVR for [18F]flutemetamol. b Bias in DVR for [18F]florbetaben

Evaluation of outcome parameter

Outliers

No outliers were present when no noise was added to the TACs, and the largest number of outliers occurred at the highest noise level for both tracers (Tables 3 and 4), full colour version see Additional file 1: Tables S2 and S3. More specifically, a positive correlation was observed between percentage of outliers and interval (R2 = 0.47, p = 0.04) and between percentage outliers and noise levels (R2 = 0.91, p < 0.05) for [18F]flutemetamol. For [18F]florbetaben, outliers were mostly present at a noise level of 5% and the number increased for higher \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) values (R2 = 0.85, p = 0.03). The percentage of [18F]flutemetamol outliers exceeded the 20% for the two largest intervals, while for [18F]florbetaben it exceeded the 20% only for the 10–90 interval.

Table 3 Bias in DVR and outliers as a result of noise and the dual-time-window protocol for SRTM-generated [18F]flutemetamol TACs
Table 4 Bias in DVR and outliers as a result of noise and the dual-time-window protocol for SRTM-generated [18F]florbetaben TACs
Table 5 Absolute bias in BPND and AUC as a result of noise and the dual-time-window protocol for SRTM-generated [18F]flutemetamol TACs

SRTM-generated [18F]flutemetamol interval TACs

The interpolated TACs can be found in Additional file 1: Figure S1. Table 5 (full color version, see Additional file 1: Table S4) shows, as expected, increasing absolute bias in \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{fit}} \),with increasing noise levels (R2 = 0.98, p = 0.01) and longer intervals (R2 = 0.51, p = 0.03) (maximum bias of 0.128 for the 10–90 \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) II interval, at 5% noise). In addition, there was a trend towards a negative correlation between absolute bias and \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) (R2 = 0.74, p = 0.06). The AUC values calculated from the acceptability curves show a trend of smaller (poorer) AUC values at higher noise levels (R2 = 1.00, p < 0.001) and larger intervals (R2 = 0.64, p = 0.01). As expected, the full-kinetic curve provided the highest AUC except for the lowest \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \). Furthermore, the 10–90 and 20–90-min intervals result in a bias in DVR of maximal 6.4%, while all other intervals showed a bias in DVR of maximum 1.6% for noise levels of up to 2%. For higher noise levels corresponding to very small regions (5%), bias in DVR was a maximum of 8.2 and 3.1%, for the 10–90 and 20–90 intervals (Table 3). Finally, Fig. 5 shows the percentage bias in R1, which was only larger than 1% for the largest interval. The bias in R1 estimates also increased with increasing noise level (R2 = 0.99, p = 0.004, COV2 ranging from 0.04 to − 0.497 and COV5 0.034 to − 3.462).

Fig. 5
figure 5

[18F]flutemetamol: percentage bias in SRTM-derived R1 across the range of \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) values for two noise levels

SRTM-generated [18F]florbetaben interval TACs

The interpolated TACs can be found in Additional file 1: Figure S2. Table 6 (full color version, see Additional file 1: Table S5) shows an increasing absolute bias for longer intervals (R2 = 0.48, p = 0.04) as well as for higher noise levels (R2 = 0.96, p = 0.02, maximum bias 1.31 for the 10–90 \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) II interval, at 5%). This positive correlation was also supported by the AUC values, where lower (poorer) values were correlated with larger intervals (R2 = 0.74, p = 0.002), higher \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) values (R2 = 0.98, p = 0.001), and higher levels of noise (R2 = 1.0, p < 0.001). As expected, the full-kinetic curve provided the highest AUC except for the lowest \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \). Furthermore, the 10–90 interval showed a bias in DVR of 3.4%, all other intervals showed a bias of ≤ 1.9% for noise levels up to 2%. For higher noise levels corresponding to very small regions (5%), bias in DVR was a maximum of 9.1% for the 10–90 interval, 5.0% for the 20–90 interval, and 1.8% for all others (Table 4). The R1 bias plots (Fig. 6) show an increase in bias at higher noise levels (R2 = 0.99, p = 0.005, COV2 ranging from 0.001 to − 0.359 and COV5 0.003 to − 1.497).

Table 6 Absolute bias in BPND and AUC as a result of noise and the dual-time-window protocol for SRTM-generated [18F]florbetaben TACs
Fig. 6
figure 6

[18F]florbetaben: percentage bias in SRTM-derived R1 across the range of \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{sim}} \) values for two noise levels

Discussion

The present pharmacokinetic simulation study demonstrated that, for [18F]flutemetamol and [18F]florbetaben, the introduction of a break with a maximum of 60 min in a dual-time-window acquisition protocol (early interval of 0–30 min followed by a late interval of 90–110 min) results in a minimal loss in quantitative accuracy while presenting major logistic advantages as compared with full dynamic acquisitions. Therefore, this protocol could serve as suitable alternative in research or clinical trial settings where accurate and fully quantitative measurements might be required.

Analysis of the 2T4k_Vb noiseless full TACs showed a systematic bias (0.17–1.95% for [18F]flutemetamol and 2.62–6.04% for [18F]florbetaben) in SRTM-derived DVR values compared with simulated DVR values. These findings are in line with previous studies reporting that kinetics of [18F]flutemetamol and [18F]florbetaben are better described by a two-tissue compartment model in target as well as reference tissues [8, 18]. In addition, Nelissen et al. showed that there were similar levels of binding in the second reference tissue compartment for both healthy control and AD subjects and therefore concluded that this binding is likely due to (relatively slow) non-specific retention [18]. As this violates one of the assumptions of SRTM, a slight bias in DVR estimates can be expected [19, 30]. Given the aim of validating a dual-time-window protocol for a reference tissue-based approach, TACs were both generated and fitted according to the SRTM model to prevent a systematic bias in the results.

A first examination of SRTM-derived \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{fit}} \) values revealed that most outliers were observed for fits of the 10–90-min interval, and, to a lesser extent, also for fits of the 20–90-min interval. In addition, compared with the full-kinetic curve, the bias in DVR only exceeded previously reported [11C]PIB TRT values, of which [18F]flutemetamol is an analog, for the 10–90 and 20–90-min intervals for [18F]flutemetamol [9, 10]. For the other dual-time-window protocols, the bias remained ≤ 3.1%. Analysis of [18F]florbetaben data showed a bias of ≤ 9.1% for the 10–90-min interval and ≤ 5.0% for the 20–90 interval for the highest noise levels, while for all intervals it was ≤ 1.9%. The latter well within previously reported TRT values for [18F]florbetaben SUVR data (ranging between 2.9% HC and 6.2% AD) [31]. Reported AUC values also showed a general trend of worse values for longer breaks and higher noise level, with the exception of some extremely low BPND cases, where the performance of SRTM is known to be suboptimal [10].

Finally, the bias plots of SRTM-generated TACs demonstrated that bias in SRTM-derived R1 increased as a function of noise and interval for both tracers. More specifically, a larger error in R1 (> 3% for [18F]flutemetamol and > 1% for [18F]florbetaben) was observed for the 10–90-min intervals compared with the other intervals. For practical applications, this error would be negligible since the TRT of flow is known to be approximately 9% [32]. As expected, the results showed that the length of the interval is related to bias in \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{fit}} \) or DVR and the number of outliers. More specifically, results suggest that it is not advisable to use the 10–90 and 20–90-min intervals for full quantification, especially due to the relatively large percentage of outliers and larger bias in DVR compared with other intervals. Moreover, the observed larger amount of unusable data would result in smaller power to detect changes in clinical trials.

Shorter scan durations are better for the patient and, as such, longer breaks would be preferred. Since the 10–90 and 20–90-min intervals result in a large number of outliers and larger bias, the 30–90-min interval would be a good compromise. This interval would have the additional advantage of a 60-min break, which may allow for interleaved scanning protocols. Consequently, the 30–90-min interval is recommended as the optimal trade-off between patient comfort and quantitative accuracy (bias in DVR < 2% and a maximum of 18% outliers for highest noise level and \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{fit}} \)). This conclusion is in agreement with recent work of Bullich et al. regarding the optimal [18F]florbetaben dual-time-window protocol [17]. Based on their analysis of clinical data, which did not include the 90–110-min diagnostic window, a dual-time-window protocol of 0–30 and 120–140 min was described as optimal. However, their simulations also supported that 0–30 and 90–110-min scanning times would maintain the best compromise between quantitative accuracy and patient comfort. The present simulation study, including TACs representing the AD spectrum and different noise levels, further validated their findings.

A major advantage of a 60-min gap in the scanning protocol is that it allows for interleaved scanning protocols, in which the first scan of the second patient can be acquired within the resting period (interval) of the first patient. An interleaved scanning protocol would increase both patient throughput and efficient use of tracer batches, thereby decreasing costs. An assessment of the practical feasibility of such an interleaved scanning protocol is beyond the scope of the present study and needs to be addressed in future studies. Main limitations of the current study include the use of fixed K1 and k2 parameters for simulations, the limited sample size of the available clinical dataset, and the extrapolation of TRT variability from other radiotracers to this work. The first limits the possibility of assessing the impact of changes in cerebral blood flow on dual-time-window protocol-based quantification, but it can be expected to introduce only small additional bias over and above the one introduced by the protocol itself [11]. Regarding the second, additional clinical data would have allowed the verification of the simulation results, which remains a goal for future work once larger cohorts are available. With respect to extrapolating TRT variability, although values from other tracers might not directly translate to our data, they are expected to be in comparable ranges [24]. In addition, although outside of the scope of this study, the evaluation of parametric methods for quantification of dual-time-window-derived data is warranted, which would require imaging data in order to optimize image contrast and reduce noise and artifacts. Finally, it must be noted that the goal of this study was not to compare these two tracers, but to identify the optimal dynamic dual-time-window scanning protocol for both of them. In order to make a head-to-head comparison between tracers, PET imaging data from both tracers within the same patient would be required.

Conclusion

Accurate estimates of \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{fit}} \) can be obtained for both [18F]flutemetamol and [18F]florbetaben using a 60-min dual-time-window protocol, with dynamic scanning from 0 to 30 and again from 90 to 110 min. This protocol results in a limited number of outliers, and an acceptable bias in \( {\mathrm{BP}}_{\mathrm{ND}}^{\mathrm{fit}} \) and DVR estimates. Moreover, it enables interleaved scanning protocols, optimizing tracer batch usage and patient throughput, thereby reducing costs and improving patient comfort.