Background

This study aims to propose an algorithm to improve the accuracy in the quantification of receptor density for parametric imaging in positron emission tomography (PET). By applying parametric imaging methods such as Logan graphical analysis (LGA) and multilinear reference tissue models (MRTM) to PET data, we can acquire the receptor density in terms of the non-displaceable binding potential (BPND) index [1]. LGA is a time-efficient and a computationally efficient algorithm applicable to reversibly binding receptors [2, 3]. It transforms data of PET tissue time-activity curves (tTACs) into a simple linear relationship, and BPND can then be estimated from the slope of this relationship using the ordinary least-squares (OLS) method.

BPND compares the concentration of the administered radiotracer in receptor-rich to receptor-free regions[1]. It can therefore be defined as the equilibrium ratio of specifically bound radiotracer (in receptor-rich region) to nondisplaceable radiotracer (in receptor-free region) [1]. Moreover, BPND is proportional to the density of binding sites [1, 2], which denotes the concentration of target receptors in the region of interest.

Despite the advantages of the LGA, the estimated BPND values have been found to be underestimated in cases of noisy PET data. This bias has been observed to increase with an increase in the magnitude of both BPND and noise [46]. Specifically, the bias is due to noise in the X-variable (predictor variable). The noise in the predictor variable is attributed to the noisy term, C(t) (see Eq. (1)), which is the denominator in both the LGA variables [6, 7]. Accordingly, the noise due to C(t) propagates correlatively in the LGA variables, presenting a correlated errors-in-variables (EIV) problem. Thus, both the predictor and response variables of the LGA are contaminated with correlated noise, yet the OLS method only accounts for the errors in the response variable. This explicitly makes OLS unsuitable for estimating the BPND.

A variety of methods have been proposed to reduce the bias in BPND estimates. These methods range from tTAC smoothing methods, based on the kinetics of an administered radiotracer, such as the generalized linear least-squares (GLLS) [8] technique, to methods based on a rearrangement of the original LGA equation, such as the multilinear reference tissue models (MRTM 1 & 2) [7, 9]. Other methods including orthogonal distance regression (ODR) [10] simply employ a different line estimation technique. These methods, however, have been found to either only slightly reduce the bias or reduce the bias at the cost of worsened estimation deviation [4, 5]. This necessitates the need for an improved method.

It has been shown that the regression method used to estimate the Logan slope influences the resulting bias in BPND estimates [6, 10]. On this basis, the present study employs an alternative linear regression method referred to as least-squares cubic (LSC) [1113], which fully accounts for errors in both variables, to estimate BPND. LSC minimizes the squared residuals in both the predictor and response variables and incorporates the correlation of errors in these variables [12, 13]. Thus, LSC is expected to be appropriate for correlated errors in LGA variables. Furthermore, LSC has been shown to improve regression parameter estimates in the fields of biogeosciences [14] and geophysics [13].

To the best of our knowledge, LSC regression (as in [12, 13]) has not been applied to the LGA. In this study, LSC was applied to the LGA in an effort to reduce bias in the estimates of BPND. With the OLS-based LGA as the conventional method, the viability of LSC in reducing bias in BPND estimates was assessed against MRTM2, which is currently a widely accepted method.

The three methods, LSC, MRTM2 and the conventional LGA, were compared using a well known radiotracer, 11C-carfentanil (CFN), for simulation studies. To account for differing binding characteristics, for human data studies, the three methods were verified using a different and relatively new radiotracer, 11C-Pittsburgh compound B (11C-PIB), which is gaining popularity owing to its usage in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) studies. Both CFN and 11C-PIB have specific binding sites in the brain. CFN binds to μ-opioid receptors, and 11C-PIB binds to beta amyloid (A β) plaques.

Methods

Logan graphical analysis

LGA is a well-established algorithm for estimating the density of specific binding sites of receptors in terms of the BPND [1]. The operational equation of the LGA is as given as [2],

$$\begin{array}{*{20}l} &\frac{\int_{0}^{t}C(u)\,du}{C(t)} = (1 + BP_{ND}) \\ &\:\:\:\:\:\:\:\:\:\:\: \cdot\left(\frac{\int_{0}^{t}C^{R}(u)\,du + \frac{1}{k^{R}_{2}}C^{R}(t)}{C(t)}\right) - \text{int}. \end{array} $$
(1)

C(t) and CR(t) represent the total radioactivity concentration in a brain tissue and a reference region, respectively. A reference region is such that it has ignorable specific binding sites and a density of nonspecific binding sites equal to that of the other brain tissues. It is defined by the biochemical aspects of the administered radiotracer, and in terms of amyloid imaging, cerebellum gray matter is used as the reference region [15].

Equation (1) is such that after some time t, the y-intercept term, int, becomes constant with time, and thus, Eq. (1) becomes a linear relationship between \({\frac {\int _{0}^{t}C(u)du}{C(t)}}\) and \({\frac {\int _{0}^{t}C^{R}(u)\,du + \frac {1}{k^{R}_{2}} C^{R}(t)}{C(t)}}\), with a slope of, 1+BPND.

We can then apply a linear regression to Eq. (1), for T>t, to compute BPND from its slope. A linear regression is fast and stable, and therefore, it is suitable for application in a voxel-by-voxel fashion to construct a BPND parametric image.

Figure 1 shows a compartmental model used to describe the kinetics of a radiotracer in brain tissues [16]. The compartments CP, CND and CS represent the concentration of the radiotracer in arterial plasma, non-displaceable (bound-free + non-specifically bound) radiotracer, and specifically bound radiotracer, respectively. The transportation of the radiotracer between neighboring compartments is described by the rate constants, K1 [mlg−1min−1] and k2–4 [min−1]. K1 denotes the delivery rate of the radiotracer from the arterial plasma to tissues. k2 is the clearance rate from the tissues back to the plasma, and k3 and k4 are the respective association and dissociation rates of the radiotracer to and from the specific binding sites. More details on the compartmental model can be found in [16].

Fig. 1
figure 1

Two-tissue compartmental model. Transportation of the administered radiotracer between the compartments, CP,CND and CS is described by the rate constants, K1, k2–4

MRTM1 reduces the noise by rearranging the original LGA equation such that the noisy term, C(t), only appears on the side of the response variable, thus allowing OLS to account for all errors in the LGA variables due to C(t) [9]. MRTM1 results in multilinear regression, with three regression parameters that have to be estimated. To further stabilize MRTM1, MRTM2 was developed by reducing the number of regression parameters from three to two [9]. BPND can then be estimated from the regression parameters obtained by MRTM2.

Least-squares cubic

LSC is an EIV-based regression method. LSC fully considers the errors in both variables, by minimizing the sum of weighted squared errors in both the involved variables, and by including the correlation of these errors [1113]. Under specific assumptions, LSC can be shown to reduce to either OLS or other common regression methods, rendering LSC to be a general solution to the linear regression problem [11]. Here, we introduce LSC regression and demonstrate how it can be used to estimate BPND from LGA.

To fit a straight line model, y=αx+β, LSC [12] estimates the regression parameters by minimizing the equation,

$$\begin{array}{*{20}l} \sum \limits_{i=1}^{N}d^{2}_{i} &= \sum \limits_{i=1}^{N} \left\{ w(X_{i})(x'_{i} -X_{i})^{2} \right. \\ & - 2r\sqrt{w(X_{i})w(Y_{i})}(x'_{i} - X_{i})(y'_{i} -Y_{i}) \\ & \left. + w(Y_{i})(y'_{i} - Y_{i})^{2} \right\}\frac{1}{1-r^{2}}, \end{array} $$
(2)

where α and β are the slope and y-intercept of the fitted straight line, respectively. (w(Xi),w(Yi)) is a pair of weights for the corresponding pair of observed data, {(Xi,Yi) | i=1,2,...,N}, and r is the correlation of the errors in the observed data. (xi′,yi′) is a pair of the estimates of the true points (xi,yi).

It can then be seen from Eq. (2) that LSC accounts for the weights (w(Xi), w(Yi)) and errors (xi′−Xi, yi′−Yi) in both variables, as well as the correlation of these errors (r). Accordingly, a regression method that does not consider either of these aspects makes assumptions about them in certain ways. For example, an unweighted method assumes equal weights of ones; minimizing only the residuals of the response variable means assuming that the predictor variable is error-free and not considering the correlation of errors means assuming that the errors in the measured data are uncorrelated.

Solving Eq. (2) for the slope (α) and y-intercept (β) gives,

$$ \alpha' = \frac{\sum \limits_{i=1}^{N}W_{i} B_{i} V_{i}}{\sum \limits_{i=1}^{N}W_{i} B_{i} U_{i}} \:\:\:\:\:\:\:\:\: \text{and}\:\:\:\:\:\:\:\:\: \beta' = \overline{Y} - \alpha'\overline{X}, $$
(3)

where,

$$ U_{i} = X_{i} - \overline{X} \:\:\:\:\:\:\:\:\: \text{and}\:\:\:\:\:\:\:\:\: V_{i} = Y_{i} - \overline{Y} $$
$$\begin{array}{*{20}l} W_{i} &= \frac{w(X_{i})w(Y_{i})}{\alpha^{'2}w(Y_{i}) + w(X_{i}) - 2\alpha'r\sqrt{w(X_{i})w(Y_{i})}}, \\ B_{i} &= W_{i} \left(\frac{U_{i}}{w(Y_{i})} + \frac{\alpha' V_{i}}{w(X_{i})} \right. \\ & \left. - (\alpha' U_{i} + V_{i})\frac{r}{\sqrt{w(X_{i})w(Y_{i})}}\right). \end{array} $$
(4)

\({\overline {X} \;\; \text {and} \;\; \overline {Y}}\) are given by,

$$ \overline{X} = \sum \limits_{i=1}^{N}W_{i} X_{i}\text{\(/\)}\sum \limits_{i=1}^{N}W_{i}; \;\;\; \overline{Y} = \sum \limits_{i=1}^{N}W_{i} Y_{i}\text{\(/\)}\sum \limits_{i=1}^{N}W_{i}. $$
(5)

A step-by-step derivation of Eqs. (3) and (4) can be found in previous papers [1113]. A more thorough derivation, including one of Eq. (2), is demonstrated in [17].

The expression for α in Eq. (3) includes α, which is contained in Wi and Bi as shown in Eq. (4). This means that LSC requires an initial guess of α; that is, it is an iterative procedure. The slope obtained by OLS can be used as the initial guess of α. Given the PET data, C(t) and CR(t), LSC can be used to estimate BPND by the slope in Eq. (3).

Weights, { w(Xi), w(Yi)}, and correlation of errors in variables (r)

For LGA, the weights, w(Xi) and w(Yi), and the correlation of errors, r, are unknown; therefore, they have to be estimated. To this end, we adopted the method of "relative error regression" [18, 19]. In relative error regression, instead of minimizing the squared-sum of the absolute errors, \({\sum (y^{\prime }_{i} - Y_{i})^{2}}\), the squared-sum of the relative errors, \({\sum [(y^{\prime }_{i} - Y_{i})/Y_{i}]^{2}}\), is minimized. In statistical prediction, relative errors are considered to be more informative than absolute errors [19]. Expressing the relative errors in this manner can help adjust measurements in a weighted linear regression with the weights given by, \({1/Y_{i}^{2}}\) [18, 19]. After adopting this technique for LSC, the X and Y weights can be respectively given as,

$$ w(X_{i}) = 1/X_{i}^{2} \:\:\:\:\:\: \text{and}\:\:\:\:\:\: w(Y_{i}) = 1/Y_{i}^{2}. $$
(6)

A detailed analysis demonstrating the viability of these weight functions for LSC-based LGA is given in Additional file 1.

The errors in the variables were unknown; thus, their correlations were estimated by the correlations between the variables themselves. These correlations were estimated by the “correlations of determination." The correlation of determination is a number between 0 and 1, and it determines the proportion of variation in the response variable that can be explained by the predictor variable [20, 21]. To iterate, the slope obtained in the first run was substituted in Wi and Bi in Eq. (4). The iteration process was repeated until the relative differences between consecutive estimates of the slope, (αi′−αi+1′)/αi+1′, were less than 10−10. A thorough step-by-step guide for using LSC can be found in [13].

Simulation studies

In order to assess the performance of LSC against that of MRTM2 and conventional LGA when estimating BPND, a set of PET data of the two-tissue compartmental model was simulated using a clinically measured plasma time-activity curve in a PET study. The kinetic parameters, K1,k2–4 and the non-displaceable distribution volume (VND) used for simulations were adopted from the range of values used in [22], mimicking CFN;

  • 11 values of BPND were set in the range of [0.0, 3.0], covering a range of [0.0, 0.35] for k3.

  • [K1k4]=[0.1835 mL cm−1 min−1 0.115/min].

  • [k2k3]=[K1/VNDBPND·k4].

  • VND =1.59 mL cm−1.

The reference region was formed with a zero noise level, zero BPND, and delivery and clearance rates equal to those of the target tissues, due to its physiological assumptions. Using these kinetic parameter values, 11 noise-free tTACs corresponding to the 11 BPND values were formed. Statistical noise was added to these noise-free tTACs, and 1024 noisy tTACs were simulated as a slice of 32-by-32 pixels for each noise-free tTAC. The BPND values were then re-estimated from the noisy tTACs by LSC, MRTM2, and conventional LGA, and the results were compared. The true value for the dissociation rate of the reference region (\({k^{R}_{2}}\)), which was used to produce the simulation data, was used for all methods.

The noise was assumed to have a zero-mean Gaussian distribution with a variance proportional to the true tTAC; this is the same as the one used in previous studies [4, 8]. The noise scaling factor was set such that the actual magnitude of the noise level in the simulated tTACs varied within a range of 0 to 30%; this range covered the actual noise observed in voxel-based PET data. The magnitudes of the noise level were calculated by the percentage of the ratios of the “standard deviation of the noisy tTACs" to the “mean of the noise-free tTAC" for the portion of the time range used for the BPND estimations. Figure 2 shows the simulated noise-free (thick lines) and their corresponding noisy (thin lines) tTACs for the 11 BPND values. The 12th blue curve at the bottom is the curve of the reference region.

Fig. 2
figure 2

Simulated tTACs. The 12 thick curves denote the noise-free tTACs. Eleven of these correspond to the 11 BPND values. The remaining one (lowermost) represents the reference region. The thin curves denote the noisy tTACs. Only one noisy tTAC is shown out of the 1024 simulated noisy tTACs for each of the noise-free tTACs

Human data studies

Two patients, of which one is A β-negative and the other is A β-positive, participated in this study. The two patients underwent a 11C-PIB PET scan for 70 minutes. The scan was carried out according to the standard ADNI 11C-PIB PET procedure. The study protocol was approved by the Ethics Committee of Kindai University Hospital, and written informed consent was obtained from the participants.

The data were collected in matrices of 128 × 128 × 47 slices, and voxel sizes of 2.1 × 2.1 × 3.4 mm3. BPND parametric images – reflecting A β deposits in the brain regions – were then generated using the three algorithms, LSC, MRTM2, and conventional LGA, implemented in MATLAB (The MathWorks, Inc., Natick, MA, United States).

Time t(30 min.), which was used for both synthetic and actual data, was graphically determined from simulated data, as demonstrated in Additional file 2.

For actual data, the values of \({k^{R}_{2}}\) that were used for all methods were estimated by MRTM1. MRTM1 allows estimation of \({k^{R}_{2}}\) with little bias [9]. Specifically for each slice, the average value, \({\bar {k}^{R}_{2}}\), calculated from the \({ \mid k^{R}_{2} \mid }\) of all voxels in the reference region, was used.

Results

Simulation studies

The results in Fig. 3 compares the percentage bias in the BPND values estimated by the three methods from the noisy tTACs. The standard deviations (in percentages) are shown as error bars.

Fig. 3
figure 3

Comparisons of the percentage bias and variability in the estimated BPND values. The percentage bias and variability in the BPND estimated from the noisy tTACs by LSC, MRTM2, and the conventional LGA, are compared for the 11 values of true BPND

In these results, LSC estimates are the least biased along the entire range of BPND, with up to a maximum of 12% and 28% bias difference in comparison with MRTM2 and conventional LGA, respectively. The error bars show that while the conventional LGA estimates have the smallest variability, LSC estimates have smaller variability than MRTM2 estimates.

Human data studies

Figure 4 shows the BPND parametric images obtained by LSC, MRTM2, and conventional LGA. These images are generated from 11C-PIB PET brain images, and therefore they reflect A β deposits in the brain regions. This is because A β is the target receptor for 11C-PIB radiotracer. The left panel shows images of three slices of the A β-negative patient, obtained by each of the three methods, whereas the right panel shows the images of the A β-positive patient.

Fig. 4
figure 4

Axial slice images of brain regions obtained by the three methods, LSC, MRTM2 and conventional LGA. These are BPND images of A β quantification of 11C-PIB brain PET data of the two patients. The left panel shows the images of the A β-negative patient, and the right panel shows the images of the A β-positive patient

The images obtained by LSC have significantly higher total BPND estimates than those of conventional LGA for both patients. The images obtained by MRTM2 have the highest total BPND estimates. For MRTM2 images, however, considerably higher levels of noise are observed. These results are consistent with those of CFN observed in the simulated data.

Table 1 shows the average computational time taken by each method for the corresponding sets of three images in Fig. 4. LSC took moderate computational time, one order of magnitude higher than the conventional LGA, for both patients’ images. This was expected of LSC because it is an iterative procedure. MRTM2 took the longest computational time amongst the three methods, two orders of magnitude higher than the conventional LGA, for both patients’ images.

Table 1 Means (± standard deviations) of computational times taken by each method for the respective slice images in Fig. 4

Discussion

In this study, LSC estimates of the BPND emerged to be less biased than those of MRTM2 and conventional LGA. Moreover, LSC estimates have smaller variations in comparison with those for MRTM2, providing a better trade-off between bias and variability.

The underestimation which is observed in the simulated data for the conventional LGA estimates is also reflected in the real data, and accordingly, the reduction of the underestimation seen in the simulated data for the LSC method is also reflected in the real data. Furthermore, the underestimations observed for the conventional LGA method in both simulation and human data studies are in consistency with previous studies [15, 22], and this strengthens the reliability of the results reported here.

Both LSC and OLS are based on the assumption that the errors in the measurements are normally distributed. However, this assumption may not always be satisfied. A visual inspection of the distributions of the LSC error values and the OLS residuals showed that the LSC error values have a better normal distribution compared with that of the OLS residuals (Additional file 3). This can be attributed to the fact that LSC takes into account the varying magnitude of errors at each data point as demonstrated in [12].

LSC considers the errors in both variables without altering the original LGA equation, thus maintaining its simplicity and inducing less variability. In addition, LSC does not only consider the errors associated with term C(t), but also the errors associated with each LGA variable as a whole. The effort to include more details in LSC has increased the number of parameters in Eq. (2), compared with the fundamental OLS regression, which only comprises the residuals of the response variable. This could be a factor in the slight increase in the variability of LSC estimates, compared with the conventional LGA estimates.

MRTM2 rearranges the conventional LGA equation into a multilinear regression form so that the noted noisy term, C(t), only appears in the response variable [7, 9]. This only considers the errors associated with C(t). However, the rest of the terms in the LGA equation are not entirely noise-free; they are less noisy compared with C(t). Not considering the noise in the rest of the terms could lead to the bias in the MRTM2 estimates.

Based on the computational times of the three methods for obtaining slice images, the conventional LGA has the highest computational efficiency, followed by LSC, and MRTM2. The computational time for LSC would depend on the number of iterations, which in turn depends on the iteration condition, with the computational time increasing with the number of iterations. All LSC images in Fig. 4 required at most three iterations. The iteration limiting condition in this study, (αi′−αi+1′)/αi+1′<10−10, was arbitrarily chosen. However, it was found that by choosing a stricter condition, say 10−11, the number of iterations did not increase.

Although the weighting formulas in Eq. (6) are inverse squares of the measurements, they will not cause the usual problem of overproportional weighting of smaller measurements. This is because they are not the direct weighting coefficients. The final weighting coefficients are appearing in Eq. (3) as WiBi. Because they appear in both the numerator and denominator, the overprortional weighting effect can be canceled out. Based on an analysis of LSC regression carried out in [12], it was shown that a set of weight functions satisfying the relation, \({\sqrt {\frac {w(X_{i})}{w(Y_{i})}} = \frac {\int _{0}^{t_{i}} C_{T}(u)\,du}{\int _{0}^{t_{i}}C^{R}(u)\,du + C^{R}(t_{i})/ k^{R}_{2}}}\), would be appropriate weights for the LSC-based LGA. The weighting formulas given in Eq. (6) satisfy this condition, therefore making them appropriate (See Additional file 1 for details).

Conclusions

The proposed method, LSC-based LGA, significantly reduced the bias in the estimates of BPND, up to 12% and 28% difference compared with MRTM2 and the conventional LGA, respectively. Considering the conventional LGA as the standard method, LSC reduced the bias at a slight expense of increased variation. However, LSC caused smaller variations and required shorter computational times than MRTM2. Thus, LSC outperformed MRTM2 in all aspects (bias, variability, and computational efficiency), making it a promising tool for the BPND estimation. These results are consitent as such for the two radiotracers respectively used for simulated and human data studies.

This study investigated the performance of LSC-based LGA mostly based on simulations, and thus, the method must be validated using a larger cohort of actual data. In addition, different weighting techniques may perform differently. Therefore, a further study is required to seek an adequate weighting technique for LGA.