Introduction

Although quantitative estimates of myocardial blood flow (MBF) and flow reserve (MFR) from positron emission tomography (PET) myocardial perfusion imaging (MPI) have widely recognized value,1,2,3 accurate estimates require high-quality scanners, as well as rigorous injection and optimized acquisition protocols.4 Dynamic series used for computation of MBF and MFR are prone to numerous quality issues including patient motion and scanner saturation. Although the former can be readily recognized and corrected,5,6,7 identification of saturation by inspection of dynamic series is not straightforward or reliable.8 Furthermore, obtaining data without saturation is near impossible with certain older scanners. Consequently, there is great interest in simpler alternatives to standard kinetic modeling approaches,9,10 which are robust to these issues. Toward this end, work by Juneau et al in the present issue of the Journal of Nuclear Cardiology investigates the estimation of MFR from the myocardial activity ratio (MAR) from static images.11

Background

Two methods of calculating MBF from PET MPI have been well validated: net retention modeling and compartmental modeling. Full compartmental modeling involves fitting observed dynamic data to a kinetic model describing the transit of tracer from the blood to tissue compartments (Figure 1A). The net retention method is a simplification of compartmental modeling wherein the tissue activity is scaled by activity in the blood pool (Figure 1B). The method proposed by Juneau et al is a further simplification of the net retention method wherein blood-pool activity is assumed to be the same at both rest and stress (Figure 1C). The authors have fit a non-linear regression formula to optimize the correlation between the activity ratio and measured MFR (which perhaps could have been done more robustly and simply by log transformation of the data). While this method avoids the need for acquisition of blood-pool phase images, it still requires a rigorous injection and acquisition protocol as the two images must result from injections of the same activity, identical delays between injection and acquisition, and identical acquisition times.

Figure 1
figure 1

Simplified overview of methods used to derive MFR. (A) 1-tissue compartment modeling fits a kinetic model to the observed tissue (red) and blood (blue) time-activity curves from dynamic series. (B) The net retention model uses area under the blood time-activity curve during the blood-pool phase and area under the tissue time-activity curve during the tissue phase to simplify calculation. Higher cardiac output leads to more rapid clearance of the blood activity, and thus, a lower area under the curve. (C) In the MAR method used by Juneau et al the blood-pool activity is assumed to be unchanged between rest and stress so MFR can be estimated as the ratio of the activity in the tissue at stress divided by rest. Partial volume effects and spillover are left out of A and B for readability. Note that the area under the blue blood-pool time-activity curve is slightly smaller at stress than rest, accounting for a modest increase in cardiac output, resulting in faster clearance

This method, first outlined more than 20 years ago,9,12 and applied to 18F-flurpiridaz in 2011,10 implicitly assumes that cardiac output (CO) does not change between rest and stress. Increase of CO at stress would result in more rapid clearance of tracer from the blood pool, violating this assumption. Juneau et al evaluated this assumption through multiple linear regression between MAR, MFR, and CO, as well as comparing CO ratios to liver uptake ratios for vasodilator stress testing. Results showed limited impact of CO on MAR calculations but require further investigation as to whether this applies to other populations and stress pharmaceuticals (dipyridamole was used in the study) and to exercise stress. Importantly, changes in ejection fraction and heart rate with stress have been shown to be prognostically important13,14 and related to the presence of CAD,15 one must wonder if the lack of relationship between CO and MAR was simply due to lack of power or technique.

Promise and Limitations

The data presented by Juneau et al show a moderate correlation between measured MFR and estimated MFR from their simplified method with R2 of 0.40 and 0.53 for 13NH3 and 82Rb, respectively. This implies only about half or less of the information in the data can be retrieved by this method. Yet, the receiver-operating analysis yielded promising results with areas under the curve (AUC) of 0.832 and 0.904, respectively, for 13NH3 and 82Rb. Using estimated MFR to detect MFR < 2 yielded a sensitivity of 73.3% and a specificity of 79.9% for 13NH3 and 77.3% and 88% for 82Rb.

How can we rationalize that only half or less of the information in a properly measured MFR is obtained in these estimates and yet the sensitivity and specificity are reasonably good? First, these results may be optimistic due to a relatively low prevalence of patients with reduced MFR. However, more likely is that the sensitivity and specificity are over-optimistic due to unblinded remeasurement in the core lab, 16 which will not be possible in real-world practice where the gold standard measurement is not available. To evaluate this, we simulated 10,000 hypothetical datasets for each tracer with the same sample size as in the Juneau et al study (N = 250), using the observed mean and standard deviation of MFR and estimated MFR, and their correlation (square root of R2) [scripts and output of the simulations are available in the electronic supplementary material]. Based on our simulations, real-world sensitivities are likely to be approximately 65.2% and 64.2%, respectively, for 13NH3 and 82Rb, markedly lower than reported and insufficient for clinical use. Conversely, real-world specificities for 13NH3 and 82Rb are likely to be 78.0% and 86.0%, respectively, closer to the reported values (Figure 2). These simulated results suggest that there may have been adjustment of false-negative studies, thus, improving the reported sensitivity. The real-world AUCs are also likely to underperform at 0.798 and 0.854 for 13NH3 and 82Rb, respectively. If one assumes that the 13NH3 and 82Rb experiments are independent, given the distribution of MFR, estimated MFR, and their correlations, the probability of obtaining results as good as reported for sensitivity, specificity, and AUC for both is less than 1 in 50000 (P = 0.000017).

Figure 2
figure 2

Scatter plots of the sensitivity and specificity derived from simulations of the study from Juneau et al for 13NH3 (panel A) and 82Rb (panel B). The red data points highlight the sensitivity/specificity pairs reported by Juneau et al. The probability of achieving or exceeding the reported sensitivity, specificity, and area under the curve of the receiver-operating characteristic plot is only 0.97% for 13NH3 and 0.18% for 82Rb, respectively

As a result, while the study shows promise, it is likely that in clinical practice, this method will not reliably reproduce MFR, yielding limits of agreement far too wide for clinical use. The method seems more applicable as a quality control tool, as discrepancies between the estimated and standard MFR larger than the limits of agreement may indicate some issue with one of the two calculations. However, even this role is potentially limited by the assumption of unchanged CO between rest and stress. Furthermore, when a discrepancy is found, it may be challenging to identify which of the two measures is reliable and the reasons to discount the other.

Waking up from the Dream

Ultimately, the work put forth by Juneau et al is of uncertain value toward simplifying MFR quantification. The clinical implementation of this method requires rigorous blinded validation to prove incremental value over relative perfusion or value as a quality control tool. To be clear, the likely adjustment of false-negative studies does not constitute unethical research behavior. Indeed, it fits with standard clinical behavior wherein physicians will remeasure or reprocess studies which do not meet their expectations. However, when this is done selectively, without blinding to the gold standard, severe biases may result.17 Therefore, unless this study is repeated with independent blinded core labs for separately processing the dynamic and static images, it is not possible to conclude that this method is ready for prime time.

A few other limitations should be considered. First, the application of the method to 13NH3 only makes sense for exercise stress, given that saturation is rarely an issue for that tracer, even on older scanners. However, exercise testing is likely to introduce markedly greater variability in the CO with a much larger increase in CO from rest to stress than with dipyridamole. Consequently, this method would require further validation, relying on a different gold standard than traditional MFR given that it is nearly impossible to collect high-quality dynamic images during exercise stress.

Although a tool to detect saturation for 82Rb MFR would be of tremendous value, the results provided stop short of demonstrating that the method works to identify saturation in this study. Indeed, the effect of saturation on MFR has yet to be fully characterized in the literature, in part due to the lack of a simple, widely accepted gold standard for saturation and for the tissue time-activity curve.

In the end, this study does offer a valuable lesson for clinical researchers. Blinding is necessary to ensure that results are not inadvertently biased while intending to maximize quality. Hopefully this work can be a steppingstone toward greater reliability, accessibility, and ease of clinical MFR assessment.