Introduction

Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is emerging as an imaging tool for diagnosis and assessing tumour response to novel anticancer antiangiogenic therapies [14]. The technique is based on the analysis of the temporal bio-distribution of an intravenously injected Gadolinium-based contrast agent within the imaged tumour. Pharmacokinetic (PK) modelling of such data requires a reliable measure of the arterial input function (AIF) in order to obtain robust estimates of physiological parameters characterising tumour vascular properties. The AIF represents the time course of the contrast concentration in the blood plasma pool, and its accuracy will strongly influence the output of the PK modelling. Therefore, a patient-specific measurement of the AIF per visit from a vessel feeding the tumour would in principle be a suitable approach. In practice, however, such direct measurements are heavily influenced by partial volume effects (especially for small vessels), in-flow effects, non-linear signal response, non-uniform B1 field, pulsatile flow and other confounding factors. Variations due to these effects can be minimised by using a fixed AIF, which can reasonably be derived by averaging measured AIFs over a suitable population. It is plausible that where a cohort response is of interest (e.g. in a clinical trial), a population-averaged AIF may be appropriate, whereas when measuring individual responses, a measured AIF may be more suitable. This work aims to explore these two hypotheses. The repeatability and treatment-response effects of DCE-MRI-derived PK parameters are compared using AIFs derived in four ways:

  1. 1.

    Fixed AIF (AIFPOP) [5]

  2. 2.

    Individual AIFs derived from pre-bolus MRI data (AIFPB) [6, 7]

  3. 3.

    Individual AIFs obtained from DCE-MRI data (AIFDCE) [8]

  4. 4.

    Individual AIFs derived from dynamic contrast computed tomography (DC-CT) data (AIFCT) (acquired same day as DCE-MRI)

The extended Kety model [9, 10] was used to model uptake of contrast in tumour tissues for all four AIF approaches. Baseline repeatability and treatment effect results were investigated in a subset of a cohort of patients previously shown to have a significant response to Cediranib in an early-phase trial [11].

Materials and methods

Patient cohort

Data presented here relate to a coherent and consistent subset of patients from an early-stage clinical trial comparing the efficacy of DCE-MRI and DC-CT for measuring treatment effects due to vascular-targeted therapies. This early-stage trial [11] investigated the treatment effects of Cediranib –a potent inhibitor of vascular endothelial growth factor receptor tyrosine kinases– and was shown to cause a cohort decrease in K trans, the transfer coefficient between blood plasma and extravascular extracellular space. To ensure a homogeneous cohort, only coronal MR acquisitions of patients with at least three longitudinal scans were assessed. No particular criteria, other than the usual clinical inclusion/exclusion conditions defined within the trial protocol [11], were further considered here.

As a result, our cohort consisted of 13 patients with abdominal tumours (12 liver and one left para-aortic lymph node) imaged longitudinally at three time points. The cohort comprised seven women and six men, with ages ranging from 31 to 73 (mean 56.6) years. All patients were scanned twice, 7 days apart, prior to treatment, to assess repeatability of the methods. The third scan was performed after a minimum of 7 days of Cediranib therapy to investigate treatment changes. Patients gave written informed consent, and the local ethical committee approved the research study.

MR acquisition protocol

All 13 patients were imaged coronally on a 1.5 T Avanto Siemens scanner using a sequential breath-hold technique optimised for liver lesions [12]. The MR scanning consisted of a pre-bolus acquisition (one tenth of the standard dose; i.e. 0.02 ml/kg Magnevist contrast agent), followed by the main DCE acquisition at the standard dose of 0.2 ml/kg Magnevist. Both contrast doses were delivered by power injector at 3 ml/s, followed by 20 ml of saline solution delivered at 2 ml/s. The pre-bolus protocol was: 2D fast-field echo (FFE) sequence, a single slice 20 mm thick, repetition time/echo time (TR/TE) = 5.5/1.21 ms, field of view (FoV) = 440 × 440 mm2, 128 × 128 matrix, temporal resolution 0.7 s/image, 60 pre-contrast images at 3° and 340 dynamics at 20°. The corresponding parameters for the main DCE acquisition were 3D FFE sequence, 14 slices 5-mm thick, TR/TE = 3.05/0.89 ms, FoV = 308 × 380 mm2, 208 × 256 matrix, number of sampling acquisitions (NSA) = 1, integrated parallel acquisition techniques (iPAT) = 2. Dynamic images with a flip angle (FA) of 16° were acquired at 3 s/volume, obtained as pairs (6 s breath-hold at expiration), with 6-s breathing interval between each pair, and total acquisition time 4 min. These were preceded by a calibration scan acquired in a single breath-hold (expiration) with the same parameters except FA = 3° and NSA = 4 to enable contrast quantification.

CT acquisition protocol

Corresponding DC-CT data were acquired on the same day no more than 4 h prior to MR scanning. Axial CT data were acquired with the following setup: GE Lightspeed; 0.5 ml/kg contrast agent (Omnipaque 300) followed by 20 ml saline (both at 3–5 ml/s); 5-s delay followed by breath-hold cine covering 4 × 5 mm2, at 0.5 s/volume in the centre of the lesion of interest over 55 s at 120 kV and 60 mA. Following this were 12 breath-hold acquisitions at 10-s intervals.

AIF data analysis

Pre-bolus MRI data (AIFPB)

Individual AIFs were extracted from the single-slice pre-bolus data for all patients at each of the three visits. AIFPB was measured from the abdominal aorta, a region of the aorta less affected by the pulsatile effects of the heart [13, 14]. A four-parameter AIF model was used to fit the pre-bolus AIF data and a corresponding full-dose AIF was obtained by correcting for the duration and volume of the injection of the main DCE measurement [15]. The measured AIF (whole blood) was converted to contrast concentration in plasma assuming a haematocrit of 0.42 [5].

DCE-MRI data (AIFDCE)

Individual AIFs from the main DCE-MRI data were obtained from a vessel visible in the acquired FoV, which was the aorta in all cases. The same four-parameter AIF model was used to fit the visit-specific AIFDCE.

DC-CT data (AIFCT)

The DC-CT derived AIF was also obtained from the aorta, and a model-based correction method was used to account for the different contrast agents, delivery volumes and delivery rates for the DCE-MRI and DC-CT acquisitions (see “Appendix”).

Fixed AIF (AIFPOP)

The fixed AIF was based on the population-averaged AIF presented in [5].

PK analysis of DCE-MRI tumour data

Tumour regions of interest (ROI) from four central slices of the DCE-MRI data were fitted with the extended Kety model for each of the available four categories of AIF. Estimates of K trans, volume of extravascular extracellular space (v e) and volume of vascular plasma space (v p) were obtained, and the median (K trans, v e) or mean (v p) over the tumour was assessed for repeatability and treatment effects.

Statistical analysis

All statistics were calculated using log-transform of the raw values [16] and back-transformed appropriately to give percentage values. Baseline repeatability was calculated for K trans, v e and v p estimates using the Bland-Altman statistic [17], expressed as coefficient of variation, CoV = 100 % × [exp2/2)-1]1/2, where σ2 is the variance of difference between baseline values. The limits of agreement (LoA) were derived from the Bland-Altman variance σ2 using LoA = 100 % × exp(±1.96σ), which gives the range of changes in one patient that are within the 95 % limits of the measurement repeatability. Patients with treatment changes outside the LoA were considered to have an individually significant effect, and the number of patients with changes outside the LoA was recorded for each measure. Treatment changes were computed as the difference between post-treatment values and the mean of the two pre-treatment values. The mean cohort changes were computed and back-transformed to give percentage changes; p values were obtained using two-sided paired t tests on the log-transformed values for K trans, v e and v p, and also for AIF parameters (see “Appendix”) for all three measured AIFs.

Results

The main results are presented in Table 1 and Fig. 1 for the three PK parameters.

Table 1 Repeatability and treatment effects for three main pharmacokinetic (PK) parameters as measured from our comparative analysis of arterial input function (AIF) methods
Fig. 1
figure 1

Representation of main results presented in Table 1. POP population, DCE dynamic contrast enhanced, PB pre-bolus, CT computed tomography, LoA limits of agreement, CoV coefficient of variation, K trans transfer coefficient between blood plasma and extravascular extracellular space, v e volume of extravascular extracellular space, v p volume of vascular plasma space

Measured AIFs were extracted for all 13 patients from both the pre-bolus MR data and the DC-CT data. No measurable vessel was present in the DCE-MRI data in 3/13 patients, so no tissue parameters were obtained for this AIF approach in these patients.

The best K trans repeatability was 7.5 %, which was achieved when using a population-derived AIFPOP, whereas those obtained with measured AIFPB and AIFDCE were both >20 %. At 11.8 %, AIFCT had a smaller CoV for K trans than using either of the MR-measured AIFs.

Post-treatment K trans cohort decreases ranged from 49.5 % (AIFPOP) to 65.7 % (AIFPB) and all were statistically significant (p < 0.05), with the most significant being the AIFCT (p = 0.0049). All AIF methods identified significant individual responders: AIFPOP and AIFCT identified 11/13 patients with K trans decreases outside the LoA; AIFPB identified 10/13; AIFDCE only identified 6/13.

Similar repeatability ranges were obtained for v e (10.9–22.4 %), while significantly worse repeatability was obtained for v p (50.4–86.8 %). For both v e and v p, the best CoV was with AIFDCE and the worst was with AIFPB. Post-treatment v e cohort decreases were in the range 32.6–47.9 %, but only AIFDCE and AIFCT reached significance (p < 0.05), while approximately half of the patients (4–6/13) had individual reductions outside the LoA. Only AIFPB identified a significant cohort decrease in v p (p = 0.021).

Typical patient AIF curves as derived from the four methods are shown in Fig. 2. The only significant treatment changes to AIF parameters were for AIFCT, where the amplitude of the recirculation phase a R increased by 66 % (p = 0.001), and its duration μ R decreased by 13 % (p = 0.04). Treatment changes for all parameters of AIFDCE and AIFPB and the remaining parameters for AIFCT had p > 0.1.

Fig. 2
figure 2

Typical arterial input function (AIF) curves for one patient as derived from the four methods compared. Fits of baselines and post-treatment curves together with the measured AIF curve for baseline 1 data are presented for each measurement technique. POP population, DCE dynamic contrast enhanced, MR magnetic resonance, PB pre-bolus, CT computed tomography

Discussion

Whilst using a DC-CT AIF is not a feasible approach in general, the good K trans repeatability using AIFCT is remarkable, since DC-CT data were acquired up to 4 h before DCE-MRI data and involved a different contrast agent and imaging approach. Nevertheless, the net effect of these additional sources of variability still results in a lower K trans variability with AIFCT than with AIFDCE or AIFPB, both of which were obtained during the same imaging session as the main DCE-MRI measurement. This suggests that, in principle, using a measured AIF can provide higher accuracy of parametric values but that significant improvements to AIF measurement using DCE-MRI are needed to yield an overall reduction in variability by accounting for genuine physiological AIF variability.

All four AIF methods detected similar K trans decreases on treatment, and whilst all four were significant (p < 0.05), AIFCT was an order of magnitude more significant than the other three (p = 0.0049). In contrast, AIFDCE identified only 6/13 patients with K trans decreases outside the LoA, compared with 10/13 or 11/13 patients identified when using the other three AIFs. Taken together, the K trans repeatability and treatment changes observed with AIFCT indicate that, in principle, visit-specific AIFs would be preferred, but the results with AIFDCE and AIFPB suggest that, in practice, neither of these approaches a significantly better performance than a population-averaged AIF. In addition, 3/13 cases did not yield a useable AIF from DCE-MRI data due to tumour location, which is a significant practical limitation.

Further comparisons between the various AIFs can be made from the v e and v p statistics in Table 1 and Fig. 1. The repeatability of v e shows no difference between AIFPOP and AIFDCE, whilst that for AIFPB is around twice as large. AIFDCE gives a v e treatment p value of borderline significance, whilst AIFPOP and AIFPB are not significant, indicating AIFDCE may be the best approach if only v e is of interest. None of the AIF approaches yields a v p repeatability suitable for monitoring individual or cohort changes.

There were no significant treatment changes to AIF parameters for AIFDCE or AIFPB, whilst the significant changes to CT -measured AIFs are associated with the amplitude and duration of the recirculation phase (a R and μ R), which is not visible in MR-measured AIFs. Although these changes may be genuine, it is more likely that the significant p values obtained are due to statistical fluctuations, especially since the AIFCT model contains eight parameters.

A population haematocrit (Hct) of 0.42 was assumed in this study. Preliminary tests (data not shown here) investigated Hct variability from two baseline measurements over a cohort of 21 patients. The repeatability coefficient was 3.4 %, suggesting a minimal impact of Hct variability on the K trans estimate (since K trans is approximately proportional to 1–Hct).

When assessing treatment response, an AIF measured per session would account for any changes in AIF and, in principle, this should lead to lower variability and, by extension, to improved sensitivity and specificity of K trans changes to a tumour effect. This specificity is particularly important if treatment has an effect on the AIF, since in such cases a population-derived AIF would mean K trans treatment changes would be confounded by AIF changes. In this study, no significant MRI AIF changes were detected, which is consistent with the finding that the population-derived AIF had the best overall performance in these data. Since AIF variability and changes with treatment have not previously been reported, it is unclear whether this is typical where DCE-MRI measurements are used. Therefore, given the poor performance of both MR-based AIF measurements, we suggest that using a population-derived AIF is the most appropriate method when measuring cohort treatment responses for DCE-MRI clinical trials unless there is good evidence that the treatment could affect the AIF.

Our findings reinforce the common practice of using a population-based AIF for quantitative DCE-MRI clinical trials when assessing tumour response [2, 5]. Other clinical studies have compared DCE-MRI metrics when using a population AIF (or cohort-averaged AIF) versus different approaches of measuring individual AIFs for various tumour sites, e.g. abdomen [5], osteosarcomas [18], neck [19, 20], prostate [21] or breast [22]. Overall, these studies reported no significant difference/improvement in PK estimates when measuring individual AIFs compared with a population AIF, which is in agreement with our conclusion.