Introduction

Pulmonary hypertension (PH) is a progressive, heterogeneous, potentially life-shortening pathophysiological condition diagnosed invasively by right heart catheterization (RHC) as a mean pulmonary arterial pressure (mPAP) exceeding 20 mmHg at rest [1]. The current guidelines recommend a multidimensional approach for PH treatment monitoring, including the evaluation of clinical, cardiopulmonary exercise, hematologic, and imaging parameters [2,3,4,5,6]. Serial assessment of the hemodynamic profile by RHC is recommended to guide therapy decisions particularly in case of clinical deterioration [2, 3].

Even though the role of cardiac MRI in the clinical management of PH is not yet established, the method offers various tools particularly suitable for PH treatment monitoring and patient follow-up. Allowing for an accurate evaluation of cardiac function and morphology, various quantitative MRI parameters have been reported to be significantly altered in PH [7,8,9,10,11,12,13,14,15,16,17,18,19,20] including, among others, the appearance of vortical blood flow along the main pulmonary artery as assessed by time-resolved, three-directional MR phase contrast (4D flow) imaging [19, 21,22,23,24,25,26,27,28]. Moreover, the duration of vortical blood flow was introduced as a potential 4D flow metric for accurate non-invasive estimation of elevated mPAP [26, 27], making MR 4D flow imaging particularly interesting for non-invasive mPAP tracking. Although there are sparse reports of cases featuring changes in the appearance of vortical blood flow patterns in chronic thromboembolic PH after percutaneous transluminal pulmonary angioplasty [29] and pulmonary thromboendarterectomy [30], it remains unknown whether the observed pressure relationship of vortical blood flow in the main pulmonary artery persists during PH therapy and whether differences in the duration of vortical blood flow along the main pulmonary artery can predict mPAP changes during treatment.

The purpose of the present study was therefore to compare MR 4D flow imaging and RHC-derived mPAP differences from serial investigations of patients with PH or at risk of developing PH and to analyze the potential of MR 4D flow to non-invasively predict mPAP changes during the clinical workup of such patients.

Materials and methods

Study population

Forty-four subjects with PH or at risk of developing PH (including patients with connective tissue diseases, interstitial lung disease, and portal hypertension), who underwent baseline and follow-up RHC and MR 4D flow imaging at 1.5 T or 3 T between August 2006 and November 2016, were retrospectively enrolled from two prospective studies in which consecutive patients scheduled for routine RHC were also scheduled for near-term comprehensive cardiac MRI (ClinicalTrials.gov identifier NCT00575692 and NCT01725763). The studies complied with the Declaration of Helsinki and were approved by the local ethics review board. Written informed consent was obtained from all participants. Baseline RHC and corresponding cardiac MRI were performed within 8 ± 13 days of each other. Follow-up RHC and cardiac MRI were performed within 8 ± 21 days of each other, 824 ± 839 days from the baseline investigation. No clinically relevant changes in drug treatment or disease state occurred between RHC and the corresponding cardiac MRI.

Right heart catheterization

RHC was performed with a 7-French quadruple lumen, balloon-tipped, flow-directed Swan-Ganz catheter (Baxter Healthcare Corp.) using the transjugular approach with the patient positioned supine. RHC parameters assessed included mPAP to diagnose PH, as well as the mean pulmonary artery wedge pressure (PAWP), cardiac output (CO), and the pulmonary vascular resistance (PVR), which were used for the clinical classification of PH patients.

Cardiac magnetic resonance imaging

Cardiac MRI was performed at 1.5 T (Magnetom Sonata, Siemens Healthcare) or 3 T (Magnetom Trio or Magnetom Skyra, Siemens Healthcare) with the patient in the supine position using phased-array body and spine matrix coils. In detail, for 19 patients, baseline and follow-up MRI investigations were performed at 1.5 T; for 16 patients, baseline and follow-up MRI investigations were performed at 3 T; and for 9 patients, baseline investigations at 1.5 T were followed-up by 3 T MRI.

The cardiac MRI protocol included assessment of retrospectively electrocardiographically gated (ECG-gated), two-dimensional, segmented or ECG-gated, real-time cine balanced steady-state free precession (bSSFP) imaging, covering the entire left (LV) and right (RV) ventricles by a stack of contiguous slices in short-axis orientation. 4D flow data were acquired during free breathing in right ventricular outflow tract (RVOT) orientation, covering the main pulmonary artery with gapless slices of a retrospectively ECG-gated, two-dimensional, segmented, spoiled gradient-echo-based cine phase contrast sequence with three-directional velocity encoding. Velocity encoding was set to 90 cm/s in all directions and adapted if necessary to prevent aliasing. Table 1 provides a summary of 1.5T and 3T cine bSSFP and 4D flow protocol parameters.

Table 1 Sequence parameters of the cine bSSFP and 4D flow imaging protocols at 1.5 T and 3 T. Typical scan times for cine bSSFP protocols are given for the stack of contiguous slices in short-axis orientation; the typical scan times for 4D flow protocols are given for the stack of slices covering the main pulmonary artery. GRAPPA, generalized auto calibrating partial parallel acquisition; *Not applicable due to prospective ECG-gating

Image analysis

LV and RV volumetric function parameters were evaluated from manual segmentation of end-diastolic and end-systolic LV/RV endocardial and epicardial borders in short-axis images using the Argus software (Siemens Healthcare). Myocardial trabeculations and papillary muscles were excluded from the LV and RV cavities. Volumetric function parameters assessed included end-diastolic (EDV) and end-systolic (ESV) volumes, stroke volume (SV), ejection fraction (EF), and LV/RV myocardial mass (LVM, RVM).

Baseline and follow-up 4D flow data of the main pulmonary artery were jointly evaluated using a prototype software (4D Flow, Siemens Healthcare) [28, 31]. After cropping spatially aliased structures, background phase correction was applied to all datasets. Blood flow patterns in the main pulmonary artery were analyzed with respect to the existence of vortical blood flow in consensus by two experienced readers (18 years of experience) using multi-planar reconstructed 3-dimensional vector fields in RVOT orientation (Fig. 1) and streamline representations. The duration of vortical blood flow along the main pulmonary artery (tvortex) was defined as the percentage of cardiac phases with vortical blood flow in the cardiac interval as previously described [27, 28]. 4D flow-based mPAP (mPAPMR) was calculated from tvortex using the piecewise linear model previously given in Reiter et al [27]:

Fig. 1
figure 1

Velocity color-encoded 3-dimensional vector representation of vortical blood flow along the main pulmonary artery in a patient with PH at baseline (upper panel) and 2-year follow-up (lower panel). Color encoding of both representations was set to an equal scale. %RR, percentage of the cardiac interval; #ph, cardiac phase in the RR interval; (Δ)mPAP, mean pulmonary arterial pressure (difference) measured by right heart catheterization; (Δ)mPAPMR, mean pulmonary arterial pressure (difference) estimated by 4D flow; PV, main pulmonary artery peak velocity magnitude. mPAP differences are written in red

$$ {\mathrm{mPAP}}_{\mathrm{MR}}\ \left(\mathrm{in}\ \mathrm{mmHg}\right)\kern1em {\displaystyle \begin{array}{cc}\le 16\kern5.5em &\ \mathrm{for}\ {\mathrm{t}}_{\mathrm{vortex}}=0\\ {}=16+0.63\cdotp {\mathrm{t}}_{\mathrm{vortex}}&\ \mathrm{for}\ {\mathrm{t}}_{\mathrm{vortex}}>0\end{array}} $$
(1)

Changes in mPAP and mPAPMR were calculated as the differences between follow-up and baseline values and are denoted by ΔmPAP and ΔmPAPMR. An increase (decrease) in mPAP was defined as ΔmPAP > 0 (ΔmPAP < 0) together with the condition mPAP > 16 mmHg at follow-up (mPAP > 16 mmHg at baseline) to enable the comparison with respective increases (decreases) in ΔmPAPMR in the entire study population, where, due to the absence of vortical blood flow for mPAP ≤ 16 mmHg, tvortex = 0 allows only an interval-estimate of ΔmPAPMR.

Statistical analysis

Mean values are given together with standard deviations; sensitivities and specificities are specified together with 95% confidence intervals in parentheses. Statistical analysis was performed using NCSS (Hintze, J. (2008) NCSS, LLC.). For statistical tests, a significance level of 0.05 was employed.

Differences in mean parameter changes from zero between baseline and follow-up investigations were analyzed by one-sample t test; relationships of parameter changes to ΔmPAP were analyzed by correlation analysis. The relationships between mPAP and mPAPMR as well as ΔmPAP and ΔmPAPMR were investigated by means of regression and Bland-Altman analysis. Dependencies of biases on MR field strength were analyzed by unpaired t test or analysis of variance (ANOVA) of three groups (baseline and follow-up at 1.5 T, baseline and follow-up at 3 T, baseline at 1.5 T and follow-up at 3 T). The diagnostic performance of mPAPMR in predicting PH and the diagnostic performance of ΔmPAPMR in predicting increases or decreases of mPAP were analyzed by receiver operating characteristic (ROC) curve analysis. Optimal cutoffs were defined as those that maximized the sum of sensitivity and specificity, whereas mPAPMR = 20 mmHg for diagnosing PH and ΔmPAPMR = 0 mmHg for diagnosing increases or decreases of mPAP were termed “natural cutoffs” due to the analogy to mPAP-definitions in RHC.

Results

Study population

At baseline, PH was diagnosed by RHC in 28 of 44 patients: 16 subjects were classified as having PH due to pulmonary arterial hypertension (PAH), 2 subjects as having PH due to lung diseases, 7 subjects as having chronic thromboembolic PH (CTEPH), and 3 subjects as having PH secondary to multifactorial mechanisms. At follow-up, one patient without PH at baseline developed PAH, and one patient with CTEPH at baseline revealed normal mPAP after surgical treatment. Consequently, PH was diagnosed in 28 patients at follow-up. mPAP > 16 mmHg was found in 31 patients at baseline, in 32 patients at follow-up, and mPAP > 16 mmHg was present in 30 patients at baseline and follow-up. Figure 2 illustrates the distributions of mPAP at baseline and at follow-up as well as the mPAP changes of the study population. The demographic and hemodynamic characterization of the study population is summarized in Table 2 together with LV and RV volumetric function parameters at baseline and follow-up.

Fig. 2
figure 2

Distributions of mPAP at baseline and at follow-up as well as the mPAP changes of the study population. nPH,BL, number of subjects with PH at baseline; nnoPH,BL, number of subjects without PH at baseline; nPH,FU, number of subjects with PH at follow-up; nnoPH,FU, number of subjects without PH at follow-up; nPH, number of subjects with PH at baseline and follow-up

Table 2 Summary of demographic, right heart catheterization (RHC), and cardiac MR imaging parameters of the study population at baseline and follow-up, together with differences in parameters between baseline and follow-up, and the correlations of differences with mPAP changes (rΔmPAP). Significant differences from 0 are indicated by *, significant correlations by +. n, number of subjects at baseline and follow-up; nPH, number of subjects with PH at baseline and follow-up; nnon-PH, number of subjects without PH at baseline and follow-up

MR 4D flow-based prediction of PH and mPAP increase/decrease

Using mPAPMR to predict PH, the area under the curve (AUC) was 1.00 at both baseline and follow-up. At baseline, optimal and natural cutoff coincided, such that the cutoff of mPAPMR = 20 mmHg resulted in an accuracy of 100% (92–100%), a sensitivity of 100% (88–100%), and a specificity of 100% (79–100%) for diagnosing PH. At follow-up, the optimal (23 mmHg) and natural cutoffs did not coincide; the cutoff of mPAPMR = 20 mmHg resulted in an accuracy of 95% (84–99%), a sensitivity of 100% (88–100%), and a specificity of 88% (62–98%) for diagnosing PH.

AUCs for the prediction of increase or decrease of mPAP between baseline and follow-up based on ΔmPAPMR were 0.92 (0.74–0.96) and 0.93 (0.75–0.98), respectively (Fig. 3a, b). In both cases, the optimal and natural (ΔmPAPMR = 0 mmHg) cutoffs coincided. The accuracy, sensitivity, and specificity for predicting mPAP increase were 91% (78–97%), 85% (56–98%), and 94% (79–99%), respectively, while the corresponding values for predicting mPAP decrease were 91% (78–97%), 89% (65–99%), and 92% (75–99%), respectively.

Fig. 3
figure 3

ROC curves for the prediction of mPAP increase (a) and decrease (b) based on ΔmPAPMR for the entire study population, as well as for mPAP increase (c) and decrease (d) for patients with PH at baseline RHC

Restricting the study population to PH patients, the AUC values for the prediction of increase or decrease of mPAP were 0.94 (0.77–0.99) and 0.94 (0.76–0.98), respectively. The natural cutoff ΔmPAPMR = 0 mmHg resulted in an accuracy of 89% (72–98%), sensitivity of 90% (56–100%), and specificity of 89% (65–99%) for predicting mPAP increase, and an accuracy of 86% (67–96%), sensitivity of 88% (64–99%), and specificity of 82% (48–98%) for predicting mPAP decrease (Fig. 3c, d).

MR 4D flow-based assessment of mPAP and ΔmPAP

The correlation between mPAP and mPAPMR for patients with mPAP > 16 mmHg was strong at both baseline (r = 0.95) and follow-up (r = 0.97). Bias to RHC as the reference method was not found at either baseline or follow-up (0.2 mmHg, p = 0.83 and - 0.1 mmHg, p = 0.90 for baseline and follow-up, respectively). Standard deviations of differences were 4.0 mmHg for the baseline and 3.6 mmHg for the follow-up investigation (Fig. 4). Differences between mPAP and mPAPMR did not depend significantly on field strength (p = 0.73 for the baseline and p = 0.19 for the follow-up investigation).

Fig. 4
figure 4

Linear regression and Bland-Altman plots of mean pulmonary arterial pressures measured at right heart catheterization (mPAP) and estimated from vortex duration (mPAPMR) at baseline (a, b) and follow-up (c, d). Light gray shading indicates the area between 95% limits of agreement, dark gray shadings indicate 95% confidence intervals of bias and 95%-limits of agreement. nmPAP > 16,BL, number of subjects with mPAP > 16 mmHg at baseline; nmPAP > 16,FU, number of subjects with mPAP > 16 mmHg at follow-up; r, correlation coefficient

In patients with mPAP > 16 mmHg at baseline and follow-up, ΔmPAP correlated strongly with ΔmPAPMR (r = 0.91). There was no significant bias (- 0.3 mmHg, p = 0.71), and the standard deviation of differences was 5.1 mmHg (Fig. 5). The differences between ΔmPAP and ΔmPAPMR did not depend significantly on field strength(s) used at baseline and follow-up (p = 0.07).

Fig. 5
figure 5

Linear regression (a) and Bland-Altman (b) plot of mean pulmonary arterial pressure differences between baseline and follow-up measured by right heart catheterization (ΔmPAP). Light gray shading indicates the area between 95% limits of agreement, dark gray shadings indicate 95% confidence intervals of bias and 95% limits of agreement. nmPAP > 16, number of subjects with mPAP > 16 mmHg at baseline and follow-up

Discussion

The most important finding of this study is that MR 4D flow imaging of the main pulmonary artery during PH treatment allows both accurate prediction of the direction of mPAP changes and precise, non-invasive estimation of the magnitude of mPAP changes in patients with mPAP > 16 mmHg.

Stabilization and improvement of pulmonary hemodynamics are major objectives in the management of patients with PH [3, 32]. Doppler echocardiography is currently the primary non-invasive method for the evaluation of RV function and pulmonary arterial pressure during PH therapy [6, 24]. The technique, however, has limitations with respect to reproducibility [5, 33,34,35,36,37] and the accuracy of pulmonary arterial pressure estimation [36,37,38,39], potentially limiting the reliability of echocardiography for serial assessments of individual patients. Cardiac MRI has increasingly been used in PH studies as a reference standard for the evaluation of ventricular function [40,41,42,43]. These studies have identified progressive RV dilatation and decreased RV EF at follow-up cardiac MRI as important predictors of poor long-term outcome and treatment failure. Moreover, van de Veerdonk et al [41] reported increasing RV volumes and decreased RV EF as early signs of clinical worsening in a group of severe PAH patients, suggesting that RV remodeling may precede disease progression even in patients appearing clinically stable. Notably, significant correlations were also found between RV ESV, RV EF, and RV SV changes and ΔmPAP in the present study.

Even though various routine cardiac MRI parameters and parameter models have been used for the evaluation of mPAP in PH showing their potential of mPAP prediction with standard deviations in the range of 7–10 mmHg [8, 12, 44], cardiac MRI is not yet established as a tool for the non-invasive estimation of mPAP. Reiter et al [27] demonstrated a more accurate 4D flow-based estimation of mPAP > 16 mmHg and high accuracy in the diagnosis of PH from the duration of vortical blood flow in the main pulmonary artery using a threshold of mPAP ≥ 25 mmHg, irrespective of the etiology of PH. Employed with the recently introduced threshold of mPAP > 20 mmHg for the diagnosis of PH [1] in the present study, the technique revealed the accuracy for the non-invasive diagnosis of PH similar to that which it yielded with the former threshold mPAP ≥ 25 mmHg [26,27,28]. Using a piecewise linear model for non-invasive estimation of mPAP > 16 mmHg [27], mPAP was—in accordance with [27, 28] —predicted without bias and with small standard deviations from RHC at baseline as well as follow-up cardiac MRI. As the differences between mPAPMR and mPAP did not depend on field strength and a main difference in 4D flow protocols at different field strength was the measured temporal resolution, it could be speculated that the measured temporal resolution of the 4D flow acquisition does not play a central role for the accuracy of mPAP estimation.

The standard deviation for the prediction of pressure differences in the present study was lower than anticipated by the law of error propagation, which would result in a standard deviation of \( \sqrt{3.6^2+{4.0}^2}=5.4\ \mathrm{mmHg} \). This can be understood to be due to the comparative interpretation of vortical blood flow along the main pulmonary artery observed at baseline and follow-up, which increases the likelihood that a combined over- or underestimation of vortex duration-derived mPAP at baseline and follow-up will be canceled out in the difference (ΔmPAPMR). The standard deviation between invasively measured and non-invasively estimated ΔmPAP, however, cannot be expected to be substantially smaller than 5 mmHg, because RHC and MRI measurements were not acquired simultaneously: Assuming a spontaneous mPAP variability of approximately 3 mmHg [26, 45, 46], deviations between RHC and MRI of at least \( \sqrt{3^2+{3}^2}=4.2\ \mathrm{mmHg} \) have to be expected, even without taking into account the limited time resolution of vortex duration assessment.

The high accuracy for the prediction of mPAP increases and decreases in PH patients can be interpreted as a natural consequence of the strong correlation between ΔmPAP from RHC and MR. Considering mPAP changes within the range mPAP ≤ 16 mmHg as clinically not relevant (that is—as defined in the present study—no mPAP change), the accuracy of predicted mPAP changes in the whole population was comparable to that in the PH group. This result implies that the absence of vortical blood flow along the main pulmonary artery at baseline and follow-up in patients at risk of developing PH indicates persistent normal mPAP ≤ 16 mmHg [47].

Several limitations of the current study need to be acknowledged. As this was a retrospective analysis of two prospective studies, follow-up of patients was not planned and the follow-up times varied between 1 and 10 years. No patients with PH due to left heart diseases were included in the data analysis, as these patients were not repeatedly followed up by RHC. The study population included patients across a wide mPAP range, which limits the comparability of differences in the diagnosis of PH from the duration of vortical blood flow in the main pulmonary artery based on different mPAP cutoffs for PH. Moreover, only adult patients were recruited. Although protocols used for the assessment of 4D flow at 1.5 T and 3 T MRI differed, the bias of resulting pressure differences to RHC did not differ, indicating that results were not directly affected by the acquisition strategy.

In conclusion, MR 4D flow imaging allows accurate non-invasive prediction and quantification of mPAP changes in adult patients with PH or at risk of developing PH.