Introduction

Pulmonary hypertension (PH) is a hemodynamic condition defined as a pathological increase of the mean pulmonary arterial pressure (mPAP) at rest measured by right heart catheterisation (RHC) [1]. The World Health Organization (WHO) classification of PH differentiates five groups based on similarities in aetiology, hemodynamic profile, clinical findings and treatment strategy [2]. Hemodynamically, PH can be subdivided into pre-capillary, post-capillary or combined pre- and post-capillary PH [3, 4].

PH due to left heart disease (WHO group 2 PH), a form of post-capillary PH, is a frequent complication of left heart disease (LHD) and associated with worse prognosis [5, 6]. Management of group 2 PH differs from that of other PH groups and should focus on treatment of the underlying cardiac disorder [3]. The diagnosis of group 2 PH is primarily defined by an elevated pulmonary arterial wedge pressure (PAWP) > 15 mmHg at rest measured by RHC [1]. However, guidelines recommend that estimation of clinical pre-test probability for LHD based on non-invasive parameters should precede invasive catheterisation for improved differentiation of group 2 PH from other causes and improved indication of invasive right and left heart catheterisation [3].

Computed tomography (CT), especially CT pulmonary angiography (CTPA), is frequently acquired in patients with suspected PH to rule out pulmonary embolism and parenchymal lung disease. Previous studies suggested good diagnostic accuracy of enlarged left atrial diameter and area measured on CTPA for identification of group 2 PH [7,8,9]. However, volumetry of cardiac chambers might be more accurate and reliable in detecting cardiac chamber enlargement [10]. Technical advances have enabled automated volumetry of cardiac chambers based on CTPA [11,12,13], alleviating the need for time-consuming manual processing. Only a limited number of studies investigated the ability of cardiac chamber volumes to differentiate group 2 PH from other causes [7, 14, 15] without comparison to manual diameter measurements within the same patient cohort, to our knowledge.

The purpose of this study was to assess the diagnostic accuracy of automated 3D volumetry of the cardiac chambers based on CTPA for the differentiation of WHO group 2 PH from other PH groups (non-group 2 PH). Furthermore, we aimed to test our hypothesis that automated 3D volumetry yields higher diagnostic accuracy than manual diameter measurements of cardiac chambers.

Materials and methods

Patients

The retrospective, single-centre study was approved by the institutional review board, and patient informed consent was waived. Records of all patients who underwent RHC and CTPA for the diagnostic workup of PH in our institution between August 2013 and February 2016 were reviewed. CTPA was performed to diagnose or rule out underlying causes, in particular chronic thromboembolic disease. Diagnosis and classification were made according to the current ESC/ERS guidelines for the diagnosis and treatment of pulmonary hypertension [1]. All patients with PH confirmed by RHC were included. Exclusion criteria were an interval between CTPA and RHC of more than 100 days, non-diagnostic CTPA and failure of 3D segmentation (Fig. 1). RHC measurements (mPAP and PAWP), WHO classification and patient characteristics (sex, height, weight, age) were extracted from the records. Body surface area (BSA) was calculated according to the formula by Du Bois and Du Bois [16].

Fig. 1
figure 1

Flow chart of the study

Right heart catheterisation

RHC was performed according to guidelines [1] by two interventional pulmonologists each with more than 7 years of experience in RHC. A 7-French pulmonary artery catheter was introduced via an 8-French introducer sheath in the right internal jugular vein. Pulmonary arterial pressure measurements were recorded after zero levelling according to the guidelines [1].

CTPA acquisition

Non-gated CTPA data were acquired using a 64-detector CT scanner (Somatom Definition AS 64, Siemens Healthineers) in inspiratory breath-hold in supine position. Protocol settings were as follows: automated tube voltage selection (80–140 kVp), automated tube current modulation (reference tube current of 100 mAs at 120 kVp), bolus tracking in the main pulmonary artery, 50 ml of iodinated contrast agent (Ultravist 300, Bayer HealthCare) followed by a saline bolus of 50 ml with identical injection rate of 3–5 ml/s depending on venous access, collimation of 64 × 0.6 mm, iterative reconstruction kernel I40f/3 and reconstructed slice thickness of 1 mm with 0.7-mm increments. Scan length and field of view were adjusted to include the whole chest of each patient.

Image analysis

Automated segmentation and volumetry of cardiac chambers was performed using a commercially available, model-based algorithm previously validated on gated CT scans [13, 17] (CT Pulmonary Artery Analysis, Intellispace Portal V11, Philips Healthcare). The algorithm automatically computes 3D segmentations of left and right atria (LA, RA) and ventricles (LV, RV). Pulmonary veins and the left atrial appendage were automatically excluded from the LA segmentation. Two board-certified radiologists with 6 and 8 years of experience in cardiovascular imaging reviewed segmentations in an individual random order and blinded to any clinical data, and manually corrected visible deviations from the chambers’ contours using the brush tools provided by the same software. The average of the measurements by the two radiologists was used for further analysis. To assess intraobserver agreement, one of the two radiologists repeated the image analysis after 12 months in half of the patients.

Diameter measurements of all cardiac chambers on axial slices of the CTPA data were performed by one of the board-certified radiologists in a different random order, again blinded to any other measurements and clinical data. LV and RV diameters were measured perpendicular to the interventricular septum on the slice on which the respective ventricle appeared largest [18]. LA diameter was determined as the largest anterior-posterior-diameter on the slice in the middle 50% of the craniocaudal extension of the left atrium [19]. RA diameter was measured as the largest diameter parallel to the tricuspid valve plane, excluding right atrial appendage and coronary sinus [20]. Additionally, contrast attenuation in Hounsfield units (HU) of the LA was assessed by placing a 300 mm2 circular region of interest in the centre of the LA on the same slice as the diameter measurement.

Statistical analysis

Normal distribution was tested using the Shapiro-Wilk test and QQ-plots. Normally distributed continuous variables are described by mean ± standard deviation and 95% confidence intervals (CI), non-normally distributed continuous variables by median and interquartile range (IQR). Categorical variables are given as numbers and percentages. Interobserver and intraobserver agreements were assessed by calculation of intraclass correlation coefficients using a two-way mixed model testing for absolute agreement. Correlation between LA contrast attenuation and the segmentation error was analysed using Pearson’s correlation.

Comparisons between group 2 PH and non-group 2 PH patients were performed using the chi-square test for categorical variables, the two-sided t-test for normally distributed continuous variables and the Mann-Whitney U test for non-normally distributed continuous variables. ROC analyses were performed to assess the ability to differentiate group 2 PH from non-group 2 PH, and AUCs were calculated. Statistical differences of AUCs between volumetric and axial measurements were assessed using the two-sided test by DeLong et al for paired data [21]. All analyses were also performed for BSA-corrected volume and diameter measurements.

Linear correlation with the logit of each cardiac chamber measurement was confirmed and binary multiple logistic regression analysis was conducted for each cardiac chamber measurement separately, including age, sex and mPAP as potentially confounding covariates. After exclusion of significant collinearity between covariates, a binary logistic regression analysis including age and cardiac chamber volumes was performed using stepwise inclusion according to the likelihood ratio. AUC of the resulting model for the identification of group 2 PH was calculated. Sensitivities, specificities and positive and negative predictive values based on the prevalence of group 2 PH in our study cohort were calculated for the threshold values yielding the maximum Youden index for each measurement.

Using the propensity score method, pairwise matching of patients with group 2 PH to non-group 2 PH patients based on age, sex and mPAP using the nearest neighbour method was performed and group analysis repeated as described above.

A p value < 0.05 was considered statistically significant. All statistical analyses were accomplished using SPSS version 27.0 (SPSS Inc.) and R version 4.0.2 (R Foundation for Statistical Computing) including packages MatchIt version 4.1.0 [22] and pROC version 1.16.2 [23].

Sample size-estimation was based on Power Analysis and Sample Size (PASS 11, NCSS, LLC) [24] using the following prerequisites and assumptions: alpha level 0.05, power 0.8, prevalence of group 2 PH among the study population 20%, null hypothesis specificity 70%, alternative hypothesis specificity 90%. The calculated sample size was 39 patients including a minimum of 8 patients with group 2 PH.

Results

Study cohort

The patient characteristics of the final study population of 91 patients are summarised in Table 1. Nineteen patients (20.9%) were classified as group 2 PH, of which 18 (94.7%) were diagnosed with heart failure with preserved ejection fraction and 1 (5.3%) with heart failure with reduced ejection fraction. Seventy-two patients (79.1%) were identified having non-group 2 PH, of which 37 patients (51.4%) were classified as group 1 PH, 15 patients (20.8%) as group 3 PH and 20 patients (27.8%) as group 4 PH.

Table 1 Clinical characteristics and cardiac chamber volumes, diameters and ratios for pulmonary hypertension patients

Cardiac chamber volumes and diameters

Volumes and diameters of LA, LV and RA were significantly larger in patients with group 2 PH compared to non-group 2 PH, while RV/LV, RA/LA and RV/LA volume and diameter ratios were significantly lower (Table 1, Figs. 2, 3 and 4).

Fig. 2
figure 2

Upper row (A, B) shows four-chamber view, lower row (C, D) 3D visualisations of representative cardiac chamber segmentations in a 79-year-old female patient with group 2 pulmonary hypertension (A, C) compared to a 63-year-old female patient with group 3 pulmonary hypertension (B, D). A particular enlargement of the LA is visible in the patient with group 2 pulmonary hypertension. LA, left atrium; LV, left ventricle; mPAP, mean pulmonary arterial pressure; PH, pulmonary hypertension; RA, right atrium; RV, right ventricle

Fig. 3
figure 3

Example of manual diameter measurements of cardiac chambers on standard axial CTPA slices. LA, left atrium; LV, left ventricle; RA, right atrium; RV, right ventricle

Fig. 4
figure 4

Boxplot diagrams of volumes (A) and axial diameters (B) of all cardiac chambers for group 2 pulmonary hypertension patients and non-group 2 pulmonary hypertension patients. LA, left atrium; LV, left ventricle; RA, right atrium; RV, right ventricle

Interobserver and intraobserver agreements of cardiac chamber volumetry were excellent for all cardiac chambers, with intraclass correlation coefficients of 0.951/0.926 for LV volume (95% CI 0.923–0.968/0.872–0.958, both p < 0.001), 0.978/0.980 for RV volume (95% CI 0.967–0.986/0.954–0.990, both p < 0.001), 0.969/0.983 for LA volume (95% CI 0.954–0.980/0.970–0.990, both p < 0.001) and 0.999/0.998 for RA volume (95% CI 0.998–0.999/0.997–0.999, both p < 0.001). Manual adjustments of automatic segmentations were performed in 38 of 91 patients (41.8%)/96 of 364 cardiac chambers (26.4%) by reader 1 and 45 patients (49.5%)/102 cardiac chambers (28.0%) by reader 2, yielding fully automatic segmentations of 72.8% of cardiac chambers without the need of manual adjustment. In cases of manual adjustment, average time for adjusting the segmentation of one cardiac chamber was 86.7 ± 34.2 s. The segmentation error, i.e. the difference between corrected and fully automatic volumes, was small (95% CI for LA (−3.4)–(+0.4)ml/(−6.9)–(+0.4%), for LV (−3.6)–(+2.0)ml/(−5.9)–(+1.2%), for RA (−2.7)–( −0.2)ml/(−1.3)–( −0.2%) and for RV (+7.0)–(+17.9)ml/(+2.5)–(+6.0%)).

Mean LA contrast attenuation was 288.0 HU (95% CI 267.8–308.2 HU). There was only a very weak positive, statistically non-significant correlation between LA contrast attenuation and the segmentation error for LA volume (r2 = 0.021, p = 0.17).

Identification of WHO group 2 pulmonary hypertension

ROC analysis showed excellent ability of LA volume to differentiate group 2 PH from non-group 2 PH with an AUC of 0.908 (95% CI 0.835–0.981) and good ability of LV volume, as well as RV/LV, RA/LA and RV/LA volume ratios (Table 2, Supplementary Table A). Cardiac chamber volumes demonstrated higher AUC values compared to axial diameters, with LA volume showing statistically significant superiority compared to LA diameter (p = 0.009, Table 2). Multiple logistic regression analysis demonstrated that the significance of enlarged LA volume (p < 0.001), enlarged LV volume (p = 0.001), lower RV/LV volume ratio (p = 0.04) and lower RV/LA volume ratio (p = 0.003) as predictors for group 2 PH persisted after adjustment for age, sex and mPAP (Table 3). The same was true for the respective axial diameter values (Table 3).

Table 2 ROC analysis of cardiac chamber volumes, diameters and ratios to differentiate group 2 pulmonary hypertension patients from non-group 2 pulmonary hypertension patients
Table 3 Cardiac chamber volumes, diameters and ratios as predictors for group 2 pulmonary hypertension after adjustment for age, sex and mean pulmonary arterial pressure

The stepwise logistic regression analysis for identification of group 2 PH patients resulted in the final model with LA volume (p < 0.001) and RV/LV volume ratio (p = 0.02) as covariates:

$$ \mathrm{Logit}\left({\mathrm{probability}}_{\mathrm{Group}\ 2\ \mathrm{PH}}\right)=0.64\times \mathrm{LA}\ \mathrm{volume}-1.39\times \mathrm{RV}/\mathrm{LV}\ \mathrm{volume}\ \mathrm{ratio}-4.954 $$

ROC analysis demonstrated an AUC of 0.936 (95% CI 0.879–0.993). The corresponding sensitivity, specificity and positive and negative predictive values using a cut-off probability of > 0.40 yielding the maximum Youden index of 1.73 were 78.9%, 94.4%, 78.9% and 94.4%, respectively (Table 4). For LA volume/LA diameter alone, sensitivity, specificity and positive and negative predictive values using a cut-off value of ≥ 110 ml/≥ 41 mm were 78.9%/89.5%, 90.3%/62.5%, 68.2%/38.6% and 94.2%/95.7%, respectively (Table 4).

Table 4 Diagnostic accuracy for the differentiation of group 2 pulmonary hypertension from non-group 2 pulmonary hypertension

The above-mentioned analyses were also conducted after adjusting cardiac chamber volumes and diameters for BSA. This did not have a relevant influence on the major findings. For example, ROC analysis to differentiate group 2 PH from non-group 2 PH yielded AUC values of 0.893 for LA volumeBSA (95% CI 0.819–0.967), 0.761 for LV volumeBSA (95% CI 0.636–0.886), 0.636 for RA volumeBSA (95% CI 0.491–0.781) and 0.447 for RV volumeBSA (95% CI 0.280–0.613).

Finally, to account for differences in clinical characteristics between the two groups, pairwise matching of group 2 (n = 19) and non-group 2 PH patients (n = 19) was performed using the propensity score method (Table 5). Analyses after pairwise matching confirmed that volumes and diameters of LA, LV and RA were significantly larger in group 2 PH patients compared to matched non-group 2 PH patients, while RV/LA volume and diameter ratios were significantly lower (Table 5).

Table 5 Clinical characteristics and cardiac chamber volumes, diameters and ratios for group 2 pulmonary hypertension patients and propensity score–matched non-group 2 pulmonary hypertension patients

Discussion

This study assessed the diagnostic accuracy of automated 3D volumetry of the cardiac chambers based on CTPA for the differentiation of WHO group 2 PH from other PH groups and demonstrated excellent ability of LA volume to differentiate group 2 PH from non-group 2 PH with an AUC of 0.908 and corresponding sensitivity, specificity and positive and negative predictive values at the maximum Youden index of 78.9%, 90.3%, 68.2% and 94.2%, respectively. LA volume was a significantly better predictor of group 2 PH compared to LA diameter as measured on axial CTPA slices. Diagnostic accuracy could be slightly improved by combining LA volume and RV/LV volume ratio resulting in an AUC of 0.936.

Correct diagnosis of group 2 PH can be challenging, especially in cases of heart failure with preserved ejection fraction, and the diagnostic pathway should be based on the pre-test probability of LHD [3]. As CTPA is frequently acquired in addition to echocardiography during diagnostic workup [3, 25, 26], including CTPA-based parameters into the estimation of pre-test probability of LHD appears to be a promising approach for translation into clinical practice and may improve confidence when deciding which patient may require invasive catheterisation. LA area and volume can also be estimated by transthoracic echocardiography using planar measurements and geometric assumptions [3, 25]. However, LA volume estimation by echocardiography has been shown to be less accurate compared to CT-based volume measurements [26].

One previous study applied the software for cardiac chamber volume analysis also used in our study in a similar clinical setting and patient cohort size [14]. This previous study found a similar AUC value of 0.92 using LA volume for identification of group 2 PH, but did not perform diameter measurements for comparison [14]. Our study not only provides confirmatory evidence for the diagnostic accuracy of LA volume in a different institution and patient cohort but also demonstrates its superiority compared to LA axial anteroposterior diameter in an intra-individual comparison. LA anteroposterior diameter is the most widely used LA measurement in clinical practice and PH research in both echocardiography and CT according to international recommendations and literature [7, 19, 27,28,29,30]. Previous studies reported AUC values of 0.71–0.83 for identification of group 2 PH by LA anteroposterior diameter, compared to 0.83 in our study cohort [7, 8].

LA transverse diameter represents an alternative but has only been investigated in a single study regarding identification of group 2 PH [8]. LA area measurements have also been proposed to differentiate group 2 PH from non-group 2 PH with reported AUC values of 0.73–0.85 being slightly higher compared to LA anteroposterior diameter, but required time-consuming manual delineation of the LA thus hampering its clinical use [7,8,9]. In our study, the software provided robust segmentations without the need of manual adjustment in 72.8% of cardiac chambers and tools for manual adjustments with little time and effort. Fully automatic segmentation without any manual adjustment resulted in only marginally lower diagnostic accuracy with an AUC of 0.90 for LA volume.

The advantage of volume measurements over one- or two-dimensional measurements has also been demonstrated for other body regions [31,32,33]. This may be attributable to the lower susceptibility to geometric variances in asymmetrical geometries, as has also been discussed for the left atrium [10]. Prospective studies are needed to confirm our findings.

This study is subject to some limitations. A retrospective study might be prone to confounders and selection bias. However, we aimed to control for potential confounders by including respective covariates in multiple regression analysis and for potential group differences in the baseline characteristics by performing propensity score matching. Diagnosis and classification of PH groups in this study were based on the current ESC/ERS guidelines [1]. While the 6th World Symposium on Pulmonary Hypertension suggested some changes to the PH definitions, these suggestions have not yet been integrated into the ESC/ERS guidelines [4]. The findings of our study might differ to some degree should the new recommendations be adopted by the guidelines in the future. Furthermore, non-gated CTPA was used for cardiac chamber segmentation. The model-based algorithm was previously evaluated on gated and non-gated CT scans and found to be accurate and highly reliable [13, 14, 17]. Gated CT scans, however, are less frequently acquired in clinical routine, may have higher radiation exposure and be less beneficial for LA volume measurements given the smaller amount of LA motion during the cardiac cycle compared to LV or RV motion. Contrast enhancement of the cardiac chambers was sufficient for reproducible measurements as demonstrated by the excellent interobserver and intraobserver agreements, as well as the small segmentation error and its non-significant correlation with LA contrast attenuation. The 95% CI of LA attenuation (267.8–308.2 HU) was well above the threshold of 210 HU for MPA attenuation considered as a diagnostic CTPA scan [34]. Of note, we observed a slight tendency of the algorithm to underestimate RV volume in PH patients. Finally, in accordance with previous studies, the present study did not address patients with isolated post-capillary PH and patients with combined pre- and post-capillary PH separately [7,8,9].

In conclusion, automated 3D volumetry of cardiac chambers based on non-gated CTPA, in particular LA volume, can differentiate WHO group 2 PH patients from other PH groups with good diagnostic accuracy. LA volume yields significantly higher diagnostic accuracy than LA diameter as measured on axial CTPA slices. If proved reliable in prospective studies, cardiac chamber volumetry based on CTPA may improve non-invasive classification of PH due to LHD and thereby optimise diagnostic pathways.