Introduction

Myocardial fibrosis is a characteristic pathological manifestation of hypertrophic cardiomyopathy (HCM). Myocardial fibrosis in patients with HCM is a predictor of poor prognosis, including sudden cardiac death, heart failure, and ventricular arrhythmia [1, 2]. Cardiac magnetic resonance (CMR) is the gold standard non-invasive modality for cardiac structure and function assessment. Late gadolinium enhancement (LGE) sequence can detect fibrosis and quantify scarred myocardium through post-processing techniques [3]. However, patients with HCM are often younger than the general population undergoing enhanced CMR scanning and need multiple CMRs to assess lifetime disease progression, nearly 1/3–1/2 of these patients are estimated to have no fibrosis [2, 4, 5]. Several studies have shown that gadolinium should be used with caution in patients with chronic kidney disease and is contraindicated in those with gadolinium allergy. Repeated gadolinium injections might result in its deposition in the central nervous system [6, 7]. Therefore, it is important to find a personalized strategy to identify patients with HCM who require LGE scanning. While patients at high risk of fibrosis should undergo LGE for further quantitative and prognostic analysis, exposure to a contrast agent could be minimized in those at low risk of fibrosis, reducing scan time and healthcare costs.

The application and exploration of extensive data mining in clinical medicine have vastly grown in recent years. Radiomics has become increasingly widespread in heart diseases as it can characterize the myocardial tissue [8,9,10,11]. Neisius et al demonstrated that radiomic analysis of native T1 mapping could outperform global native T1 values in discriminating between HCM and hypertensive heart disease as well as identifying LGE(−) patients for avoiding gadolinium administration [12, 13]. However, unlike cine sequence, mapping technology is unavailable in most hospitals. Schofield et al analyzed texture features of the mid-left ventricular slice in cine imaging, showing that radiomics could distinguish the various left ventricular (LV) hypertrophy etiologies [14]. Nevertheless, a comparison between radiomics studies based on the entire LV myocardium and a LV myocardial slice has not been reported.

The study aimed to establish models that fuse image features and texture characteristics based on cine imaging to detect fibrosis in patients with HCM and compare the performance of models constructed using a single slice and the entire LV myocardium.

Material and methods

Study population

The ethics committee of our hospital approved this study and waived the requirement for informed consent due to its retrospective nature. The study included patients with HCM who underwent CMR scans between January 2013 and December 2020 (Fig. 1). HCM was defined referred to previous guideline: maximal LV thickness on CMR images ≥ 15 mm or ≥ 13 mm with a documented family history of HCM [2]. The exclusion criteria were as follows: poor or incomplete images, previous radiofrequency or alcohol ablation, myocardial infarction, and myocardial hypertrophy due to other causes such as hypertensive cardiomyopathy, poorly controlled hypertension, cardiac amyloidosis, valvular disease, and Fabry disease.

Fig. 1
figure 1

Flowchart of patient selection in this study

Cardiac magnetic resonance imaging

CMR scans were performed on 1.5-T magnetic resonance scanners (GE Signa Excite HD or Siemens MAGNETOM Avanto) with a phased-array body coil. Short-axis cine images covering the entire LV and long-axis cine images (two/ three/ four-chamber) were acquired using a retrospective electrocardiogram-gated and balanced steady-state free precession sequence. The respective parameters were as follows: slice thickness/gap, 8 mm/2 mm; repetition time (TR), 3.5 and 2.64 ms; echo time (TE), 1.5 and 1.11 ms; flip angle, 45° and 56°; field of view (FOV), 360 × 360 and 340 × 276 mm2; acquired matrix, 224 × 224 and 192 × 125 pixels2; reconstructed matrix, 512 × 512 and 192 × 156 pixels2; temporal resolution, 49 and 47.52 ms. The array spatial and sensitivity encoding technique (acceleration factor = 2) was used in the GE scanner, while the generalized auto-calibrating partially parallel acquisition (acceleration factor = 2) in the Siemens scanner. LGE images were obtained 8–10 min after intravenous injection of 0.15–0.2 mmol/kg of gadopentetate dimeglumine (Beilu) with the following parameters: slice thickness/gap, 8 mm/2 mm; TR, 6.6 and 2.5 ms; TE, 3.1 and 1.09 ms; flip angle, 20° and 40°; FOV, 360 × 324 and 340 × 255 mm2 (Supplementary Table S1).

Imaging analysis

Cardiac function and strain analysis were performed using CVI42, Version 5.13.5 (Circle Cardiovascular Imaging Inc.). LV function, including LV ejection fraction (LVEF), LV end-diastolic volume index (LVEDVI), LV end-systolic volume index (LVESVI), systolic volume index, and LV mass index (LVMI), was calculated automatically by loading short-axis images into the short-3D module. A stack of long- (two-/three-/four-chamber) and short-axis images were loaded into the tissue tracking module for two-dimensional strain analysis. Endocardial and epicardial borders at end-diastolic LV were automatically drawn to track the myocardium throughout the heart cycle and manually adjusted when necessary for optimal tracking. Then, peak values of myocardial strains and strain rates were derived: global radial, circumferential, and longitudinal strains (GRS, GCS, GLS); global radial, circumferential, and longitudinal strain rates (GRSr, GCSr, GLSr); early global radial, circumferential, and longitudinal strain of diastolic rates (GRSDr, GCSDr, GLSDr). The end-diastolic maximum wall thickness (MWT) was measured in the short-axis images. Left atrial anteroposterior diameter was measured on a three-chamber view.

LGE analysis was obtained in the tissue characteristic module. Radiologist A, with 4 years of CMR experience, visually estimated LGE in short- and long-axis views. Patients were classified as LGE(+) if they had myocardial enhancement in at least one segment. If uncertain, a senior radiologist was consulted, reaching a consensus through discussion. LGE extent was quantified as the percentage of total LV mass (%LGE) using a grayscale threshold of > 5 SDs from remote normal myocardium. The HCM cohort was divided into LGE (+) and LGE (−) groups.

Radiomic features extraction

Radiologist A manually delineated the entire LV myocardium at the end of diastole in short-axis cine views as the region of interest using ITK-SNAP, version 3.8.0 (http://www.itksnap.org/). Before feature extraction, cine images were interpolated to voxel sizes of 1 × 1 × 4 mm3 using a SimpleITK nearest-neighbor resampling filter to reduce within-image variability. A min-max normalization was performed for all images to map the pixel intensity to 0–255. The Pyradiomics package, version 3.0.1 (https://github.com/Radiomics/pyradiomics), extracted 1130 and 1688 features from the MWT slice and the entire LV myocardium, respectively, including four categories: 18 first-order, 14 shape (2D/3D), and 75 texture features for both, and 1023 and 1581 high-order features, respectively.

Radiomic features selection

Radiologist A delineated images from 30 randomly selected patients with HCM twice one month apart to test intraclass reproducibility. The 30 patients were also assessed by radiologist B, who had 5 years of experience in CMR, to assess interclass reproducibility. Features with intra- and interclass correlation coefficient (ICC) > 0.85 were considered reproducible and retained for further analysis. Subsequently, the Boruta algorithm reduced the dimensionality of the total radiomic features [15]. The importance of each feature was calculated and ranked and the top 100 features were retained. Extreme gradient boosting (XGBoost) was selected as the machine learning technique for prediction.

Model construction and validation

CMR imaging parameters with significant differences were selected in the training set using logistic regression and collinearity analysis. These were used to build the CMR model. R1 and R2 models were constructed based on the MWT slice and entire LV myocardium, respectively, using radiomic features selected by XGBoost. ICMR+R1 and ICMR+R2 models were built by integrating CMR model and R1 or R2 model through logistic regression. Receiver operating characteristic (ROC) curves evaluated the models’ diagnostic performance. Calibration curve and Hosmer-Lemeshow test evaluated the calibration performance and goodness of fit. Decision curve analysis (DCA) assessed the net benefits of models. Net reclassification improvement (NRI) and integrated discrimination improvement (IDI) evaluated the incremental value of ICMR+R2 for identifying fibrosis. Figure 2 shows the study methodology workflow.

Fig. 2
figure 2

Radiomics workflow in this study

Statistical analysis

Data analysis was performed using IBM SPSS Statistics for Windows, version 23.0 (IBM Corp.) and R studio (version 4.1.2, http://www.r-project.org/). Continuous variables were expressed as mean ± SD or median (interquartile range). Shapiro Wilk test evaluated the normality of data distributions. Student’s t-test or Mann–Whitney test was used appropriately to assess differences between groups. Categorical data were expressed as frequencies (percentages), and the chi-square test or Fisher’s exact test compared differences between groups. Statistical significance was set at p < 0.05.

Results

Clinical data and CMR variables of HCM patients

This study included 273 patients with HCM (training set, 70%; test set, 30%). Patients and CMR characteristics are summarized in Table 1. In the training set, the median age was 55 years, 67.54% were male, and 147 (76.96%) were LGE(+). More patients had a history of non-sustained ventricular tachycardia and LVEF < 50% in the LGE(+) than in the LGE(–) group, showing higher MWT and LVMI and lower strains, including GRS, GCS, GLS, GRSr, GCSr, GLSr, GRSDr, and GLSDr (all p < 0.05; Table 1). In the test set, the median age was 53.5 years, most were male (73.17%) and 67 (81.70%) were LGE(+). LVEDVI, LVESVI, MWT, LVMI, GRS, GCS, GLS, GLSr, and GLSDr were significantly different between LGE(–) and LGE(+) groups (all p < 0.05; Table 1). The groups were similar for all other variables.

Table 1 Clinical characteristics and CMR parameters of HCM patients in the training and test sets

Radiomic feature selection

The radiomics workflow is illustrated in Fig. 2. A total of 1688 and 1130 features were extracted from the entire LV myocardium and MWT slice respectively. Among them, 1041 and 546 features had good repeatability with ICC > 0.85. The XGBoost algorithm selected the most important and robust features. Supplementary Figure S1 depicts the importance level of selected features from the MWT slice and entire LV myocardium. The predictive score was much higher in the LGE(+) group than in the LGE(–) group (Supplementary Figure S2). The top three radiomic features in R1, the top five radiomic features in R2, and correlated interpretations in LGE(+) with patients are shown in Supplementary Table S2.

Model construction and evaluation

Univariate and multivariate logistic regression and collinearity analyses revealed that MWT, LVMI, and GLS were independent predictors of LGE(+) (Fig. 3). Therefore, we established five prediction models: the CMR model, two radiomic models (R1 and R2), and two integrated models (ICMR+R1 and ICMR+R2). In the training set, R2, ICMR+R1, and ICMR+R2 models could better distinguish between LGE(+) and LGE(–) groups (AUC values were 0.909, 0.901, and 0.930, respectively) than CMR and R1 models (Fig. 4a). In the test set, R2 and ICMR+R2 models showed an excellent predictive ability for LGE(+) patients (AUC values were 0.906 and 0.898, respectively; Fig. 4b). The radar plots (Fig. 5) and Table 2 show that the ICMR+R2 model had high diagnostic accuracy, sensitivity, NPV, and F1 score for differentiating LGE(+) from LGE(–) patients in the training set (92.15%, 97.28%, 89.19%, and 95.02%, respectively) and test set (89.02%, 92.54%, 68.75%, and 93.23%, respectively). The R2 model achieved a specificity of 93.33% in the test cohort. Supplementary Figure S3 illustrates myocardial appearance in cine imaging by radiomic analysis to reveal fibrosis.

Fig. 3
figure 3

Multivariate logistic regression analysis indicated maximum wall thickness (MWT), left ventricular mass index (LVMI), and global longitudinal strain (GLS) as independent predictors of LGE(+) patients

Fig. 4
figure 4

Receiver operating characteristic curves of five models for identifying LGE(+) patients in the training (a) and test (b) sets

Fig. 5
figure 5

Radar plots showing the diagnostic efficacy of five models in the training (a) and test (b) sets

Table 2 Diagnostic performance of CMR model, radiomic models, and integrated models in the training and test sets

Furthermore, we evaluated the calibration of the five models and their impact on clinical decision-making. The calibration plots indicated that ICMR+R2 model was well calibrated, with a mean predicted probability for each group close to the observed probability (Fig. 6a, b). This was further proved by the Hosmer-Lemeshow test with p values > 0.05 in both cohorts, suggesting no reason to reject the null hypothesis of no difference between the predicted and observed LGE(+) probability in each group. The DCA results comparing the performance of five models are shown in Fig. 6c and d. The results suggested the use of unenhanced CMR based on the indications of R2 and ICMR+R2 models had better net benefits than other models for threshold probabilities ranging from 40 to 90%. Furthermore, ICMR+R2 model yielded significantly positive NRI and IDI compared to the CMR model in the training (0.854 and 0.302, respectively) and test set (0.498 and 0.136, respectively), indicating that the ICMR+R2 model could improve the discrimination and reclassification for fibrosis in HCM. Moreover, we performed a subgroup analysis of R2 and ICMR+R2 models. AUC values in the GE and Siemens scanners were similar to the mixed data in both datasets (Supplementary Table S3), implying that our models worked equally well on data from the two scanners.

Fig. 6
figure 6

Calibration curves and decision curve analyses of the various models (a, c: training set; b, d: test set). Calibration curves depict the calibration performance of models in terms of an agreement between predicted and observed LGE(+). The closer the curve is to the dotted black line, the better the model’s predictive accuracy is. For decision curves, the vertical axis measures standardized net benefit, while the horizontal axis shows the corresponding risk threshold. The decision curves show that performing unenhanced CMR based on the indications of R2 and ICMR+R2 models had better net benefits than other models for threshold probabilities ranging from 40 to 90%

Discussion

In this study, we constructed CMR, radiomics, and integrated models based on cine imaging to identify fibrosis in patients with HCM. The results in the independent test set showed that ICMR+R2 model could effectively detect 62 (92.54%) of the LGE(+) patients, while R2 model screened out 14 (93.33%) of LGE(–) patients. A model including radiomic features derived from the entire LV myocardium may provide a reliable and stable tool to identify patients with HCM at high risk of fibrosis and screen out those at low risk, avoiding unnecessary contrast-enhanced CMR scanning.

The apparent manifestation of HCM is myocardial thickening [1, 2]. LV wall thickness (MWT ≥ 15 mm) is one of the most important criteria for HCM diagnosis [2]. MWT is one of several clinical features used when recommending implantable cardioverter defibrillator implantation [16, 17]. Previous studies have demonstrated that myocardial hypertrophy and fibrosis can lead to reduced LV strain [18, 19]. This study found three imaging features independently associated with fibrosis. A CMR model including MWT, LVMI, and GLS had an AUC value of 0.847 to identify LGE(+) patients in the test cohort. However, the CMR model showed only mild accuracy (70.73%), sensitivity (65.67%), and F1 score (78.57%). In clinical practice, we often encounter situations where the LV wall is thickened but shows no LGE hyperintensity [20]. Thus, identifying fibrosis based on imaging features alone has a limited value.

Radiomics can conduct an in-depth analysis of medical images and extract a large number of features that the naked eyes cannot observe [8]. Baessler et al proved that radiomics of nonenhanced cine images could accurately diagnose subacute and chronic myocardial infarction [21]. Our study analyzed the radiomics of the MWT slice and entire LV myocardium. We found that almost all radiomic features in the R2 model were high-order features changed by various algorithms, which allowed identification of nonuniform and coarse myocardial texture associated with fibrosis. The results indicated that high-order features could provide rich and accurate differential diagnosis information [22, 23]. Among the five models, CMR, R1, and ICMR+R1 models had similar AUC values for identifying LGE(+) patients, suggesting that radiomic models based on a single myocardial slice have no advantage over imaging features in diagnosing fibrosis in HCM. In comparison, R2 and ICMR+R2 models had better discrimination power with AUC values of 0.906 and 0.898 in the test cohort. ICMR+R2 model correctly identified most LGE(+) patients (92.54%) who would benefit from LGE scanning. At least 93.33% of patients without fibrosis could avoid repeated enhanced CMR based on the R2 model. Therefore, after a comprehensive evaluation of the ROC curves, confusion matrix, decision curves, and reclassification indices, we believe that the ICMR+R2 model is the optimal tool to identify LGE(+) patients, while the R2 model is more suitable for screening out LGE(–) patients. In clinical practice, enhanced CMR is used most frequently to evaluate fibrosis. LGE scanning is recommended for patients with HCM without contraindications, regardless of the probability of fibrosis. This study proposed a personalized strategy, namely, performing gadolinium-free CMR in patients with HCM at low risk of fibrosis and LGE scanning in those at high risk of fibrosis. A multi-center study on 1099 patients with HCM established a deep learning model that combined morphology (MWT and radiomic features) and function (ventricular wall thickening) [24]. Among the 82 patients determined to have no fibrosis by the model in the validation set, 73 (89%) were true negative and could avoid enhanced CMR. Our study results show some differences. GLS, the final functional variable selected, was proved to be a relatively stable and convenient deformation factor [25, 26]. Although ICMR+R2 model could only identify 68.75% of patients without fibrosis, it still exhibited satisfactory discriminability and stability and excellent sensitivity (92.54%) and F1 score (93.23%), even with limited data.

Previous studies have shown that radiomics or deep learning based on CT and other MR sequences can identify patients with HCM at high risk of fibrosis. Qin et al [27] demonstrated that a radiomic model based on CT angiography achieved an AUC of 0.78 to assess the presence of fibrosis. However, CTA exposes patients to radiation and increases the risk of allergy to contrast agent. Zhang et al [28] proposed a deep learning method on cine images that could detect the presence, position, and size of chronic myocardial infarction. Zhang et al [29] showed that virtual native enhancement imaging could generate LGE-equivalent images using deep learning to fuse native T1 maps and cine images. Such images achieved high agreement with LGE in detecting the distribution and quantification of fibrosis. However, T1 mapping is only carried out in some large-scale tertiary hospitals, mostly for scientific research, with only a few centers adding mapping sequence to routine scans. Additionally, deep learning still has the issue of complex technology being regarded as a “black box.” Our ICMR+R2 model was constructed by radiomics based on routine cine images, presenting great discriminative performance and considerable clinical benefit, which would help promote its application in clinical practice.

This study had some limitations. First, it was a single-center study with a small sample size. A multi-center study with a larger sample is required to confirm the model effectiveness. Second, the results were based on two MR scanners. Future studies should test models for generality across diverse populations and scanners. Third, our study lacked a quantitative scar analysis. We are trying to collect multi-center data for further quantitative and prognostic studies, hoping to soon implement gadolinium-free scans in selected patients with HCM. Fourth, we assessed end-diastolic but not end-systolic cine images. This was because MWT data available for HCM diagnosis and CMR model training were acquired at the end of diastole. Finally, our study included slightly more patients with LGE(+) than in other studies, possibly because the patients referred to our center were screened by lower-level hospitals, resulting in a selection bias.

Conclusion

A predictive model that fused radiomics from the entire LV myocardium had good diagnostic performance, robustness, and clinical utility. This model may represent an economical and feasible alternative follow-up plan for patients who do not have to undergo repeated enhanced CMR, avoiding unnecessary contrast injection.