Differential diagnosis of common etiologies of left ventricular hypertrophy using a hybrid CNN-LSTM model

Hwang, In-Chang; Choi, Dongjun; Choi, You-Jung; Ju, Lia; Kim, Myeongju; Hong, Ji-Eun; Lee, Hyun-Jung; Yoon, Yeonyee E.; Park, Jun-Bean; Lee, Seung-Pyo; Kim, Hyung-Kwan; Kim, Yong-Jin; Cho, Goo-Yeong

doi:10.1038/s41598-022-25467-w

Differential diagnosis of common etiologies of left ventricular hypertrophy using a hybrid CNN-LSTM model

Article
Open access
Published: 05 December 2022

Volume 12, article number 20998, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Differential diagnosis of common etiologies of left ventricular hypertrophy using a hybrid CNN-LSTM model

Download PDF

In-Chang Hwang^1,2^na1,
Dongjun Choi³^na1,
You-Jung Choi⁴,
Lia Ju¹,
Myeongju Kim³,
Ji-Eun Hong³,
Hyun-Jung Lee^2,5,
Yeonyee E. Yoon^1,2,
Jun-Bean Park^2,5,
Seung-Pyo Lee^2,5,
Hyung-Kwan Kim^2,5,
Yong-Jin Kim^2,5 &
…
Goo-Yeong Cho^1,2

3058 Accesses
17 Citations
3 Altmetric
Explore all metrics

Abstract

Differential diagnosis of left ventricular hypertrophy (LVH) is often obscure on echocardiography and requires numerous additional tests. We aimed to develop a deep learning algorithm to aid in the differentiation of common etiologies of LVH (i.e. hypertensive heart disease [HHD], hypertrophic cardiomyopathy [HCM], and light-chain cardiac amyloidosis [ALCA]) on echocardiographic images. Echocardiograms in 5 standard views (parasternal long-axis, parasternal short-axis, apical 4-chamber, apical 2-chamber, and apical 3-chamber) were obtained from 930 subjects: 112 with HHD, 191 with HCM, 81 with ALCA and 546 normal subjects. The study population was divided into training (n = 620), validation (n = 155), and test sets (n = 155). A convolutional neural network-long short-term memory (CNN-LSTM) algorithm was constructed to independently classify the 3 diagnoses on each view, and the final diagnosis was made by an aggregate network based on the simultaneously predicted probabilities of HCM, HCM, and ALCA. Diagnostic performance of the algorithm was evaluated by the area under the receiver operating characteristic curve (AUC), and accuracy was evaluated by the confusion matrix. The deep learning algorithm was trained and verified using the training and validation sets, respectively. In the test set, the average AUC across the five standard views was 0.962, 0.982 and 0.996 for HHD, HCM and CA, respectively. The overall diagnostic accuracy was significantly higher for the deep learning algorithm (92.3%) than for echocardiography specialists (80.0% and 80.6%). In the present study, we developed a deep learning algorithm for the differential diagnosis of 3 common LVH etiologies (HHD, HCM and ALCA) by applying a hybrid CNN-LSTM model and aggregate network to standard echocardiographic images. The high diagnostic performance of our deep learning algorithm suggests that the use of deep learning can improve the diagnostic process in patients with LVH.

Using deep learning method to identify left ventricular hypertrophy on echocardiography

Article Open access 10 November 2021

Multi-channel deep learning model-based myocardial spatial–temporal morphology feature on cardiac MRI cine images diagnoses the cause of LVH

Article Open access 24 April 2023

Echocardiography-based machine learning algorithm for distinguishing ischemic cardiomyopathy from dilated cardiomyopathy

Article Open access 26 September 2023

Introduction

Echocardiography is widely accepted as an essential diagnostic tool for cardiovascular evaluation. Most measurements on echocardiography can be automated using machine learning techniques^1,2,3. However, the value of echocardiography also includes differential diagnosis and clinical decision making. Echocardiography specialists make judgements based on the visual information from echocardiographic images, along with knowledge and experience. Because of complex and diverse medical situations, the interpretation of echocardiographic images and resulting decision still remain as dependent on the clinician’s expertise.

The differential diagnosis of “unexplained” left ventricular hypertrophy (LVH) on echocardiography is important, but challenging⁴. LVH is most commonly a physiologic consequence of increased afterload by hypertension (i.e. hypertensive heart disease [HHD])⁵. However, some patients demonstrate hypertrophied myocardium without an increased afterload; the differential diagnosis in such patients includes hypertrophic cardiomyopathy (HCM) and infiltrative cardiomyopathy, such as light-chain cardiac amyloidosis (ALCA)^4,6,7,8,9. The differential diagnosis of LVH requires a series of expensive, invasive, and time-consuming procedures, such as cardiac magnetic resonance imaging (CMR), endomyocardial biopsy (EMB), and genetic testing⁴. In particular, CMR is useful in the differentiation of LVH of unknown etiology based on the well-established typical CMR features of HCM and ALCA, but is expensive and sometimes unavailable, and does not confirm the diagnosis^10,11. For confirmation of the diagnosis, EMB is useful, especially for ALCA. However, EMB has limitations such as invasiveness, lower diagnostic yield at the right ventricle (RV), difficulty in approaching to the LV myocardium, and a lack of specific histologic markers for HHD^12,13. Genetic testing can be useful for the detection of HCM, but the results are often inconclusive and sometimes do not provide confirmative diagnostic information¹⁴. Due to these limitations, patients with LVH of unknown etiology require additional tests, which necessitate substantial time and cost. More importantly, these tests often need to be performed simultaneously or sequentially, as the findings of each test might not provide confirmative results. If the echocardiographic findings can narrow the differential diagnosis of LVH of unknown etiology, then the time and cost required for diagnostic process can be reduced, and patients can avoid unnecessary tests. However, although echocardiography plays a role in screening for the suggestion of differential diagnosis of “unexplained” LVH, this imaging modality might not be correct, and may mislead or complicate the diagnostic process¹⁵. Therefore, the presumptive diagnosis by expert cardiologists must be improved in terms of accuracy, and there is a clinical need for higher diagnostic accuracy on echocardiography for a more efficient diagnostic process.

Considering that machine learning can objectively evaluate imaging data without prejudice, and construct a decision from information that is difficult for human eyes to comprehend, it can be assumed that a machine learning approach would be helpful for the differential diagnosis of LVH on echocardiography. Therefore, in the present study, we aimed to differentiate common LVH etiologies (HHD, HCM, and ALCA) on standard echocardiographic images by using a hybrid convolutional neural network-long short-term memory (CNN-LSTM) algorithm.

Methods

The overall scheme of the study is depicted in Fig. 1 and more detailed methods are available in the Supplementary Methods.

Study design and cohort

This study conformed to the principles outlined in the Declaration of Helsinki and was approved by the Seoul National University Bundang Hospital Institutional Review Board (IRB No. B-2105-687-107) in May 2021. The requirement for informed consent was waived by the Seoul National University Bundang Hospital Institutional Review Board because of the retrospective nature of the study and minimal expected risk to the subjects. This study was conducted and described according to the Proposed Requirements for Cardiovascular Imaging-Related Machine Learning Evaluation, as suggested by the American College of Cardiology Healthcare Innovation Council¹⁶.

From the echocardiography databases of Seoul National University Bundang Hospital (n = 755) and Seoul National University Hospital (n = 175), we retrospectively identified 930 subjects (112 patients with HHD, 191 with HCM, 81 with ALCA, and 546 normal subjects). The diagnostic criteria for HHD, HCM and ALCA are described below.

HHD

Patients with a history of hypertension, who met the diagnostic criteria for LVH on echocardiography (LV mass index [LVMI] > 115 g/m² in men, and > 95 g/m² in women) were included^17,18. The following additional criteria were required for a specific diagnosis of HHD: (1) end-diastolic maximal LV wall thickness (LVWTmax) ≥ 12 mm, (2) regression of LVH after appropriate blood pressure control, and (3) exclusion of other causes of LVH (such as HCM, infiltrative cardiomyopathy, metabolic cardiomyopathy, etc.).

HCM

Patients who met the diagnostic criteria of HCM (LVWTmax ≥ 15 mm on echocardiography, in the absence of abnormal loading conditions that could sufficiently explain the LVH) were included^19,20. For a specific and accurate diagnosis of HCM, definite evidence of HCM on CMR or a typical gene mutation on genetic analysis were required.

ALCA

According to clinical guidelines, ALCA on echocardiography was suggested when the LVWTmax was > 12 mm²¹. Other typical features on echocardiography, such as (1) symmetrical LV thickening; (2) right ventricular (RV) free wall thickening; (3) small pericardial effusion; (4) thickening of the atrioventricular valves and interatrial septum; (5) abnormal myocardial texture characterized as a speckled appearance; (6) voltage-mass discrepancy; (7) base-to-apex strain gradient or relative apical sparing of longitudinal strain; and (8) typical findings on CMR (patchy, subendocardial circumferential, or diffuse fuzzy late gadolinium enhancement [LGE] of the LV), were used for clinical suspicion and detection of ALCA^21,22. For a specific and accurate labeling, definite evidence of amyloid involvement on EMB was required. Due to the small number of patients with transthyretin amyloidosis and potential differences in myocardial texture, we included patients with ALCA, and excluded those with transthyretin amyloidosis.

Normal subjects

Inclusion criteria for normal subjects were as follows: (1) no clinical history of cardiovascular disease or diabetes; (2) normal blood pressure (≤ 130/80 mm Hg); (3) body mass index ≤ 30 kg/m²; (4) normal sinus rhythm at 50–85 beats/min without conduction abnormalities; (5) normal LV wall thickness, LV wall motion, and left atrial volume (< 27 mL/m² using the biplane method of discs); (6) no mitral valve prolapse; and (7) no more than trivial valve regurgitation.

Exclusion criteria

Patients were excluded if they had (1) significant LV dysfunction (LV ejection fraction < 40%), (2) active malignancy (or receiving chemotherapy), (3) end-stage renal disease, (4) prior coronary revascularization, (5) significant valve disease, (6) regional wall motion abnormality, (7) no evidence of LVH or LVWTmax < 11 mm, or (8) other metabolic or infiltrative cardiomyopathies, such as Fabry disease, Danon disease, mitochondrial encephalopathy lactic acidosis and stroke-like episodes (MELAS), and PRKAG2 cardiomyopathy.

Echocardiography

All images were obtained using a standard ultrasound device with a 2.5-MHz probe, in accordance with the guidelines of the American Society of Echocardiography¹⁷. Echocardiograms comprised 1 cardiac cycle, obtained in 5 standard views (parasternal long-axis [PLAX], parasternal short-axis [PSAX], apical 4-chamber [A4C], apical 2-chamber [A2C], and apical 3-chamber [A3C]).

Image processing for the deep learning algorithm

Echocardiogram videos were downloaded as Digital Imaging and Communications in Medicine (DICOM) files from the picture archiving and communication system, and anonymized (Fig. 1). Because of differences in heart rate and echocardiographic frame rate, the number of images in the cardiac cycle differed among patients and views. Therefore, 12 images were extracted at the same interval for each view. The extracted images were cropped to 12 × 12cm² based on each center point to remove parts not related to the region of interest. The cropped images were resized to 256 × 256 pixels using bilinear interpolation. Pydicom (python package, version 2.1.0) was used to preprocess the DICOM files.

Deep learning model development

The development of the deep learning model is detailed in the Supplementary Methods. Briefly, the total study population (n = 930) was divided into training (n = 620), validation (n = 155), and test sets (n = 155). Using the training set, a deep learning algorithm based on a CNN-LSTM for the differential diagnosis of LVH was developed in two major steps (Fig. 1). The first step comprised the development of a CNN-LSTM network^23,24. The same CNN was applied to the 12 DICOM images extracted from each standard echocardiographic view. Because we aimed to combine the CNN’s feature extraction from the DICOM images and the LSTM’s temporal information, we opted to extract 12 images/cardiac cycle, in order to avoid exhaustive amount of computing time from various lengths of input videos, while maintaining clinical relevance²⁵. Then, in order to reflect the temporal and spatial connectivity between the 12 DICOM images, a bi-directional convolutional LSTM layer was applied. Finally, a multi-label classification block was applied to predict HHD, HCM, and ALCA independently, on each view. The second step comprised the development of a neural network that aggregated the results obtained in the first step. This neural network was developed to decide the final “most-likely” diagnosis among 4 categories (normal, HHD, HCM, and ALCA) from the 5 standard views of each patient; in real-world clinical practice, the evaluation of a patient’s echocardiographic images should lead to a single clinical diagnosis. The outputs obtained from the 5 independent CNN-LSTM networks were concatenated to compose the input. Binary cross entropy was used as an objective function to train the first and second steps, and He-initialization was used to initialize the weights²⁶. The region to which the deep learning algorithm reacted sensitively in images was detected using class activation mapping²⁷. Network development was implemented using the Tensorflow framework (version 2.3) and graphic processing unit (NVIDIA GeForce RTX 2080 Ti) in Linux (Ubuntu 16.04) with NVIDIA CUDA/cuDNN (versions 10.1 and 7.6, respectively).

Study outcomes

The study outcomes were the area under the receiver operating characteristic curve (AUC) for the differentiation of the 4 categories (normal, HHD, HCM, and ALCA) and the diagnostic accuracy as calculated by the confusion matrix. For the latter, the final diagnosis made by the deep learning algorithm was compared to the ground-truth labeling. Additionally, using the test set, the diagnostic performance of the CNN-LSTM model was evaluated by comparing the final diagnosis made by the deep learning algorithm to the visual interpretation of expert cardiologists (I-C Hwang and G-Y Cho, who have more than 10 and 25 years of experience in echocardiography, respectively).

Statistical analysis

The AUC was used to measure the classification performance. Sensitivity, specificity, positive and negative predictive values, and positive and negative likelihood ratios of the deep learning algorithm were calculated for each disease. The optimal cutoff for each of the 3 diseases was calculated in advance using the Youden’s J statistic of the validation set²⁸. If the probabilities for HHD, HCM, or ALCA were smaller than the corresponding optimal cutoff, the diagnosis was “normal”. Otherwise, the highest value among the probabilities for HHD, HCM, and ALCA decided the final diagnosis. Cohen’s \(\kappa\) coefficient and the confusion matrix were calculated to compare the diagnostic performance between the deep learning algorithm and the expert clinicians²⁹. Diagnostic accuracy based on the confusion matrix was calculated as (true positives + true negatives)/(true positives + true negatives + false positives + false negatives). All statistical analyses were performed using R statistical software version 4.1.1 (The R Foundation for Statistical Computing, Vienna, Austria). p-values < 0.05 were considered statistically significant.

Results

Baseline characteristics

In total, 4650 echocardiograms from 930 subjects (5 standard echocardiographic views for each subject) were analyzed. Baseline characteristics of the study population are summarized in Table 1. The LVWTmax and LVMI were significantly larger in patients with LVH than in normal subjects, but there were no significant differences between HHD, HCM, and CA subgroups. Details regarding the composition of the training, validation, and test sets are provided in Table 2.

Table 1 Baseline characteristics.

Full size table

Table 2 Splitting of the data into training, validation and test sets.

Full size table

Diagnostic accuracy

The diagnostic accuracy of the developed algorithm was assessed at each step of the algorithm development. First, the AUCs for the differential diagnosis of HHD, HCM, and ALCA were obtained from the CNN models without LSTM network, and were compared with the AUCs obtained from the CNN-LSTM model (Supplementary Table S1). In overall, the AUCs for the diagnosis of HHD, HCM and ALCA were higher with the combined CNN-LSTM model compared to the CNN models: the averaged AUCs of the CNN models were around 0.9, but further improved by the addition of LSTM network. Details regarding the diagnostic performance for each view are provided in Supplementary Table S2, comparing the diagnosis made by the expert cardiologists and that by CNN-LSTM algorithm.

Second, the AUCs obtained from the combined CNN-LSTM model of each echocardiographic views and those of the final AUCs from the final aggregate network of the 5 standard views were assessed in the validation and test sets (Table 3, Supplementary Fig. S1). In the validation set, the AUCs of the final aggregate network of the 5 standard echocardiographic views were 0.958, 0.988, and 0.993, for the diagnosis of HHD, HCM, and ALCA, respectively (Table 3, Fig. 2A). The AUCs were similar in the test set (0.962, 0.982 and 0.996, respectively) (Table 3, Fig. 2B). The AUCs obtained from the final aggregate network were higher than those from each echocardiographic view. Details on the sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) for each LVH etiology, provided by the expert cardiologists and CNN-LSTM algorithm, are compared in Supplementary Table S3.

Table 3 AUCs for the differential diagnosis of LVH.

Full size table

The diagnostic performance of the CNN-LSTM model and aggregate network was compared according to the number of echocardiographic images extracted from 1 cardiac cycle, which was one of the major hyperparameters of our deep learning algorithm (Supplementary Methods and Supplementary Table S4). In the developed model, the number of images/cardiac cycle was determined empirically: 12 DICOM images were extracted from 1 cardiac cycle, considering the various heart rates and frame rates of the included echocardiogram videos. The AUCs of the algorithm based on 12 images/cardiac cycle were comparable to the models based on 4, 8, or 16 images/cardiac cycle. In order to reflect the entire cardiac cycle in echocardiogram videos of various heart rates and frame rates in routine clinical practice, the 12 images/cardiac cycle was maintained for the deep learning algorithm. Further, the AUCs were compared between the 2-dimensional (2D) image-based CNN-LSTM model with aggregate network and the 3-dimensional CNN (3D-CNN) model, which was suggested in a recent study³⁰. In this analysis, we extracted 12 images/cardiac cycle or 16 images/cardiac cycle for the 3D-CNN model for comparability with our 2D-CNN-LSTM model, and found that the AUCs of the 3D-CNN model are not significantly different compared to our algorithm (Supplementary Table S5).

Echocardiographic features used in the differential diagnosis

Class activation mapping demonstrated that well-established typical echocardiographic findings for the differential diagnosis of LVH were utilized in the deep learning algorithm (Fig. 3 and Supplementary Table S6). In PLAX views, the highlighted regions comprised the anteroseptum, ascending aorta, and basal inferolateral segment with posterior mitral valve leaflet (Fig. 3A,F,K). In PSAX views, the septum and papillary muscle were highlighted in all 4 categories, and for the differentiation of ALCA, the pericardium at the LV posterior side was highlighted (Fig. 3B,G,L). The inferoseptum and papillary muscle were typically highlighted in A4C images (Fig. 3C,H,M); the LV inferior wall and LA wall were highlighted in A2C images (Fig. 3D,I,N); and the anteroseptum, inferolateral wall, and the pericardium at the LV posterior side were highlighted in A3C images (Fig. 3E,J,O). The frequencies of the highlighted regions in each echocardiographic view are summarized in Supplementary Table S6.

Comparison with expert interpretation

As shown in Supplementary Table S2, the diagnostic performance of expert cardiologists on a single echocardiographic view was not satisfactory: the sensitivity ranged from 14 to 78% and the PPV from 26 to 77%. Although the diagnostic performance of expert cardiologists was improved when the 5 standard echocardiographic views were combined for decision, the sensitivity, specificity, PPV and NPV for each LVH etiology were lower than those provided by the deep learning algorithm using the hybrid CNN-LSTM model and aggregate network (Supplementary Table S3). The overall diagnostic accuracy of the deep learning algorithm was 92.3% and the Cohen’s \(\kappa\) was 0.869 (p < 0.001), which were significantly higher than those of the two expert cardiologists (expert 1: accuracy, 80%; Cohen’s \(\kappa\), 0.674; p < 0.001; expert 2: accuracy, 80.6%; Cohen’s \(\kappa\), 0.687; p < 0.001) (Fig. 4).

Discussion

In the present study, we developed a deep learning algorithm based on 5 standard echocardiographic views from 930 subjects to differentiate the common etiologies of LVH on echocardiography using a hybrid CNN-LSTM model and aggregate network. The deep learning algorithm showed excellent diagnostic performance in the differentiation of LVH, which was significantly greater than that based on expert cardiologists’ interpretations of the echocardiogram. These findings suggest that deep learning-assisted interpretation of the echocardiogram can improve the accuracy of the differential diagnosis of LVH, and improve the overall diagnostic process.

Etiologies of LVH and challenges for differential diagnosis

LVH is often a physiologic adaptation to an increased afterload, with a prevalence reaching 10% to 15% in the echocardiography laboratory³¹. However, the etiology of LVH is not limited to hypertension, but includes a wide range of disease conditions. According to previous echocardiographic studies, the common causes of LVH other than HHD include HCM and CA⁹. HCM is a genetic disease with an approximate prevalence of 1:200–1:500, and the patients with HCM show significant LVH due to myocardial fiber disorganization/disarray^19,20. Light-chain CA is a hematologic malignant disease, in which abnormally increased amyloid protein production leads to a profound infiltration of amyloid protein in the myocardium, resulting in significant LVH³².

The differential diagnosis between these conditions is important because of differences in the treatment and prognosis. While the management of HHD mainly focuses on blood pressure control, the management of HCM and ALCA is much more complex and multifactorial. In patients with HCM, the treatment strategy includes sudden cardiac death risk assessment; primary or secondary prevention of sudden death; management of combined arrhythmia, heart failure, or LV outflow tract obstruction; and family counseling/screening^19,20. The management of ALCA includes cytotoxic chemotherapy and stem cell transplantation, along with the management of cardiovascular complications such as arrhythmia and heart failure²¹. Furthermore, the overall life expectancy of patients with HCM is comparable to that of the general population, but 30–40% of patients will experience adverse events¹⁹. In contrast, patients with light-chain ALCA have a very poor prognosis, with a median survival from the initial diagnosis of only 24 months^22,32.

Although the underlying LVH pathophysiology differs between HHD (increased afterload), HCM (sarcomere mutation and myofibril disarray/disorganization), and ALCA (amyloid protein infiltration), the differential diagnosis is often difficult on echocardiography. This is because of morphologic similarities on echocardiography, and the high prevalence of hypertension in patients with HCM or ALCA^6,7,15. The differential diagnosis of HCM is especially problematic when patients show diffuse or mixed-type HCM. A comprehensive echocardiography examination can improve the diagnostic accuracy in the detection of ALCA, which paradoxically suggests that the visual assessment has limited use in the differential diagnosis^8,33,34. The difficulties in the differential diagnosis on echocardiography leads to the subsequent use of numerous noninvasive and invasive tests, such as CMR, EMB, and genetic testing. However, despite the limited diagnostic accuracy in many clinical situations, these tests often require additional cost, time, and invasiveness^{10,11,12,13,14}. Thus, improvements in the differential diagnosis of LVH etiologies by echocardiography can facilitate the efficient diagnostic process, and further lead to a timely application of disease-specific treatment.

Relevance of an artificial intelligence-supported differential diagnosis

Our deep learning algorithm showed excellent diagnostic accuracy for the differential diagnosis of LVH using 5 standard echocardiographic views. It might be argued that the differences in the LV wall thickness might be the determinant of the differential diagnosis, given that the patients with ALCA may have less LVH than HHD or HCM. However, in the present study, the inclusion criterion of LVWTmax was > 12 mm for both HHD and ALCA, and the mean LV wall thickness did not differ between the two groups. Due to innate characteristics of the deep learning process, as well as relatively small study population, we cannot provide detailed reasons for this improvement or delicate sensitivity analyses; however, the class activation mapping results provided clues. For the diagnosis of HHD, the class activation map highlighted regions at the ascending aorta on PLAX views, RV insertion site on PSAX views, and RV apex and LV inferior/inferolateral wall on apical views (A4C, A2C and A3C) (Fig. 3 and Supplementary Table S6). Patients with HHD show concentric or eccentric LVH, but specific echocardiographic findings differentiating HHD from other causes of LVH are largely unknown. However, a previous CMR study reported that patients with HHD demonstrate LGE at RV insertion points, and limited aortic distensibility, which might have been utilized in our deep learning algorithm³⁵. The diagnosis of HCM was mainly based on highlighted regions at the basal septum and basal inferolateral wall on PLAX views, and the LV septum and inferior wall on apical views, all of which are typically hypertrophied in patients with HCM^20,36. For the diagnosis of ALCA, highlighted regions typically included the anterior mitral valve leaflet, left atrial wall, and LV basal inferior/inferolateral segments with the adjacent pericardial space. Patients with ALCA often demonstrate thickened valve leaflets and atrial wall due to amyloid protein infiltration, and a small amount of pericardial effusion³⁷. Although not sufficiently pathognomonic to exclude other possible differential diagnoses, these highlighted regions show typical features for the clinical suspicion and determination of LVH etiologies on echocardiography.

Furthermore, it can be assumed that different myocardial textures and motions were also utilized in the deep learning algorithm, as suggested in a recent study by Fei Yu et al.³⁸. In particular, the microscopic features of LVH etiologies significantly differ, due to different underlying pathophysiology. Patients with HHD have hypertrophied cardiomyocytes with diffuse myocardial fibrosis, whereas patients with HCM typically have disorganized myocardial fibers with marked fibrosis, and those with ALCA have infiltrated amyloid proteins. These pathophysiologic differences are also utilized in the visual assessment of echocardiographic images (e.g. increased echogenicity in HCM, and a granular sparkling appearance in ALCA). However, visual interpretation of these morphologic features is subjective to the observer’s discretion, and thus, is not specific. In the current class activation mapping results, a thickened LV myocardium was highlighted in most echocardiographic images, suggesting that the myocardial texture was utilized as an important indicator in the differential diagnosis.

In the present study, it was noted that the PPV values of the CNN-LSTM algorithms for each standard echocardiographic view were low, ranging from 30 to 70% (Supplementary Table S2). Thus, we applied the aggregate network in order to concatenate the results obtained from the CNN-LSTM models of 5 echocardiographic views. The concatenated outputs from the aggregate network significantly improved the overall diagnostic performance, as well as the PPV values. The use of aggregate network resembles in part the clinical decision by human experts, in which a full series of echocardiographic images are integrated. Given that the highlighted regions on class activation mapping differed between the 5 echocardiographic views, it can be inferred that the aggregate network could improve diagnostic accuracy through integration of features from the 5 different views. In addition, the diagnostic accuracy of our deep learning algorithm was significantly higher than that for the echocardiography specialists. In real-world practice, the overall diagnostic process for unexplained LVH is guided by the decisions of echocardiography specialists. Thus, the higher diagnostic accuracy, especially the excellent NPV and specificity, of our deep learning algorithm can contribute to a more efficient process, reducing the time and effort required for a final diagnosis of the LVH etiology. Although a deep learning algorithm-assisted diagnosis on echocardiography cannot yet replace the current confirmative diagnostic tools, this approach can help attending physicians go straight to confirmative testing, avoiding inconclusive results and uncertain debates regarding the diagnosis.

Machine-learning approaches for differential diagnosis of LVH etiologies

The application of deep learning in echocardiography has been considered as challenging, because of the various view orientations and inter-view differences as well as the variability within a single view³⁹. However, several landmark studies demonstrated accurate view classification with segmentation, cardiac structure identification, and cardiac phase detection, all of which enabled the accurate automated measurement of cardiac structures and functional parameters^2,3,40,41,42. These can contribute to the accurate measurement of echocardiographic parameters while reducing human errors. On top of these, the deep learning algorithms demonstrated promising results in the detection of certain echocardiographic features, such as the presence of LVH or regional wall motion abnormalities^39,43, and furthermore, differential diagnosis on echocardiographic images to aid clinical decision-making, which was previously believed to require complex and sophisticated clinical reasoning by specialists. In particular, several studies focused on the differential diagnosis of LVH and demonstrated meaningful results.

A study by Xiang Yu et al. also developed a deep learning algorithm for detection of LVH and its differential diagnosis of HHD, HCM, and ALCA⁴⁴, but we found that the methodology is different compared to our study. The study by Xiang Yu et al. obtained 2 still images from PLAX and A4C views of each patient, utilized the ResNet and U-net ++ for the algorithm development, and performed manual delineation of LV myocardium as the ground truth. In contrast, we obtained 5 standard echocardiogram videos (PLAX, PSAX, A4C, A2C, and A3C) and extracted 12 images from 1 cardiac cycle, in order to reflect the motion of cardiac structures. Our deep learning algorithm did not require the manual delineation of cardiac structures, but provided excellent diagnostic accuracy and demonstrated that relevant echocardiographic features were utilized for the decision, as shown in the class activation map. In addition, we tried our best effort to improve the diagnostic accuracy of our deep learning algorithm, avoiding the use of images from repetitive echocardiograms from a same patient, which is another difference compared to the study by Xiang Yu et al.⁴⁴. Furthermore, we confirmed that each step of the algorithm development, such as the application of LSTM network and the use of aggregate network, improved the diagnostic accuracy. Indeed, the combined CNN-LSTM algorithm was adopted to appropriately reflect the myocardial texture, along with myocardial systolic and diastolic motions. The LSTM algorithm is a novel and efficient type of recurrent neural network, and has strengths in time series prediction, such as in movie frames. Because the myocardial systolic and diastolic motions can differ between HHD, HCM, and ALCA, these features might have been utilized in our deep learning algorithm.

More recently, Duffy et al. developed a deep learning workflow for measurement of LV geometry and diagnosis of LVH etiologies, using a large-scale cohort of 23,745 patients³⁰. In that landmark study, a deep learning model for measurement of LV dimensions and wall thickness was developed using PLAX videos, and a video-based CNN model for identification of the etiology was developed using A4C videos. One of the important differences of the study by Duffy et al. compared to our study is the use of 3D-CNN with spatiotemporal convolutions. In contrast, we designed 2D-based CNN for 12 images extracted from 1 cardiac cycle in order to extract echocardiographic features for differential diagnosis. Then, an LSTM layer was applied to the 12 CNNs, to reflect the temporal and special changes of the heart during the cardiac cycle. Given the relevance of both methods (2D-CNN-LSTM and 3D-CNN) for acquisition of spatiotemporal data, we compared the diagnostic performance of these methods using our dataset (Supplementary Table S5), and found that the AUCs were not different. These findings infer that, echocardiographic features including geometry, myocardial texture, and cardiac systolic/diastolic motions, can be reflected in both 2D-CNN-LSTM algorithm and 3D-CNN algorithm. Another important difference is the echocardiographic views used in the study. The deep learning algorithm developed by Xiang Yu et al. utilized 2 still images of echocardiogram (PLAX and A4C)⁴⁴, and the algorithm by Duffy et al. utilized only A4C videos for the differential diagnosis of LVH³⁰. In contrast, we utilized 5 standard echocardiographic views (PLAX, PSAX, A4C, A2C and A3C) for the development of CNN-LSTM algorithms, which were concatenated to provide a single most-likely diagnosis. Although it can be assumed that the integration of various aspects of cardiac structure and function may improve the diagnostic accuracy, future studies on the direct comparison of these algorithms are required. Furthermore, given the potential benefits of a deep learning-assisted differential diagnosis, prospective studies or clinical trials are warranted to assess whether its use can reduce the time, costs, and number of tests deemed as necessary, compared to that for echocardiography specialists alone.

Limitations

The present study has several limitations. First, we did not include rare LVH etiologies, such as Fabry disease, MELAS, Danon syndrome, PRKAG2 cardiomyopathy, and transthyretin amyloidosis. The exclusion of these rare diseases was inevitable to ensure a sufficient number of patients for each LVH etiology. However, future multi-center studies are warranted to include the rare LVH etiologies in the deep learning algorithm. Second, we excluded patients with valvular heart disease or chronic kidney disease, as there is a possibility that these conditions overlap with the LVH etiologies included in the present study. Nonetheless, the overlap of these conditions cannot be strictly classified into a specific label, and it is impossible to clearly distinguish the proportion of each causative factor of LVH. Third, we excluded patients with other overt echocardiographic abnormalities, such as regional wall motion abnormalities or significant LV dysfunction. As the presence of these pathologic conditions indicate a poor prognosis in patients with LVH, future studies are warranted to develop a comprehensive deep learning algorithm that includes a wide range of complex cardiac conditions. Finally, our deep learning algorithm was developed using echocardiographic images from 2 tertiary hospitals in South Korea, but was not validated in external datasets from other ethnicities. For further validation, as well as for facilitation of deep learning approaches in cardiovascular imaging, the full code for our algorithm was released (https://github.com/djchoi1742/Echo_LVH).

Conclusion

We developed a deep learning algorithm for the differential diagnosis of common LVH etiologies (HHD, HCM, and ALCA) by applying a hybrid CNN-LSTM model and aggregate network to standard echocardiographic images. The high diagnostic performance of our deep learning algorithm suggests that the use of deep learning can improve the diagnostic process in patients with LVH.

Data availability

The datasets of echocardiographic images generated during and/or analysed during the current study are available from the corresponding author on reasonable request. The full code for our algorithm was released (https://github.com/djchoi1742/Echo_LVH).

References

Narula, S., Shameer, K., Salem Omar, A. M., Dudley, J. T. & Sengupta, P. P. Machine-Learning Algorithms to Automate Morphological and Functional Assessments in 2D Echocardiography. J Am Coll Cardiol. 68(21), 2287–95 (2016).
Article Google Scholar
Zhang, J. et al. Fully automated echocardiogram interpretation in clinical practice. Circulation 138(16), 1623–1635 (2018).
Article Google Scholar
Lara Hernandez, K. A., Rienmuller, T., Baumgartner, D. & Baumgartner, C. Deep learning in spatiotemporal cardiac imaging: A review of methodologies and clinical usability. Comput. Biol. Med. 130, 104200 (2021).
Article Google Scholar
Yilmaz, A. & Sechtem, U. Diagnostic approach and differential diagnosis in patients with hypertrophied left ventricles. Heart 100(8), 662–671 (2014).
Article Google Scholar
Drazner, M. H. The progression of hypertensive heart disease. Circulation 123(3), 327–334 (2011).
Article Google Scholar
Doi, Y. L. et al. Echocardiographic differentiation of hypertensive heart disease and hypertrophic cardiomyopathy. Br. Heart J. 44(4), 395–400 (1980).
Article CAS Google Scholar
Sun, J. P. et al. Differentiation of hypertrophic cardiomyopathy and cardiac amyloidosis from other causes of ventricular wall thickening by two-dimensional strain imaging echocardiography. Am. J. Cardiol. 103(3), 411–415 (2009).
Article Google Scholar
Liu, D. et al. Effect of combined systolic and diastolic functional parameter assessment for differentiation of cardiac amyloidosis from other causes of concentric left ventricular hypertrophy. Circ. Cardiovasc. Imaging 6(6), 1066–1072 (2013).
Article Google Scholar
Weidemann, F., Niemann, M., Ertl, G. & Stork, S. The different faces of echocardiographic left ventricular hypertrophy: Clues to the etiology. J. Am. Soc. Echocardiogr. 23(8), 793–801 (2010).
Article Google Scholar
Grajewski, K. G., Stojanovska, J., Ibrahim, E. H., Sayyouh, M. & Attili, A. Left ventricular hypertrophy: Evaluation With cardiac MRI. Curr. Probl. Diagn. Radiol. 49(6), 460–475 (2020).
Article Google Scholar
Nordin, S., Dancy, L., Moon, J. C. & Sado, D. M. Clinical applications of multiparametric CMR in left ventricular hypertrophy. Int. J. Cardiovasc. Imaging 34(4), 577–585 (2018).
Article Google Scholar
Yoshizawa, S., Uto, K., Nishikawa, T., Hagiwara, N. & Oda, H. Histological features of endomyocardial biopsies in patients undergoing hemodialysis: Comparison with dilated cardiomyopathy and hypertensive heart disease. Cardiovasc. Pathol. 49, 107256 (2020).
Article CAS Google Scholar
Chimenti, C. & Frustaci, A. Contribution and risks of left ventricular endomyocardial biopsy in patients with cardiomyopathies: A retrospective study over a 28-year period. Circulation 128(14), 1531–1541 (2013).
Article Google Scholar
Ho, C. Y. et al. Genotype and lifetime burden of disease in hypertrophic cardiomyopathy: Insights from the Sarcomeric Human Cardiomyopathy Registry (SHaRe). Circulation 138(14), 1387–1398 (2018).
Article Google Scholar
Magnusson, P., Palm, A., Branden, E. & Morner, S. Misclassification of hypertrophic cardiomyopathy: Validation of diagnostic codes. Clin. Epidemiol. 9, 403–410 (2017).
Article Google Scholar
Sengupta, P. P. et al. Proposed requirements for cardiovascular imaging-related machine learning evaluation (PRIME): A checklist: Reviewed by the American College of Cardiology Healthcare Innovation Council. JACC Cardiovasc. Imaging 13(9), 2017–2035 (2020).
Article Google Scholar
Lang, R. M. et al. Recommendations for cardiac chamber quantification by echocardiography in adults: An update from the American Society of Echocardiography and the European Association of Cardiovascular Imaging. J. Am. Soc. Echocardiogr. 28(1), 1-39 e14 (2015).
Article Google Scholar
Marwick, T. H. et al. Recommendations on the use of echocardiography in adult hypertension: A report from the European Association of Cardiovascular Imaging (EACVI) and the American Society of Echocardiography (ASE). J. Am. Soc. Echocardiogr. 28(7), 727–754 (2015).
Article Google Scholar
Ommen, S. R. et al. 2020 AHA/ACC guideline for the diagnosis and treatment of patients with hypertrophic cardiomyopathy: A report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation 142(25), e558–e631 (2020).
Google Scholar
Elliott, P. M. et al. 2014 ESC guidelines on diagnosis and management of hypertrophic cardiomyopathy: The task force for the diagnosis and management of hypertrophic cardiomyopathy of the European Society of Cardiology (ESC). Eur. Heart J. 35(39), 2733–79 (2014).
Article Google Scholar
Gertz, M. A. et al. Definition of organ involvement and treatment response in immunoglobulin light chain amyloidosis (AL): A consensus opinion from the 10th International Symposium on Amyloid and Amyloidosis, Tours, France, 18–22 April 2004. Am. J. Hematol. 79(4), 319–328 (2005).
Article Google Scholar
Hwang, I. C. et al. Time trajectory of cardiac function and its relation with survival in patients with light-chain cardiac amyloidosis. Eur. Heart J. Cardiovasc. Imaging 22(4), 459–469 (2021).
Article Google Scholar
Krizhevsky, A. & Sutskever, I. & Hinton G. E. Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017).
Article Google Scholar
Shi, X. et al. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. Adv. Neural Inf. Process. Syst. 1, 802–810 (2015).
Google Scholar
Phan, H., Andreotti, F., Cooray, N., Chen, O. Y. & De Vos, M. SeqSleepNet: End-to-end hierarchical recurrent neural network for sequence-to-sequence automatic sleep staging. IEEE Trans. Neural Syst. Rehabil. Eng. 27(3), 400–410 (2019).
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. Proc. of the IEEE International Conference on Computer Vision (2015).
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A. Learning deep features for discriminative localization. Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (2016).
Youden, W. J. J. C. Index for rating diagnostic tests. Cancer 3(1), 32–35 (1950).
Article CAS Google Scholar
McHugh, M. L. Interrater reliability: The kappa statistic. Biochem. Med. 22(3), 276–82 (2012).
Article MathSciNet Google Scholar
Duffy, G. et al. High-throughput precision phenotyping of left ventricular hypertrophy with cardiovascular deep learning. JAMA Cardiol. 7(4), 386–395 (2022).
Article Google Scholar
Schirmer, H., Lunde, P. & Rasmussen, K. Prevalence of left ventricular hypertrophy in a general population; The Tromso Study. Eur. Heart J. 20(6), 429–438 (1999).
Article CAS Google Scholar
Garcia-Pavia, P. et al. Diagnosis and treatment of cardiac amyloidosis: A position statement of the ESC Working Group on Myocardial and Pericardial Diseases. Eur. Heart J. 42(16), 1554–1568 (2021).
Article Google Scholar
Boldrini, M. et al. Multiparametric echocardiography scores for the diagnosis of cardiac amyloidosis. JACC Cardiovasc. Imaging 13(4), 909–920 (2020).
Article Google Scholar
Baccouche, H. et al. Differentiating cardiac amyloidosis and hypertrophic cardiomyopathy by use of three-dimensional speckle tracking echocardiography. Echocardiography 29(6), 668–677 (2012).
Article Google Scholar
Rodrigues, J. C. et al. Prevalence and predictors of asymmetric hypertensive heart disease: Insights from cardiac and aortic function with cardiovascular magnetic resonance. Eur. Heart J. Cardiovasc. Imaging 17(12), 1405–1413 (2016).
Article Google Scholar
Klues, H. G., Schiffers, A. & Maron, B. J. Phenotypic spectrum and patterns of left ventricular hypertrophy in hypertrophic cardiomyopathy: Morphologic observations and significance as assessed by two-dimensional echocardiography in 600 patients. J. Am. Coll. Cardiol. 26(7), 1699–1708 (1995).
Article CAS Google Scholar
Selvanayagam, J. B., Hawkins, P. N., Paul, B., Myerson, S. G. & Neubauer, S. Evaluation and management of the cardiac amyloidosis. J. Am. Coll. Cardiol. 50(22), 2101–2110 (2007).
Article CAS Google Scholar
Yu, F. et al. Artificial intelligence-based myocardial texture analysis in etiological differentiation of left ventricular hypertrophy. Ann. Transl. Med. 9(2), 108 (2021).
Article CAS Google Scholar
Madani, A., Ong, J. R., Tibrewal, A. & Mofrad, M. R. K. Deep echocardiography: Data-efficient supervised and semi-supervised deep learning towards automated diagnosis of cardiac disease. NPJ Digit. Med. 1, 59 (2018).
Article Google Scholar
Ouyang, D. et al. Video-based AI for beat-to-beat assessment of cardiac function. Nature 580(7802), 252–256 (2020).
Article ADS CAS Google Scholar
Ghorbani, A. et al. Deep learning interpretation of echocardiograms. NPJ Digit. Med. 3, 10 (2020).
Article Google Scholar
Madani, A., Arnaout, R., Mofrad, M. & Arnaout, R. Fast and accurate view classification of echocardiograms using deep learning. NPJ Digit. Med. 1, 1–8 (2018).
Article Google Scholar
Kusunose, K. et al. A deep learning approach for assessment of regional wall motion abnormality from echocardiographic images. JACC Cardiovasc. Imaging 13(2 Pt 1), 374–381 (2020).
Article Google Scholar
Yu, X. et al. Using deep learning method to identify left ventricular hypertrophy on echocardiography. Int. J. Cardiovasc. Imaging 38, 759–769 (2021).
Article Google Scholar

Download references

Acknowledgements

The authors sincerely thank Ki-Ryong Jung, a Registered Diagnostic Cardiac Sonographer (RDCS), for his contribution in the data collection.

Funding

This work was supported by the Center for Artificial intelligence in Healthcare of Seoul national University Bundang Hospital (SNUBH) and a grant from the SNUBH Research Fund (No. 18–2020-0013). The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

These authors contributed equally: In-Chang Hwang and Dongjun Choi.

Authors and Affiliations

Cardiovascular Center, Seoul National University Bundang Hospital, 82 Gumi-Ro-173-Gil, Bundang, Seongnam, Gyeonggi, 13620, South Korea
In-Chang Hwang, Lia Ju, Yeonyee E. Yoon & Goo-Yeong Cho
Department of Internal Medicine, Seoul National University College of Medicine, Seoul, South Korea
In-Chang Hwang, Hyun-Jung Lee, Yeonyee E. Yoon, Jun-Bean Park, Seung-Pyo Lee, Hyung-Kwan Kim, Yong-Jin Kim & Goo-Yeong Cho
Center for Artificial Intelligence in Healthcare, Seoul National University Bundang Hospital, Songnam, Gyeonggi, South Korea
Dongjun Choi, Myeongju Kim & Ji-Eun Hong
Division of Cardiology, Cardiovascular Center, Korea University Guro Hospital, Seoul, South Korea
You-Jung Choi
Cardiovascular Center and Department of Internal Medicine, Seoul National University Hospital, Seoul, South Korea
Hyun-Jung Lee, Jun-Bean Park, Seung-Pyo Lee, Hyung-Kwan Kim & Yong-Jin Kim

Authors

In-Chang Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Dongjun Choi
View author publications
You can also search for this author in PubMed Google Scholar
You-Jung Choi
View author publications
You can also search for this author in PubMed Google Scholar
Lia Ju
View author publications
You can also search for this author in PubMed Google Scholar
Myeongju Kim
View author publications
You can also search for this author in PubMed Google Scholar
Ji-Eun Hong
View author publications
You can also search for this author in PubMed Google Scholar
Hyun-Jung Lee
View author publications
You can also search for this author in PubMed Google Scholar
Yeonyee E. Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Jun-Bean Park
View author publications
You can also search for this author in PubMed Google Scholar
Seung-Pyo Lee
View author publications
You can also search for this author in PubMed Google Scholar
Hyung-Kwan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Jin Kim
View author publications
You can also search for this author in PubMed Google Scholar
Goo-Yeong Cho
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception of research idea and study design: I.C.H., D.C., L.J.; data acquisition: I.C.H., D.C., Y.J.C., L.J.; data analysis and interpretation: I.C.H., D.C., Y.J.C., L.J., M.K., J.E.H., H.J.L., Y.E.Y., J.B.P., S.P.L., H.K.K., Y.J.K., G.Y.C.; statistical analysis: I.C.H., D.C.; supervision and mentorship: S.P.L., H.K.K., Y.J.K., G.Y.C. All authors revised the manuscript critically for important intellectual content, and read and approved the final manuscript version.

Corresponding author

Correspondence to In-Chang Hwang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hwang, IC., Choi, D., Choi, YJ. et al. Differential diagnosis of common etiologies of left ventricular hypertrophy using a hybrid CNN-LSTM model. Sci Rep 12, 20998 (2022). https://doi.org/10.1038/s41598-022-25467-w

Download citation

Received: 28 December 2021
Accepted: 30 November 2022
Published: 05 December 2022
DOI: https://doi.org/10.1038/s41598-022-25467-w
Springer Nature Limited

This article is cited by

Dual spin max pooling convolutional neural network for solar cell crack detection
- Sharmarke Hassan
- Mahmoud Dhimish
Scientific Reports (2023)

Differential diagnosis of common etiologies of left ventricular hypertrophy using a hybrid CNN-LSTM model

Abstract

Similar content being viewed by others

Using deep learning method to identify left ventricular hypertrophy on echocardiography

Multi-channel deep learning model-based myocardial spatial–temporal morphology feature on cardiac MRI cine images diagnoses the cause of LVH

Echocardiography-based machine learning algorithm for distinguishing ischemic cardiomyopathy from dilated cardiomyopathy

Introduction

Methods

Study design and cohort

HHD

HCM

ALCA

Normal subjects

Exclusion criteria

Echocardiography

Image processing for the deep learning algorithm

Deep learning model development

Study outcomes

Statistical analysis

Results

Baseline characteristics

Diagnostic accuracy

Echocardiographic features used in the differential diagnosis

Comparison with expert interpretation

Discussion

Etiologies of LVH and challenges for differential diagnosis

Relevance of an artificial intelligence-supported differential diagnosis

Machine-learning approaches for differential diagnosis of LVH etiologies

Limitations

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Dual spin max pooling convolutional neural network for solar cell crack detection

Search

Navigation