A novel multi-task machine learning classifier for rare disease patterning using cardiac strain imaging data

Siva, Nanda K.; Singh, Yashbir; Hathaway, Quincy A.; Sengupta, Partho P.; Yanamala, Naveena

doi:10.1038/s41598-024-61201-4

A novel multi-task machine learning classifier for rare disease patterning using cardiac strain imaging data

Article
Open access
Published: 09 May 2024

Volume 14, article number 10672, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

A novel multi-task machine learning classifier for rare disease patterning using cardiac strain imaging data

Download PDF

Nanda K. Siva^1,2,
Yashbir Singh^2,3,
Quincy A. Hathaway^1,2,
Partho P. Sengupta⁴ &
…
Naveena Yanamala^4,5

634 Accesses
4 Altmetric
Explore all metrics

Abstract

To provide accurate predictions, current machine learning-based solutions require large, manually labeled training datasets. We implement persistent homology (PH), a topological tool for studying the pattern of data, to analyze echocardiography-based strain data and differentiate between rare diseases like constrictive pericarditis (CP) and restrictive cardiomyopathy (RCM). Patient population (retrospectively registered) included those presenting with heart failure due to CP (n = 51), RCM (n = 47), and patients without heart failure symptoms (n = 53). Longitudinal, radial, and circumferential strains/strain rates for left ventricular segments were processed into topological feature vectors using Machine learning PH workflow. In differentiating CP and RCM, the PH workflow model had a ROC AUC of 0.94 (Sensitivity = 92%, Specificity = 81%), compared with the GLS model AUC of 0.69 (Sensitivity = 65%, Specificity = 66%). In differentiating between all three conditions, the PH workflow model had an AUC of 0.83 (Sensitivity = 68%, Specificity = 84%), compared with the GLS model AUC of 0.68 (Sensitivity = 52% and Specificity = 76%). By employing persistent homology to differentiate the “pattern” of cardiac deformations, our machine-learning approach provides reasonable accuracy when evaluating small datasets and aids in understanding and visualizing patterns of cardiac imaging data in clinically challenging disease states.

Machine Learning Outcome Prediction in Dilated Cardiomyopathy Using Regional Left Ventricular Multiparametric Strain

Article 01 October 2020

Diagnostic signature for heart failure with preserved ejection fraction (HFpEF): a machine learning approach using multi-modality electronic health record data

Article Open access 26 December 2022

Current Challenges and Recent Updates in Artificial Intelligence and Echocardiography

Article 21 January 2020

Find the latest articles, discoveries, and news in related topics.

Introduction

Effectively interpreting the vast amount of medical data generated daily in healthcare is paramount to improving clinical decision-making. Machine learning has been employed in the healthcare field to discover meaningful trends. For cardiac imaging, advances in artificial intelligence have improved both the speed and accuracy of image interpretation as well as have facilitated detection of subtle changes in cardiac structure and function. Many studies have demonstrated the value of artificial intelligence in cardiac imaging, deep learning to classify left ventricular hypertrophy in echocardiography images¹, structured random forests to trace myocardium borders in 3D echocardiograph volumes², and convolution neural network model to develop high resolution images from 2D magnetic resonance image stacks³.

However, the large number of attributes present in many datasets must be reduced to prevent overwhelming the machine learning algorithms. For example, data derived from cardiac deformation analyses capture left ventricular wall motion in both 2- and 3-dimensional planes, generating a significant amount of data for machine learning applications⁴. Although we have a vast amount of data, today’s clinical evaluations usually depend on just one data point. The methods we use to gain insights from all the data haven’t shown to be more helpful in guiding patient care⁵.

Dimensionality reduction techniques, such as Principal Component Analysis and Linear Discriminant Analysis, are commonly applied techniques to better aggregate large feature sets. Another dimensionality reduction technique growing in prominence is Topological Data Analysis (TDA), which extracts data features based on local geometry and global topology encoded in the distribution of data points.

Many studies have incorporated general TDA principles in identifying patterns in nature, e.g., zebrafish pattern variability⁶, stability of protein folding⁷, and outcomes in preclinical traumatic brain injury and spinal cord injury⁸. Other groups have specifically utilized Persistent Homology from the TDA toolbox to understand human anatomy and physiology, e.g., to study brain artery branching and looping⁹, gait signals in patients with neurodegenerative diseases¹⁰, and MRI liver images in patients with Primary Sclerosing Cholangitis⁵. In this study, we propose a workflow for applying persistent homology to study high-dimensional data where the ratio of sample to features is very low.

The purpose of this study is to determine if topological data analysis can help in identifying patterns in echocardiography images to improve diagnostic accuracy for rare cardiac conditions. We provide a use-case scenario of our workflow that harnesses both deformation patterns and global topological information of the left ventricle to characterize cardiovascular diseases (Figs. 1, 2, 3).

Results

Average strain pattern motifs

The average strain patterns motifs for each cardiac condition are shown (Fig. 4). From visual inspection, general trends can be identified, e.g., RCM patients generally have higher intensity values limited to lower persistence pixels while having lower intensity values at higher persistence pixels. This indicates a restrictive pattern within these patients as it suggests that a fully connected component in the H0 dimension is formed at a lower scale parameter. Alternatively, for both CP and normal groups, the average motifs showed a much wider spread in their strain persistence pixel intensities, indicating more spacing in these patients’ data points than the general constraint seen in RCM.

Feature selection

While the number of features was considerably reduced from 14,700 to 900 (18 by 50), this data still exhibited a high feature to sample ratio. The selected features through Boruta feature selection were used to develop predictive models for distinguishing between CP, RCM, and normal patients.

Machine learning classifiers

To determine if the features extracted through our pipeline helped distinguish the cardiac conditions, we developed three binary class classifiers for CP vs. RCM, CP vs. normal, and RCM vs. normal. The combined dataset from Amaki et al.¹¹ and Sengupta et al.¹² was evaluated using tenfold cross-validation. Finally, we compared the performance of these models with a baseline performance achieved by logistic regression models using average peak longitudinal strain from the 4Ch view. This peak value approximates the global longitudinal strain that clinicians typically extract from cardiac strain imaging data (Fig. 5).

Our CP vs RCM logistic regression classifier showed a statistically significant improvement compared to the GLS model (PH AUC = 0.94; GLS AUC = 0.69; p = 1.4 × 10^–6). Our CP vs normal logistic regression classifier demonstrated improvement as well (PH AUC = 0.83; GLS AUC = 0.66; p = 0.019). Our RCM vs normal random forest classifier showed a statistically significant improvement compared to the GLS model (PH AUC = 0.91; GLS AUC = 0.82; p = 0.028). We created a multi-class random forest classifier to discriminate between all conditions; the average across all classes AUC, sensitivity (Sn), and specificity (Sp) were improved in comparison to the baseline model. Our PH model achieved AUC = 0.83 (Sn = 68% and Sp = 84%) whereas GLS model achieved AUC = 0.68 (Sn = 52% and Sp = 76%).

Interpretable artificial intelligence

We show the interpretable artificial intelligence results as Shapley additive explanation plots indicating the top ten features integral in distinguishing each class from the others (Fig. 6). Thus, a combination of feature trends is responsible for the model to output a particular prediction. Moreover, these results allow better comprehension of the average strain motifs produced (Fig. 4). To understand these patterns, we can refer to the original phase space reconstruction point clouds for septal longitudinal strain; for convenience, a few example patients from each disease group are depicted in Supplementary Figs. 1 and 2.

Discussion

Compared with traditional echocardiography approaches, our methodology uses subclinical features identified using topological data extraction of segmental strain analysis to more clearly delineate clinically similar cardiac phenogroups, such as restrictive cardiomyopathy (RCM) and constrictive pericarditis (CP). Global and local structural deformations of the cardiac myocardium were captured with persistent homology, enabling us to predict the presence of CP (AUC: 0.83) and RCM (AUC: 0.91) from normal patients. Additionally, when directly differentiating between the two types of heart failure, CP and RCM (AUC: 0.94), our model demonstrated a comparable combination of sensitivity (92%) and specificity (81%) compared to the sensitivity (87%) and specificity (91%) shown in a study by Welch et al. at the Mayo Clinic investigating the conventional evaluation of constrictive pericarditis from restrictive myocardial disease or severe tricuspid regurgitation based on five principal echocardiographic features, including respiration related ventricular septal shift, maintained or greater medial mitral annular e′ velocity, and expiratory diastolic reversal ratio of the hepatic vein¹³. To provide context of our workflow’s comparison to another non-invasive method assessing clinically similar cardiac phenogroups, in a study by Masui et al. cardiac MRI was used to differentiate CP from RCM, with a sensitivity of 88% and specificity of 100%; our model performed worse in terms of specificity but better in terms of sensitivity¹⁴. While our TDA/echocardiography model performs similarly, it is important to note that ultrasound-based techniques offer a wider range of accessibility to patients than either CT or MRI.

In this small cohort, we highlight the ability of our TDA model to make accurate predictions of rare disease presentations as a use-case example for future cardiovascular applications. In diseases with low prevalence, this presents an obvious advantage to traditional approaches that may require a specific threshold of cases before allowing appropriate stratification. Focal involvement, either decompensation or compensation, of the myocardium will be captured in our current workflow in the appropriate wall region which would be useful in both common and rare cardiac conditions, such as myocardial infarction and apical hypertrophic cardiomyopathy, respectively. However, in order to gain more granularity to exactly which portion the deviation originates from, our workflow could subgroup the left ventricular wall into the American Heart Association 18–19 segments¹⁵, which would also require expanding our patient specific motif.

Previous investigations have endeavored to apply segmental strain analysis to predict structural and/or functional outcomes in the heart. Tabassian et al. used principal component analysis to represent the complex spatio-temporal nature of stress–strain curves and utilized machine learning to classify patients with myocardial infarctions¹⁶. Senapati et al. proposed a relative regional strain ratio (a metric of relative longitudinal strain sparing in the apex) to provide prognostic information in cardiac amyloidosis patients¹⁷. While these applications provide meaningful interpretation of segmental strain data, their translation to diverse cardiac phenotypes is likely limited by the lack of integration of the ultrastructural component of the left ventricle (LV). Another limitation of segmental strain data is high variability in measurements between vendors¹⁸; we attempt to reduce the effects of this issue in our workflow through standardizing patients’ cardiac cycles with spline interpolation and by aggregating individual segments into larger functional wall regions. The EACVI-ASE Strain Standardization Task Force recommends utilizing segmental strain pattern analysis rather than single segmental strain values¹⁹. We believe application of our protocol inherently accomplishes this as persistent homology is a topological data analysis tool that describes the shape of data by extracting its topological invariants²⁰.

TDA has been increasingly applied to areas of biomedical research^10,21,22 but only recently been evaluated in cardiovascular medicine. Specifically, diagnostic tests such as the ECG have provided the first applications of TDA in converting simple waveforms to numeric data^23,24. More recently, TDA has been proposed as a method for the assessment of vascular diseases²⁵ and has even provided improved predictive capacity for detecting acute coronary syndrome or revascularization in patients with coronary plaques than through the use of more commonly used clinical markers, such as risk factors, stenosis, and high-risk plaque features²⁶.

Our work can be translated into clinical applications such as a medical decision support system for physicians, AI virtual assistant for patients, or an automated image analysis software. To facilitate accessibility for physicians and scientists with varying levels of expertise/understanding, we plan to provide the visual motifs with annotations and labels; providing comparison images of disease states and normal motifs will also enhance comprehension for a broader audience.

The limitation inherent to many studies investigating rare diseases is the relatively small sample size, which can be due to practical and resource constraints. However, a key strength of topological data analysis is its ability to find patterns in small groups of data²⁷. For further studies, increasing the sample size collected will help address most data/performance errors. The inclusion of other cardiac pathologies, such as dilated cardiomyopathy, ischemic cardiomyopathy, and valvular heart diseases, along with the integration of other input data types, such as ratios of regional strains²⁸, cardiac MRI, or patient specific demographics, can enhance the versatility of the workflow in the clinical setting. A limitation to the current study is the analysis of only RCM and CP; without consideration of other cardiac pathologies and how their strain parameters may help, or interfere, with correct classification it is unclear how this will generalize to other uncommon pathologies.

The current application of our use-case scenario highlights the ability of TDA, and more specifically persistent homology, to correctly stratify unique cardiovascular anomalies from segmental stress strain analysis.

Materials and methods

Study population

In this retrospective case study, we utilized a merged cohort from two previously published datasets^11,12, with a total of 54 constrictive pericarditis (CP), 49 restrictive cardiomyopathies (RCM), and 55 no structural heart failure control patients (normal).

The institutional review board at the Mayo Foundation approved the protocol outlined by Amaki et al.¹¹. Between July 2005 and January 2007, 37 consecutive patients with CP that were scheduled for pericardiectomy treatment and 22 heart failure patients diagnosed with RCM through transthoracic echocardiography; due to suboptimal 2D-echocradiography image quality, seven patients with CP were excluded. Of the remaining patients, 26 with CP and 19 with RCM provided informed consent for participation in the study. Additionally, there was recruitment of 21 control subjects without cardiovascular disease and no evidence of left ventricular dysfunction or significant valvular heart disease observed with echocardiography.

The institutional review board at the Mount Sinai Medical Center approved the protocol by Sengupta et al.¹². 92 patients (28 with CP, 30 with RCM, and 34 control with no structural/functional abnormality) who underwent transthoracic echocardiography imaging were retrospectively identified.

Three CP patients, 2 RCM patients, and 2 normal patients were excluded from analysis due to incomplete data of all investigated strain values; the remaining dataset utilized in this study had a total of 51 CP, 47 RCM, 53 and control (normal) patients.

Data security

To maintain confidentiality and integrity of study data, all data was generated, stored, and transmitted using protected encryption measures. Specifically, stored on encrypted drives with industry standard encryption algorithms to prevent unauthorized access. All transmission of data both onsite and offsite was performed using secure protocols to prevent interception and tampering.

Proposed framework

We propose a persistent homology workflow based on topological data analysis techniques to identify disease patterns from functional physiologic signals. The pipeline is outlined as follows (Fig. 1):

1.
Data preprocessing—data is converted to an n-dimensional point cloud.
2.
Persistent homology
1. a.
  TDA filtration—simplicial complexes are built upon the point cloud, from which topological invariants are extracted.
2. b.
  Persistent image—birth, and death features are transformed into a persistent image to develop feature vectors
3.
Patient-specific motif—features are stored in a visual representation that can be directly interpreted by physicians/scientists or serve as input for machine learning
1. a.
  Direct physician analysis—doctors develop general understanding of underlying patterns visually apparent in motif
2. b.
  Machine learning modeling—various techniques applied for feature selection and classification of patients

Speckle tracking echocardiography

Grayscale images from the apical 4-chamber (4ch) and midventricular parasternal short-axis views were evaluated with 2-dimensional speckle-tracking echocardiography (STE) by a licensed professional as described previously^11,12. Stress–strain analysis can be segmentally divided into anatomically unique locations, 48 points for short-axis view and 49 points for apical four chamber view, that comprise the entirety of the left ventricular myocardium²⁹. At each spatial location, various features, including longitudinal, circumferential, and radial strains and strain rates, were measured over one cardiac cycle. These measurements were stored in two text files, one for the 4ch view and one for the mid view, which serve as the raw data for this proof-of-concept study (Fig. 2).

Pre-processing

For our purposes, we combined the 48 (short-axis view) and 49 (4ch) segmental strain locations into three functional groupings. The short axis was grouped into the anterior septal, inferior septal, and lateral wall; the apical four chamber view was grouped into the lateral wall, apex, and interventricular septum. Cubic spline interpolation of all strain tracings was performed to standardize patients’ time points within one cardiac cycle. Analysis of segmental strain waveforms as aggregates instead of individual segments has been previously shown³⁰. This approach attempts to remove the stochastic nature that analysis of each separate segment would precipitate. Instead, grouping by functional domains allows for averaging curves through a more physiologically relevant manner, specifically regarding the contractile nature and ultrastructural properties of cardiomyocytes within the myocardium³¹. Mean strain curves for each region were created by averaging the corresponding ventricular regions. Each mean strain curve was transformed using phase space reconstruction³² using Python library pypsr (v0.0.1) (https://github.com/hsharrison/pypsr) with embedding dimension d = 3 and time delay τ = 2. This reconstruction transformed the echocardiography strain time series into a point cloud in a higher dimensional space (phase space) for subsequent TDA processing, representing a more complete picture of the dynamic system’s linear time signal as a geometric shape, shown in Supplementary Figs. 1 and 2.

TDA filtration and persistent image

The topological data analysis technique utilized in this study was persistent homology (PH), which is a TDA tool that describes the shape of data by extracting its topological invariants; the mathematical basis of PH is shown in works by Zomorodian and Carlsson³³, Edelsbrunner et al.³⁴, Ghirst et al.³⁵, Bubenik et al.³⁶ and Adams et al.³⁷. The PH concepts relevant to our workflow (Fig. 3) are described briefly in Supplementary File 1.

To accomplish the TDA filtration and conversion to persistent image, we performed this experiment on Python using TDA libraries, ripser (v0.6.0)³⁸ and persim (v0.2.0), which are both freely available. To analyze the point clouds, simplicial complex filtration was built on the data points using ripser. The PH of the filtration was extracted as birth and death values for dimension 0. To utilize this information in downstream machine learning tasks, a linear persistent image was created with pixel resolution of [1, 50] and variance of 0.005; the image bounds were automatically selected by the persim algorithm. A total of 18 persistent images each with 50 pixels were created, one for each combination of strain type and wall region; stacking these 18 linear images together generated a specific motif for each patient with a total of 18 by 50 pixels. The feature vector utilized in this study contained the intensities of these 900 persistent image pixels. The computational time for this workflow was approximately 1 min per patient data performed on a MacBook Pro laptop with 2.7 GHz with 16 GB ram without distributed computing.

Patient-specific motif

Our workflow produced a visual representation indicative of the initial input that can interpreted directly by physicians/scientists while also being capable of feeding into downstream machine learning tasks. The patient-specific motifs showcase the general trends of the disease conditions while maintaining individual patient characteristics, allowing the patients to be monitored for cardiac function changes.

Statistical analysis

Orange data mining software (v3.28.0) was used for statistical analyses. The models were trained and evaluated using tenfold cross validation. Each model’s receiver operating characteristic (ROC) curve, the area under the curve (AUC), sensitivity, and specificity were calculated to evaluate discriminatory power. To determine the significance of the AUC, a p-value less than 0.05 was considered statistically significant when using the pROC package in R (v4.0.3). To avoid an overfitting problem, Boruta³⁹ feature selection was performed through R (v4.0.3) statistical suite that applies a random forest algorithm to determine meaningful features to retain.

Ethics declarations

All studies were in accordance with the ethical standards of the institutional and national research committee and with the 1964 Helsinki Declaration. For one patient cohort, the institutional review board at Mount Sinai Medical Center approved the protocol, and for second patient cohort, the institutional review board at Mayo Foundation approved the protocol. Participants were included regardless of gender, race, ethnicity, or other demographic factors.

Data availability

The datasets and computer code produced in this study can be provided upon request to the corresponding author(s). A link to the data is provided: https://github.com/qahathaway/TDA_Persistent_Homology.

Abbreviations

4ch:: 4-Chamber
AUC:: Area under curve
CP:: Constrictive pericarditis
GLS:: Global longitudinal strain
PH:: Persistent homology
PI:: Persistent image
RCM:: Restrictive cardiomyopathy
Sn:: Sensitivity
Sp:: Specificity
STE:: Speckle-tracking echocardiography
TDA:: Topological data analysis

References

Madani, A., Ong, J. R., Tibrewal, A. & Mofrad, M. R. K. Deep echocardiography: data-efficient supervised and semi-supervised deep learning towards automated diagnosis of cardiac disease. NPJ Digit. Med. 1, 59 (2018).
Article PubMed PubMed Central Google Scholar
Domingos, J. S., Stebbing, R. V., Leeson, P. & Noble, J. A. Structured Random Forests for Myocardium Delineation in 3D Echocardiography in Machine Learning in Medical Imaging: 5th International Workshop 215–222 (Springer, 2014).
Google Scholar
Oktay, O. et al. Multi-input cardiac image super-resolution using convolutional neural networks. In Medical Image Computing and Computer-Assisted Intervention—MICCAI 2016 (eds Ourselin, S. et al.) 246–254 (Springer, 2016).
Google Scholar
Voigt, J. U. & Cvijic, M. 2- and 3-Dimensional myocardial strain in cardiac health and disease. JACC Cardiovasc. Imaging 12, 1849–1863 (2019).
Article PubMed Google Scholar
Singh, Y. et al. Algebraic topology-based machine learning using MRI predicts outcomes in primary sclerosing cholangitis. Eur. Radiol. Exp. 6, 58 (2022).
Article PubMed PubMed Central Google Scholar
McGuirl, M. R., Volkening, A. & Sandstede, B. Topological data analysis of zebrafish patterns. Proc. Natl. Acad. Sci. U.S.A. 117, 5113–5124 (2020).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Xia, K. & Wei, G. W. Persistent homology analysis of protein structure, flexibility, and folding. Int. J. Numer. Methods Biomed. Eng. 30, 814–844 (2014).
Article MathSciNet Google Scholar
Nielson, J. L. et al. Topological data analysis for discovery in preclinical spinal cord injury and traumatic brain injury. Nat. Commun. 6, 8581 (2015).
Article ADS CAS PubMed Google Scholar
Bendich, P., Marron, J. S., Miller, E., Pieloch, A. & Skwerer, S. Persistent homology analysis of brain artery trees. Ann. Appl. Stat. 10, 198–218 (2016).
Article MathSciNet PubMed PubMed Central Google Scholar
Yan, Y. et al. Gait rhythm dynamics for neuro-degenerative disease classification via persistence landscape- based topological representation. Sensors (Basel). https://doi.org/10.3390/s20072006 (2020).
Article PubMed PubMed Central Google Scholar
Amaki, M. et al. Diagnostic concordance of echocardiography and cardiac magnetic resonance-based tissue tracking for differentiating constrictive pericarditis from restrictive cardiomyopathy. Circ. Cardiovasc. Imaging 7, 819–827 (2014).
Article PubMed Google Scholar
Sengupta, P. P. et al. Disparate patterns of left ventricular mechanics differentiate constrictive pericarditis from restrictive cardiomyopathy. JACC Cardiovasc. Imaging 1, 29–38 (2008).
Article PubMed Google Scholar
Welch, T. D. et al. Echocardiographic diagnosis of constrictive pericarditis: Mayo Clinic criteria. Circ. Cardiovasc. Imaging 7, 526–534 (2014).
Article PubMed Google Scholar
Masui, T., Finck, S. & Higgins, C. B. Constrictive pericarditis and restrictive cardiomyopathy: Evaluation with MR imaging. Radiology 182, 369–373 (1992).
Article CAS PubMed Google Scholar
Cerqueira, M. D. et al. Standardized myocardial segmentation and nomenclature for tomographic imaging of the heart. A statement for healthcare professionals from the Cardiac Imaging Committee of the Council on Clinical Cardiology of the American Heart Association. Circulation 105, 539–542 (2002).
Article PubMed Google Scholar
Tabassian, M. et al. Machine learning of the spatio-temporal characteristics of echocardiographic deformation curves for infarct classification. Int. J. Cardiovasc. Imaging 33, 1159–1167 (2017).
Article PubMed Google Scholar
Senapati, A. et al. Prognostic implication of relative regional strain ratio in cardiac amyloidosis. Heart 102, 748–754 (2016).
Article PubMed Google Scholar
Negishi, K. et al. What is the primary source of discordance in strain measurement between vendors: Imaging or analysis? Ultrasound Med. Biol. 39, 714–720 (2013).
Article PubMed Google Scholar
Mirea, O. et al. Variability and reproducibility of segmental longitudinal strain measurement: A report from the EACVI-ASE strain standardization task force. JACC Cardiovasc. Imaging 11, 15–24 (2018).
Article PubMed Google Scholar
Edelsbrunner, Letscher, & Zomorodian,. Topological persistence and simplification. Discret. Comput. Geom. 28, 511–533 (2002).
Article MathSciNet Google Scholar
Chung, Y. M., Hu, C. S., Lo, Y. L. & Wu, H. T. A persistent homology approach to heart rate variability analysis with an application to sleep-wake classification. Front. Physiol. 12, 637684 (2021).
Article PubMed PubMed Central Google Scholar
Saggar, M. et al. Towards a new approach to reveal dynamical organization of the brain using topological data analysis. Nat. Commun. 9, 1399 (2018).
Article ADS PubMed PubMed Central Google Scholar
Dindin, M., Umeda, Y. & Chazal, F. Topological data analysis for arrhythmia detection through modular neural networks. In Canadian AI 2020—33rd Canadian Conference on Artificial Intelligence, Ottawa, Canada (2020).
Ignacio, P. S., Dunstan, C., Escobar, E., Trujillo, L. & Uminsky, D. Classification of single-lead electrocardiograms: TDA informed machine learning. In 18th IEEE International Conference on Machine Learning and Applications (ICMLA), Boca Raton 1241–1246 (2019).
Nicponski, J. & Jung, J.-H. Topological data analysis of vascular disease: A theoretical framework. Front. Appl. Math. Stat. https://doi.org/10.3389/fams.2020.00034 (2020).
Article Google Scholar
Hwang, D. et al. Topological data analysis of coronary plaques demonstrates the natural history of coronary atherosclerosis. JACC Cardiovasc. Imaging. https://doi.org/10.1016/j.jcmg.2020.11.009 (2021).
Article PubMed Google Scholar
Singh, Y. et al. Topological data analysis in medical imaging: Current state of the art. Insights Imaging 14, 58 (2023).
Article PubMed PubMed Central Google Scholar
Yang, Z. et al. Left ventricular strain-curve morphology to distinguish between constrictive pericarditis and restrictive cardiomyopathy. ESC Heart Fail. 8, 4863–4872 (2021).
Article PubMed PubMed Central Google Scholar
Dandel, M., Lehmkuhl, H., Knosalla, C., Suramelashvili, N. & Hetzer, R. Strain and strain rate imaging by echocardiography—Basic concepts and clinical applicability. Curr. Cardiol. Rev. 5, 133–148 (2009).
Article PubMed PubMed Central Google Scholar
Hensel, F., Moor, M. & Rieck, B. A survey of topological machine learning methods. Front. Artif. Intell. 4, 681108 (2021).
Article PubMed PubMed Central Google Scholar
Sengupta, P. P. et al. Left ventricular structure and function: Basic science for cardiac imaging. J. Am. Coll. Cardiol 48, 1988–2001 (2006).
Article PubMed Google Scholar
Kennel, M. B., Brown, R. & Abarbanel, H. D. I. Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys. Rev. A 45, 3403 (1992).
Article ADS CAS PubMed Google Scholar
Zomorodian, A. & Carlsson, G. Computing persistent homology. Discret. Comput. Geom. 33, 249–274 (2005).
Article MathSciNet Google Scholar
Edelsbrunner, H., Letscher, D. & Zomorodian, A. Topological persistence and simplification. Discret. Comput. Geom. 28, 511–533 (2000).
Article MathSciNet Google Scholar
Ghrist, R. Barcodes: The persistent topology of data. Bull. Am. Math. Soc. 45, 61–75 (2008).
Article MathSciNet Google Scholar
Bubenik, P. Statistical topological data analysis using persistence landscapes. J. Mach. Learn. Res. 16, 77–102 (2015).
MathSciNet Google Scholar
Adams, H. et al. Persistence images: A stable vector representation of persistent homology. J. Mach. Learn. Res. 18, 1–35 (2017).
MathSciNet Google Scholar
Tralie, C., Saul, N. & Bar-On, R. Ripser py: A lean persistent homology library for python. J. Open Source Softw. https://doi.org/10.21105/joss.00925 (2018).
Article Google Scholar
Kursa, M. B. & Rudnicki, W. R. Feature selection with the boruta package. J. Stat. Softw. 36, 1–13 (2010).
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Medicine, West Virginia University, Morgantown, WV, USA
Nanda K. Siva & Quincy A. Hathaway
Division of Cardiology, Heart and Vascular Institute, West Virginia University, Morgantown, WV, USA
Nanda K. Siva, Yashbir Singh & Quincy A. Hathaway
Department of Radiology, Mayo Clinic, Rochester, MN, USA
Yashbir Singh
Division of Cardiovascular Disease and Hypertension, Rutgers Robert Wood Johnson Medical School, 125 Patterson St, New Brunswick, NJ, 08901, USA
Partho P. Sengupta & Naveena Yanamala
Institute for Software Research, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
Naveena Yanamala

Authors

Nanda K. Siva
View author publications
You can also search for this author in PubMed Google Scholar
Yashbir Singh
View author publications
You can also search for this author in PubMed Google Scholar
Quincy A. Hathaway
View author publications
You can also search for this author in PubMed Google Scholar
Partho P. Sengupta
View author publications
You can also search for this author in PubMed Google Scholar
Naveena Yanamala
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Designing research studies (NKS, YS, QAH, PPS, NY), conducting experiments (NKS, YS, NY), acquiring data (NKS, YS, NY), analyzing data (NKS, YS, QAH, PPS, NY), writing the manuscript (NKS, YS, QAH, PPS, NY). The author Nanda K. Siva had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Partho P. Sengupta or Naveena Yanamala.

Ethics declarations

Competing interests

Partho P. Sengupta is a consultant to Heart Sciences, Ultromics, and Kencor Health. Quincy A. Hathaway is the Chief Science Officer for Aspirations LLC. The other authors have nothing to disclose.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Supplementary Legends.

Supplementary Figure 1.

Supplementary Figure 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Siva, N.K., Singh, Y., Hathaway, Q.A. et al. A novel multi-task machine learning classifier for rare disease patterning using cardiac strain imaging data. Sci Rep 14, 10672 (2024). https://doi.org/10.1038/s41598-024-61201-4

Download citation

Received: 21 June 2023
Accepted: 02 May 2024
Published: 09 May 2024
DOI: https://doi.org/10.1038/s41598-024-61201-4
Springer Nature Limited

A novel multi-task machine learning classifier for rare disease patterning using cardiac strain imaging data

Abstract

Similar content being viewed by others

Machine Learning Outcome Prediction in Dilated Cardiomyopathy Using Regional Left Ventricular Multiparametric Strain

Diagnostic signature for heart failure with preserved ejection fraction (HFpEF): a machine learning approach using multi-modality electronic health record data

Current Challenges and Recent Updates in Artificial Intelligence and Echocardiography

Explore related subjects

Introduction

Results

Average strain pattern motifs

Feature selection

Machine learning classifiers

Interpretable artificial intelligence

Discussion

Materials and methods

Study population

Data security

Proposed framework

Speckle tracking echocardiography

Pre-processing

TDA filtration and persistent image

Patient-specific motif

Statistical analysis

Ethics declarations

Data availability

Abbreviations

References

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Supplementary Legends.

Supplementary Figure 1.

Supplementary Figure 2.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation