Artificial intelligence assessment for early detection and prediction of renal impairment using electrocardiography

Kwon, Joon-myoung; Kim, Kyung-Hee; Jo, Yong-Yeon; Jung, Min-Seung; Cho, Yong-Hyeon; Shin, Jae-Hyun; Lee, Yoon-Ji; Ban, Jang-Hyeon; Lee, Soo Youn; Park, Jinsik; Oh, Byung-Hee

doi:10.1007/s11255-022-03165-w

Artificial intelligence assessment for early detection and prediction of renal impairment using electrocardiography

Nephrology - Original Paper
Open access
Published: 11 April 2022

Volume 54, pages 2733–2744, (2022)
Cite this article

Download PDF

You have full access to this open access article

International Urology and Nephrology Aims and scope Submit manuscript

Artificial intelligence assessment for early detection and prediction of renal impairment using electrocardiography

Download PDF

Joon-myoung Kwon ORCID: orcid.org/0000-0001-6754-1010^1,2,3,4,
Kyung-Hee Kim^2,5,
Yong-Yeon Jo¹,
Min-Seung Jung¹,
Yong-Hyeon Cho¹,
Jae-Hyun Shin¹,
Yoon-Ji Lee¹,
Jang-Hyeon Ban³,
Soo Youn Lee^2,5,
Jinsik Park⁵ &
…
Byung-Hee Oh⁵

2074 Accesses
3 Citations
Explore all metrics

Abstract

Purpose

Although renal failure is a major healthcare burden globally and the cornerstone for preventing its irreversible progression is an early diagnosis, an adequate and noninvasive tool to screen renal impairment (RI) reliably and economically does not exist. We developed an interpretable deep learning model (DLM) using electrocardiography (ECG) and validated its performance.

Methods

This retrospective cohort study included two hospitals. We included 115,361 patients who had at least one ECG taken with an estimated glomerular filtration rate measurement within 30 min of the index ECG. A DLM was developed using 96,549 ECGs of 55,222 patients. The internal validation included 22,949 ECGs of 22,949 patients. Furthermore, we conducted an external validation with 37,190 ECGs of 37,190 patients from another hospital. The endpoint was to detect a moderate to severe RI (estimated glomerular filtration rate < 45 ml/min/1.73m²).

Results

The area under the receiver operating characteristic curve (AUC) of a DLM using a 12-lead ECG for detecting RI during the internal and external validation was 0.858 (95% confidence interval 0.851–0.866) and 0.906 (0.900–0.912), respectively. In the initial evaluation of 25,536 individuals without RI patients whose DLM was defined as having a higher risk had a significantly higher chance of developing RI than those in the low-risk group (17.2% vs. 2.4%, p < 0.001). The sensitivity map indicated that the DLM focused on the QRS complex and T-wave for detecting RI.

Conclusion

The DLM demonstrated high performance for RI detection and prediction using 12-, 6-, single-lead ECGs.

Deep learning-based electrocardiographic screening for chronic kidney disease

Article Open access 26 May 2023

Artificial Intelligence in Predicting Kidney Function and Acute Kidney Injury

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Renal impairment (RI), including chronic kidney disease and acute kidney injury, is an important contributor to morbidity and mortality. Globally, in 2017, over 1.2 million people died from RI [1]. The treatment cost for RI increases with the availability of renal replacement techniques, resulting in a life-saving but expensive treatment in the long term for patients with end-stage kidney disease [1]. The number of people currently receiving renal replacement therapy exceeds 2.5 million and is projected to double to approximately 5.4 million by 2030 [2]. RI is emerging as a major healthcare burden worldwide, and 2.3–7.1 million adults die from a lack of access to renal replacement therapy [2]. The cornerstone to prevent irreversible progression of RI and initiate appropriate treatment is the early detection of RI [3, 4].

However, most cases of mild kidney function decline are asymptomatic, and the symptoms for the progression of the disease are vague and nonspecific [5]. A diagnostic test for renal failure includes a laboratory examination to measure the creatinine and blood urea nitrogen and calculate the glomerular filtration rate [6]. Laboratory tests are invasive, expensive, and require specialized equipment and infrastructure, such as trained medical staff for blood sampling and a hematologic analysis machine for assessment with biochemical reagents. Therefore, detecting RI in daily living is impossible, and screening for RI is difficult in low-income countries [7].

RI is associated with electrolyte imbalance, volume overload, and hypertension and also affects cardiac function [8,9,10]. RI is a known cause of diastolic dysfunction, left ventricular hypertrophy, arrhythmia, and heart failure and is associated with increased cardiovascular mortality [11, 12]. In several studies, RI was shown to change the morphology of an electrocardiogram (ECG), and researchers suggested that the alteration of cardiac function and electrolyte imbalance affects an ECG [13,14,15]. However, it is not easy to detect such subtle and non-linear ECG changes; hence, the current state of the ECG is not useful for detecting RI. Screening and detecting RI with an ECG would be useful because patients suspected to have RI could be referred for confirmatory laboratory tests.

In this study, we aimed to develop and validate a deep learning-based artificial intelligence model (DLM) for detecting RI using ECG. Deep learning has previously been used in the medical field to identify lesions and is currently used to analyze ECGs to diagnose heart failure, valvular heart disease, anemia, and coronary artery disease [16,17,18,19,20,21,22,23,24]. We hypothesized that a DLM could effectively screen for RI.

Methods

Study design and population

We conducted a retrospective, multicenter, diagnostic study in which a DLM was developed using ECGs, and then, it was internally and externally validated. We excluded individuals with missing demographic, ECG, and laboratory examination information. Data from Sejong General Hospital (SGH) were used for development and internal validation. In SGH, we identified patients with at least one standard digital 10-s 12-lead ECG acquired in the supine position within the study period (October 1, 2016 to August 31, 2020) and at least one renal laboratory panel for serum creatinine and blood urea nitrogen obtained within 30 min of the index ECG. The individuals who visited SGH for inpatient, outpatient, emergency, and health checkup clinic were the study population for the development and internal validation datasets of the DLM. As shown in Supplementary Figure S1, patients who underwent a follow-up laboratory examination after an initial evaluation were assigned to an internal validation dataset. Patients who had no follow-up laboratory exam were assigned to a development dataset that was used to develop the DLM. Subsequently, we evaluated the accuracy of the DLM using the internal validation dataset. Data from Mediplex Sejong Hospital (MSH) were used for external validation. We identified the patients who were admitted to MSH during the study period (March 1, 2017 to August 31, 2020) and who had at least one ECG and at least one renal laboratory panel for serum creatinine and blood urea nitrogen obtained within 30 min of the index ECG. Because the purpose of the validation data was to assess the accuracy of the algorithm, we used only one ECG of each patient for the internal and external validation datasets, i.e., the ECG obtained closest to the patient’s laboratory exam during the study period.

This study was approved by the institutional review boards of the SGH and MSH. Clinical data, including digitally stored ECGs, the laboratory examination results of the renal panel, age, and sex of patients were obtained from both hospitals. Both institutional review boards waived the need for informed consent because of the retrospective nature of the study using fully anonymized ECG and health data and causing minimal harm.

Procedures

The predictor variables used were ECG, age, and sex. Digitally stored 12-lead ECG data, amounting to 5000 data points for each lead, were recorded for 10 s (500 Hz). We removed 1 s each at the beginning and end of each ECG because they had more artifacts than other parts. Therefore, the length of each ECG was 8 s (4000 data points). We created a dataset using the entire 12-lead ECG data. In addition, we used partial datasets from 12-lead ECG data, such as limb six-lead and single lead (I). We selected the sets of leads because these leads could easily be recorded by wearable and pad devices in contact with the hands and legs. Consequently, when we developed and validated an DLM using 12-lead ECGs, we used a dataset of two-dimensional (2D) data of 12 × 4000 data points. When we developed and validated an algorithm using six-lead ECGs, we used datasets of 6 × 4000 data points, and when using single-lead ECGs, we used datasets of 1 × 4000 data points.

The primary endpoint of this study was a moderate to severe RI, which was defined as the estimated glomerular filtration rate (eGFR) under 45 ml/min per 1.73 m². The eGFR was calculated using the modification of diet in the renal disease study equation (eGFR = 175 × (serum creatinine)^−1.154 × (age)^−0.203 × 0.742 [if female] × 1.212 [if Black]) [25, 26]. The secondary endpoint was a mild to severe RI, which was defined as an eGFR under 60 ml/min per 1.73 m².

The DLM was developed using several hidden layers of neurons to learn complex hierarchical non-linear representations from the data. A residual block with six stages included two convolution layers, two batch normalizations, one max-pooling, and one dropout layer repeated, as shown in Fig. 1. We used 1 × 4 max-pooling layers between blocks 1 and 4 and 2 × 4 max-pooling layers between blocks 4 and 6. The last convolutional layer of the residual block was connected to a flattened layer, which was fully connected to the one-dimensional (1D) layer composed of 256 nodes. The input layer of epidemiology data (age and sex) was concatenated with the 1D layer. Two fully connected 1D layers were connected to the output node, which was composed of one node. The output node used a softmax function as an activation function because the output of the softmax function was between 0 and 1. The architecture of the DLM was evaluated and verified using a grid search. We developed additional DLM using limb six-lead and single-lead (I) ECGs.

Statistical analysis

Continuous variables were presented as mean values (standard deviation, SD) and compared using the unpaired Student’s t-test or Mann–Whitney U test (if variables were found to be not normally distributed). We checked the homogeneity of the variance when using the unpaired Student’s t-test. Categorical variables were expressed as frequencies and percentages and compared using the χ2 test. At each input (ECG) of validation data, the DLM calculated the possibility of a primary endpoint in the range from 0 (a non-moderate to severe RI) to 1 (a moderate to severe RI). To verify the DLM performance, we compared the possibility calculated by the DLM with the presence of a moderate to severe RI in the internal and external validation datasets. To achieve this, we used the area under the receiver operating characteristic curve (AUC). The performance of the DLM for detecting the secondary endpoint, i.e., a mild to severe RI, was similarly verified using AUC. We applied the cutoff point to internal and external validation data to calculate sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Sensitivity, specificity, PPV, and NPV were confirmed at the operating point from Youden J statistics in the development data [27]. Exact 95% confidence intervals (CIs) were used for all measures of diagnostic performances, except for AUC. The CIs for AUC were determined based on the Sun and Su optimization of the De-long method using the pROC package in R (The R Foundation for Statistical Computing, Vienna, Austria). We evaluated the p-value of the difference between the AUCs using the bootstrap methods. The bootstrap operation for p-value was performed with non-parametric resampling and the percentile method. The number of bootstrap replicates was 2000, which was recommended by Carpenter and Bithell [28]. A significant difference in patient characteristics was defined as a two-sided p value of less than 0.001. We also calculated effect size of results. The effect size was calculated using the bootstrap method. We defined effect sizes of 0.2, 0.5, 0.8 as indicative of small, moderate, and large clinical changes [29]. Statistical analyses were computed using R software, version 3.4 In addition, we used PyTorch’s open-source software library as the backend and Python (version 3.6) for the analysis.

Visualizing developed XDM for interpretation

To understand the developed model and compare it to existing medical knowledge, it was necessary to identify a region that had a significant effect on the decision of the developed DLM. We employed a sensitivity map using a saliency method [30, 31]. The map was computed using the first-order gradients of the classifier probabilities with respect to the input signals; if the probability of a classifier was sensitive to a specific region of the signal, the region would be considered as significant in the model. In other words, we verified the region of ECG that was associated with RI using a sensitivity map. We used a gradient class activation map as a sensitivity map, and we guided the gradient backpropagation method. Further, we verified the variable importance of ECG features, age, and sex in logistic regression, random forest, and deep learning using the deviance difference, mean decreased Gini, and relative importance based on Garson’s algorithm, respectively [32]. A logistic regression model was derived using the maximum likelihood method to calculate coefficients via “glmulti” packages in R (R Development Core Team, Vienna, Austria). We used iteratively reweighted least squares (IWLS) to fit the final model. In logistic regression, R-squared was calculated using the Cox and Snell method. The random forest model consisted of 20,000 decision trees using the “randomForest” package in R. Additionally, the AUROC between the prognostic score and classification of RI in logistic regression and random forest in the test dataset were also confirmed. In logistic regression model, residual standard error and adjusted R-squared were 0.2366 and 0.0654.

Verifying DLM performance to predict RI development as subgroup analysis

We hypothesized that the ECGs would display subtle abnormal patterns in the pre-RI phase and that the developed DLM would classify certain cases as abnormal, yielding a false positive test (a study subject classified as having RI but considered as non-RF) as the initial result. We conducted a subgroup analysis of patients who underwent follow-up laboratory examinations in the internal and external validation datasets. The difference in date between the initial and follow-up echocardiography data was over 14 days. Among those patients, we verified the development of RI in patients who were initially considered non-RI, whose eGFR was 60 ml/min per 1.73 m² or over. The DLM was categorized into high- and low-risk groups based on the risk score using cutoff values, which were determined using the Youden’s J statistic with the development dataset [27]. We used the Kaplan–Meier method to analyze the RI development over 24 months.

Results

The eligible population included 78,188 and 37,201 patients from SGH and MSH, respectively. We excluded 17 and 11 patients (from SGH and MSH, respectively) because of missing clinical information (age and sex), laboratory evaluation information, or ECG data (Supplementary Figure S1). The study included a total of 115,361 patients, of which 7362 patients had a moderate to severe RI. The DLM was developed using a development dataset of 96,549 ECGs of 55,222 patients from SGH (47.9%). Then, the performance of the algorithm was verified using 22,949 ECGs of 22,949 patients from SGH (19.9%) in the internal validation dataset and 37,190 ECGs of 37,190 patients from MSH (32.2%) in the external validation dataset. In moderate to severe RI, the ECGs had prolonged QRS duration, prolonged QTc, rightward T-wave axis, prolonged PR interval, and tachycardia (Table 1).

Table 1 Baseline characteristics

Full size table

During internal and external validations, the AUC of the DLM for detecting a moderate to severe RI as the primary endpoint using 12-lead ECGs was 0.858 (95% CI 0.851–0.866) and 0.906 (95% CI 0.900–0.912), respectively (Fig. 2). The AUC of the DLM for detecting a moderate to severe RI using six-lead ECGs during internal and external validations was 0.852 (95% CI 0.845–0.859) and 0.901 (95% CI 0.895–0.908), respectively. The AUC of the DLM using single-lead ECGs during internal and external validations were 0.842 (95% CI 0.834–0.850) and 0.892 (95% CI 0.886–0.899), respectively. During internal and external validation, the p-values for differences of AUC of the DLM for detecting a moderated to severe RI using 12 leads ECG and other leads ECG were <0.001. During internal and external validation, the effect size between AUCs of the DLM for detecting a moderated to severe RI between the 12-lead ECG model and 6-lead ECG model was 0.014 and 0.008, respectively. During internal and external validation, the effect size between AUCs of the DLM for detecting a moderated to severe RI between the 12-lead ECG model and 6-lead ECG model was 0.026 and 0.023, respectively. During internal and external validations, the AUC of the DLM for detecting a mild to severe RI as the secondary endpoint using 12-lead ECGs was 0.846 (95% CI 0.840–0.852) and 0.901 (95% CI 0.896–0.906), respectively (Fig. 2).

The DLM described the important ECG region for RI detection. As shown in Fig. 3, the DLM focused on the QRS complex and T wave for detecting RI. As shown in Table 2, the variable importance differed for each prognostic model. The logistic regression and random forest used the T-wave axis and the DLM used the QT interval as an important predictive variable. In the logistic regression model, the residual standard error and adjusted R-squared were 0.2366 and 0.0654, respectively.

Table 2 Variable importance for detecting renal impairment

Full size table

Our study comprised 30,865 patients (22,949 and 7916 patients in the internal and external validation datasets, respectively) with follow-up laboratory results. Among them, 25,536 patients were normal (non-RI) at initial laboratory examination. We conducted a subgroup analysis of RI development after initial laboratory examination in these 25,536 patients, of whom 1,826 developed RI within 24 months. The high-risk group of the DLM demonstrated a significantly higher hazard (Fig. 4) and higher development rate of RI than the low-risk group (17.2% vs. 2.4%, respectively, p < 0.001).

Discussion

We developed and validated a DLM based on an ensemble network for RI detection using 12-, six-, and single-lead ECGs and demonstrated reasonable performance. Subsequently, we visualized our DLM to determine the regions and characteristics of the ECG that were used for RI detection and verified the important variable for the decision in diverse statistical methods, such as logistic regression, random forest, and DLM. We conducted a subgroup analysis for patients with non-RI (normal) at the initial laboratory examination; it was demonstrated that the DLM could predict the development of RI. To our knowledge, this study is the first to develop a DLM for detecting and predicting RI and demonstrating interpretable patterns of decision making using the DLM. In a previous study, Rahman et al. showed the possibility that cardio-renal syndrome patients could be detected using ECG with a machine learning model (support vector machine). However, this study used data from a small population with renal disease. In our study, we developed a deep learning model (DLM) using big data and confirmed the accuracy using both internal and external validation datasets.

Developing a reliable screening tool for detecting and predicting RI is the cornerstone for early diagnosis of RI and preventing irreversible disease progression for end stage renal disease, which requires renal replacement treatment. Most RI patients were asymptomatic and had nonspecific symptoms. Diagnostic examinations are laboratory tests that require invasive blood sampling and cannot be conducted in daily living and low-income countries. Therefore, a new technology is required for detecting RI with simple and noninvasive methods that could be adopted in daily living. As ECG is a non-invasive test and is changed with RI, we developed a DLM for detecting RI using ECG.

The most important aspect of deep learning is its ability to extract features and develop an algorithm using various types of data, such as images, 2D data, and waveforms [33]. In previous studies, Attia and colleagues and our study group developed a DLM to screen for heart failure, arrhythmia, valvular heart disease, left ventricular hypertrophy, electrolyte imbalance, and anemia [16,17,18,19,20,21,22,23,24]. Deep learning is criticized for its unreliable outcomes because of the low transparency of the process, the so-called black box. Therefore, we adopted a sensitivity map to describe the abnormal findings that affected the decision of the DLM for detecting RI and describing the variable importance of ECG features. Using this method, we verified an ECG region and features that were associated with RI. In conventional methods, the research process began based on the hypothesis of researchers. For example, in the association between RI and ECG, researchers hypothesized based on their experience of dictating the ECG of RI. This method limited the opportunity to discover knowledge in human perception. In deep learning methods, such as DLM and sensitivity mapping in this study, the findings were not based on previous medical knowledge of humans, but on the data itself. Therefore, we had the opportunity to discover new knowledge from the data itself without human prejudice. Deep learning could discover the complex hierarchical non-linear representation that could not be discovered using conventional statistical methods, such as logistic regression. In this study, we verified the important ECG region for detecting RI from waveform data. We verified that RI could be detected and predicted using ECG based on a DLM. Further, we verified that specific ECG features, such as the QRS duration, T-wave axis, and corrected QT interval, were correlated with detecting RI. These findings were in agreement with the results of previous studies. Bignotto et al. and Stewart et al. demonstrated that left ventricular hypertrophy was identified in 50–80% of the RI population [34, 35]. Shafi et al. demonstrated that a widening of the QRS complex and prolonged QTc was identified in RI populations [36]. Deo et al. verified that ECG metrics, such as the PR interval, QRS duration, and QTc, were independent risk markers for cardiovascular death [13].

We developed and experimented DLMs using diverse format of ECG, such as 12, 6, and single lead ECG. We also showed the differences of AUC of the DLM for detecting RI using 12 lead ECG and other lead ECG during internal and external validation. Although the p-values for differences of AUC of the DLM for detecting a moderated to severe RI using 12 lead ECG and other leads ECG were <0.001, the effect sizes were 0.008–0.023. In medical big data research, since the p-value could be significant due to the large sample size, it is important to interpret results in consideration of the effect size.

There were several limitations to this study. First, we validated the DLM using retrospective data; however, it is necessary to validate DLM with prospective studies and data from daily living. Studies related to the clinical significance of the new technology are required for application in clinical practice. In our next study, we will verify DLM performance and significance with a prospective study in daily clinical practice. Second, this study was conducted only in two hospitals in Korea; hence, it is necessary to validate the DLM with patients in other countries. Third, although we compared the variable importance ranking in several machine learning and DLMs, we could not confirm the exact statistics between the importance results due to the lack of machine learning and deep learning statistical methods. Several improvements in the machine learning area helped find new methods for comparing the importance results more precisely.

Conclusion

The DLM demonstrated accurate performance in detecting RI using ECG. The DLM successfully demonstrated the abnormality of ECG, which was correlated with RI. The application of artificial intelligence technologies based on the DLM to ECG could enable screening for RI and predict the development of RI.

Data availability

The data underlying this article will be shared on reasonable request to the corresponding author.

Code availability

The data underlying this article will be shared on reasonable request to the corresponding author.

Abbreviations

AUC:: Area under the receiver operating characteristic curve
CI:: Confidence interval
DLM:: Deep learning model
ECG:: Electrocardiography
eGFR:: Estimated glomerular filtration rate
MSH:: Mediplex Sejong Hospital
NPV:: Negative predictive value
PPV:: Positive predictive value
RI:: Renal impairment
SD:: Standard deviation
SGH:: Sejong General Hospital
2D:: 2 Dimensional
1D:: 1 Dimensional

References

Himmelfarb J, Ikizler TA (2010) Hemodialysis. N Engl J Med 363:1833–1845. https://doi.org/10.1056/NEJMra0902710
Article CAS PubMed Google Scholar
Liyanage T, Ninomiya T, Jha V, Neal B, Patrice HM, Okpechi I, Zhao M, Lv J, Garg AX, Knight J, Rodgers A, Gallagher M, Kotwal S, Cass A, Perkovic V (2015) Worldwide access to treatment for end-stage kidney disease: a systematic review. Lancet 385:1975–1982. https://doi.org/10.1016/S0140-6736(14)61601-9
Article PubMed Google Scholar
Ruggenenti P, Cravedi P, Remuzzi G (2012) Mechanisms and treatment of CKD. J Am Soc Nephrol 23:1917–1928. https://doi.org/10.1681/ASN.2012040390
Article CAS PubMed Google Scholar
Trivedi HS, Pang MMH, Campbell A, Saab P (2002) Slowing the progression of chronic renal failure: economic benefits and patients’ perspectives. Am J Kidney Dis 39:721–813. https://doi.org/10.1053/ajkd.2002.31990
Article PubMed Google Scholar
Webster AC, Nagler EV, Morton RL, Masson P (2017) Chronic kidney disease. Lancet 389:1238–1252. https://doi.org/10.1016/S0140-6736(16)32064-5
Article PubMed Google Scholar
Chen TK, Knicely DH, Grams ME (2019) Chronic kidney disease diagnosis and management. JAMA 322:1294. https://doi.org/10.1001/jama.2019.14745
Article CAS PubMed PubMed Central Google Scholar
Stanifer JW, Jing B, Tolan S, Helmke N, Mukerjee R, Naicker S, Patel U (2014) The epidemiology of chronic kidney disease in sub-Saharan Africa: a systematic review and meta-analysis. Lancet Glob Heal 2:e174–e181. https://doi.org/10.1016/S2214-109X(14)70002-6
Article Google Scholar
Dhondup T, Qian Q (2017) Acid-base and electrolyte disorders in patients with and without chronic kidney disease: an update. Kidney Dis 3:136–148. https://doi.org/10.1159/000479968
Article Google Scholar
Hung S, Lai Y, Kuo K, Tarng D (2015) Volume overload and adverse outcomes in chronic kidney disease: clinical observational and animal studies. J Am Heart Assoc 4:e001918. https://doi.org/10.1161/JAHA.115.001918
Article CAS PubMed PubMed Central Google Scholar
McAlister FA, Ezekowitz J, Tonelli M, Armstrong PW (2004) Renal insufficiency and heart failure. Circulation 109:1004–1009. https://doi.org/10.1161/01.CIR.0000116764.53225.A9
Article PubMed Google Scholar
Sarnak MJ, Levey AS, Schoolwerth AC, Coresh J, Culleton B, Hamm LL, McCullough PA, Kasiske BL, Kelepouris E, Klag MJ, Parfrey P, Pfeffer M, Raij L, Spinosa DJ, Wilson PW (2003) Kidney disease as a risk factor for development of cardiovascular disease. Hypertension 42:1050–1065. https://doi.org/10.1161/01.HYP.0000102971.85504.7c
Article CAS PubMed Google Scholar
Matsushita K, Coresh J, Sang Y, Chalmers J, Fox C, Guallar E, Jafar T, Jassal SK, Landman GWD, Muntner P, Roderick P, Sairenchi T, Schöttker B, Shankar A, Shlipak M, Tonelli M, Townend J, van Zuilen A, Yamagishi K, Yamashita K, Gansevoort R, Sarnak M, Warnock DG, Woodward M, Ärnlöv J (2015) Estimated glomerular filtration rate and albuminuria for prediction of cardiovascular outcomes: a collaborative meta-analysis of individual participant data. Lancet Diabetes Endocrinol 3:514–525. https://doi.org/10.1016/S2213-8587(15)00040-6
Article PubMed PubMed Central Google Scholar
Deo R, Shou H, Soliman EZ, Yang W, Arkin JM, Zhang X, Townsend RR, Go AS, Shlipak MG, Feldman HI (2016) Electrocardiographic measures and prediction of cardiovascular and noncardiovascular death in CKD. J Am Soc Nephrol 27:559–569. https://doi.org/10.1681/ASN.2014101045
Article CAS PubMed Google Scholar
Kestenbaum B, Rudser KD, Shlipak MG, Fried LF, Newman AB, Katz R, Sarnak MJ, Seliger S, Stehman-Breen C, Prineas R, Siscovick DS (2007) Kidney function, electrocardiographic findings, and cardiovascular events among older adults. Clin J Am Soc Nephrol 2:501–508. https://doi.org/10.2215/CJN.04231206
Article PubMed Google Scholar
Dobre M, Brateanu A, Rashidi A, Rahman M (2012) Electrocardiogram abnormalities and cardiovascular mortality in elderly patients with CKD. Clin J Am Soc Nephrol 7:949–956. https://doi.org/10.2215/CJN.07440711
Article PubMed Google Scholar
Attia ZI, Kapa S, Lopez-Jimenez F, McKie PM, Ladewig DJ, Satam G, Pellikka PA, Enriquez-Sarano M, Noseworthy PA, Munger TM, Asirvatham SJ, Scott CG, Carter RE, Friedman PA (2019) Screening for cardiac contractile dysfunction using an artificial intelligence–enabled electrocardiogram. Nat Med 25:70–74. https://doi.org/10.1038/s41591-018-0240-2
Article CAS PubMed Google Scholar
Attia ZI, Friedman PA, Noseworthy PA, Lopez-Jimenez F, Ladewig DJ, Satam G, Pellikka PA, Munger TM, Asirvatham SJ, Scott CG, Carter RE, Kapa S (2019) Age and sex estimation using artificial intelligence from standard 12-lead ECGs. Circ. Arrhythmia Electrophysiol. 12:e007284. https://doi.org/10.1161/CIRCEP.119.007284
Article Google Scholar
Galloway CD, Valys AV, Shreibati JB, Treiman DL, Petterson FL, Gundotra VP, Albert DE, Attia ZI, Carter RE, Asirvatham SJ, Ackerman MJ, Noseworthy PA, Dillon JJ, Friedman PA (2019) Development and validation of a deep-learning model to screen for hyperkalemia from the electrocardiogram. JAMA Cardiol 4:428. https://doi.org/10.1001/jamacardio.2019.0640
Article PubMed PubMed Central Google Scholar
Attia ZI, Noseworthy PA, Lopez-Jimenez F, Asirvatham SJ, Deshmukh AJ, Gersh BJ, Carter RE, Yao X, Rabinstein AA, Erickson BJ, Kapa S, Friedman PA (2019) An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. Lancet 394:861–867. https://doi.org/10.1016/S0140-6736(19)31721-0
Article PubMed Google Scholar
Cho Y, Kwon J-M, Kim K-H, Medina-Inojosa JR, Jeon K-H, Cho S, Lee SY, Park J, Oh B-H (2020) Artificial intelligence algorithm for detecting myocardial infarction using six-lead electrocardiography. Sci Rep 10:20495. https://doi.org/10.1038/s41598-020-77599-6
Article CAS PubMed PubMed Central Google Scholar
Jo Y-Y, Cho Y, Lee SY, Kwon J, Kim K-H, Jeon K-H, Cho S, Park J, Oh B-H (2020) Explainable artificial intelligence to detect atrial fibrillation using electrocardiogram. Int J Cardiol. https://doi.org/10.1016/j.ijcard.2020.11.053
Article PubMed Google Scholar
Kwon J, Cho Y, Jeon K-H, Cho S, Kim K-H, Baek SD, Jeung S, Park J, Oh B-H (2020) A deep learning algorithm to detect anaemia with ECGs: a retrospective, multicentre study. Lancet Digit Heal 2:e358–e367. https://doi.org/10.1016/S2589-7500(20)30108-4
Article Google Scholar
Kwon J, Lee SY, Jeon K, Lee Y, Kim K, Park J, Oh B, Lee M (2020) Deep learning-based algorithm for detecting aortic stenosis using electrocardiography. J Am Heart Assoc 9:e014717. https://doi.org/10.1161/JAHA.119.014717
Article PubMed PubMed Central Google Scholar
Myoung Kwon J, Kim KH, Medina-Inojosa J, Jeon KH, Park J, Oh BH (2020) Artificial intelligence for early prediction of pulmonary hypertension using electrocardiography. J Hear Lung Transplant 39:805–814. https://doi.org/10.1016/j.healun.2020.04.009
Article Google Scholar
Levey AS, Coresh J, Greene T, Stevens LA, Zhang Y, Hendriksen S, Kusek JW, Van Lente F (2006) Using Standardized serum creatinine values in the modification of diet in renal disease study equation for estimating glomerular filtration rate. Ann Intern Med 145:247–254. https://doi.org/10.7326/0003-4819-145-4-200608150-00004
Article CAS PubMed Google Scholar
Cheung AK, Chang TI, Cushman WC, Furth SL, Hou FF, Ix JH, Knoll GA, Muntner P, Pecoits-Filho R, Sarnak MJ, Tobe SW, Tomson CRV, Mann JFE (2021) KDIGO 2021 clinical practice guideline for the management of blood pressure in chronic kidney disease. Kidney Int 99:S1–S87. https://doi.org/10.1016/j.kint.2020.11.003
Article Google Scholar
Schisterman EF, Perkins NJ, Liu A, Bondell H (2005) Optimal cut-point and its corresponding Youden index to discriminate individuals using pooled blood samples. Epidemiology 16:73–81. https://doi.org/10.1097/01.ede.0000147512.81966.ba
Article PubMed Google Scholar
Carpenter J, Bithell J (2000) Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians. Stat Med 19:1141–1164. https://doi.org/10.1002/(SICI)1097-0258(20000515)19:9%3c1141::AID-SIM479%3e3.0.CO;2-F
Article CAS PubMed Google Scholar
Lai MHC (2021) Bootstrap confidence intervals for multilevel standardized effect size. Multivariate Behav Res 56:558–578. https://doi.org/10.1080/00273171.2020.1746902
Article PubMed Google Scholar
R.R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, D. Batra, Grad-CAM: visual explanations from deep networks via gradient-based localization, in: Proc. IEEE Int. Conf. Comput. Vis., 2017: pp. 1;618–626. https://doi.org/10.1109/ICCV.2017.74.
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2020) Grad-CAM: visual explanations from deep networks via gradient-based localization. Int J Comput Vis 128:336–359. https://doi.org/10.1007/s11263-019-01228-7
Article Google Scholar
Zhang Z, Beck MW, Winkler DA, Huang B, Sibanda W, Goyal H (2018) Opening the black box of neural networks: methods for interpreting neural network models in clinical applications. Ann Transl Med 6:216–216. https://doi.org/10.21037/atm.2018.05.32
Article PubMed PubMed Central Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539
Article CAS PubMed Google Scholar
Bignotto LH, Kallás ME, Djouki RJT, Sassaki MM, Voss GO, Soto CL, Frattini F, Medeiros FSR (2012) Electrocardiographic findings in chronic hemodialysis patients. J Bras Nefrol 34:235–242. https://doi.org/10.5935/0101-2800.20120004
Article PubMed Google Scholar
Stewart GA, Gansevoort RONT, Mark PB, Rooney E, Mcdonagh TA, Dargie HJ, Stuart R, Rodger C, Jardine AG (2005) Electrocardiographic abnormalities and uremic cardiomyopathy. Kidney Int 67:217–226. https://doi.org/10.1111/j.1523-1755.2005.00072.x
Article PubMed Google Scholar
Shafi S, Saleem M, Anjum R, Abdullah W, Shafi T (2017) ECG abnormalities in patients with chronic kidney disease. J Ayub Med Coll Abbottabad 29:61–64
PubMed Google Scholar

Download references

Acknowledgements

This research was results of a study on the "High Performance Computing Support" and “AI voucher” Project, supported by the ‘Ministry of Science and ICT and National IT Industry Promotion Agency of South Korea.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2020R1F1A1073791).

Author information

Joon-myoung Kwon, Kyung-Hee Kim, amd Yong-Yeon Jo contributed equally to this work.

Authors and Affiliations

Medical Research Team, Medical AI, co., Seoul, South Korea
Joon-myoung Kwon, Yong-Yeon Jo, Min-Seung Jung, Yong-Hyeon Cho, Jae-Hyun Shin & Yoon-Ji Lee
Artificial Intelligence and Big Data Research Center, Sejong Medical Research Institute, Bucheon, South Korea
Joon-myoung Kwon, Kyung-Hee Kim & Soo Youn Lee
Medical R&D Center, Body Friend, co, Seoul, South Korea
Joon-myoung Kwon & Jang-Hyeon Ban
Department of Critical Care and Emergency Medicine, Incheon Sejong Hospital, 20, Gyeyangmunhwa-ro, Gyeyang-gu, Incheon, Republic of Korea
Joon-myoung Kwon
Division of Cardiology, Department of Internal Medicine, Incheon Sejong Hospital, Cardiovascular Center20, Gyeyangmunhwa-ro, Gyeyang-gu, Incheon, Republic of Korea
Kyung-Hee Kim, Soo Youn Lee, Jinsik Park & Byung-Hee Oh

Authors

Joon-myoung Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Kyung-Hee Kim
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Yeon Jo
View author publications
You can also search for this author in PubMed Google Scholar
Min-Seung Jung
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Hyeon Cho
View author publications
You can also search for this author in PubMed Google Scholar
Jae-Hyun Shin
View author publications
You can also search for this author in PubMed Google Scholar
Yoon-Ji Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jang-Hyeon Ban
View author publications
You can also search for this author in PubMed Google Scholar
Soo Youn Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jinsik Park
View author publications
You can also search for this author in PubMed Google Scholar
Byung-Hee Oh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

YYJ and KHK performed data analysis and verified the clinical coding. MSJ, YHC, HJS, YJL, and JHB contributed to the study idea and design as well as data collection, performed data analysis, and contributed to subsequent drafts. SYL, JP, and BHO contributed to data collection and revised the manuscript. JK is the principal investigator and contributed to the study idea and design, data analysis, verified the clinical coding, and contributed to subsequent drafts.

Corresponding authors

Correspondence to Joon-myoung Kwon or Kyung-Hee Kim.

Ethics declarations

Conflicts of interest

KHK, SYL, and BHO declare that they have no competing interests. JK and JP are co-founder and YYJ, MSJ, YJL, YHC, and JHS are researchers of Medical AI Co., a medical artificial intelligence company. JK and JHB are researchers of Body friend Co. There are no products in development or marketed products to declare. This does not alter our adherence to Journal.

Ethics approval

The institutional review boards of Sejong General Hospital (2019–0355) and Mediplex Sejong Hospital (2019–065) approved this study protocol.

Consent to participate

The institutional review boards of Sejong General Hospital (2019–0355) and Mediplex Sejong Hospital (2019–065) waived the need for informed consent because of the impracticality and minimal harm involved.

Consent for publication

The authors do hereby declare that all illustrations and figures in the manuscript are entirely original and do not require reprint permission.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 108 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kwon, Jm., Kim, KH., Jo, YY. et al. Artificial intelligence assessment for early detection and prediction of renal impairment using electrocardiography. Int Urol Nephrol 54, 2733–2744 (2022). https://doi.org/10.1007/s11255-022-03165-w

Download citation

Received: 27 January 2021
Accepted: 28 February 2022
Published: 11 April 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s11255-022-03165-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Artificial intelligence assessment for early detection and prediction of renal impairment using electrocardiography

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Deep learning-based electrocardiographic screening for chronic kidney disease

Artificial Intelligence in Predicting Kidney Function and Acute Kidney Injury

Artificial Intelligence in Predicting Kidney Function and Acute Kidney Injury

Introduction

Methods

Study design and population

Procedures

Statistical analysis

Visualizing developed XDM for interpretation

Verifying DLM performance to predict RI development as subgroup analysis

Results

Discussion

Conclusion

Data availability

Code availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflicts of interest

Ethics approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 108 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation