Applying the Temporal Abstraction Technique to the Prediction of Chronic Kidney Disease Progression
- 182 Downloads
Chronic kidney disease (CKD) has attracted considerable attention in the public health domain in recent years. Researchers have exerted considerable effort in attempting to identify critical factors that may affect the deterioration of CKD. In clinical practice, the physical conditions of CKD patients are regularly recorded. The data of CKD patients are recorded as a high-dimensional time-series. Therefore, how to analyze these time-series data for identifying the factors affecting CKD deterioration becomes an interesting topic. This study aims at developing prediction models for stage 4 CKD patients to determine whether their eGFR level decreased to less than 15 ml/min/1.73m2 (end-stage renal disease, ESRD) 6 months after collecting their final laboratory test information by evaluating time-related features. A total of 463 CKD patients collected from January 2004 to December 2013 at one of the biggest dialysis centers in southern Taiwan were included in the experimental evaluation. We integrated the temporal abstraction (TA) technique with data mining methods to develop CKD progression prediction models. Specifically, the TA technique was used to extract vital features (TA-related features) from high-dimensional time-series data, after which several data mining techniques, including C4.5, classification and regression tree (CART), support vector machine, and adaptive boosting (AdaBoost), were applied to develop CKD progression prediction models. The results revealed that incorporating temporal information into the prediction models increased the efficiency of the models. The AdaBoost+CART model exhibited the most accurate prediction among the constructed models (Accuracy: 0.662, Sensitivity: 0.620, Specificity: 0.704, and AUC: 0.715). A number of TA-related features were found to be associated with the deterioration of renal function. These features can provide further clinical information to explain the progression of CKD. TA-related features extracted by long-term tracking of changes in laboratory test values can enable early diagnosis of ESRD. The developed models using these features can facilitate medical personnel in making clinical decisions to provide appropriate diagnoses and improved care quality to patients with CKD.
KeywordsChronic kidney disease Delay progression Time-series data Temporal abstraction Data mining
Compliance with Ethical Standards
Conflicts of Interest
Authors declare that they have no conflict of interest.
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Written consent from the study was unavailable because the dataset comprises only de-identified secondary data for research purposes, and the Institutional Review Board of St. Martin de Porres Hospital issued a formal written waiver of the need for consent and approved the study.
- 2.National Kidney Foundation, About chronic kidney disease. Available: http://www.kidney.org/kidneydisease/aboutckd.cfm, 2015.
- 5.Abbasi, M. A., Chertow, G. M., and Hall, Y. N., End-stage renal disease. Clin. Evid., 2010.Google Scholar
- 7.Taiwan Society of Nephrology, Available: http://www.tsn.org.tw/, 2015.
- 10.Hsu, C.C., Hwang, S.J., Wen, C.P., Chang, H.Y., Chen, T., Shiu, R.S., et al., High prevalence and low awareness of CKD in Taiwan: A study on the relationship between serum creatinine and awareness from a nationally representative survey. Am. J. Kidney Dis. 48(5):727–738, 2006.CrossRefPubMedGoogle Scholar
- 11.Haroun, M.K., Jaar, B.G., Hoffman, S.C., Comstock, G.W., Klag, M.J., and Coresh, J., Risk factors for chronic kidney disease: A prospective study of 23,534 men and women in Washington County. Maryland. Journal of the American Society of Nephrology. 14(11):2934–2941, 2003.CrossRefPubMedGoogle Scholar
- 13.Peralta, C.A., Shlipak, M.G., Fan, D., Ordonez, J., Lash, J.P., Chertow, G.M., et al., Risks for end-stage renal disease, cardiovascular events, and death in Hispanic versus non-Hispanic white adults with chronic kidney disease. J. Am. Soc. Nephrol. 17(10):2892–2899, 2006.CrossRefPubMedGoogle Scholar
- 18.Chen, L., Li, X., Yang, Y., Kurniawati, H., Sheng, Q.Z., Hu, H.Y., and Huang, N., Personal health indexing based on medical examinations: A data mining approach. Decis. Support. Syst. 81:54–65, 2016.Google Scholar
- 20.Breiman, L., Friedman, J.H., Olshen, R.A., and Stone, C.J., Classification and regression trees. Wadsworth & Brooks, Monterey, 1984.Google Scholar
- 21.Cortes, C., and Vapnik, V., Support-vector networks. Mach. Learn. 20(3):273–297, 1995.Google Scholar
- 22.Freund, Y., and Schapire, R.E., A short introduction to boosting introduction to AdaBoost. Journal of Japanese Society for Artificial Intelligence. 14:771–780, 1999.Google Scholar
- 31.Bala, S., and Kumar, K., A literature review on kidney disease prediction using data mining classification technique. International Journal of Computer Science and Mobile Computing. 3:960–967, 2014.Google Scholar
- 32.Vijayarani, S., and Dhayanand, M.S., Data mining classification algorithms for kidney disease prediction. Int. J. Cybern. Inf. 4(4), 2015.Google Scholar
- 35.Altintas, Y.Y., Gokcen, H., Ulgu, M., and Demirel, N., Analysing interactions of risk factors according to risk levels for hemodialysis patients in Turkey: A data mining application. Gazi University Journal of Science. 24(4):829–839, 2011.Google Scholar
- 47.Ishani, A., Grandits, G.A., Grimm, R.H., Svendsen, K.H., Collins, A.J., Prineas, R.J., et al., Association of single measurements of dipstick proteinuria, estimated glomerular filtration rate, and hematocrit with 25-year incidence of end-stage renal disease in the multiple risk factor intervention trial. J. Am. Soc. Nephrol. 17(5):1444–1452, 2006.CrossRefPubMedGoogle Scholar