Skip to main content

Performance-Based Prediction of Chronic Kidney Disease Using Machine Learning for High-Risk Cardiovascular Disease Patients

  • Chapter
  • First Online:
Nature-Inspired Computation in Data Mining and Machine Learning

Part of the book series: Studies in Computational Intelligence ((SCI,volume 855))

Abstract

People at high-risk of cardiovascular disease are most likely vulnerable to chronic kidney diseases, and historical medical records can help avert complicated kidney problems. In this paper, 12 supervised machine learning algorithms were used to analyses a retrospective electronic medical data on chronic kidney disease. The study targeted 544 outpatients although 48 failed to meet the inclusion criteria and some other 21 cases had missing values and were excluded from the study. The profiling and the preliminaries result established that 88.5% of the cases were labeled as advance CKD while 11.5% were labelled as early-stage CKD cases. The classification task and the subsequent evaluation of the models were based on the correct classification of the two groups. Of the evaluated algorithms, decision tree boosted decision tree, and CN2 rule induction was the least accurate ones. However, logistic regression (Ridge and Lasso), neural network (logistic and stochastic gradient descent), and support vector machine (Radial Basis Function and Polynomial) had very high accuracies and efficiency. With an efficiency of 93.4% and a classification accuracy of 91.7%, Polynomial Support Vector Machine algorithm was the most efficient and accurate. The model suggested 253 2-dimensional combinations of factors with a history of vascular diseases and smoking as the most influential factors. The other combinations can provide information that can be used to predict or detect chronic kidney disease based on historical records. Future research prospects should consider using discretized Glomerular Filtration Rate to ensure that the classification integrates the five stages of the CKD.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Al-Shamsi, S., Regmi, D., Govender, R.: Chronic kidney disease in patients at high risk of cardiovascular disease in the United Arab Emirates: a population-based study. PLoS ONE 13, e0199920 (2018). https://doi.org/10.1371/journal.pone.0199920

    Article  Google Scholar 

  2. Jain, D., Singh, V.: Feature selection and classification systems for chronic disease prediction: a review. Egypt. Inform. J. (2018). https://doi.org/10.1016/j.eij.2018.03.002

    Article  Google Scholar 

  3. Kumar, M.: Prediction of chronic kidney disease using random forest machine learning algorithm. Int. J. Comput. Sci. Mob. Comput. 5(2), 24–33 (2016)

    Google Scholar 

  4. Sharma, S., Sharma, V., Sharma, A.: Performance-based evaluation of various machine learning classification techniques for chronic kidney disease diagnosis. arXiv preprint arXiv:1606.09581, 28 June 2016

  5. Sinha, P., Sinha, P.: Comparative study of chronic kidney disease prediction using KNN and SVM. Int. J. Eng. Res. Technol. 4(12), 608–612 (2015)

    Google Scholar 

  6. Pagán, J., Risco-Martín, J.L., Moya, J.M., Ayala, J.L.: Modeling methodology for the accurate and prompt prediction of symptomatic events in chronic diseases. J. Biomed. Inform. 1(62), 136–147 (2016)

    Article  Google Scholar 

  7. Natarajan, B.: Machine Learning. Elsevier Science, Amsterdam (2014)

    Google Scholar 

  8. Ahmad, A.: Decision tree ensembles based on kernel features. Appl. Intell. 41(3), 855–869 (2014)

    Article  Google Scholar 

  9. Clark, P., Niblett, T.: The CN2 induction algorithm. Mach. Learn. 3, 261–283 (1989)

    Google Scholar 

  10. Zhang, D., Tsai, J.: Machine Learning Applications in Software Engineering. World Scientific, Hackensack, NJ (2005)

    Book  Google Scholar 

Download references

Acknowledgements

We are grateful to the UCI team for granting access to the data used in the study. We acknowledge and appreciate the Dr. P. Soundarapandian, L. Jerlin Rubini, and Dr. P. Eswaran of the Department of Computer Science and Engineering, Alagappa University and Apollo Hospitals for collecting and sharing the dataset with UCI.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohamed Alloghani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Alloghani, M., Al-Jumeily, D., Hussain, A., Liatsis, P., Aljaaf, A.J. (2020). Performance-Based Prediction of Chronic Kidney Disease Using Machine Learning for High-Risk Cardiovascular Disease Patients. In: Yang, XS., He, XS. (eds) Nature-Inspired Computation in Data Mining and Machine Learning. Studies in Computational Intelligence, vol 855. Springer, Cham. https://doi.org/10.1007/978-3-030-28553-1_9

Download citation

Publish with us

Policies and ethics