Skip to main content
Log in

Privacy-preserving predictive modeling for early detection of chronic kidney disease

  • Original Article
  • Published:
Network Modeling Analysis in Health Informatics and Bioinformatics Aims and scope Submit manuscript

Abstract

With the phenomenal growth of machine learning, privacy protection has become a major concern in medical science. Sensitive medical information, such as chronic kidney disease (CKD) data, is subject to privacy protections and cannot be shared with others without patients’confidentiality and data security. This research study aims to design a secure and effective methodology for developing a predictive model for the early detection of chronic kidney disease. In our research study, we addressed two major problems: accurate diagnosis of chronic kidney disease and data security. The workflow in this research paper is divided into two stages. In the initial phase, to prioritize shorter processing times and improve the model’s performance, we applied two meta-heuristic algorithms, the genetic algorithm and the bat algorithm, as feature selection methods to identify the most relevant features for accurate CKD diagnosis. In the subsequent phase of our study, we proposed a privacy-preserving logistic regression-based chronic kidney disease inference model to perform predictive analysis on encrypted chronic kidney disease data. In this model, we used the Paillier homomorphic cryptosystem that provides strong security assurances, ensuring secure communication and processing of medical data, which maintains patients’confidentiality during diagnosis. Our proposed model, utilizing the genetic algorithm feature selection approach, achieves higher accuracy at 98.75% compared to the original features and bat algorithm selected features. We also observed that this model had less computation time compared to the original features and bat algorithm-selected features. To confirm the efficiency of our proposed model, we compared its performance with those obtained by applying the logistic regression algorithm to plain text. We found that, in all cases, we achieved the same performance as observed with encrypted data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Algorithm 1
Algorithm 2
Algorithm 3
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data availability

This article uses public datasets, which are publicly available at (https://archive.ics.uci.edu/ml).

References

  • Almasoud M, Ward TE (2019) Detection of chronic kidney disease using machine learning algorithms with least number of predictors. International Journal of Advanced Computer Science and Applications(IJACSA), 10(8). https://doi.org/10.14569/IJACSA.2019.0100813

  • Ani R, Sasi G, Sankar UR, Deepa O (2016) Decision support system for diagnosis and prediction of chronic renal failure using random subspace classification. In: Paper presented at the 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), IEEE, 1287–1292

  • Chahar V, Katoch S, Chauhan S (2021) A review on genetic algorithm: past, present, and future. Multimed Tools Appl 80. https://doi.org/10.1007/s11042-020-10139-6

  • Chen TK, Knicely DH, Grams ME (2019) Chronic kidney disease diagnosis and management: a review. JAMA 322(13):1294–1304. https://doi.org/10.1001/jama.2019.14745

    Article  Google Scholar 

  • Dare AJ, Fu SH, Patra J, Rodriguez PS, Thakur JS, Jha P (2017). Million Death Study Collaborators. Renal failure deaths and their risk factors in India 2001-13: nationally representative estimates from the Million Death Study. Lancet Glob Health. 5(1):e89-e95. https://doi.org/10.1016/S2214-109X(16)30308-4

  • Dua D, Graff C (2017). UCI Machine Learning Repository. https://archive.ics.uci.edu/ml

  • Eroglu K, Palabas T, Ieee. (2016). The Impact on the Classification Performance of the Combined Use of Different Classification Methods and Different Ensemble Algorithms in Chronic Kidney Disease Detection. New York: Paper presented at the 2016 National Conference on Electrical, Electronics, and Biomedical Engineering (ELECO), IEEE, pp. 512–516

  • Ethiopia (2020) Kidney disease. https://www.worldlifeexpectancy.com/ethiopia-kidney-disease. Accessed 07

  • GBD Chronic Kidney Disease Collaboration (2020) Global, regional, and national burden of chronic kidney disease, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 395(10225):709–733. https://doi.org/10.1016/S0140-6736(20)30045-3

  • George C, Mogueo A, Okpechi I, Echouffo-Tcheugui JB, Kengne AP (2017) Chronic kidney disease in low-income to middle-income countries: the case for increased screening. BMJ Glob Health 2(2):e000256. https://doi.org/10.1136/bmjgh-2016-000256

    Article  Google Scholar 

  • Han X, Zheng X, Wang Y, Sun X, Xiao Y, Tang Y, Qin W (2019). Random forest can accurately predict the development of end-stage renal disease in immunoglobulin nephropathy patients. Ann Transl Med. 7(11):234. https://doi.org/10.21037/atm.2018.12.11

  • Hill NR, Fatoba ST, Oke JL, Hirst JA, O’Callaghan CA, Lasserson DS, Hobbs FD (2016) Global Prevalence of Chronic Kidney Disease - A Systematic Review and Meta-Analysis. PLoS ONE 11(7):e0158765. https://doi.org/10.1371/journal.pone.0158765

    Article  Google Scholar 

  • Hossain MM, Detwiler RK, Chang EH et al (2019) Mechanical Anisotropy Assessment in Kidney Cortex Using ARFI Peak Displacement: Preclinical Validation and Pilot In Vivo Clinical Results in Kidney Allografts. IEEE Trans Ultrason Ferroelectr Freq Control 66(3):551–562. https://doi.org/10.1109/tuffc.2018.2865203

    Article  Google Scholar 

  • Ifraz G.M, Rashid M.H, Tazin T, Bourouis S, Khan M.M (2021). Comparative Analysis for Prediction of Kidney Disease Using Intelligent Machine Learning Methods. Computational and Mathematical Methods in Medicine, 2021. vol. 2021, Article ID 6141470, 10 pages, 2021. https://doi.org/10.1155/2021/6141470

  • Institute of Medicine (US) Committee on Health Research and the Privacy of Health Information: The HIPAA Privacy Rule. Beyond the HIPAA Privacy Rule: Enhancing Privacy, Improving Health Through Research. Nass SJ, Levit LA, Gostin LO, editors. Washington (DC): National Academies Press (US); 2009

  • Jeyasingh S, Veluchamy M (2017 ). Modified Bat Algorithm for Feature Selection with the Wisconsin Diagnosis Breast Cancer (WDBC) Dataset. Asian Pac J Cancer Prev.18(5):1257-1264. https://doi.org/10.22034/APJCP.2017.18.5.1257

  • Krishnamurthy S, Ks K, Dovgan E, Luštrek M, Gradišek Piletič B, Srinivasan K, Li YJ, Gradišek A, Syed-Abdul S (2021) Machine Learning Prediction Models for Chronic Kidney Disease Using National Health Insurance Claim Data in Taiwan. Healthcare (Basel). 9(5):546. https://doi.org/10.3390/healthcare9050546

    Article  Google Scholar 

  • Levey AS, Atkins R, Coresh J, Cohen EP, Collins AJ, Eckardt KU, Nahas ME, Jaber BL, Jadoul M, Levin A, Powe NR, Rossert J, Wheeler DC, Lameire N, Eknoyan G (2007) Chronic kidney disease as a global public health problem: approaches and initiatives - a position statement from Kidney Disease Improving Global Outcomes. Kidney Int 72(3):247–59. https://doi.org/10.1038/sj.ki.5002343

    Article  Google Scholar 

  • Mallappallil M, Friedman EA, Delano BG, McFarlane SI, Salifu MO (2014) Chronic kidney disease in the elderly: evaluation and management. Clin Pract (Lond). 11(5):525–535. https://doi.org/10.2217/cpr.14.46

    Article  Google Scholar 

  • Moahmmed S (2019) A Self-learning Knowledge based System for Diagnosis and Treatment of Chronic Kidney Disease. International Journal of Education and Management Engineering. 9:44–58. https://doi.org/10.5815/ijeme.2019.02.05

    Article  Google Scholar 

  • Murti Della, Pujianto Utomo, Wibawa Aji, Akbar Muhammad (2019). K-Nearest Neighbor (K-NN) based Missing Data Imputation. 83-88. https://doi.org/10.1109/ICSITech46713.2019.8987530

  • Nandimath OV (2009) Consent and medical treatment: The legal paradigm in India. Indian J Urol. 25(3):343–7. https://doi.org/10.4103/0970-1591.56202

    Article  Google Scholar 

  • Noroozi Z, Orooji A, Erfannia L (2023) Analyzing the impact of feature selection methods on machine learning algorithms for heart disease prediction. Sci Rep 13:22588. https://doi.org/10.1038/s41598-023-49962-w

    Article  Google Scholar 

  • Ogburn M, Turner C, Dahal P (2013) Homomorphic Encryption. Procedia Computer Science. 20:502–509. https://doi.org/10.1016/j.procs.2013.09.310

    Article  Google Scholar 

  • Paillier Pascal (1999). Public-Key Cryptosystems Based on Composite Degree Residuosity Classes. Advances in Cryptology–EUROCRYPT’99, 223–238 (Springer, Berlin, 1999). https://doi.org/10.1007/3-540-48910-X_16

  • Papademetriou V, Nylen ES, Doumas M, Probstfield J, Mann JFE, Gilbert RE, Gerstein HC (2017) Chronic Kidney Disease, Basal Insulin Glargine, and Health Outcomes in People with Dysglycemia: The ORIGIN Study. Am J Med 130(12):1465.e27-1465.e39. https://doi.org/10.1016/j.amjmed.2017.05.047

    Article  Google Scholar 

  • Peng J, Lee K, Ingersoll G (2002) An Introduction to Logistic Regression Analysis and Reporting. Journal of Educational Research - J EDUC RES. 96:3–14. https://doi.org/10.1080/00220670209598786

    Article  Google Scholar 

  • Polat H, Danaei Mehr H, Cetin A (2017) Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods. J Med Syst 41(4):55. https://doi.org/10.1007/s10916-017-0703-x

    Article  Google Scholar 

  • Poonia RC, Gupta MK, Abunadi I, Albraikan AA, Al-Wesabi FN, Hamza MA (2022 ). B T. Intelligent Diagnostic Prediction and Classification Models for Detection of Kidney Disease. Healthcare (Basel). 10(2):371. https://doi.org/10.3390/healthcare10020371

  • Priyanka K (2019) Science BC. Chronic kidney disease prediction based on naive Bayes technique. p. 1653–9

  • Pudjihartono N, Fadason T, Kempa-Liehr AW, O’Sullivan JM (2022) A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction. Front Bioinform. 27(2):927312. https://doi.org/10.3389/fbinf.2022.927312

    Article  Google Scholar 

  • Qin Jiongming, Chen Lin, Liu Yuhua, Liu Chuanjun, Feng Changhao, Chen Bin (2019). A Machine Learning Methodology for Diagnosing Chronic Kidney Disease. IEEE Access. PP. 1-1. https://doi.org/10.1109/ACCESS.2019.2963053

  • Radhakrishnan J, Mohan S (2017) KI Reports and World Kidney Day. Kidney Int Rep. 2(2):125–126. https://doi.org/10.1016/j.ekir.2017.01.014

    Article  Google Scholar 

  • Rady El Houssainy, Anwar Ayman (2019) Prediction of kidney disease stages using data mining algorithms. Informatics in Medicine Unlocked. 15. https://doi.org/10.1016/j.imu.2019.100178

  • Rostami M, Berahmand K, Forouzandeh S (2021) A novel community detection based genetic algorithm for feature selection. J Big Data 8:2. https://doi.org/10.1186/s40537-020-00398-3

    Article  Google Scholar 

  • Salekin A, Stankovic J (2016). Detection of Chronic Kidney Disease and Selecting Important Predictive Attributes, 2016 IEEE International Conference on Healthcare Informatics (ICHI), Chicago, IL, USA, pp. 262-270. https://doi.org/10.1109/ICHI.2016.36

  • Tekale S, Shingavi P, Wandhekar S, Chatorikar A (2018) Prediction of chronic kidney disease using machine learning algorithm. Disease. 7(10):92–6

    Google Scholar 

  • Varkey B (2021) Principles of Clinical Ethics and Their Application to Practice. Med Princ Pract 30(1):17–28. https://doi.org/10.1159/000509119

    Article  Google Scholar 

  • Wickramasinghe M.P.N.M, Perera D.M. ,Kahandawaarachchi K.A.D.C.P (2017). Dietary prediction for patients with Chronic Kidney Disease (CKD) by considering blood potassium level using machine learning algorithms. 300-303

  • Yang Xin-She (2013) Bat Algorithm: Literature Review and Applications. International Journal of Bio-Inspired Computation. https://doi.org/10.1504/IJBIC.2013.055093

  • Yang W, Wang S, Cui H, Tang Z, Li Y (2023) A Review of Homomorphic Encryption for Privacy-Preserving Biometrics. Sensors (Basel). 23(7):3566. https://doi.org/10.3390/s23073566

    Article  Google Scholar 

  • Yashfi Shanila , Islam Md , Pritilata, Sakib Nazmus , Islam Tanzila, Shahbaaz Mohammad, Pantho Sadaf (2020). Risk Prediction Of Chronic Kidney Disease Using Machine Learning Algorithms. 1-5. https://doi.org/10.1109/ICCCNT49239.2020.9225548

  • Zhang L, Wang F, Wang L, Wang W, Liu B, Liu J, Chen M, He Q, Liao Y, Yu X, Chen N, Zhang JE, Hu Z, Liu F, Hong D, Ma L, Liu H, Zhou X, Chen J, Pan L, Chen W, Wang W, Li X, Wang H (2012) Prevalence of chronic kidney disease in China: a cross-sectional survey. Lancet 379(9818):815–22. https://doi.org/10.1016/S0140-6736(12)60033-6

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Prokash Gogoi.

Ethics declarations

Conflict of interest

We authors declare that there are no conflicts of interest relevant to this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gogoi, P., Valan, J.A. Privacy-preserving predictive modeling for early detection of chronic kidney disease. Netw Model Anal Health Inform Bioinforma 13, 16 (2024). https://doi.org/10.1007/s13721-024-00452-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13721-024-00452-7

Keywords

Navigation