Abstract
The current healthcare system is designed using advanced competence technologies such as big data analytics, machine learning, and information technology to provide progressive and intelligent healthcare services. The available healthcare services suffer from various issues such as lack of healthcare data handling, data filtering, and right treatment due to which patients are suffering worldwide. Therefore, this paper proposes a Healthcare multi-phase architecture (HCMP) for predicting chronic kidney disease. The HCMP architecture works on six different layers namely: data-collection, data-storage, data-management, data-processing, data-analysis, and report-generation. The data-storage and data-management layers were performed on heterogeneous Hadoop cluster and the profiling methods were used to consider three situations for calculating the capacity ratio of each DataNode in the cluster. MapReduce is used for parallel data processing and the MySymptom algorithm has been used to filter the kidney dataset of patients according to their symptoms at the data processing layer. Furthermore, horizontal scaling is performed in the Hadoop cluster, and the performance of every DataNode is investigated based on a capacity ratio. The data-analysis layer has performed classification tasks using a Decision tree, K Nearest Neighbors (KNN) classification, Kernel distributed Naïve Bayes, Simple distributed Naïve Bayes, Random forest, and Random tree. Among these classifiers, Kernel distributed Naïve Bayes has produced the best results. Comparing the architecture and results of the present study with the available methods, the empirical results showed an improvement. The significant improvisation of classification performance parameters such as accuracy, precision, and AUC have been improved by 2.2%, 2.3%, and 0.3%, respectively.
This is a preview of subscription content, access via your institution.










References
Abdelaziz A, Elhoseny M, Salama AS, Riad AM (2018) A machine learning model for improving healthcare services on cloud computing environment. Measurement 119:117–128. https://doi.org/10.1016/j.measurement.2018.01.022
Arulanthu P, Perumal E (2020) An intelligent IoT with cloud centric medical decision support system for chronic kidney disease prediction. Int J Imaging Syst Technol 30(3):815–827. https://doi.org/10.1002/ima.22424
Asghari P, Rahmani AM, Haj SJH (2019) A medical monitoring scheme and health-medical service composition model in cloud-based IoT platform. Trans Emerg Telecommun Technol 30(6):1–25. https://doi.org/10.1002/ett.3637
Boukenze B, Haqiq A, Mousannif H (2017) Predicting chronic kidney failure disease using data mining techniques. In: El-Azouzi R, Menasche D, Sabir E, De Pellegrini F, Benjillali M (eds) Advances in ubiquitous networking 2 UNet 2016. Springer International Publishing, Singapore, pp 701–712. https://doi.org/10.1007/978-981-10-1627-1_55
Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159. https://doi.org/10.1016/S0031-3203(96)00142-2
Breese JS, Heckerman D, Kadie C (2013) Empirical analysis of predictive algorithms for collaborative filtering. In: 2013 proceedings of the fourteenth conference on uncertainty in artificial intelligence, pp 43–52
Business standard (2013) 70% Indians live in rural areas: census. Press Trust of India http://www.business-standard.com/article/economy-policy/70-indians-live-in-rural-areas-census-111071500171_1.html. Accessed 23 Mar 2019
Dua D, Graff C (2019) UCI machine learning repository. Irvine, CA: university of california, school of information and computer science, 2019. http://archive.ics.uci.edu/ml. Accessed 11 Jan 2019
Elhoseny M, Shankar K, Uthayakumar J (2019) Intelligent diagnostic prediction and classification system for chronic kidney disease. Sci Rep 9(1):1–14. https://doi.org/10.1038/s41598-019-46074-2
Goldberg D, Nichols D, Oki BM, Terry D (1992) Using collaborative filtering to weave an information tapestry. Commun ACM 35(12):61–70. https://doi.org/10.1145/138859.138867
Harimoorthy K, Thangavelu M (2021) Multi-disease prediction model using improved SVM-radial bias technique in healthcare monitoring system. J Ambient Intell Humaniz Comput 12(3):3715–3723. https://doi.org/10.1007/s12652-019-01652-0
Hermida A (2016) The five es of journalism. NiemanLab. https://www.niemanlab.org/2015/12/the-five-es-of-journalism-in-2016/. Accessed 27 Mar 2019
Hwang E, Kim S, Kim JS, Hwang S, Choi YR (2016) On the role of application and resource characterizations in heterogeneous distributed computing systems. Clust Comput 19(4):2225–2240. https://doi.org/10.1007/s10586-016-0638-x
India organ (2014). A study of the deceased organ donation environment in Delhi/NCR. In: An initiative of the Parashar foundation in partnership with MOHAN Foundation. https://www.organindia.org/wp-content/uploads/2014/11/ORGAN-Research-Report.pdf. Accessed 14 Apr 2019
Jain D, Singh V (2018) Feature selection and classification systems for chronic disease prediction: a review. Egypt Inf J 19(3):179–189. https://doi.org/10.1016/j.eij.2018.03.002
Jongbo OA, Adetunmbi AO, Ogunrinde RB, Badeji-Ajisafe B (2020) Development of an ensemble approach to chronic kidney disease diagnosis. Sci Afr 8:1–8. https://doi.org/10.1016/j.sciaf.2020.e00456
Jose N (2016) Challenges in organ transplantation-an indian scenario. Indian J Surg Nurs 5(1):9–32. https://doi.org/10.21088/ijsn.2277.467X.5116.2
Kriplani H, Patel B, Roy S (2019) Prediction of chronic kidney diseases using deep artificial neural network technique. In: Peter J, Fernandes S, Eduardo Thomaz C, Viriri S (eds) Computer aided intervention and diagnostics in clinical and medical images. Springer International Publishing, Cham, pp 179–187. https://doi.org/10.1007/978-3-030-04061-1_18
Kumar V, Recupero DR, Riboni D, Helaoui R (2020a) Ensembling classical machine learning and deep learning approaches for morbidity identification from clinical notes. IEEE Access 9:7107–7126. https://doi.org/10.1109/ACCESS.2020.3043221
Kumar V., Mishra BK, Mazzara M, Thanh DNH, Verma A (2020b) Prediction of malignant and benign breast cancer: a data mining approach in healthcare applications. In: Borah S, Emilia Balas V, Polkowski Z (eds) Advances in data science and management. Springer International Publishing, Singapore, pp 435–442. https://doi.org/10.1007/978-981-15-0978-0_43
Langarizadeh M, Moghbeli F (2016) Applying naive bayesian networks to disease prediction: a systematic review. Acta Inf Med 24(5):364–369. https://doi.org/10.5455/aim.2016.24.364-369
Lee CW, Hsieh KY, Hsieh SY, Hsiao HC (2014) A dynamic data placement strategy for Hadoop in heterogeneous environments. Big Data Res 1:14–22. https://doi.org/10.1016/j.bdr.2014.07.002
Ling CX, Huang J, Zhang H (2003a) AUC: a better measure than accuracy in comparing learning algorithms. In: 2003a conference of the Canadian society for computational studies of intelligence. Springer, Berlin, Heidelberg, pp 329–341. https://doi.org/10.1007/3-540-44886-1_25
Ling CX, Huang J, Zhang H (2003b) AUC: a statistically consistent and more discriminating measure than accuracy. In Ijcai 3:519–524
Makino M, Yoshimoto R, Ono M et al (2019) Artificial intelligence predicts the progression of diabetic kidney disease using big data machine learning. Sci Rep 9(1):1–9. https://doi.org/10.1038/s41598-019-48263-5
Manogaran G, Lopez D (2018) Disease surveillance system for big climate data processing and dengue transmission. In: I Management Association (eds) Climate change and environmental concerns: breakthroughs in research and practice. IGI Global, pp 427–446. https://doi.org/10.4018/978-1-5225-5487-5.ch022
Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Byers AH (2011) Big data: The next frontier for innovation, competition, and productivity. McKinsey Global Institute. Accessed 17 Jan 2019
Nishanth A, Thiruvaran T (2017) Identifying important attributes for early detection of chronic kidney disease. IEEE Rev Biomed Eng 11:208–216. https://doi.org/10.1109/RBME.2017.2787480
Princy RJP, Parthasarathy S, Jose PSH, Lakshminarayanan AR, Jeganathan S (2020) Prediction of cardiac disease using supervised machine learning algorithms. In: 2020 4th international conference on intelligent computing and control systems (ICICCS), pp 570–575. https://doi.org/10.1109/ICICCS48265.2020.9121169
Rady EHA, Anwar AS (2019) Prediction of kidney disease stages using data mining algorithms. Inf Med Unlocked 15:1–7. https://doi.org/10.1016/j.imu.2019.100178
Ren Y, Fei H, Liang X, Ji D, Cheng M (2019) A hybrid neural network model for predicting kidney disease in hypertension patients based on electronic health records. BMC Med Inf Decis Mak 19(2):131–138. https://doi.org/10.1186/s12911-019-0765-4
Resnick P, Varian HR (1997) Recommender systems. Commun ACM 40(3):56–58. https://doi.org/10.1145/245108.245121
Sossi Alaoui S, Aksasse B, Farhaoui Y (2018) Statistical and predictive analytics of chronic kidney disease. 2018 international conference on advanced intelligent systems for sustainable development. Springer, Cham, pp 27–38
Tikoo AV (2017) Transplantation of human organs: the Indian scenario. ILI Law Rev 1:147–174. https://ili.ac.in/pdf/paper1017.pdf
Tresp V, Overhage JM, Bundschus M, Rabizadeh S, Fasching PA, Yu S (2016) Going digital: a survey on digitalization and large-scale data analytics in healthcare. In: Proceedings of the IEEE, pp 2180–2206. https://doi.org/10.1109/JPROC.2016.2615052
Vasquez-Morales GR, Martinez-Monterrubio SM, Moreno-Ger P, Recio-Garcia JA (2019) Explainable prediction of chronic renal disease in the colombian population using neural networks and case-based reasoning. IEEE Access 7:152900–152910. https://doi.org/10.1109/ACCESS.2019.2948430
Wang J, Bao B, Shen P et al (2019) Using electronic health record data to establish a chronic kidney disease surveillance system in China: protocol for the China Kidney Disease Network (CK-NET)-Yinzhou Study. BMJ Open 9(8):1–8. https://doi.org/10.1136/bmjopen-2019-030102
White T (2012) Hadoop: the definitive guide. O’Reilly Media Inc, London
Yu J, Ping P, Wang L, Kuang L, Li X, Wu Z (2018) A novel probability model for lncRNA–disease association prediction based on the naïve bayesian classifier. Genes 9(7):1–21. https://doi.org/10.3390/genes9070345
Zhang Z, Wang B, Ahmed F, Ramakrishnan IV, Zhao R, Viccellio A, Mueller K (2013) The five Ws for information visualization with application to healthcare informatics. IEEE Trans Visual Comput Graphics 19(11):1895–1910. https://doi.org/10.1109/TVCG.2013.89
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Sisodia, A., Jindal, R. An effective model for healthcare to process chronic kidney disease using big data processing. J Ambient Intell Human Comput (2022). https://doi.org/10.1007/s12652-022-03817-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12652-022-03817-w
Keywords
- Big data
- Hadoop
- Health monitoring
- Machine learning
- MapReduce
- MySymptom