Abstract
The paper addresses the challenge of imbalanced classification in the context of cerebrovascular diseases, including stroke, transient ischemic attack (TIA), and vascular dementia. The imbalanced nature of cerebrovascular disease datasets poses significant challenges to conventional machine learning algorithms, making precise diagnosis and effective management difficult. The aim of the paper is to propose a novel approach, the INTEL_SS algorithm, which combines ensemble learning techniques with Support Vector Machine-Synthetic Minority Over-sampling Technique (SVM-SMOTE) to effectively handle the imbalanced nature of cerebrovascular disease datasets. The goal is to improve the accuracy of diagnosis and management of cerebrovascular diseases through advanced machine learning techniques. The proposed methodology involves several key steps, including preprocessing, SVM-SMOTE, and ensemble learning. Preprocessing techniques are used to improve the quality of the dataset, SVM-SMOTE is employed to address class imbalance, and ensemble learning methods such as bagging, boosting, and stacking are utilized to improve overall classification performance. The experimental results demonstrate that the INTEL_SS algorithm outperforms existing methods in terms of accuracy, precision, recall, F1-score, and AUC-ROC. Performance metrics are used to assess the effectiveness of the proposed approach, and the results consistently show the superiority of INTEL_SS compared to state-of-the-art imbalanced classification algorithms. The paper concludes that the INTEL_SS algorithm has the potential to enhance the diagnosis and management of cerebrovascular diseases, offering new opportunities to apply machine learning techniques to improve healthcare outcomes.
Similar content being viewed by others
Availability of data and materials
Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
References
Abdullahi SD, Muhammad SA (2022) Early prediction of cerebrovascular disease using boosting machine learning algorithms to assist clinicians. J Appl Sci Environ Manag 26(6):1031–1037
Annamalai M, Muthiah PB (2022) An early prediction of tumor in heart by cardiac masses classification in echocardiogram images using robust back propagation neural network classifier. Braz Arch Biol Techno 65. https://doi.org/10.1590/1678-4324-2022210316
Ajay D, Malik SK (2021) Artificial bee colony optimized deep neural network model for handling imbalanced stroke data: ABC-DNN for prediction of stroke. Int J E-Health Med Commun (IJEHMC) 12(5):67–83
Al-Mekhlafi ZG, Senan EM, Rassem TH, Mohammed BA, Makbol NM, Alanazi AA et al (2022) Deep learning and machine learning for early detection of stroke and haemorrhage. Comput Mater Contin 72(1):775–796
Ali R, Manikandan A, Xu J (2023) A novel framework of adaptive fuzzy-GLCM segmentation and fuzzy with capsules network (F-CapsNet) classification. Neural Comput & Applic. https://doi.org/10.1007/s00521-023-08666-y
Ghosh M (2021) An enhanced stroke prediction scheme using SMOTE and machine learning techniques. In: 2021 12th International conference on computing communication and networking technologies (ICCCNT), pp 1–6
Islam MM, Akter S, Rokunojjaman MR, Rony JH, Amin A, Kar S (2021) Stroke prediction analysis using machine learning classifiers and feature technique. Int J Electron Commun Syst 1(2):17–22
Karthik Chandran C, Rajalakshmi M, Mohanty SN, Chowdhury S, Chandrasekaran R (2023) Machine learning for healthcare systems, 1st edn. River Publisher, Denmark
Karpagalakshmi R, Tensing D, Kalpana A (2016) Image localization using deformable model and its application in health informatics. J Med Imaging & Health Infor 6:1972–1976. https://doi.org/10.1166/jmihi.2016.1959
Kerber KA, Brown DL, Lisabeth LD, Smith MA, Morgenstern LB (2006) Stroke among patients with dizziness, vertigo, and imbalance in the emergency department: a population-based study. Stroke 37(10):2484–2487
Kolli S, AV PK, Ashok J, Manikandan A (2023) Internet of things for pervasive and personalized healthcare: Architecture, technologies, components, applications, and prototype development. https://doi.org/10.4018/978-1-6684-8913-0.ch008
Liu T, Fan W, Wu C (2019) A hybrid machine learning approach to cerebral stroke prediction based on imbalanced medical dataset. Artif Intell Med 101:101723
Liu N, Li X, Qi E, Xu M, Li L, Gao B (2020) A novel ensemble learning paradigm for medical diagnosis with imbalanced data. IEEE Access 8:171263–171280
Manikandan, A., & Sakthivel, J. (2017a). Recognizable Proof of Biometric System With Even Distorted And Rectification States. Journal of Advanced Research in Dynamical and Control Systems, 9(2), 1393–1398
Manikandan, Annamalai, Bala MP (2023) Intracardiac mass detection and classification using double convolutional neural network classifier. J Eng Res. 11(2A):272-280. https://doi.org/10.36909/jer.12237.
Mridha K, Ghimire S, Shin J, Aran A, Uddin MM, Mridha MF (2023) Automated stroke prediction using machine learning: an explainable and exploratory study with a web application for early intervention. IEEE Access 11:52288–52308
Palaniappan M, Annamalai M (2019) Advances in signal and image processing in biomedical applications. https://doi.org/10.5772/intechopen.88759
Rajinikanth V, Yassine S, Bukhari SA (2023) Hand-sketchs based Parkinson’s disease screening using lightweight deep-learning with two-fold training and fused optimal features. Int J Math Stat Comput Sci 2:9–18
Seyala N, Abdullah SN (2023) cluster analysis on longitudinal data of patients with kidney dialysis using a smoothing cubic B-spline model. Int J Math Stat Comput Sci 2:85–95
Sheikdavood K, Surendar P, Manikandan A. Certain (2016) Investigation on latent fingerprint improvement through multi-scale patch based sparse representation. Indian J Eng 13(31):59-64
Su P-Y, Wei Y-C, Luo H, Liu C-H, Huang W-Y, Chen K-F et al (2022) Machine learning models for predicting influential factors of early outcomes in acute ischemic stroke: registry-based study. JMIR Med Inform 10(3):e32508
Thippa Reddy G, Bhattacharya S, Maddikunta PKR, Hakak S et al (2022) Antlion re-sampling based deep neural network model for classification of imbalanced multimodal stroke dataset. Multimedia Tools Appl 81:41429–41453
Venmathi AR, David S, Govinda E, Ganapriya K, Dhanapal R, Manikandan A (2023) “An automatic brain tumors detection and classification using deep convolutional neural network with VGG-19,” 2023 2nd International Conference on Advancements in Electrical, Electronics, Communication, Computing and Automation (ICAECA), India, pp. 1–5. https://doi.org/10.1109/ICAECA56562.2023.10200949
Zhang X, Chen S, Lai K, Chen Z, Wan J, Xu Y (2022) Machine learning for the prediction of acute kidney injury in critical care patients with acute cerebrovascular disease. Ren Fail 44(1):43–53
Funding
No funding received by any government or private concern.
Author information
Authors and Affiliations
Contributions
Nithya R, was responsible for the research’s conception and design, data collection and analysis, and original article writing. Dr. Kokilavani T, oversaw the investigation, contributed data analytic skills, and revised the final draft of the paper before submission. Dr. Lucia T, updated the text for significant intellectual content as well as contributed to the data interpretation and critical intellectual input.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Research involving human participants and/or animals
This article does not contain any studies involving Human Participants and/or Animals performed by any of the authors.
Informed consent
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Nithya, R., Kokilavani, T. & Beena, T.L.A. Balancing cerebrovascular disease data with integrated ensemble learning and SVM-SMOTE. Netw Model Anal Health Inform Bioinforma 13, 12 (2024). https://doi.org/10.1007/s13721-024-00447-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13721-024-00447-4