Skip to main content
Log in

Review on Improved Machine Learning Techniques for Predicting Chronic Diseases

  • Published:
Optical Memory and Neural Networks Aims and scope Submit manuscript

Abstract

Healthcare industry is a stage which is presented with tremendous innovative headways consistently. Parkinson disease (PD) has become a critical overall general clinical issue starting late. To provide the solution for this problem, in this paper, use fusion of machine learning and federated learning techniques for processing electronically collected patients’ health record (PD dataset) in accurate manner. The PD dataset are constantly gathered and sorted out to give a point by point history of patients, their sicknesses and determination plans. The medical PD dataset contains 43 400 electronic records of potential patients which includes normal, Ischemic and Hemorrhagic stroke. Cleaning, finding feature correlation and imputing missing values in the PD has to be performed by preprocessing & normalization approach. For further processing, using Random over sampling (ROS) methods the imbalanced PD dataset will be converted into balanced. From the balanced PD datasets the stroke prediction accuracy will be validated using Decision Tree, Logistic Regression, Random Forest and Improved LSTM (Imp-LSTM) machine learning algorithms. Using distinct experiments of executing performance measurements the accuracy rate from our prediction classifiers for the patient with smokes category will be 62.29, 71.36, 96.51 and 99.56% respectively as like the patient with never smoked category dataset the accuracy will be 70.49, 75.86, 96.49 and 99.58% respectively. The proposed Imp-LSTM algorithm in this research will effectively produce high overall accuracy in both the datasets, which means a successful decrease in the misdiagnosis rate for stroke prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.
Fig. 6.
Fig. 7.
Fig. 8.
Fig. 9.
Fig. 10.
Fig. 11.
Fig. 12.
Fig. 13.
Fig. 14.
Fig. 15.
Fig. 16.
Fig. 17.
Fig. 18.
Fig. 19.
Fig. 20.
Fig. 21.
Fig. 22.
Fig. 23.

Similar content being viewed by others

REFERENCES

  1. O’Donnell, M.J., Chin, S.L., Rangarajan, S., Xavier, D., Liu, L., Zhang, H., Rao-Melacini, P., Zhang, X., Pais, P., Agapay, S., et al., Global and regional effects of potentially modifiable risk factors associated with acute stroke in 32 countries (interstroke): A case-control study, Lancet, 2016, vol. 388, no. 10046, pp. 761–775.

    Article  Google Scholar 

  2. Khosla, A., Cao, Y., Lin, C.C.-Y., Chiu, H.-K., Hu, J., and Lee, H., An integrated machine learning approach to stroke prediction, in Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2010, pp. 83–192.

  3. Çomak, A. Arslan and Türkoğlu, İ., A decision support system based on support vector machines for diagnosis of the heart valve diseases, Comput. Biol. Med., 2007, vol. 37, no. 1, pp. 21–27.

    Article  Google Scholar 

  4. Zhang, X., Song, S., Wu, C., Robust, bayesian classification with incomplete data, Cognit. Comput., 2013, vol. 5, no. 2, pp. 170–187.

    Article  Google Scholar 

  5. Haixiang, G., Yijing, L., Shang, J., Mingyun, G., Yuanyue, H., and Bing, G., Learning from class-imbalanced data: Review of methods and applications, Expert Syst. Appl., 2017, vol. 73, pp. 220–239.

    Article  Google Scholar 

  6. Chawla, N., Japkowicz, N., and Kotcz, A., Editorial: Special issue on learning from imbalanced data sets, Sigkdd explore news, 2004, vol. 6, pp. 1–6.

  7. Yoo, I., Alafaireet, P., Marinov, M., Pena-Hernandez, K., Gopidi, R., Chang, J.-F., and Hua, L., Data mining in healthcare and biomedicine: a survey of the literature, J. Med. Syst., 2012, vol. 36, no. 4, pp. 2431–2448.

    Article  Google Scholar 

  8. Richter, A.N. and Khoshgoftaar, T.M., A review of statistical and machine learning methods for modeling cancer risk using structured clinical data, Artif. Intell. Med., 2018, vol. 90, pp. 1–14.

    Article  Google Scholar 

  9. Liton Chandra Paul, Abdulla Al Suman, and Nahid Sultan, Methodological analysis of principal component analysis (PCA) method, Int. J. Comput. Eng. Manage., 2013, vol. 16, no. 2, pp. 32–37.

    Google Scholar 

  10. Gopalakrishnan, C. and Iyapparaja, M., Active contour with modified Otsu method for automatic detection of polycystic ovary syndrome from ultrasound image of ovary, Multimedia Tools and Applications, 2019, pp. 1–24.

  11. Meenakshisundaram, I. and Sreedharan, S., Intelligent risk analysis model for mining adaptable reusable component, Int. Arab J. Inf. Technol. (IAJIT), 2015, p. 12.

  12. Chawla, N.V., Bowyer, K.W., Hall, L.O., and Kegelmeyer, W.P., Smote: Synthetic minority over-sampling technique, J. Artif. Intell. Res., 2002, vol. 16, pp. 321–357.

    Article  Google Scholar 

  13. Wagacha, P.W., Induction of decision trees, Found. Learn. Adapt. Syst., 2003, no. 12, pp. 1–14.

  14. Karsmakers, P., Pelckmans, K., and Suykens, J.A.K., Multi-class kernel logistic regression: A fixed-size implementation, 2007 International Joint Conference on Neural Networks, Orlando, FL, 2007, pp. 1756–1761.

  15. Yekkala, S. Dixit and Jabbar, M.A., Prediction of heart disease using ensemble learning and Particle Swarm Optimization, 2017 International Conference On Smart Technologies For Smart Nation (SmartTechCon), Bangalore, 2017, pp. 691–698.

  16. Kuang Junwei, Hangzhou Yang, Liu Junjiang, and Yan Zhijun, Dynamic prediction of cardiovascular disease using improved LSTM, Int. J. Crowd Sci., 2019, vol. 3, no. 1, pp. 14–25.

    Article  Google Scholar 

Download references

ACKNOWLEDGMENTS

This survey was supported by Karthikeyan J. I thank my supervisor from Vellore Institute of Technology, Vellore who provided invaluable insights and expertise that greatly assisted the research.

Funding

This work was supported by ongoing institutional funding. No additional grants to carry out or direct this particular research were obtained.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to J. Karthikeyan.

Ethics declarations

CONFLICT OF INTEREST

The authors of this work declare that they have no conflicts of interest.

FUTURE SCOPE

In future, the complete classification process can be improved by incorporating time threshold values into to all the gates of LSTM algorithm and enhance this with the advancement of soft computing technologies.

Additional information

Publisher’s Note.

Allerton Press remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Abirami, L., Karthikeyan, J. Review on Improved Machine Learning Techniques for Predicting Chronic Diseases. Opt. Mem. Neural Networks 33, 28–46 (2024). https://doi.org/10.3103/S1060992X24010028

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S1060992X24010028

Keywords:

Navigation