Skip to main content

Advertisement

Log in

Disease analysis using machine learning approaches in healthcare system

  • Original Paper
  • Published:
Health and Technology Aims and scope Submit manuscript

A Correction to this article was published on 11 August 2022

This article has been updated

Abstract

This paper addresses disease analysis using machine learning approaches in healthcare system. Several approaches have been used to identify various disease as their corresponding model, but generic model for detecting disease is a challenging task. Thus, this paper proposed the model for disease detection using machine learning approaches with various methodologies such as support vector machine (SVM), K-nearest neighbours, random forests, artificial neural networks (ANNs), and logistic regression. This paper is also used an evaluation matrix with different parameters for performance analysis. The experimental performance is identified as per proposed model through the evaluation matrix. The outcomes disclose that the ANNs method performed good compare to others based on accuracy (97.94%), precision (96.78%), and F1-score (97.87%), respectively. The correlation approach also determined the number of attributes with very close diagonal values i.e., 100%. The comparative approaches are strongly analysed in experiments for clarity of performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Availability of data and material

Data used in this study are available from the first author upon request.

Code availability

The Programming codes implemented in this study are available from the first author upon request.

Change history

References

  1. Park SH, Han K. Methodologic guide for evaluating clinical performance and effect of artificial intelligence technology for medical diagnosis and prediction. Radiol Soc N Am. 2018;286(3):800–9.

    MathSciNet  Google Scholar 

  2. Breast Cancer: Statistics. Approved by the Cancer.Net Editorial Board. [Online]. Available: http://www.cancer.net/cancer-types/breast-cancer/statistics. 2017. Accessed 26 Aug 2018.

  3. Azar AT, El-Said SA. Probabilistic neural network for breast cancer classification. Neural Comput Appl. 2013;23(6):1737–51.

    Article  Google Scholar 

  4. Nagashima T, Suzuki M, Yagata H, Hashimoto H, Shishikura T, Imanaka N, Miyazaki M. Dynamic-enhanced MRI predicts metastatic potential of invasive ductal breast cancer. Springer. 2002;9(3):226–30.

    Google Scholar 

  5. Park CS, Kim SH, Jung NY, Choi JJ, Kang BJ, Jung HS. Interobserver variability of ultrasound elastography and the ultrasound BI-RADS lexicon of breast lesions. Springer. 2013;22(2):153–60.

    Google Scholar 

  6. Ayon SI, Islam MM, Hossain MR. Coronary artery heart disease prediction: a comparative study of computational intelligence techniques. IETE J Res. 2020. https://doi.org/10.1080/03772063.2020.1713916.

  7. Muhammad LJ, Islam MM, Usman SS, Ayon SI. Predictive data mining models for novel coronavirus (COVID-19) infected patients’ recovery. SN Comput Sci. 2020;1(4):206.

    Article  Google Scholar 

  8. Bhuyan HK, Reddy CV (2018) Madhusudan: Sub-feature selection for novel classification. International Conference on Inventive Communication and Computational Technologies (ICICCT). IEEE Explore 20–21. https://doi.org/10.1109/ICICCT.2018.8473206.

  9. Bhuyan HK, Ravi VK. Analysis of Sub-feature for Classification in Data Mining. IEEE Trans Eng Manag. 2021. (Early Published).

  10. Ayon SI, Islam MM. Diabetes prediction: a deep learning approach. Int J Inf Eng Electron Bus (IJIEEB). 2019;11(2):21–7.

    Google Scholar 

  11. Bhuyan HK, Chakraborty C, Shelke Y, Pani SK. COVID‐19 diagnosis system by deep learning approaches. Expert Syst. 2021. (Early Published).

  12. Hasan MK, Islam MM, Hashem MMA. Mathematical model development to detect breast cancer using multigene genetic programming. In: 2016 5th International Conference on Informatics, Electronics and Vision (ICIEV). 2016;574–579.

  13. Sakri SB, Rashid NBA, Zain ZM. Particle swarm optimization feature selection for breast cancer recurrence prediction. IEEE Access. 2018;6:29637–47.

    Article  Google Scholar 

  14. Juneja K, Rana C. An improved weighted decision tree approach for breast cancer prediction. Int J Inf Technol. 2018.

  15. Yue W, et al. Machine learning with applications in breast cancer diagnosis and prognosis. Designs. 2018;2(2):13.

    Article  Google Scholar 

  16. Chiu H-J, Li T-H. And Ping-Huan Kuo. Breast Cancer-Detection System Using PCA, Multilayer Perceptron, Transfer Learning, and Support Vector Machine. 2020;8:204309–24.

    Google Scholar 

  17. Islam M, Haque M, Iqbal H, Hasan M, Hasan M, Kabir MN. Breast cancer prediction: a comparative study using machine learning techniques. SN Computer Science. 2020;1:290. https://doi.org/10.1007/s42979-020-00305-w.

  18. Banu AB, Subramanian PT. Comparison of Bayes classifiers for breast cancer classification. Asian Pac J Cancer Prev (APJCP). 2018;19(10):2917–20.

    Google Scholar 

  19. Chaurasia V, Pal S, Tiwari B. Prediction of benign and malignant breast cancer using data mining techniques. J Algorithms Comput Technol. 2018;12(2):119–26.

    Article  Google Scholar 

  20. Azar AT, El-Metwally SM. Decision tree classifiers for automated medical diagnosis. Neural Comput Appl. 2012;23(7–8):2387–403.

    Google Scholar 

  21. Senapati MR, Mohanty AK, Dash S, Dash PK. Local linear wavelet neural network for breast cancer recognition. Neural Comput Appl. 2013;22(1):125–31.

    Article  Google Scholar 

  22. Senapati MR, Panda G, Dash PK. Hybrid approach using KPSO and RLS for RBFNN design for breast cancer detection. Neural Comput Appl. 2014;24(3–4):745–53.

    Article  Google Scholar 

  23. Hasan MK, Islam MM, Hashem MMA. Mathematical model development to detect breast cancer using multigene genetic programming. In: Proc. 5th International Conference on Informatics, Electronics and Vision (ICIEV) Dhaka 2016;574–579.

  24. Azar AT, El-Said SA. Performance analysis of support vector machines classifiers in breast cancer mammography recognition. Neural Comput Appl. 2013;24(5):1163–77.

    Article  Google Scholar 

  25. Ferreira P, Dutra I, Salvini R, Burnside E. Interpretable models to predict Breast Cancer. In: Proc. IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shenzhen 2016;1507–1511.

  26. Jhajharia S, Verma S, Kumar R. A cross-platform evaluation of various decision tree algorithms for prognostic analysis of breast cancer data. In: Proc. International Conference on Inventive Computation Technologies (ICICT), Coimbatore 2016;1–7.

  27. Islam MM, Rahaman A, Islam MR. Development of smart healthcare monitoring system in IoT environment. SN Comput Sci. 2020;1(3):185.

    Article  Google Scholar 

  28. Rahaman A, Islam M, Islam M, Sadi M, Nooruddin S. Developing IoT based smart health monitoring systems: a review. Rev d’Intell Artif. 2019;33(6):435–40.

    Google Scholar 

  29. James G, Witten D, Hastie T, Tibshirani R. An introduction to statistical learning. 1st ed. New York: Springer; 2013.

    Book  Google Scholar 

  30. Scholkopf B, Mika S, Burges CJC, Knirsch P, Müller K-R, Ratsch G, Smola AJ. ‘Input space versus feature space in kernel-based methods.’ IEEE Trans Neural Netw. 1999;10(5):1000–17.

    Article  Google Scholar 

  31. Burges CJC. ‘A tutorial on support vector machines for pattern recognition.’ Data Min Knowl Disc. 1998;2(2):121–67.

    Article  Google Scholar 

  32. Bhuyan HK, Kamila NK, Pani SK. Individual privacy in data mining using fuzzy optimization. Eng Optim. 2021. (Early Published).

  33. Bordes A, Ertekin S, Weston J, Bottou L. ‘Fast kernel classifiers with online and active learning.’ J Mach Learn Res. 2005;6:1579–619.

    MathSciNet  MATH  Google Scholar 

  34. Bhuyan HK, Pani SK. Impact of world pandemic “COVID-19” and an assessment of world health management and economics. The Internet of Medical Things: Enabling Technologies and Emerging Applications, IET. 2022;pp 55.

  35. Bhuyan HK, Dash SK, Roy S, Swain DK. Privacy preservation with penalty in decentralized network using multiparty computation. Int J Adv Res Technol 2012;4(1):297–303

  36. Bhuyan HK, Kumar LR, Reddy KR. Optimization model for Sub-feature selection in data mining. 2nd International Conference on Smart Systems and Inventive Technology (ICSSIT 2019). IEEE Explore. 2019. 

  37. Dwivedi AK. Performance evaluation of different machine learning techniques for prediction of heart disease. Neural Comput Appl. 2016;29(10):685–93.

    Article  Google Scholar 

  38. Hosmer DW Jr, Lemeshow S. Applied logistic regression. New York: Wiley; 2004. 36. Schumacher M, Roner R, Vach W. Neural networks and logistic regression: part I. Comput Stat Data Anal. 1996;21(6):661–82.

  39. Hajmeer M, Basheer I. Comparison of logistic regression and neural network-based classifiers for bacterial growth. Food Microbiol. 2003;20(1):43–55.

    Article  Google Scholar 

  40. Hornik K, Stinchcombe M, White H. ‘Multilayer feedforward networks are universal approximators.’ Neural Netw. 1989;2(5):359–66.

    Article  Google Scholar 

  41. Street WN, Wolberg WH, Mangasarian OL. Nuclear feature extraction for breast tumor diagnosis. Proc SPIE. 1993;1905:861–870.

  42. Al-Antari MA, Al-Masni MA, Park SU, Park J, Metwally MK, Kadah YM, Han SM, Kim TS.  Automatic computer-aided diagnosis of breast cancer in digital mammogramsvia deep belief network, Global Conference on Engineering and AppliedScience (GCEAS), Japan. 2016;1306–1314.

  43. Al-Antari MA, Al-Masni MA, Park SU, Park J, Metwally MK, Kadah YM, Han SM, Kim TS. An automatic computer-aided diagnosis system for breastcancer in digital mammograms via deep belief network. J Med Biol Eng. 2017;38(no. 3):443–456. https://doi.org/10.1007/s40846-017-0321-6.

  44. Bhuyan HK, Huque MDS. Sigmoid function based classification in data mining. Int J Appl Sci Comput. 2019;6(2):908–912.

  45. Bhuyan HK, Mohanty M, Das SR. Privacy preserving for feature selection in data mining using centralized network. International Journal Computer Science and Issues (IJCSI). 2012;9(3):434 - 440. ISSN (Online version): 1694–0784, ISSN (Print version): 1694–0814.

  46. Alshammari MM, Almuhanna A, Alhiyafi J. Mammography image-based diagnosis of breast cancer using machine learning: a pilot study. Sensors. 2022;22:203.

    Google Scholar 

  47. Chauhan A, Kharpate H, Narekar Y, Gulhane S, Virulkar T, Hedau Y. Breast cancer detection and prediction using machine learning. 2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA). 2021. https://doi.org/10.1109/ICIRCA51532.2021.9544687.

  48. Breast Cancer Wisconsin (Original) Data set, [Online]. https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/breast-cancer-wisconsin.data. Accessed Jan 2022

  49. Islam M, Haque M, Iqbal H, Hasan M, Hasan M, Kabir MN. Breast cancer prediction: a comparative study using machine learning techniques. SN Comp Sci. 2020.

  50. Bemando C, Miranda E, Aryuni M. Machine-learning-based prediction models of coronary heart disease using naïve bayes and random forest algorithms. In Proceedings of the 2021 International Conference on Software Engineering & Computer Systems and 4th International Conference on Computational Science and Information Management (ICSECS-ICOCSIM), Pekan, Malaysia 2021;24–28:232–237.

  51. Kumar RR, Polepaka S. Performance comparison of random forest classifier and convolution neural network in predicting heart diseases. In ICCII 2018, Proceedings of the Third International Conference on Computational Intelligence and Informatics; Springer: Singapore, 2020:683–691.

  52. Singh H, Navaneeth N, Pillai G. Multisurface proximal SVM based decision trees for heart disease classification. In Proceedings of the TENCON 2019–2019 IEEE Region 10 Conference (TENCON), Kerala, India 2018;17–20:13–18.

  53. Desai SD, Giraddi S, Narayankar P, Pudakalakatti NR, Sulegaon S. Back-propagation neural network versus logistic regression in heart disease classification. In Advanced Computing and Communication Technologies; Springer: Berlin/Heidelberg, Germany 2019:133–144.

  54. Safdar S, Rizwan M, Gadekallu TR, Javed AR, Rahmani MK, Jawad K, Bhatia S. Bio-Imaging-Based Machine Learning Algorithm for Breast Cancer Detection. Diagnostics (Basel). 2022;12(5):1134. https://doi.org/10.3390/diagnostics12051134.

    Article  Google Scholar 

  55. Abbas S, Jalil Z, Javed AR, Batool I, Khan MZ, Noorwali A, Gadekallu TR, Akbar A. BCD-WERT: a novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm. Peer J Comput Sci. 2021;12(7):e390. https://doi.org/10.7717/peerj-cs.390. eCollection 2021.

Download references

Author information

Authors and Affiliations

Authors

Contributions

All authors have equal contributions in the current work.

Corresponding author

Correspondence to Vinayakumar Ravi.

Ethics declarations

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Conflicts of interest

The authors declare no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this article was revised: The correct author name is “Biswajit Brahma”.

Appendix

Appendix

Table 10 Twenty sample of data set as [48]
Fig. 5
figure 5

Only Benign and Malignant class for experiments

Fig. 6
figure 6

Binary form of class

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bhuyan, H.K., Ravi, V., Brahma, B. et al. Disease analysis using machine learning approaches in healthcare system. Health Technol. 12, 987–1005 (2022). https://doi.org/10.1007/s12553-022-00687-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12553-022-00687-2

Keywords

Navigation