Skip to main content
Log in

Prediction analysis for Parkinson disease using multiple feature selection & classification methods

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Diagnosis of Parkinson's disease (PD) is a difficult undertaking that requires a variety of testing and clinical trials. Despite all these testing and trials, there is a considerable risk of misdiagnosis of Parkinson's disease, which causes a delay in making decisions about the best treatment for the patients. Computer aided diagnosis can improve the experiences and helps the doctors to predict PD patients. Four sorts of classification algorithms with various types of feature selection approaches are proposed in this paper for successful PD diagnosis. Support Vector Machine-SVM, Nave Bayes-NB, K-Nearest Neighbor-KNN, and Random Forest-RF are the categorization techniques employed. All the dataset's features are not necessary for classification. To choose the relevant characteristics, four feature selection approaches are used. Least Absolute Shrinkage and Selection Operator (LASSO), Backward-forward, rough set, and tree-based feature selection techniques are used. Experimentation is carried out using the Parkinson's Progressive Markers Initiative (PPMI) dataset. Four classification algorithms are constructed, together with and without feature selection procedures, and comparative research is conducted. To measure the performance of the classifiers, various evaluation methodologies are used. Overall Random Forest classifier with four feature selection methods’ gives best results with average accuracy is 96% and rough set feature selection for all three classifiers’ average accuracy is 97%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

Based on the request to corresponding author data will be provided.

References

  1. Almuallim H, Dietterich TG (1991) Learning with many irrelevant features. In: Proc. AAAI-91, Anaheim, CA, pp 547–552

  2. Beal MF (2001) Experimental models of Parkinson’s disease. Nat Rev Neurosci 2:325–334

    Article  Google Scholar 

  3. Betarbet R, Sherer TB, Greenamyre JT (2002) Animal models of Parkinson’s disease. BioEssays 24:308–318

    Article  Google Scholar 

  4. Blum AI, Langley P (1997) Selection of relevant features and examples in machine learning. Artif Intell 97:245–271

    Article  MathSciNet  MATH  Google Scholar 

  5. Bron EE, Smits M, Niessen WJ, Klein S (2015) Feature selection based on the SVM weight vector for classification of dementia. IEEE J Biomed Health Inform 19(5):1617–1626

    Article  Google Scholar 

  6. Chandrashekar G, Sahin F (2014) A survey on feature selection methods. Comput Electr Eng 40(1):16–28

    Article  Google Scholar 

  7. Das R (2010) A comparison of multiple classification methods for diagnosis of Parkinson disease. Expert Syst Appl 37(2):1568–1572

    Article  Google Scholar 

  8. Dash S, Verma S, Khan MS, Wozniak M, Shafi J, Ijaz MF (2021) A hybrid method to enhance thick and thin vessels for blood vessel segmentation. Diagnostics 11(11):2017

    Article  Google Scholar 

  9. Dash S, Verma S, Bevinakoppa S, Wozniak M, Shafi J, Ijaz MF (2022) Guidance image-based enhanced matched filter with modified thresholding for blood vessel extraction. Symmetry 14(2):194

    Article  Google Scholar 

  10. Grifoni P, Caschera MC, Ferri F (2021) Evaluation of a dynamic classification method for multimodal ambiguities based on Hidden markov models. Evol Syst 12(2):377–395

    Article  Google Scholar 

  11. Guo H, Zhang F, Chen J, Xu Y, Xiang J (2017) Machine learning classification combining multiple features of a hyper-network of fMRI data in Alzheimer’s disease. Front Neurosci 11:615

    Article  Google Scholar 

  12. Gupta I, Sharma V, Kaur S, Singh AK (2022) PCA-RF: an efficient Parkinson's Disease prediction model based on random forest classification. arXiv preprint arXiv:2203.11287

  13. Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3(Mar):1157–1182

    MATH  Google Scholar 

  14. Hassanien AE, Ali JM (2004) Rough set approach for generation of classification rules of breast cancer data. Informatica 15(1):23–38

    Article  MATH  Google Scholar 

  15. Ijaz MF, Alfian G, Syafrudin M, Rhee J (2018) Hybrid prediction model for type 2 diabetes and hypertension using DBSCAN-based outlier detection, synthetic minority over sampling technique (SMOTE), and random forest. Appl Sci 8(8):1325

    Article  Google Scholar 

  16. Ijaz MF, Attique M, Son Y (2020) Data-driven cervical cancer prediction model with outlier detection and over-sampling methods. Sensors 20(10):2809

    Article  Google Scholar 

  17. Khemphila A, Boonjing V (2011) Heart disease classification using neural network and feature selection. In: 2011 21st international conference on systems engineering. IEEE, pp 406–409

  18. Khemphila A, Boonjing V (2012) Parkinsons disease classification using neural network and feature selection. World Acad Sci Eng Technol 64:15–18

    Google Scholar 

  19. Kira K, Rendell LA (1992) The feature selection problem: tradional methods and a new algorithm. In: Proc. AAAI-92, San Jose, CA, pp 122–126

  20. Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97:273–324

    Article  MATH  Google Scholar 

  21. Koller D, Sahami M (1996) Toward optimal feature selection. Stanford InfoLab.

  22. Kumar Y, Koul A, Singla R, Ijaz MF (2022) Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda. J Ambient Intell Humaniz Comput:1–28

  23. Lee SH, Lim JS (2012) Parkinson’s disease classification using gait characteristics and wavelet-based feature extraction. Expert Syst Appl 39(8):7338–7344

    Article  Google Scholar 

  24. Li J, Cheng K, Wang S, Morstatter F, Trevino RP, Tang J, Liu H (2017) Feature selection: a data perspective. ACM Comput Surv 50(6):1–45

    Article  Google Scholar 

  25. Mafarja M, Mirjalili S (2018) Whale optimization approaches for wrapper feature selection. Appl Soft Comput 62:441–453

    Article  Google Scholar 

  26. Mafarja MM, Eleyan D, Jaber I, Hammouri A, Mirjalili S (2017) Binary dragonfly algorithm for feature selection. In: 2017 International Conference on New Trends in Computing Sciences (ICTCS). IEEE, pp 12–17

  27. Mafarja M, Aljarah I, Heidari AA, Hammouri AI, Faris H, Ala’M AZ, Mirjalili S (2018) Evolutionary population dynamics and grasshopper optimization approaches for feature selection problems. Knowl-Based Syst 145:25–45

    Article  Google Scholar 

  28. Mandal M, Singh PK, Ijaz MF, Shafi J, Sarkar R (2021) A tri-stage wrapper-filter feature selection framework for disease classification. Sensors 21(16):5571

    Article  Google Scholar 

  29. Mitra P, Murthy CA, Pal SK (2002) Unsupervised feature selection using feature similarity. IEEE Trans Pattern Anal Mach Intell 24(3):301–312

    Article  Google Scholar 

  30. Mittal V, Sharma RK (2021) Machine learning approach for classification of Parkinson disease using acoustic features. J Reliable Intell Environ 7(3):233–239

    Article  Google Scholar 

  31. Mostafa SA, Mustapha A, Mohammed MA, Hamed RI, Arunkumar N, Abd Ghani MK … Khaleefah SH (2019) Examining multiple feature evaluation and classification methods for improving the diagnosis of Parkinson’s disease. Cognit Syst Res 54:90–99

  32. Nilashi M, Ibrahim O, Ahmadi H, Shahmoradi L, Farahmand M (2018) A hybrid intelligent system for the prediction of Parkinson’s Disease progression using machine learning techniques. Biocybern Biomed Eng 38(1):1–15

    Article  Google Scholar 

  33. Novaković J (2016) Toward optimal feature selection using ranking methods and classification algorithms. Yugosl J Oper Res 21(1):119–135

    Article  MathSciNet  MATH  Google Scholar 

  34. Polat K (2012) Classification of Parkinson’s disease using feature weighting method on the basis of fuzzy C-means clustering. Int J Syst Sci 43(4):597–609

    Article  MathSciNet  MATH  Google Scholar 

  35. Prashanth R, Roy SD (2018) Early detection of Parkinson’s disease through patient questionnaire and predictive modelling. Int J Med Informatics 119:75–87

    Article  Google Scholar 

  36. Priya SJ, Rani AJ, Subathra MSP, Mohammed MA, Damaševičius R, Ubendran N (2021) Local pattern transformation based feature extraction for recognition of Parkinson’s disease based on gait signals. Diagnostics 11(8):1395

    Article  Google Scholar 

  37. Sharanyaa S, Renjith PN, Ramesh K (2022) An exploration on feature extraction and classification techniques for dysphonic speech disorder in Parkinson’s Disease. In: Inventive communication and computational technologies. Springer, Singapore, pp 33–48

  38. Singh N, Pillay V, Choonara YE (2007) Advances in the treatment of Parkinson’s disease. Prog Neurobiol 81:29–44

    Article  Google Scholar 

  39. Srinivasu PN, Ahmed S, Alhumam A, Kumar AB, Ijaz MF (2021) An AW-HARIS based automated segmentation of human liver using CT images. Comput Mater Contin 69(3):3303–3319

    Google Scholar 

  40. Srinivasu PN, SivaSai JG, Ijaz MF, Bhoi AK, Kim W, Kang JJ (2021) Classification of skin disease using deep learning neural networks with MobileNet V2 and LSTM. Sensors 21(8):2852

    Article  Google Scholar 

  41. Sujatha J, Rajagopalan SP (2017) Performance evaluation of machine learning algorithms in the classification of Parkinson disease using voice attributes. Int J Appl Eng Res 12(21):10669–10675

    Google Scholar 

  42. Tahir NM, Manap HH (2012) Parkinson Disease gait classification based on machine learning approach. J Appl Sci Faisalabad (Faisalabad) 12:180–185

    Google Scholar 

  43. Vulli A, Srinivasu PN, Sashank MSK, Shafi J, Choi J, Ijaz MF (2022) Fine-tuned DenseNet-169 for breast cancer metastasis prediction using FastAI and 1-cycle policy. Sensors 22(8):2988

    Article  Google Scholar 

  44. Weston J, Mukherjee S, Chapelle O, Pontil M, Poggio T, Vapnik V (2001) Feature selection for SVMs. In: Advances in neural information processing systems, pp 668–674

  45. Win TZ, Kham NSM (2019) Information gain measured feature selection to reduce high dimensional data. Seventeenth International Conference on Computer Applications (ICCA 2019)

  46. Yadav G, Kumar Y, Sahoo G (2012) Predication of Parkinson's disease using data mining methods: a comparative analysis of tree, statistical and support vector machine classifiers. In: 2012 national conference on computing and communication systems. IEEE, pp 1–8

  47. Yasar A, Saritas I, Sahman MA, Cinar AC (2019) Classification of Parkinson disease data with artificial neural networks. In: IOP conference series: materials science and engineering, vol 675, no 1. IOP Publishing, pp 012031

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. Pandimurugan.

Ethics declarations

Conflict of interest

The authors whose names are listed immediately below certify that they have NO affiliations with or involvement in any organization or entity with any financial interest (such as honoraria; educational grants; participation in speakers’ bureaus; membership, employment, consultancies, stock ownership, or other equity interest; and expert testimony or patent-licensing arrangements), or non-financial interest (such as personal or professional relationships, affiliations, knowledge or beliefs) in the subject matter or materials discussed in this manuscript.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hema, M.S., Maheshprabhu, R., Reddy, K.S. et al. Prediction analysis for Parkinson disease using multiple feature selection & classification methods. Multimed Tools Appl 82, 42995–43012 (2023). https://doi.org/10.1007/s11042-023-15280-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-15280-6

Keywords

Navigation