Parkinson’s disease is a neurodegenerative disorder that progresses slowly and its symptoms appear over time, so its early diagnosis is not easy. A neurologist can diagnose Parkinson's by reviewing the patient's medical history and repeated scans. Besides, body movement analysts can diagnose Parkinson's by analyzing body movement. Recent research work has shown that changes in speech can be used as a measurable indicator for early Parkinson’s detection. In this work, the authors propose a speech signal-based hybrid Parkinson's disease diagnosis system for its early diagnosis. To achieve this, the authors have tested several combinations of feature selection approaches and classification algorithms and designed the model with the best combination. To formulate various combinations, three feature selection methods such as mutual information gain, extra tree, and genetic algorithm and three classifiers namely naive bayes, k-nearest-neighbors, and random forest have been used. To analyze the performance of different combinations, the speech dataset available at the UCI (University of California, Irvine) machine learning repository has been used. As the dataset is highly imbalanced so the class balancing problem is overcome by the synthetic minority oversampling technique (SMOTE). The combination of genetic algorithm and random forest classifier has shown the best performance with 95.58% accuracy. Moreover, this result is also better than the recent work found in the literature.
This is a preview of subscription content,to check access.
Access this article
Similar content being viewed by others
Almeida, J. S., Rebouças Filho, P. P., Carneiro, T., Wei, W., Damaševičius, R., Maskeliūnas, R., & de Albuquerque, V. H. C. (2019). Detecting Parkinson’s disease with sustained phonation and speech signals using machine learning techniques. Pattern Recognition Letters, 125, 55–62. https://doi.org/10.1016/j.patrec.2019.04.005.
Chandrashekar, G., & Sahin, F. (2014). A survey on feature selection methods. Computers & Electrical Engineering, 40(1), 16–28. https://doi.org/10.1016/j.compeleceng.2013.11.024.
Emamzadeh, F. N., & Surguchov, A. (2018). Parkinson’s disease: Biomarkers, treatment, and risk factors. Frontiers in Neuroscience, 12, 612. https://doi.org/10.3389/fnins.2018.00612.
Grover, S., Bhartia, S., Yadav, A., & Seeja, K. R. (2018). Predicting severity of Parkinson’s disease using deep learning. Procedia Computer Science, 132, 1788–1794. https://doi.org/10.1016/j.procs.2018.05.154.
Gunduz, H. (2019). Deep learning-based Parkinson’s disease classification using vocal feature sets. IEEE Access, 7, 115540–115551. https://doi.org/10.1109/ACCESS.2019.2936564.
Gupta, D., Julka, A., Jain, S., Aggarwal, T., Khanna, A., Arunkumar, N., & de Albuquerque, V. H. C. (2018). Optimized cuttlefish algorithm for diagnosis of Parkinson’s disease. Cognitive Systems Research, 52, 36–48. https://doi.org/10.1016/j.cogsys.2018.06.006.
Hemphill, E., Lindsay, J., Lee, C., Măndoiu, I. I., & Nelson, C. E. (2014). Feature selection and classifier performance on diverse bio-logical datasets. BMC bioinformatics, BioMed Central, 15(13), S4. https://doi.org/10.1186/1471-2105-15-S13-S4.
Jain, A., Tiwari, S., & Sapra, V. (2019). Two-phase heart disease diagnosis system using deep learning. International Journal of Control and Automation, 12(5), 558–573.
Lahmiri, S., & Shmuel, A. (2019). Detection of Parkinson’s disease based on voice patterns ranking and optimized support vector machine. Biomedical Signal Processing and Control, 49, 427–433. https://doi.org/10.1016/j.bspc.2018.08.029.
Lamba, R., Gulati, T., Al-Dhlan, K. A., & Jain, A. (2021). A systematic approach to diagnose Parkinson’s disease through kinematic features extracted from handwritten drawings. Journal of Reliable Intelligent Environments, 1–10. https://doi.org/10.1007/s40860-021-00130-9.
Lamba, R., Gulati, T., & Jain, A. (2020). Comparative analysis of Parkinson’s disease diagnosis system. Advances in Mathematics Scientific Journal, 9(6), 3399–3406. https://doi.org/10.37418/amsj.9.6.20.
Little, M., McSharry, P., Hunter, E., Spielman, J., & Ramig, L. (2008). Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease. Nature Precedings. https://doi.org/10.1038/npre.2008.2298.1.
Mostafa, S. A., Mustapha, A., Mohammed, M. A., Hamed, R. I., Arunkumar, N., Ghani, M. K. A., Jaber, M. M., & Khaleefah, S. H. (2019). Examining multiple feature evaluation and classification methods for improving the diagnosis of Parkinson’s disease. Cognitive Systems Research, 54, 90–99. https://doi.org/10.1016/j.cogsys.2018.12.004.
Murthy, Y. S., & Koolagudi, S. G. (2018). Classification of vocal and non-vocal segments in audio clips using genetic algorithm based feature selection (GAFS). Expert Systems with Applications, 106, 77–91. https://doi.org/10.1016/j.eswa.2018.04.005.
Naranjo, L., Pérez, C. J., Campos-Roca, Y., & Martín, J. (2016). Addressing voice recording replications for Parkinson’s disease detection. Expert Systems with Applications, 46, 286–292. https://doi.org/10.1016/j.eswa.2015.10.034.
Nissar, I., Rizvi, D., Masood, S. and Mir, A. (2019). Voice-Based Detection of Parkinson’s Disease through Ensemble Machine Learning Approach: A Performance Study. EAI Endorsed Transactions on Pervasive Health and Technology, 5(19). doi: https://doi.org/10.4108/eai.13-7-2018.162806
Olivares, R., Munoz, R., Soto, R., Crawford, B., Cárdenas, D., Ponce, A., & Taramasco, C. (2020). An optimized brain-based algorithm for classifying Parkinson’s disease. Applied Sciences, 10(5), 1827. https://doi.org/10.3390/app10051827.
Polat, K. (2019). A hybrid approach to Parkinson disease classification using speech signal: The combination of SMOTE and random forests. In 2019 scientific meeting on electrical-electronics & biomedical engineering and computer science (EBBT) IEEE, 1–3. https://doi.org/10.1109/EBBT.2019.8741725
Rani, P., Kumar, R., Ahmed, N. M. O. S., & Jain, A. (2021a). A decision support system for heart disease prediction based upon machine learning. Journal of Reliable Intelligent Environments. https://doi.org/10.1007/s40860-021-00133-6.
Rani, P., Kumar, R., & Jain, A. (2021b). Multistage Model for Accurate Prediction of Missing Values Using Imputation Methods in Heart Disease Dataset. In J. S. Raj, A. M. Iliyasu, R. Bestak, & Z. A. Baig (Eds.), Innovative Data Communication Technologies and Application, Lecture Notes on Data Engineering and Communications Technologies (pp. 637–653). Singapore: Springer. https://doi.org/10.1007/978-981-15-9651-3_53.
Rani, P., Kumar, R., Jain, A., & Lamba, R. (2020). Taxonomy of Machine Learning Algorithms and its Applications. Journal of Computational and Theroretical Nanoscience, 17(6), 2509–2514. https://doi.org/10.1166/jctn.2020.8922.
Reich, S. G., & Savitt, J. M. (2018). Parkinson Disease. Medical Clinics of North America. https://doi.org/10.1016/j.mcna.2018.10.014.
Remeseiro, B., & Bolon-Canedo, V. (2019). A review of feature selection methods in medical applications. Computers in Biology and Medicine. https://doi.org/10.1016/j.compbiomed.2019.103375.
Sakar, C. O., Serbes, G., Gunduz, A., Tunc, H. C., Nizam, H., Sakar, B. E., Tutuncu, M., Aydin, T., Isenkul, M. E., & Apaydin, H. (2019). A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform. Applied Soft Computing, 74, 255–263. https://doi.org/10.1016/j.asoc.2018.10.022.
Sarker, I. H., Kayes, A. S. M., & Watters, P. (2019). Effectiveness analysis of machine learning classification models for predicting personalized context-aware smartphone usage. Journal of Big Data, 6(1), 57. https://doi.org/10.1186/s40537-019-0219-y.
Senturk, Z. K. (2020). Early diagnosis of Parkinson’s disease using machine learning algorithms. Medical Hypotheses, 138, 109603. https://doi.org/10.1016/j.mehy.2020.109603.
Sharma, P., Jain, R., Sharma, M., & Gupta, D. (2019). Parkinson’s diagnosis using ant-lion optimisation algorithm. International Journal of Innovative Computing and Applications, 10(3–4), 138–146. https://doi.org/10.1504/IJICA.2019.103370.
Sharma, P., Sundaram, S., Sharma, M., Sharma, A., & Gupta, D. (2019). Diagnosis of Parkinson’s disease using modified grey wolf optimization. Cognitive Systems Research, 54, 100–115. https://doi.org/10.1016/j.cogsys.2018.12.002.
Solana-Lavalle, G., Galán-Hernández, J. C., & Rosas-Romero, R. (2020). Automatic Parkinson disease detection at early stages as a pre-diagnosis tool by using classifiers and a small set of vocal features. Biocybernetics and Biomedical Engineering, 40(1), 505–516. https://doi.org/10.1016/j.bbe.2020.01.003.
Tuncer, T., & Dogan, S. (2019). A novel octopus based Parkinson’s disease and gender recognition method using vowels. Applied Acoustics, 155, 75–83. https://doi.org/10.1016/j.apacoust.2019.05.019.
Tuncer, T., Dogan, S., & Acharya, U. R. (2020). Automated detection of Parkinson’s disease using minimum average maximum tree and singular value decomposition method with vowels. Biocybernetics and Biomedical Engineering, 40(1), 211–220. https://doi.org/10.1016/j.bbe.2019.05.006.
Uddin, S., Khan, A., Hossain, M. E., & Moni, M. A. (2019). Comparing different supervised machine learning algorithms for disease prediction. BMC Medical Informatics and Decision Making, 19(1), 1–16. https://doi.org/10.1186/s12911-019-1004-8.
Xiong, Y., & Lu, Y. (2020). Deep Feature Extraction From the Vocal Vectors Using Sparse Autoencoders for Parkinson’s Classification. IEEE Access, 8, 27821–27830. https://doi.org/10.1109/ACCESS.2020.2968177.
Yaman, O., Ertam, F., & Tuncer, T. (2020). Automated Parkinson’s disease recognition based on statistical pooling method using acoustic features. Medical Hypotheses, 135, 109483. https://doi.org/10.1016/j.mehy.2019.109483.
Zahid, L., Maqsood, M., Durrani, M. Y., Bakhtyar, M., Baber, J., Jamal, H., Mehmood, I., & Song, O. Y. (2020). A Spectrogram-Based Deep Feature Assisted Computer-Aided Diagnostic System for Parkinson’s Disease. IEEE Access, 8, 35482–35495. https://doi.org/10.1109/ACCESS.2020.2974008.
Zesiewicz, T. A., Bezchlibnyk, Y., Dohse, N., & Ghanekar, S. D. (2019). Management of Early Parkinson Disease. Clinics in geriatric medicine. https://doi.org/10.1016/j.cger.2019.09.001.
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Lamba, R., Gulati, T., Alharbi, H.F. et al. A hybrid system for Parkinson’s disease diagnosis using machine learning techniques. Int J Speech Technol 25, 583–593 (2022). https://doi.org/10.1007/s10772-021-09837-9