Abstract
An early and accurate diagnosis of Alzheimer’s disease (AD) has been progressively attracting more attention in recent years. One of the main problems of AD is the loss of language skills. This paper presents a computational framework for classifying AD patients compared to healthy control subjects using information from spontaneous speech signals. Spontaneous speech data are obtained from 30 AD patients and 30 healthy controls. Because of the nonlinear and dynamic nature of speech signals, higher order spectral features (specifically bispectrum) were used for analysis. Four classifiers (k-Nearest Neighbor, Support Vector Machine, Naïve Bayes and Decision tree) were used to classify subjects into three different levels of AD and healthy group based on their performance in terms of the HOS-based features. Ten-fold cross-validation method was used to test the reliability of the classifier results. The results showed that the proposed method had a good potential in AD diagnosis. The proposed method was also able to diagnose the earliest stage of AD with high accuracy. The method has the great advantage of being non-invasive, cost-effective, and associated with no side effects. Therefore, the proposed method can be a spontaneous speech directed test for pre-clinical evaluation of AD diagnosis.
Similar content being viewed by others
References
Acharya UR, Chua CK, Ng EY, Yu W, Chee C (2008) Application of higher order spectra for the identification of diabetes retinopathy stages. Journal of medical system 32:481–488
Azarbarzin A, Moussavi Z (2011) Nonlinear properties of snoring sounds. In: 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 4316–4319
Banbrook M, Mclughlin S (1994) Is speech chaotic? Invariant geometrical measures for speech data. In: IEEE colloquium on exploiting chaos in signals processing, 8/1–8/10, 1994
Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and Regression Trees. Wadsworth International Group, Belmont
Buiza C (2010) Evaluación y tratamiento de los trastornosdellenguaje. MatiaFundazioa, Donostia
Childers DG (ed) (1978) Modern spectrum analysis. IEEE Press, New York
Christianini N, Taylor J (2000) Support vector machines and other Kernal-based learning methods. Cambridge University Press, Cambridge
Chua KC, Chandran V, Acharya UR, Lim CM (2008) Cardiac state diagnosis using higher order spectra of heart rate variability. J Med Eng Technol 32:145–155
Chua CK, Chandran V, Acharya RU, Min LC (2009) Cardiac health diagnosis using higher order spectra and support vector machine. Open Med Inform J 3:1–8
Chua KC, Chandran V, Acharya UR, Lim CM (2010) Application of higher order statistics/spectra in biomedical signals—a review. Med Eng Phys 32:679–689
Dauwels J, Vialatte F, Cichocki A (2010) Diagnosis of Alzheimer’s disease from EEG signals: where are we standing? Curr Alzheimer Res 7:487–505
Dogan MC, Mendel JM (1992) Real time robust pitch detector. In International conference on acoustics, speech and signal processing, San Francisco, USA, pp I129–I132
Gdoura IJ, Louzou P, Spanias A (1993) Speech processing using higher order statistics. In: Proceeding of the IEEE international symposium on circuits and systems, Chicago, pp 160–163
Good IJ (1965) The estimation of probabilities: an essay on modern Bayesian methods. M.I.T. Press, Cambridge
Han J, Kamber M (2006) Data mining: concepts and techniques, 2nd edn. Morgan Kaufmann, San Francisco
Hu WT, McMillan C, Libon D, Leight S, Forman M, Lee VMY, Trojanowski JQ, Grossman M (2010) Multimodal predictors for Alzheimer’s disease in non fluent primary progressive aphasia. Neurology 75:595–602
Indrebo KM, Povinelli RJ, Johnson MT (2004) A comparison of reconstructed phase spaces and cepstral coefficients for multi-band phoneme classification. In: Proceedings 7th international conference on signal processing, 2004. Proceedings. ICSP ‘04
Jack CR Jr, Holtzman DM (2013) Biomarker modeling of Alzheimer’s disease. Neuron 80:1347–1358
Jia P, Dai J, Pan Y, Zhu M (2005) Novel algorithm for attribute reduction based on mutual-information gain ratio. J Zhejiang Univ (Eng Ed) 40:1041–1044
Kippenhan JS, Barker WW, Nagel J, Grady C, Duara R (1994) Neural-network classification of normal and Alzheimer’s disease subjects using high-resolution and low-resolution PET cameras. J Nucl Med 35:7–15
König A, Satt A, Sorin A, Hoory R, Toledo-Ronen O, Derreumaux A, Manera V, Verhey F, Aalten P, Robert PH, David R (2015) Automatic speech analysis for the assessment of patients with predementia and Alzheimer’s disease. Alzheimer’s Dement Diagn Assess Disease Monit 1:112–124
Kumar A, Mullick SK (1990) Attractor dimension, entropy and modeling of speech time series. Electron Lett 26:1790–1791
Langley P, Iba W, Thompson K (1992) An analysis of Bayesian classifiers. In Proceedings of the tenth national conference on artificial intelligence. AAAI Press and MIT Press, pp 223–228
Li Z, Wu Z, He Y, Fulei C (2005) Hidden Markov model-based fault diagnostics method in speed-up and speed-down process for rotating machinery. Mech Syst Signal Process 19:329–339
López de Ipiña K et al (2013a) Automatic analysis of emotional response based on non-linear speech modeling oriented to Alzheimer disease diagnosis. In: IEEE 17th international conference on intelligent engineering systems, 19–21 June 2013
López de Ipiña K et al (2013b) Feature extraction approach based on fractal dimension for spontaneous speech modelling oriented to Alzheimer disease diagnosis. Adv Nonlinear Speech Process 7911:144–151
López de Ipiña K, Solé-Casals J, Eguiraun H, Alonso JB, Travieso CM, Ezeiza A, Barroso N, Ecay-Torres M, Martinez-Lage P, Beitia B (2015) Feature selection for spontaneous speech analysis to aid in Alzheimer’s disease diagnosis: a fractal dimension approach. Comput Speech Lang 30:43–60
Martinez F, Garcia J, Perez E, Carro J, Anara JM (2012) Patrones de Prosodiaexpresiva en pacientes con enfermedadde Alzheimer. Psicothema 24:16–21
Martis RJ, Acharya UR, Mandana KM, Ray AK, Chakraborty C (2013) Cardiac decision making using higher order spectra. Biomed Signal Process Control 8:193–203
McKeith IG, Galasko D, Kosaka K, Perry EK, Dickson DW, Hansen LA, Salmon DP, Lowe J, Mirra SS, Byrne EJ, Lennox G, Quinn NP, Edwardson JA, Ince PG, Bergeron C, Burns A, Miller BL, Lovestone S, Collerton D, Jansen EN, Ballard C, de Vos RA, Wilcock GK, Jellinger KA, Perry RH (1996) Consensus guideline for the clinical and pathological diagnosis of dementia with lewy bodies (DLB): report of the consortium on DLB international workshop. Neurology 47:1113–1124
McKhann GM et al (2011) The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s Dement 7:263–269
Mendal JM (1991) Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some application. Proc IEEE 79:278–305
Mohammadpoory Z, Haddadnia J (2014) Speech enhancement using Laplacian mixture model under signal presence uncertainty. IJE Trans C Asp 27:1367–1376
Mookiah MRK, Acharya UR, Lim CM, Petznick A, Suri JS (2012) Data mining technique for automated diagnosis of glaucoma using higher order spectra and wavelet energy features. Knowl-Based Syst 33:73–82
Moreno A, Fonollosa JAR (1992) Pitch determination of noisy speech using HOS. In: International conference on acoustics, speech and signal processing, San Francisco, USA, pp 133–136
Moreno A, Fonollosa JAR, Vidal J (1993) Vocoder design based on HOS. In: Eurospeech ‘93, Berlin, Germany, pp 519–522
Muller KR, Mika S, Ratsch G, Tsuda K, Scholkopf B (2001) An introduction to Kernal based learning algorithms. IEEE Trans Neural Netw 12:181–201
Muthuswamy J, Sharma A (1996) A study of electroencephalographic descriptors and end-tidal concentration in estimating depth of anesthesia. J Clin Monit 12:353–364
Nasrolahzadeh M, Haddadnia J (2016) Poincaré plots of Spontaneous Speech Signals during Alzheimer’s disease. Mitteilungen Saechsischer Entomologen 119:358–365
Nasrolahzadeh M, Mohhamadpoori Z, Haddadnia J (2014) Optimal way to find the frame length of the speech signal for diagnosis of Alzheimer’s disease with PSO. Asian J Math Comput Res 2:33–41
Nasrolahzadeh M, Mohhamadpoori Z, Haddadnia J (2015a) Adaptive neuro-fuzzy inference system for classification of speech signals in Alzheimer’s disease using acoustic and non-linear characteristics. Asian J Math Comput Res 3:122–131
Nasrolahzadeh M, Mohhamadpoori Z, Haddadnia J (2015b) Alzheimer’s disease diagnosis using spontaneous speech signals and hybrid features. Asian J Math Comput Res 7:322–331
Nasrolahzadeh M, Mohhamadpoori Z, Haddadnia J (2016a) Analysis of mean square error surface and its corresponding contour plots of spontaneous speech signals in Alzheimer’s disease with adaptive wiener filter. Comput Hum Behav 61:364–371
Nasrolahzadeh M, Mohhamadpoory Z, Haddadnia J (2016b) A novel method for early diagnosis of Alzheimer’s disease based on higher-order spectral estimation of spontaneous speech signals. Cognit Neurodyn 10:495–503
Nikias CL, Petropulu A (1993) Higher-order spectral analysis: a nonlinear signal processing framework. Prentice Hall, Englewood Cliffs
Nikias CL, Raghuveer MR (1987) Bispectrum estimation: a digital signal processing framework. Proc IEEE 75:869–891
Ning T, Bronzino JD (1990) Autoregressive and Bispectral analysis techniques: EEG applications. IEEE Eng Med Biol 9:47–50
Ortiz A, Górriz JM, Ramírez J, Martínez-Murcia FJ (2013) LVQ-SVM Based CAD tool applied to structural MRI for the diagnosis of the Alzheimer’s disease. Pattern Recognit Lett 34:1725–1733
Pai PF, Hsu MF, Wang MC (2011) A support vector machine based model for detecting top management fraud. Knowl-Based Syst 24:314–321
Paliwal KK, Sondhi MM (1991) Recognition of noisy speech using cumulant based linear prediction analysis. In: International conference on acoustics, speech and signal processing, Toronto, Canada, pp 429–432
Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers, Inc, Los Altos. ISBN 1-55860-238-0
Rangoussi M, Delopoulos A, Tsatsanis M (1993a) On the use of higher-order statistics for robust endpoint detection of speech. In: IEEE signal processing workshop on higher-order statistics, Lake Tahoe, California, USA. IEEE. pp 56–60
Rangoussi M, Bakamidis S, Carayannis G (1993b) Robust endpoint detection of speech in the presence of noise. In: Eurospeech ‘93, Berlin, Germany. ESCA, pp 649–652
Refaeilzadeh P, Tang L, Liu H (2009) Cross-validation. In: Encyclopedia of database systems, pp 532–538
Ren J (2012) ANN vs. Svm: which one performs better in classification of Mccs in mammogram imaging. Knowl-Based Syst 26:144–153
Reynolds A (2013) Alzheimer disease: focus on computed tomography. Radiol Technol 2085:187CT–211CT
Rohrer JD (2012) Structural brain imaging in frontotemporal dementia. Biochem Biophys Acta 1822:325–332
Salas-Gonzalez D, Gorriz JM, Ramírez J, Lopez M, Alvarez I, Segovia F, Chaves R, Puntonet CG (2010) Computer-aided diagnosis of Alzheimer’s disease using support vector machines and classification trees. Phsy Med Biol 5:2807–2817
Salavedra JM, Masgrau E, Moreno A, Jove X (1993a) A speech enhancement system using higher order AR estimation in real environments. In: Eurospeech ‘93, Berlin, Germany, pp 223–226
Salavedra JM, Masgrau E, Moreno A, Jove X (1993b) Comparison of different order cumulants in a speech enhancement system by adaptive wiener filtering. In: IEEE signal processing workshop on higher-order statistics, Lake Tahoe, California, USA. IEEE, pp 61–65
Sigl JC, Chamoun NG (1994) An introduction to bispectral analysis for the electroencephalogram. J Clin Monit 10:392–404
Silipo R, Deco G, Vergassola R, Bartsch H (1998) Dynamics extraction in multivariate biomedical time series. Biol Cybern 79:15–27
SubbaRao T, Gabr MM (1984) An introduction to bispectral analysis and bilinear time series models (Lecture notes in statistics), vol 24. Springer, New York. ISBN 0-387-96039-2
Teodorescu HN, Grigorasand F, Apppei V (1996) Nonlinear and non stationary processes in speech production. Int J Chaos Theor Appl 5:1453–1457
Todder D, Avissar S, Schreiber G (2013) Non-linear dynamic analysis of inter- word time intervals in psychotic speech. IEEE J Transl Eng Health Med 1:2200107. https://doi.org/10.1109/JTEHM.2013.2268850
Xianda Z (1995) Modern signal processing. Tsinghua University Press, Beijing, pp 373–433
Yuvaraj R, Murugappan M, Ibrahim NM, Sundaraj K, Omar MI, Mohamad K, Palaniappan R (2014) Detection of emotions in Parkinson’s disease using higher order spectral features from brain’s electrical activity. Biomed Signal Process Control 14:108–116
Zhang Y, Dong Z, Phillips P, Wang S, Ji G, Yang J, Yuan TF (2015a) Detection of subjects and brain regions related to Alzheimer’s disease using 3D MRI scans based on eigenbrain and machine learning. Front Comput Neurosci 9:66. https://doi.org/10.3389/fncom.2015.00066.eCollection2015
Zhang Y, Wang S, Phillips P, Dong Z, Ji G, Yang J (2015b) Detection of Alzheimer’s disease and mild cognitive impairment based on structural volumetric MR images using 3D-DWT and WTA-KSVM trained by PSOTVAC. Biomed Signal Process Control 21:58–73
Zhang Z, Zheng H, Liang K, Wang H, Kong S, Hu J, Wu F, Sun G (2015c) Functional degeneration in dorsal and ventral attention systems in amnestic mild cognitive impairment and Alzheimer’s disease: an fMRI study. Neurosci Lett 585:160–165
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Nasrolahzadeh, M., Mohammadpoory, Z. & Haddadnia, J. Higher-order spectral analysis of spontaneous speech signals in Alzheimer’s disease. Cogn Neurodyn 12, 583–596 (2018). https://doi.org/10.1007/s11571-018-9499-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11571-018-9499-8