Abstract
Major Depressive Disorder (MDD) is a largely extended mental health disorder commonly associated with a hesitant and monotonous speech. This study analyses a speech corpus from a database acquired on 40 MDD patients and 40 matched controls (CT). During the recordings, individuals experienced different levels of cognitive stress when performing Stroop color test that includes three tasks with increasingly level of difficulty. Speech features based on the fundamental frequency (F0), and the speech ratio (SR), which measures the speech to silence ratio, are used for characterising depressive mood and stress responsiveness. Results show that SR is significantly lower in MDD subjects compared to healthy controls for all the tasks, decreasing as the difficulty of the cognitive tasks, and thus the stress level, increases. Moreover F0 related parameters (median and interquartile range) show higher values within the same subject in tasks with increased difficulty level for both groups. It can be concluded that speech features could be used for characterising depressive mood and assessing different levels of stress.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Vos T (2017) Global, regional, and national incidence, prevalence, and years lived with disability for 328 disease and injuries, 195 countries, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet 390:1211–1259
World Health Organisation (WHO) (2011) Depression: let’s talk. In: Website of World Health Association. Disorders Management, Depression. https://www.who.int/news-room/detail/30-03-2017--depression-let-s-talk-says-who-as-depression-tops-list-of-causes-of-ill-health. Accessed Jan 2020
American Psychiatric Association (1994) Diagnosis and Statistical Manual of Mental Disorders (DSM). 4th edn. Washington DC
Sperry SH, Kwapil TR, Eddington KM et al (2018) Psychopathology, everyday behaviours, and autonomic activity in daily life: An ambulatory impedance cardiography study of depression, anxiety, and hypomaniac traits. Int J Psychophysiol 129:67–75
Kräpelin E (1921) Manic-depressive insanity and paranoia, 2nd edn. Livingstone, Edinburgh
Cummins N, Scherer S, Krajewski J et al (2015) A review of depression and suicide risk assessment using speech analysis. Speech Commun 71:10–49
Hönig F et al (2014) Automatic modelling of depressed speech: relevant features and relevance of gender. In: 15th Proceedings of Interspeech, Singapore, 14–18 September 2014
Cannizzaro M, Harel B, Reilly N et al (2004) Automatic modelling of depressed speech: voice acoustical measurement of the severity of major depression. Brain Cogn 56:30–35
France DJ, Shiavi RG, Silverman S et al (2000) Acoustical properties of speech as indicator of depression and suicidal risk. IEEE T Bio Med Eng 47:309–319
Mundt JC, Snyder PJ, Cannizzaro MS et al (2007) Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J Neurolinguist 20:50–64
Taguchi T, Tachikawa H, Nemoto K, Suzuki M et al (2017) Major depressive disorder discrimination using vocal acoustic features. J Affect Disorders 225:214–220
Quatieri TF et al (2012) Vocal-source biomarkers for depression: a link to psychomotor activity. In: 13th Proceedings of Interspeech, Portland, OR, USA, 9–13 September 2012
Mundt JC, Vogel AP, Feltner DE et al (2012) Vocal acoustic biomarkers of depression severity and treatment response. Biol Psychiatry 72:580–587
Stroop JR (1992) Studies of interference in serial verbal reactions. J Exp Psychol 121:15–23
Resch B, Nilsson M, Ekman A et al (2007) Estimation of the Instantaneous Pitch of Speech. IEEE T Audio Speech 15:813–822
Eyben F, Wöllmer M, Schuller B (2010) openSMILE - the munich versatile and fast open-source audio feature extractor. In: Proceedings of the 18th ACM international conference on multimedia, Firenze, Italy, 25–29 October 2010
Ramírez J, Górriz JM, Segura JC (2007) Voice activity detection. Fundamentals and speech recognition system robustness. In: Grimm M, Kroschel K (eds) Robust speech recognition and understanding. InTech
Klatt DH, Klatt LC (1990) Analysis, synthesis and perception of voice quality variations among female and male talkers. J Acoust Soc Am 87:820–857
Schuller B et al (2014) The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. In: 15th Proceedings of Interspeech, Singapore, 14–18 September 2014
Yin B et al (2008) Speech-based cognitive load monitoring system. In: 2008 IEEE international conference on acoustics, speech, and signal processing, Las Vegas, NV, USA, 31 March–4 April 2008
Yap TF, Epps J, Ambikairajah E et al (2001) Formant frequencies under cognitive load: effects and classification. EURASIP J Adv Sig Pr
Williamson JR et al (2014) Vocal and facial biomarkers of depression based on motor incoordination and timing. In: AVEC 2014 Proceedings of the 4th international workshop on audio/visual emotion challenge, Orlando, Florida, USA, November 2014
Lam RW, Kennedy SH, McIntyre RS et al (2014) Cognitive dysfunction in major depressive disorder: effects on psychosocial functioning and implications for treatment. Can J Psychiatry 59:614–654
Scarpina F, Tagini S (2017) The stroop color and word test. Front Psychol 8:557
Videbech P, Ravnkilde B, Gammelgaard L et al (2014) The danish PET/depression project: performance on Stroop’s test linked to white matter lesions in the brain. Psychiatry Res 130:117–130
Kontaxis S, Orini M, Gil E, Posadas-de Miguel M, Bernal ML, Aguiló J, de la Cámara C, Laguna P, Bailón R (2018) Heart rate variability analysis guided by respiration in major depressive disorder. In: 45th International conference of computing in cardiology, Maastricht, The Netherlands, 23–26 September 2018
Acknowledgements
This work has been supported by AEI and FEDER under the projects RTI2018-097723-B-I00 and 2014–2020 “Building Europe from Aragón”, by CIBER de Bioingeniería, Biomateriales y Nanomedicina, and CIBERSAM, through Instituto de Salud Carlos III, by LMP44-18, BSICoS group (T39-20R), ViVoLab group (T36-20R) and a personal grant to S. Kontaxis funded by Gobierno de Aragón; and by Spanish Ministry of Economy and Competitiveness and the European Social Fund (TIN2017-85854-C4-1-R). The computation was performed by the ICTS ‘NANBIOSIS’, more specifically by the High Performance Computing Unit of the CIBER in Bioengineering, Biomaterials & Nanomedicne (CIBERBBN).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Martínez, C. et al. (2021). Analysis of Prosodic Features During Cognitive Load in Patients with Depression. In: D'Haro, L.F., Callejas, Z., Nakamura, S. (eds) Conversational Dialogue Systems for the Next Decade. Lecture Notes in Electrical Engineering, vol 704. Springer, Singapore. https://doi.org/10.1007/978-981-15-8395-7_14
Download citation
DOI: https://doi.org/10.1007/978-981-15-8395-7_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8394-0
Online ISBN: 978-981-15-8395-7
eBook Packages: EngineeringEngineering (R0)