Analysis of Prosodic Features During Cognitive Load in Patients with Depression

Martínez, Carmen; Kontaxis, Spyridon; Posadas-de Miguel, Mar; García, Esther; Siddi, Sara; Aguiló, Jordi; Haro, Josep Maria; de la Cámara, Concepción; Bailón, Raquel; Ortega, Alfonso

doi:10.1007/978-981-15-8395-7_14

Carmen Martínez³⁷,
Spyridon Kontaxis³⁸,
Mar Posadas-de Miguel³⁹,
Esther García⁴⁰,
Sara Siddi⁴¹,
Jordi Aguiló⁴⁰,
Josep Maria Haro⁴¹,
Concepción de la Cámara³⁹,
Raquel Bailón³⁸ &
…
Alfonso Ortega³⁸

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 704))

865 Accesses

Abstract

Major Depressive Disorder (MDD) is a largely extended mental health disorder commonly associated with a hesitant and monotonous speech. This study analyses a speech corpus from a database acquired on 40 MDD patients and 40 matched controls (CT). During the recordings, individuals experienced different levels of cognitive stress when performing Stroop color test that includes three tasks with increasingly level of difficulty. Speech features based on the fundamental frequency (F0), and the speech ratio (SR), which measures the speech to silence ratio, are used for characterising depressive mood and stress responsiveness. Results show that SR is significantly lower in MDD subjects compared to healthy controls for all the tasks, decreasing as the difficulty of the cognitive tasks, and thus the stress level, increases. Moreover F0 related parameters (median and interquartile range) show higher values within the same subject in tasks with increased difficulty level for both groups. It can be concluded that speech features could be used for characterising depressive mood and assessing different levels of stress.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Detecting subtle signs of depression with automated speech analysis in a non-clinical sample

Article Open access 27 December 2022

Evaluation of Depression Severity in Speech

On the Significance of Speech Pauses in Depressive Disorders: Results on Read and Spontaneous Narratives

References

Vos T (2017) Global, regional, and national incidence, prevalence, and years lived with disability for 328 disease and injuries, 195 countries, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet 390:1211–1259
Article Google Scholar
World Health Organisation (WHO) (2011) Depression: let’s talk. In: Website of World Health Association. Disorders Management, Depression. https://www.who.int/news-room/detail/30-03-2017--depression-let-s-talk-says-who-as-depression-tops-list-of-causes-of-ill-health. Accessed Jan 2020
American Psychiatric Association (1994) Diagnosis and Statistical Manual of Mental Disorders (DSM). 4th edn. Washington DC
Google Scholar
Sperry SH, Kwapil TR, Eddington KM et al (2018) Psychopathology, everyday behaviours, and autonomic activity in daily life: An ambulatory impedance cardiography study of depression, anxiety, and hypomaniac traits. Int J Psychophysiol 129:67–75
Article Google Scholar
Kräpelin E (1921) Manic-depressive insanity and paranoia, 2nd edn. Livingstone, Edinburgh
Google Scholar
Cummins N, Scherer S, Krajewski J et al (2015) A review of depression and suicide risk assessment using speech analysis. Speech Commun 71:10–49
Article Google Scholar
Hönig F et al (2014) Automatic modelling of depressed speech: relevant features and relevance of gender. In: 15th Proceedings of Interspeech, Singapore, 14–18 September 2014
Google Scholar
Cannizzaro M, Harel B, Reilly N et al (2004) Automatic modelling of depressed speech: voice acoustical measurement of the severity of major depression. Brain Cogn 56:30–35
Article Google Scholar
France DJ, Shiavi RG, Silverman S et al (2000) Acoustical properties of speech as indicator of depression and suicidal risk. IEEE T Bio Med Eng 47:309–319
Article Google Scholar
Mundt JC, Snyder PJ, Cannizzaro MS et al (2007) Voice acoustic measures of depression severity and treatment response collected via interactive voice response (IVR) technology. J Neurolinguist 20:50–64
Article Google Scholar
Taguchi T, Tachikawa H, Nemoto K, Suzuki M et al (2017) Major depressive disorder discrimination using vocal acoustic features. J Affect Disorders 225:214–220
Article Google Scholar
Quatieri TF et al (2012) Vocal-source biomarkers for depression: a link to psychomotor activity. In: 13th Proceedings of Interspeech, Portland, OR, USA, 9–13 September 2012
Google Scholar
Mundt JC, Vogel AP, Feltner DE et al (2012) Vocal acoustic biomarkers of depression severity and treatment response. Biol Psychiatry 72:580–587
Article Google Scholar
Stroop JR (1992) Studies of interference in serial verbal reactions. J Exp Psychol 121:15–23
Article Google Scholar
Resch B, Nilsson M, Ekman A et al (2007) Estimation of the Instantaneous Pitch of Speech. IEEE T Audio Speech 15:813–822
Article Google Scholar
Eyben F, Wöllmer M, Schuller B (2010) openSMILE - the munich versatile and fast open-source audio feature extractor. In: Proceedings of the 18th ACM international conference on multimedia, Firenze, Italy, 25–29 October 2010
Google Scholar
Ramírez J, Górriz JM, Segura JC (2007) Voice activity detection. Fundamentals and speech recognition system robustness. In: Grimm M, Kroschel K (eds) Robust speech recognition and understanding. InTech
Google Scholar
Klatt DH, Klatt LC (1990) Analysis, synthesis and perception of voice quality variations among female and male talkers. J Acoust Soc Am 87:820–857
Article Google Scholar
Schuller B et al (2014) The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. In: 15th Proceedings of Interspeech, Singapore, 14–18 September 2014
Google Scholar
Yin B et al (2008) Speech-based cognitive load monitoring system. In: 2008 IEEE international conference on acoustics, speech, and signal processing, Las Vegas, NV, USA, 31 March–4 April 2008
Google Scholar
Yap TF, Epps J, Ambikairajah E et al (2001) Formant frequencies under cognitive load: effects and classification. EURASIP J Adv Sig Pr
Google Scholar
Williamson JR et al (2014) Vocal and facial biomarkers of depression based on motor incoordination and timing. In: AVEC 2014 Proceedings of the 4th international workshop on audio/visual emotion challenge, Orlando, Florida, USA, November 2014
Google Scholar
Lam RW, Kennedy SH, McIntyre RS et al (2014) Cognitive dysfunction in major depressive disorder: effects on psychosocial functioning and implications for treatment. Can J Psychiatry 59:614–654
Article Google Scholar
Scarpina F, Tagini S (2017) The stroop color and word test. Front Psychol 8:557
Article Google Scholar
Videbech P, Ravnkilde B, Gammelgaard L et al (2014) The danish PET/depression project: performance on Stroop’s test linked to white matter lesions in the brain. Psychiatry Res 130:117–130
Article Google Scholar
Kontaxis S, Orini M, Gil E, Posadas-de Miguel M, Bernal ML, Aguiló J, de la Cámara C, Laguna P, Bailón R (2018) Heart rate variability analysis guided by respiration in major depressive disorder. In: 45th International conference of computing in cardiology, Maastricht, The Netherlands, 23–26 September 2018
Google Scholar

Download references

Acknowledgements

This work has been supported by AEI and FEDER under the projects RTI2018-097723-B-I00 and 2014–2020 “Building Europe from Aragón”, by CIBER de Bioingeniería, Biomateriales y Nanomedicina, and CIBERSAM, through Instituto de Salud Carlos III, by LMP44-18, BSICoS group (T39-20R), ViVoLab group (T36-20R) and a personal grant to S. Kontaxis funded by Gobierno de Aragón; and by Spanish Ministry of Economy and Competitiveness and the European Social Fund (TIN2017-85854-C4-1-R). The computation was performed by the ICTS ‘NANBIOSIS’, more specifically by the High Performance Computing Unit of the CIBER in Bioengineering, Biomaterials & Nanomedicne (CIBERBBN).

Author information

Authors and Affiliations

University of Zaragoza, Campus Río Ebro, C/María de Luna 1, 50018, Zaragoza, Spain
Carmen Martínez
University of Zaragoza, Campus Río Ebro, C/María de Luna 1, 50018, Zaragoza, Spain
Spyridon Kontaxis, Raquel Bailón & Alfonso Ortega
Hospital Clínico de Zaragoza, Zaragoza, Spain
Mar Posadas-de Miguel & Concepción de la Cámara
Autonomous University of Barcelona, Barcelona, Spain
Esther García & Jordi Aguiló
Parc Sanitari Sant Joan de Déu, Barcelona, Spain
Sara Siddi & Josep Maria Haro

Authors

Carmen Martínez
View author publications
You can also search for this author in PubMed Google Scholar
Spyridon Kontaxis
View author publications
You can also search for this author in PubMed Google Scholar
Mar Posadas-de Miguel
View author publications
You can also search for this author in PubMed Google Scholar
Esther García
View author publications
You can also search for this author in PubMed Google Scholar
Sara Siddi
View author publications
You can also search for this author in PubMed Google Scholar
Jordi Aguiló
View author publications
You can also search for this author in PubMed Google Scholar
Josep Maria Haro
View author publications
You can also search for this author in PubMed Google Scholar
Concepción de la Cámara
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Bailón
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Ortega
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Carmen Martínez .

Editor information

Editors and Affiliations

Speech Technology Group - Information Processing and Telecommunications Center (IPTC), Universidad Politécnica de Madrid, Madrid, Spain
Luis Fernando D'Haro
Department of Languages and Computer Systems, Universidad de Granada, CITIC-UGR, Granada, Spain
Zoraida Callejas
Information Science, Nara Institute of Science and Technology, Ikoma, Japan
Satoshi Nakamura

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Martínez, C. et al. (2021). Analysis of Prosodic Features During Cognitive Load in Patients with Depression. In: D'Haro, L.F., Callejas, Z., Nakamura, S. (eds) Conversational Dialogue Systems for the Next Decade. Lecture Notes in Electrical Engineering, vol 704. Springer, Singapore. https://doi.org/10.1007/978-981-15-8395-7_14

Download citation

DOI: https://doi.org/10.1007/978-981-15-8395-7_14
Published: 25 October 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8394-0
Online ISBN: 978-981-15-8395-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Analysis of Prosodic Features During Cognitive Load in Patients with Depression

Abstract

Access this chapter

Similar content being viewed by others

Detecting subtle signs of depression with automated speech analysis in a non-clinical sample

Evaluation of Depression Severity in Speech

On the Significance of Speech Pauses in Depressive Disorders: Results on Read and Spontaneous Narratives

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Analysis of Prosodic Features During Cognitive Load in Patients with Depression

Abstract

Access this chapter

Similar content being viewed by others

Detecting subtle signs of depression with automated speech analysis in a non-clinical sample

Evaluation of Depression Severity in Speech

On the Significance of Speech Pauses in Depressive Disorders: Results on Read and Spontaneous Narratives

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation