Abstract
The acoustic measurement of the severity of the symptoms present in pathological voice is an active research area, for being inexpensive and non invasive. Periodicity perturbations are among the most frequently used, requiring the previous extraction of the individual glottal pulse boundaries. In this paper we explore the performances of methods for detecting glottal pulse boundaries as implemented by freely available software (Praat, intended for phonetic studies) vs. a research-grade pulse cycle detector (reported as a super-resolution method). We compare the sequences of pulse markers as obtained by two of Praat’s internal implementations and the super-resolution method against the hand-marked reference sequence in a dataset of pathological sustained vowels from a well-known database. A group of performance measures is extracted from this comparison, using a Dynamic-Time Warping alignment procedure. The measures obtained show the pros and cons of each alternative. Researchers and clinicians must be aware of the benefits of selecting either approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baken, R.J., Orlikoff, R.F.: Clinical Measurement of Speech and Voice, 2nd edn. Cengage Learning (2000)
Kreiman, J., Gerratt, B.R.: Measuring vocal quality. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement, pp. 73–112. Singular, San Diego (2000)
Dejonckere, P.H., et al.: A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques: Guideline elaborated by the Committee on Phoniatrics of the European Laryngolo. Eur. Arch. Oto-Rhino-Laryngology 258(2), 77–82 (2001)
Dejonckere, P.H.: Assessment of voice and respiratory function. In: Remacle, M., Eckel, E.H. (eds.) Surgery of Larynx and Trachea, pp. 11–26. Springer, Heidelberg (2010)
Buder, E.H.: Acoustic analysis of voice quality: a tabulation of algorithms 1902–1990. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement, pp. 119–244. Singular, San Diego (2000)
Ng, M.L., Wong, J.: Voice onset time characteristics of esophageal, tracheoesophageal, and laryngeal speech of cantonese. J. Speech Lang. Hear. Res. 52, 780–789 (2009)
Karlsson, F., Olofsson, K., Blomstedt, P., Linder, J., Nordh, E., van Doorn, J.: Articulatory closure proficiency in patients with Parkinson’s disease following deep brain stimulation of the subthalamic nucleus and caudal zona incerta. J. Speech. Lang. Hear. Res. 57, 1178–1190 (2014)
Pouplier, M., Marin, S., Waltl, S.: Voice onset time in consonant cluster errors: can phonetic accommodation differentiate cognitive from motor errors? J. Speech Lang. Hear. Res. 57, 1577–1588 (2014)
Borrie, S.A., Liss, J.M.: Rhythm as a coordinating device: entrainment with disordered speech. J. Speech Lang. Hear. Res. 57, 815–824 (2014)
Skodda, S., Grönheit, W., Schlegel, U.: Intonation and speech rate in Parkinson’s disease: general and dynamic aspects and responsiveness to levodopa admission, J. Voice 25(4) (2011)
Ní Chasaide, A., Yanushevskaya, I., Gobl, C.: Voice source dynamics in intonation. In: Proceedings of 17th International Congress of Phonetic Sciences (ICPhS 2011), pp. 1470–1473 (2011)
Hess, W.J.: Pitch Determination of Speech Signals. Springer, Berlin Heidelberg (1983)
Hess, W.J.: Pitch and voicing determination of speech with an extension toward music signals. In: Benesty, J., Sondhi, M.M., Huang, Y.A. (eds.) Springer Handbook of Speech Processing. Springer. Heidelberg, pp. 181–212 (2008)
Boersma, P.: PRAAT, a system for doing phonetics by computer. Glot Int. 5(9/10), 5 (2002)
Barsties, B., Latoszek, V., Maryn, Y., Gerrits, E., De Bodt, M.S.: “The acoustic breathiness index (ABI): a multivariate acoustic model for breathiness, J. Voice (2017, in press)
Uloza, V., Latoszek, B.B.V., Ulozaite-Staniene, N., Petrauskas, T., Maryn, Y.: A comparison of Dysphonia severity index and acoustic voice quality index measures in differentiating normal and dysphonic voices. Eur. Arch. Oto-Rhino-Laryngology 275(4), 949–958 (2018)
Latoszek, B.B.V., Ulozaitė-Stanienė, N., Petrauskas, T., Uloza, V., Maryn, Y.: Diagnostic accuracy of dysphonia classification of DSI and AVQI. Laryngoscope (2018)
Latoszek, B.B.V., De Bodt, M., Gerrits, E., Maryn, Y.: The exploration of an objective model for roughness with several acoustic markers. J. Voice 32(2), 149–161 (2018)
Boersma, P.: Should jitter be measured by peak picking or by waveform matching? Folia Phoniatr. Logop. 61(5), 305–308 (2009)
Medan, Y., Yair, E., Chazan, D.: Super resolution pitch determination of speech signals. IEEE Trans. Signal Process. 39(1), 40–48 (1991)
Bagshaw, P.C., Miller, S.M., Jack, M.A.: Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching. In: 3rd European Conference on Speech Communication and Technology EUROSPEECH 1993, pp. 1003–1006 (1993)
Parsa, V., Jamieson, D.G.: A comparison of high precision F0 extraction algorithms for sustained vowels. J. Speech. Lang. Hear. Res. 42(1), 112–126 (1999)
Ferrer, C., Torres, D., Hernández-Díaz, M.E.: Using dynamic time warping of T0 contours in the evaluation of cycle-to-cycle pitch detection algorithms. Pattern Recognit. Lett. 31(6), 517–522 (2010)
Disordered Voice Database v1.03. Kay Elemetrics Corp (1994)
Hosokawa, K., Ogawa, M., Hashimoto, M., Inohara, H.: Statistical analysis of the reliability of acoustic and electroglottographic perturbation parameters for the detection of vocal roughness. J. Voice 28(2), 263.e9–263.e16 (2014)
Manfredi, C., Giordano, A., Schoentgen, J., Fraj, S., Bocchi, L., Dejonckere, P.H.: Validity of jitter measures in non-quasi-periodic voices. part II: the effect of noise. Logop. Phoniatr. Vocology 36(2), 78–89 (2011)
Acknowledgements
This work was partially supported by an Alexander von Humboldt Foundation Fellowship granted to one of the authors (Ref 3.2-1164728-CUB-GF-E).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Rodríguez-Guillén, R., Ferrer-Riesgo, C.A. (2020). Measuring Periodicity Perturbations in Pathological Voice: General-Purpose Software vs. Custom-Tailored Methods. In: González Díaz, C., et al. VIII Latin American Conference on Biomedical Engineering and XLII National Conference on Biomedical Engineering. CLAIB 2019. IFMBE Proceedings, vol 75. Springer, Cham. https://doi.org/10.1007/978-3-030-30648-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-30648-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30647-2
Online ISBN: 978-3-030-30648-9
eBook Packages: EngineeringEngineering (R0)