Vocal dysperiodicities estimation by means of adaptive long-term prediction

Kacha, Abdellah; Bettens, Frédéric; Grenez, Francis

doi:10.1007/s11517-005-0003-3

Vocal dysperiodicities estimation by means of adaptive long-term prediction

ORIGINAL ARTICLE
Published: 26 January 2006

Volume 44, pages 61–68, (2006)
Cite this article

Medical and Biological Engineering and Computing Aims and scope Submit manuscript

Abdellah Kacha¹,
Frédéric Bettens¹ &
Francis Grenez¹

99 Accesses
2 Citations
Explore all metrics

Abstract

An adaptive formulation of the long-term bi-directional linear predictive analysis is proposed in the context of the acoustic assessment of disordered speech. Vocal dysperiodicities are summarized by means of a signal-to-dysperiodicity ratio (SDR) marker. It is shown that performing an adaptive forward and backward long-term linear prediction of each speech sample and retaining the minimal prediction error energy as a cue of vocal dysperiodicity results in an SDR that correlates with the perceived degree of hoarseness. The coefficients of the time-varying long-term linear predictive model are estimated by means of the recursive least squares algorithm. The corpora comprise sustained vowels and French sentences produced by male and female normophonic and dysphonic speakers. A perceptual assessment of speech samples, which rests on comparative judgments, is used to evaluate the ability of the acoustic marker to predict subjective measures of voice quality. Experimental results show that the adaptive approach gives rise to high correlations for sustained vowels as well as for sentences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Application of the Lognormal Model to the Vocal Tract Movement to Detect Neurological Diseases in Voice

Robust Automatic Evaluation of Intelligibility in Voice Rehabilitation Using Prosodic Analysis

Low-complexity disordered speech quality estimation

Article 20 February 2020

Yousef S. Ettomi Ali, Vijay Parsa, … Soulaimane Berkane

References

Bettens F, Grenez F, Schoengen J (2005) Estimation of vocal dysperiodicities in connected speech by means of distant-sample bi-directional linear predictive analysis. J Acoust Soc Am 117:328–337
Article PubMed Google Scholar
Dejonckere PH, Remacle M, Fresnel-elbaz E, Woisard V, Crevier-buchman L, Millet B (1996) Differentiated perceptual evaluation of pathological voice quality: reliability and correlations with acoustic measurements. Rev Laryngol Otol Rhinol 117:219–224
Google Scholar
De Krom G (1993) A cepstrum-based technique for determining a harmonics-to-noise-ratio in speech signals. J Speech Hear Res 36:254–265
PubMed Google Scholar
De Oliveira RM, Pareira JC, Grellet M (2000) Adaptive estimation of residue signal for voice pathology diagnosis. IEEE Trans Biomed Eng 47:96–103
Article PubMed Google Scholar
Haykin S (1991) Adaptive filter theory. Prentice Hall, Englewood Cliffs
MATH Google Scholar
Hillenbrand J, Houde RA (1996) Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res 39:311–321
PubMed Google Scholar
Kacha A, Grenez F, Schoentgen J (2005) Voice quality assessment by means of comparative judgments of speech tokens. In: International conference on spoken language processing, September 4–8, 2005, Lisboa, Portugal, pp 1733–1736
Kahn M, Garst P (1983) The effects of five voice characteristics on LPC quality. In: International conference on acoustics, speech, and signal processing, Boston, pp 531–534
Klingholtz F (1987) The measurement of the signal-to-noise ratio (SNR) in continuous speech. Speech Commun 6:1–12
Article Google Scholar
Klingholtz F (1990) Acoustic recognition of voice disorders: a comparative study of running speech versus sustained vowels. J Acoust Soc Am 87:2218–2224
Article PubMed Google Scholar
Kreiman J, Gerrat BR (1998) Validity of rating scale measures of voice quality. J Acoust Soc Am 104:1598–1608
Article PubMed Google Scholar
Lieberman P (1963) Some acoustic measures of the fundamental periodicity of normal and pathologic larynges. J Acoust Soc Am 35:344–353
Article Google Scholar
Makhoul J (1975) Linear prediction: a tutorial review. Proc IEEE 63:561–580
Article Google Scholar
Moore D, Mccabe G (1999) Introduction to the practice of statistics. Freeman, New York
Google Scholar
Murphy P (2000) Spectral characterization of jitter, shimmer and additive noise in synthetically generated voice signals. J Acoust Soc Am 107:978–988
Article PubMed Google Scholar
Muta H, Baer T, Wagatsuma K, Muraoka T, Fukuda H (1988) A pitch-synchronous analysis of hoarseness in running speech. J Acoust Soc Am 84:1292–1301
Article PubMed Google Scholar
Parsa J, Jamieson JDG (2001) Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. J Speech Hear Res 44:327–339
Article Google Scholar
Qi Y (1999) The estimation of signal-to-noise ratio in continuous speech for disordered voices. J Acoust Soc Am 105:3532–2535
Google Scholar
Qi Y, Hillman R (1997) Temporal and spectral estimation of harmonics-to-noise ratio in human voice signals. J Acoust Soc Am 102:537–543
Article PubMed Google Scholar
Ramachandran RP, Kabal P (1989) Pitch prediction filters in speech coding. IEEE Trans Acoust Speech Signal Proc 37:467–478
Article Google Scholar
Schoentgen J (1982) Quantitative evaluation of the discrimination performance of acoustic features in detecting laryngeal pathology. Speech Commun 1:269–282
Article Google Scholar
Schoentgen J (2003) Spectral models of additive and modulation noise in speech and phonatory excitation signals. J Acoust Soc Am 113:553–562
Article PubMed Google Scholar
Schoentgen J, Bensaid M, Bucella F (2000) Multivariate statistical analysis of flat vowel spectra models with a view to characterizing dysphonic voices. J Speech Lang Hear Res 43:1493–1508
PubMed Google Scholar
Yumoto E, Gould WJ (1982) The estimation of signal-to-noise ratio in continuous speech of disordered voices. J Acoust Soc Am 71:1544–1549
Article PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank Prof. J. Schoentgen, National Fund for Scientific Research, Belgium for useful comments and discussions and the anonymous reviewers for their useful advices.

Author information

Authors and Affiliations

Service Ondes et Signaux, Faculté des Sciences Appliquées, Université Libre de Bruxelles, Av. F. D. Roosevelt 50, CP 165/51, 1050, Bruxelles, Belgium
Abdellah Kacha, Frédéric Bettens & Francis Grenez

Authors

Abdellah Kacha
View author publications
You can also search for this author in PubMed Google Scholar
Frédéric Bettens
View author publications
You can also search for this author in PubMed Google Scholar
Francis Grenez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abdellah Kacha.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kacha, A., Bettens, F. & Grenez, F. Vocal dysperiodicities estimation by means of adaptive long-term prediction. Med Bio Eng Comput 44, 61–68 (2006). https://doi.org/10.1007/s11517-005-0003-3

Download citation

Received: 29 July 2005
Accepted: 16 November 2005
Published: 26 January 2006
Issue Date: March 2006
DOI: https://doi.org/10.1007/s11517-005-0003-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Vocal dysperiodicities estimation by means of adaptive long-term prediction

Abstract

Access this article

Similar content being viewed by others

Application of the Lognormal Model to the Vocal Tract Movement to Detect Neurological Diseases in Voice

Robust Automatic Evaluation of Intelligibility in Voice Rehabilitation Using Prosodic Analysis

Low-complexity disordered speech quality estimation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Vocal dysperiodicities estimation by means of adaptive long-term prediction

Abstract

Access this article

Similar content being viewed by others

Application of the Lognormal Model to the Vocal Tract Movement to Detect Neurological Diseases in Voice

Robust Automatic Evaluation of Intelligibility in Voice Rehabilitation Using Prosodic Analysis

Low-complexity disordered speech quality estimation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation