Skip to main content

Advertisement

Log in

Vocal dysperiodicities estimation by means of adaptive long-term prediction

  • ORIGINAL ARTICLE
  • Published:
Medical and Biological Engineering and Computing Aims and scope Submit manuscript

Abstract

An adaptive formulation of the long-term bi-directional linear predictive analysis is proposed in the context of the acoustic assessment of disordered speech. Vocal dysperiodicities are summarized by means of a signal-to-dysperiodicity ratio (SDR) marker. It is shown that performing an adaptive forward and backward long-term linear prediction of each speech sample and retaining the minimal prediction error energy as a cue of vocal dysperiodicity results in an SDR that correlates with the perceived degree of hoarseness. The coefficients of the time-varying long-term linear predictive model are estimated by means of the recursive least squares algorithm. The corpora comprise sustained vowels and French sentences produced by male and female normophonic and dysphonic speakers. A perceptual assessment of speech samples, which rests on comparative judgments, is used to evaluate the ability of the acoustic marker to predict subjective measures of voice quality. Experimental results show that the adaptive approach gives rise to high correlations for sustained vowels as well as for sentences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Similar content being viewed by others

References

  1. Bettens F, Grenez F, Schoengen J (2005) Estimation of vocal dysperiodicities in connected speech by means of distant-sample bi-directional linear predictive analysis. J Acoust Soc Am 117:328–337

    Article  PubMed  Google Scholar 

  2. Dejonckere PH, Remacle M, Fresnel-elbaz E, Woisard V, Crevier-buchman L, Millet B (1996) Differentiated perceptual evaluation of pathological voice quality: reliability and correlations with acoustic measurements. Rev Laryngol Otol Rhinol 117:219–224

    Google Scholar 

  3. De Krom G (1993) A cepstrum-based technique for determining a harmonics-to-noise-ratio in speech signals. J Speech Hear Res 36:254–265

    PubMed  Google Scholar 

  4. De Oliveira RM, Pareira JC, Grellet M (2000) Adaptive estimation of residue signal for voice pathology diagnosis. IEEE Trans Biomed Eng 47:96–103

    Article  PubMed  Google Scholar 

  5. Haykin S (1991) Adaptive filter theory. Prentice Hall, Englewood Cliffs

    MATH  Google Scholar 

  6. Hillenbrand J, Houde RA (1996) Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res 39:311–321

    PubMed  Google Scholar 

  7. Kacha A, Grenez F, Schoentgen J (2005) Voice quality assessment by means of comparative judgments of speech tokens. In: International conference on spoken language processing, September 4–8, 2005, Lisboa, Portugal, pp 1733–1736

  8. Kahn M, Garst P (1983) The effects of five voice characteristics on LPC quality. In: International conference on acoustics, speech, and signal processing, Boston, pp 531–534

  9. Klingholtz F (1987) The measurement of the signal-to-noise ratio (SNR) in continuous speech. Speech Commun 6:1–12

    Article  Google Scholar 

  10. Klingholtz F (1990) Acoustic recognition of voice disorders: a comparative study of running speech versus sustained vowels. J Acoust Soc Am 87:2218–2224

    Article  PubMed  Google Scholar 

  11. Kreiman J, Gerrat BR (1998) Validity of rating scale measures of voice quality. J Acoust Soc Am 104:1598–1608

    Article  PubMed  Google Scholar 

  12. Lieberman P (1963) Some acoustic measures of the fundamental periodicity of normal and pathologic larynges. J Acoust Soc Am 35:344–353

    Article  Google Scholar 

  13. Makhoul J (1975) Linear prediction: a tutorial review. Proc IEEE 63:561–580

    Article  Google Scholar 

  14. Moore D, Mccabe G (1999) Introduction to the practice of statistics. Freeman, New York

    Google Scholar 

  15. Murphy P (2000) Spectral characterization of jitter, shimmer and additive noise in synthetically generated voice signals. J Acoust Soc Am 107:978–988

    Article  PubMed  Google Scholar 

  16. Muta H, Baer T, Wagatsuma K, Muraoka T, Fukuda H (1988) A pitch-synchronous analysis of hoarseness in running speech. J Acoust Soc Am 84:1292–1301

    Article  PubMed  Google Scholar 

  17. Parsa J, Jamieson JDG (2001) Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. J Speech Hear Res 44:327–339

    Article  Google Scholar 

  18. Qi Y (1999) The estimation of signal-to-noise ratio in continuous speech for disordered voices. J Acoust Soc Am 105:3532–2535

    Google Scholar 

  19. Qi Y, Hillman R (1997) Temporal and spectral estimation of harmonics-to-noise ratio in human voice signals. J Acoust Soc Am 102:537–543

    Article  PubMed  Google Scholar 

  20. Ramachandran RP, Kabal P (1989) Pitch prediction filters in speech coding. IEEE Trans Acoust Speech Signal Proc 37:467–478

    Article  Google Scholar 

  21. Schoentgen J (1982) Quantitative evaluation of the discrimination performance of acoustic features in detecting laryngeal pathology. Speech Commun 1:269–282

    Article  Google Scholar 

  22. Schoentgen J (2003) Spectral models of additive and modulation noise in speech and phonatory excitation signals. J Acoust Soc Am 113:553–562

    Article  PubMed  Google Scholar 

  23. Schoentgen J, Bensaid M, Bucella F (2000) Multivariate statistical analysis of flat vowel spectra models with a view to characterizing dysphonic voices. J Speech Lang Hear Res 43:1493–1508

    PubMed  Google Scholar 

  24. Yumoto E, Gould WJ (1982) The estimation of signal-to-noise ratio in continuous speech of disordered voices. J Acoust Soc Am 71:1544–1549

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Prof. J. Schoentgen, National Fund for Scientific Research, Belgium for useful comments and discussions and the anonymous reviewers for their useful advices.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdellah Kacha.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kacha, A., Bettens, F. & Grenez, F. Vocal dysperiodicities estimation by means of adaptive long-term prediction. Med Bio Eng Comput 44, 61–68 (2006). https://doi.org/10.1007/s11517-005-0003-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11517-005-0003-3

Keywords

Navigation