Abstract
Cepstral analysis is used to estimate the harmonics-to-noise ratio (HNR) in speech signals. The inverse Fourier transformed liftered cepstrum approximates a noise baseline from which the harmonics-to-noise ratio is estimated. The present study highlights the cepstrum-based noise baseline estimation process; it is shown to analogous to the action of a moving average filter applied to the power spectrum of voiced speech. The noise baseline, which is taken to approximate the noise excited vocal tract is influenced by the window length and the shape of the glottal source spectrum. Two existing estimation techniques are tested systematically using synthetically generated glottal flow and voiced speech signals with a priori knowledge of the HNR. The source influence is removed using a novel harmonic pre-emphasis technique. The results indicate accurate HNR estimation using the present approach. A preliminary investigation of the method with a set of normal/ pathological data is investigated.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
de Krom, G.: A cepstrum based technique for determining a harmonics-to-noise ratio in speech signals. J. Speech Hear. Res. 36(2), 254–266 (1993)
Qi, Y., Hillman, R.E.: Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals. J. Acoust. Soc. Amer. 102(1), 537–543 (1997)
Murphy, P.J.: A cepstrum-based harmonics-to-noise ratio in voice signals. In: Proceedings International Conference on Spoken Language Processing, Beijing, China, pp. 672–675 (2000)
Schafer, R.W., Rabiner, L.R.: System for automatic formant analysis of voiced speech. J. Acoust. Soc. Amer. 47, 634–648 (1970)
Murphy, P.J.: Averaged modified periodogram analysis of aperiodic voice signals. In: Proceedings Irish Signals and Systems Conference, Dublin, pp. 266–271 (June 2000)
Fant, G., Liljencrants, J., Lin, Q.G.: A four parameter model of glottal flow. STL-QPSR 4, 1–12 (1985)
Murphy, P.J.: Perturbation-free measurement of the harmonics-to-noise ratio in speech signals using pitch-synchronous harmonic analysis. J. Acoust. Soc. Amer. 105(5), 2866–2881 (1999)
Childers, D.G.: Speech processing and synthesis toolboxes. John Wiley & Sons, Inc., New York (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Murphy, P.J., Akande, O.O. (2006). Cepstrum-Based Estimation of the Harmonics-to-Noise Ratio for Synthesized and Human Voice Signals. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds) Nonlinear Analyses and Algorithms for Speech Processing. NOLISP 2005. Lecture Notes in Computer Science(), vol 3817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11613107_13
Download citation
DOI: https://doi.org/10.1007/11613107_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31257-4
Online ISBN: 978-3-540-32586-4
eBook Packages: Computer ScienceComputer Science (R0)