Abstract
In this paper, we define an algorithm with low complexity which performs a new use of the linear prediction analysis (covariance method) to retrieve the maximum-phase component of speech signals. First, we study the mixed-phase model of speech through a new representation named the Zeros of Z-Transform (ZZT) in the z-plane, which is an all-zero representation of the z-transform of a discrete time signal. Then, based on the properties of the mixed-phase model, we introduce an algorithm to estimate the anticausal glottal flow component from speech signals. LP-covariance analysis is used to estimate a pole pair outside the unit circle corresponding to the anticausal poles of the source signal component in the mixed-phase speech model. Given the pair of anticausal poles, a procedure to resynthesize the anticausal part of the glottal flow, and then an open quotient estimation method, are proposed. Evaluations show that the method is high quality for analyzing synthetic speech but lacks robustness in analysis of natural speech.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bozkurt, B., Dutoit, T.: Mixed-Phase Speech Modeling and Formant Estimation, Using Differential Phase Spectrums. In: Proc. ISCA ITRW VOQUAL, Geneva, Switzerland, pp. 21–24 (2003)
Doval, B., d’Alessandro, C., Henrich, N.: The Voice Source As A Causal/Anticausal Linear Filter. In: Proc. ISCA ITRW VOQUAL, Geneva, Switzerland, pp. 15–19 (2003)
Bozkurt, B., Doval, B., d’Alessandro, C., Dutoit, T.: Zeros of Z-Transform (ZZT) Decomposition Of Speech For Source-Tract Separation. In: Proc. ICSLP, Jeju Island, Korea (2004)
Bozkurt, B., Doval, B., d’Alessandro, C., Dutoit, T.: A Method For Glottal Formant Frequency Estimation. In: Proc. ICSLP, Jeju Island, Korea (2004)
Kawahara, H., Atake, Y., Zolfaghari, P.: Accurate vocal event detection method based on a fixed-point to weighted average group delay. In: Proc. ICSLP, Beijing, pp. 664–667 (2000)
Makhoul, J.: Linear Prediction: A Tutorial Review. In: Proc. IEEE, pp. 561–580 (1975)
Makhoul, J.: Lattice Methods For Linear Prediction. In: IEEE Trans. On Acoustics, Speech, And Signal Processing, vol. ASSP-25, pp. 423–428 (1977)
Alku, P.: Glottal Wave Analysis With Pitch Synchronous Iterative Adaptive Inverse Filtering. Speech Communication 11, 109–118 (1992)
Hanson, H.M.: Glottal Characteristics Of Female Speakers. Ph.D. Thesis, Harvard University (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bozkurt, B., Severin, F., Dutoit, T. (2005). An Algorithm to Estimate Anticausal Glottal Flow Component from Speech Signals. In: Chollet, G., Esposito, A., Faundez-Zanuy, M., Marinaro, M. (eds) Nonlinear Speech Modeling and Applications. NN 2004. Lecture Notes in Computer Science(), vol 3445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11520153_15
Download citation
DOI: https://doi.org/10.1007/11520153_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27441-4
Online ISBN: 978-3-540-31886-6
eBook Packages: Computer ScienceComputer Science (R0)