An Algorithm to Estimate Anticausal Glottal Flow Component from Speech Signals

Bozkurt, Baris; Severin, François; Dutoit, Thierry

doi:10.1007/11520153_15

Baris Bozkurt²²,
François Severin²² &
Thierry Dutoit²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3445))

Included in the following conference series:

International School on Neural Networks, Initiated by IIASS and EMFCSC

1156 Accesses

Abstract

In this paper, we define an algorithm with low complexity which performs a new use of the linear prediction analysis (covariance method) to retrieve the maximum-phase component of speech signals. First, we study the mixed-phase model of speech through a new representation named the Zeros of Z-Transform (ZZT) in the z-plane, which is an all-zero representation of the z-transform of a discrete time signal. Then, based on the properties of the mixed-phase model, we introduce an algorithm to estimate the anticausal glottal flow component from speech signals. LP-covariance analysis is used to estimate a pole pair outside the unit circle corresponding to the anticausal poles of the source signal component in the mixed-phase speech model. Given the pair of anticausal poles, a procedure to resynthesize the anticausal part of the glottal flow, and then an open quotient estimation method, are proposed. Evaluations show that the method is high quality for analyzing synthetic speech but lacks robustness in analysis of natural speech.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bozkurt, B., Dutoit, T.: Mixed-Phase Speech Modeling and Formant Estimation, Using Differential Phase Spectrums. In: Proc. ISCA ITRW VOQUAL, Geneva, Switzerland, pp. 21–24 (2003)
Google Scholar
Doval, B., d’Alessandro, C., Henrich, N.: The Voice Source As A Causal/Anticausal Linear Filter. In: Proc. ISCA ITRW VOQUAL, Geneva, Switzerland, pp. 15–19 (2003)
Google Scholar
Bozkurt, B., Doval, B., d’Alessandro, C., Dutoit, T.: Zeros of Z-Transform (ZZT) Decomposition Of Speech For Source-Tract Separation. In: Proc. ICSLP, Jeju Island, Korea (2004)
Google Scholar
Bozkurt, B., Doval, B., d’Alessandro, C., Dutoit, T.: A Method For Glottal Formant Frequency Estimation. In: Proc. ICSLP, Jeju Island, Korea (2004)
Google Scholar
Kawahara, H., Atake, Y., Zolfaghari, P.: Accurate vocal event detection method based on a fixed-point to weighted average group delay. In: Proc. ICSLP, Beijing, pp. 664–667 (2000)
Google Scholar
Makhoul, J.: Linear Prediction: A Tutorial Review. In: Proc. IEEE, pp. 561–580 (1975)
Google Scholar
Makhoul, J.: Lattice Methods For Linear Prediction. In: IEEE Trans. On Acoustics, Speech, And Signal Processing, vol. ASSP-25, pp. 423–428 (1977)
Google Scholar
Alku, P.: Glottal Wave Analysis With Pitch Synchronous Iterative Adaptive Inverse Filtering. Speech Communication 11, 109–118 (1992)
Article Google Scholar
Hanson, H.M.: Glottal Characteristics Of Female Speakers. Ph.D. Thesis, Harvard University (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

TCTS Lab. Faculté Polytechnique de Mons, Initialis Sci. Park, B-7000, Mons, Belgium
Baris Bozkurt, François Severin & Thierry Dutoit

Authors

Baris Bozkurt
View author publications
You can also search for this author in PubMed Google Scholar
François Severin
View author publications
You can also search for this author in PubMed Google Scholar
Thierry Dutoit
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CNRS LTCI/TSI Paris, 46 rue Barrault, 75634, Paris Cedex 13, France
Gérard Chollet
Department of Psychology, Second University of Naples, and IIASS, Via Pellegrino 19, 84019, Vietri sul Mare, SA, Italy
Anna Esposito
Escola Universitària Politècnica de Mataró, Universitat Politècnica de Catalunya, Barcelona, Spain
Marcos Faundez-Zanuy
Dipartimento di Fisica “E.R. Caianiello”, Università degli Studi di Salerno, Via S. Allende, 84081, Baronissi, SA, Italy
Maria Marinaro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bozkurt, B., Severin, F., Dutoit, T. (2005). An Algorithm to Estimate Anticausal Glottal Flow Component from Speech Signals. In: Chollet, G., Esposito, A., Faundez-Zanuy, M., Marinaro, M. (eds) Nonlinear Speech Modeling and Applications. NN 2004. Lecture Notes in Computer Science(), vol 3445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11520153_15

Download citation

DOI: https://doi.org/10.1007/11520153_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27441-4
Online ISBN: 978-3-540-31886-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics