Abstract
In this paper an approach to speaker identification based on an estimation of parameters of a linear speech-production model is presented. The estimation is based on the discrete Kalman estimator. It is generally supposed that the vocal tract can be modelled by a system with constant parameters over short intervals. Taking this assumption into account, we can derive a special form of the discrete Kalman estimator for the model of speech production. The parameters of the vocal tract model obtained by the above mentioned Kalman estimation are then used to compute a new type of cepstral coefficients which we call Kalman cepstral coefficients (KCCs). These coefficients were used in text-independent speaker identification experiments based on discrete vector quantisation. Achieved results were then compared with results obtained by using the LPC-derived cepstral coefficients (LPCCs). The experiments were performed in a closed group of 591 speakers (312 male, 279 female).
The work was supported by the Ministry of Education of the Czech Republic, project no. MSM235200004, and by the Grant Agency of the Czech Republic, project no. 102/96/K087.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Mack G. A., Jain V. K.: A Compensated-Kalman Speech Parameter Estimator. IEEE Signal Processing Magazine (1985).
Mammone R. J., Zhang X., Ramachandran R. P.: Robust Speaker Recognition. IEEE Signal Processing Magazine (1996).
Psutka J.: Communication with Computer by Speech. Academia, Prague (1995) (in Czech).
Radová V., Švenda Z.: An Approach to Speaker Recognition Based on Vector Quantization. First International Conference on Advanced Engineering Design, Prague (1999).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Švenda, Z., Radovă, V. (2000). Speaker Identification Using Kalman Cepstral Coefficients. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_50
Download citation
DOI: https://doi.org/10.1007/3-540-45323-7_50
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive