Abstract
In this paper, Individual Dimension Gaussian Mixture Model (IDGMM) is proposed for speaker identification. As to the training-purpose feature vector series of a certain register, its joint probability distribution function (PDF) of is modeled by the product of the PDF of each dimension (marginal PDF), the scalar-based Gaussian Mixture Model (GMM) serving as the marginal PDF. For a good discriminative capability, the decorrelation by Schmidt orthogonalization and the Mixture Component Number (MCN) decision are adopted during the train. A close-set text-independent speaker identification experiment is also given. The simulation result shows that the IDGMM accelerates the training process remarkably and maintains the discriminative capability in testing process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ramachandran, R.P., Farrell, K.R., Ramachandran, R., Mammone, R.J.: Speaker recognition – general classifier approaches and data fusion methods. Patter Recognition, 35 801–2821 (2002)
Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using Gaussian Mixture Speaker Models. IEEE Transactions on Speech And Audio Processing 3, 72–83 (1995)
Jauquet, F., Verlinde, P., Vloeberghs, C.: Histogram classifiers using vocal tract and pitch information for text independent speaker identification. In: Proceedings of the ProRISC Workshop on Circuits, Systems and Signal Processing, pp. 213–217 (1997)
Chu, C.-H.H., Delp, E.J.: Impulsive noise suppression and background normalization of electrocardiogram signals using morphological operators. IEEE Transactions on Biomedical Engineering 36, 262–273 (1989)
Gil, J.Y., Kimmel, R.: Efficient dilation, erosion, opening, and closing algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 1606–1617 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, C., Hou, L.M., Fang, Y. (2005). Individual Dimension Gaussian Mixture Model for Speaker Identification. In: Li, S.Z., Sun, Z., Tan, T., Pankanti, S., Chollet, G., Zhang, D. (eds) Advances in Biometric Person Authentication. IWBRS 2005. Lecture Notes in Computer Science, vol 3781. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11569947_22
Download citation
DOI: https://doi.org/10.1007/11569947_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29431-3
Online ISBN: 978-3-540-32248-1
eBook Packages: Computer ScienceComputer Science (R0)