Individual Dimension Gaussian Mixture Model for Speaker Identification

Wang, Chao; Hou, Li Ming; Fang, Yong

doi:10.1007/11569947_22

Chao Wang²²,
Li Ming Hou²² &
Yong Fang²²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3781))

Included in the following conference series:

International Workshop on Biometric Person Authentication

919 Accesses

Abstract

In this paper, Individual Dimension Gaussian Mixture Model (IDGMM) is proposed for speaker identification. As to the training-purpose feature vector series of a certain register, its joint probability distribution function (PDF) of is modeled by the product of the PDF of each dimension (marginal PDF), the scalar-based Gaussian Mixture Model (GMM) serving as the marginal PDF. For a good discriminative capability, the decorrelation by Schmidt orthogonalization and the Mixture Component Number (MCN) decision are adopted during the train. A close-set text-independent speaker identification experiment is also given. The simulation result shows that the IDGMM accelerates the training process remarkably and maintains the discriminative capability in testing process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ramachandran, R.P., Farrell, K.R., Ramachandran, R., Mammone, R.J.: Speaker recognition – general classifier approaches and data fusion methods. Patter Recognition, 35 801–2821 (2002)
Google Scholar
Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using Gaussian Mixture Speaker Models. IEEE Transactions on Speech And Audio Processing 3, 72–83 (1995)
Article Google Scholar
Jauquet, F., Verlinde, P., Vloeberghs, C.: Histogram classifiers using vocal tract and pitch information for text independent speaker identification. In: Proceedings of the ProRISC Workshop on Circuits, Systems and Signal Processing, pp. 213–217 (1997)
Google Scholar
Chu, C.-H.H., Delp, E.J.: Impulsive noise suppression and background normalization of electrocardiogram signals using morphological operators. IEEE Transactions on Biomedical Engineering 36, 262–273 (1989)
Article Google Scholar
Gil, J.Y., Kimmel, R.: Efficient dilation, erosion, opening, and closing algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 1606–1617 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Communication and Information Engineering, Shanghai, China
Chao Wang, Li Ming Hou & Yong Fang

Authors

Chao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Li Ming Hou
View author publications
You can also search for this author in PubMed Google Scholar
Yong Fang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Biometrics and Security Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences,
Stan Z. Li
National Laboratory of Pattern Recognition, Institute of Automation, CAS,
Zhenan Sun
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
Exploratory Computer Vision Group, IBM T.J. Watson Research Center, 10598, Yorktown Heights, NY
Sharath Pankanti
CNRS LTCI/TSI Paris, 46 rue Barrault, 75634, Paris Cedex 13, France
Gérard Chollet
Biometrics Research Centre, Department of Computing, The Hong Kong Polytechnic University, Kowloon, Hong Kong
David Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, C., Hou, L.M., Fang, Y. (2005). Individual Dimension Gaussian Mixture Model for Speaker Identification. In: Li, S.Z., Sun, Z., Tan, T., Pankanti, S., Chollet, G., Zhang, D. (eds) Advances in Biometric Person Authentication. IWBRS 2005. Lecture Notes in Computer Science, vol 3781. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11569947_22

Download citation

DOI: https://doi.org/10.1007/11569947_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29431-3
Online ISBN: 978-3-540-32248-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics