Abstract
To apply a joint factor analysis (JFA) in a multiple channel circumstance, this paper proposes an orthogonal subspace combination method for a text-independent speaker recognition system. On the condition of multiple channels, the subspace loading matrix estimated by a mixed data corpus suffers from the data masking effects. And the subspace loading matrix estimated by a simple combination method has a drawback of subspace overlapping. To overcome these problems, this paper presents an orthogonal subspace combination method. The proposed method is based on a proper approximation of the core computation of the JFA and makes use of the Gram-Schmidt orthogonalization. On the NIST SRE 2008 core tasks corpus, the proposed method has a better performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kenny, P., Ouellet, P., Dehak, N., Gupta, V., Dumouchel, P.: A Study of Interspeaker Variability in Speaker Verification. IEEE Transactions on Audio, Speech, and Language Processing 16, 980–988 (2008)
Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint Factor Analysis Versus Eigenchannels in Speaker Recognition. IEEE Transactions on Audio, Speech, and Language Processing 15, 1435–1447 (2007)
National Institute of Standards and Technology. NIST speaker recognition evaluation, http://www.itl.nist.gov/iad/mig/tests/spk/2008/index.html
Guo, W., Li, Y.J., Dai, L.R., Wang, R.H.: Factor Analysis and Space Assembling in Speaker Recognition. Acta Automatica Sinica 35(9), 1193–1198 (2009)
Vogt, R., Sridharan, S.: Experiments in Session Variability Modelling for Speaker Verification Acoustics. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 897–900. IEEE Press, New York (2006)
Glembek, O., Burget, L., Dehak, N., Brummer, N., Kenny, P.: Comparison of scoring methods used in speaker recognition with Joint Factor Analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4057–4060. IEEE Press, New York (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
He, L., Liu, J. (2012). Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition. In: Zheng, WS., Sun, Z., Wang, Y., Chen, X., Yuen, P.C., Lai, J. (eds) Biometric Recognition. CCBR 2012. Lecture Notes in Computer Science, vol 7701. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35136-5_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-35136-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35135-8
Online ISBN: 978-3-642-35136-5
eBook Packages: Computer ScienceComputer Science (R0)