Advertisement

Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition

  • Liang He
  • Jia Liu
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7701)

Abstract

To apply a joint factor analysis (JFA) in a multiple channel circumstance, this paper proposes an orthogonal subspace combination method for a text-independent speaker recognition system. On the condition of multiple channels, the subspace loading matrix estimated by a mixed data corpus suffers from the data masking effects. And the subspace loading matrix estimated by a simple combination method has a drawback of subspace overlapping. To overcome these problems, this paper presents an orthogonal subspace combination method. The proposed method is based on a proper approximation of the core computation of the JFA and makes use of the Gram-Schmidt orthogonalization. On the NIST SRE 2008 core tasks corpus, the proposed method has a better performance.

Keywords

Gaussian mixture models joint factor analysis subspace combination text-independent speaker recognition 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Kenny, P., Ouellet, P., Dehak, N., Gupta, V., Dumouchel, P.: A Study of Interspeaker Variability in Speaker Verification. IEEE Transactions on Audio, Speech, and Language Processing 16, 980–988 (2008)CrossRefGoogle Scholar
  2. 2.
    Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint Factor Analysis Versus Eigenchannels in Speaker Recognition. IEEE Transactions on Audio, Speech, and Language Processing 15, 1435–1447 (2007)CrossRefGoogle Scholar
  3. 3.
    National Institute of Standards and Technology. NIST speaker recognition evaluation, http://www.itl.nist.gov/iad/mig/tests/spk/2008/index.html
  4. 4.
    Guo, W., Li, Y.J., Dai, L.R., Wang, R.H.: Factor Analysis and Space Assembling in Speaker Recognition. Acta Automatica Sinica 35(9), 1193–1198 (2009)CrossRefGoogle Scholar
  5. 5.
    Vogt, R., Sridharan, S.: Experiments in Session Variability Modelling for Speaker Verification Acoustics. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 897–900. IEEE Press, New York (2006)Google Scholar
  6. 6.
    Glembek, O., Burget, L., Dehak, N., Brummer, N., Kenny, P.: Comparison of scoring methods used in speaker recognition with Joint Factor Analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4057–4060. IEEE Press, New York (2009)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Liang He
    • 1
  • Jia Liu
    • 1
  1. 1.Tsinghua National Laboratory for Information Science and Technology, Department of Electronic EngineeringTsinghua UniversityBeijingChina

Personalised recommendations