Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition

He, Liang; Liu, Jia

doi:10.1007/978-3-642-35136-5_30

Liang He²¹ &
Jia Liu²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7701))

Included in the following conference series:

Chinese Conference on Biometric Recognition

1825 Accesses

Abstract

To apply a joint factor analysis (JFA) in a multiple channel circumstance, this paper proposes an orthogonal subspace combination method for a text-independent speaker recognition system. On the condition of multiple channels, the subspace loading matrix estimated by a mixed data corpus suffers from the data masking effects. And the subspace loading matrix estimated by a simple combination method has a drawback of subspace overlapping. To overcome these problems, this paper presents an orthogonal subspace combination method. The proposed method is based on a proper approximation of the core computation of the JFA and makes use of the Gram-Schmidt orthogonalization. On the NIST SRE 2008 core tasks corpus, the proposed method has a better performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kenny, P., Ouellet, P., Dehak, N., Gupta, V., Dumouchel, P.: A Study of Interspeaker Variability in Speaker Verification. IEEE Transactions on Audio, Speech, and Language Processing 16, 980–988 (2008)
Article Google Scholar
Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint Factor Analysis Versus Eigenchannels in Speaker Recognition. IEEE Transactions on Audio, Speech, and Language Processing 15, 1435–1447 (2007)
Article Google Scholar
National Institute of Standards and Technology. NIST speaker recognition evaluation, http://www.itl.nist.gov/iad/mig/tests/spk/2008/index.html
Guo, W., Li, Y.J., Dai, L.R., Wang, R.H.: Factor Analysis and Space Assembling in Speaker Recognition. Acta Automatica Sinica 35(9), 1193–1198 (2009)
Article Google Scholar
Vogt, R., Sridharan, S.: Experiments in Session Variability Modelling for Speaker Verification Acoustics. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 897–900. IEEE Press, New York (2006)
Google Scholar
Glembek, O., Burget, L., Dehak, N., Brummer, N., Kenny, P.: Comparison of scoring methods used in speaker recognition with Joint Factor Analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4057–4060. IEEE Press, New York (2009)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Tsinghua National Laboratory for Information Science and Technology, Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
Liang He & Jia Liu

Authors

Liang He
View author publications
You can also search for this author in PubMed Google Scholar
Jia Liu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Science and Technology, Sun Yat-Sen University, 510275, Guangzhou, P.R. China
Wei-Shi Zheng & Jianhuang Lai &
Institute of Automation, National Laboratory of Pattern Recognition, Chinese Academy of Sciences, 100190, Beijing, P.R. China
Zhenan Sun
School of Computer Science and Engineering, Beihang University, Beijing University of Aeronautics and Astronautics, 100191, Beijing, P.R. China
Yunhong Wang
Institute of Computing Technology, Chinese Academy of Sciences, 100190, Beijing, P.R. China
Xilin Chen
Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Kowloon, Hong Kong, China
Pong C. Yuen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, L., Liu, J. (2012). Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition. In: Zheng, WS., Sun, Z., Wang, Y., Chen, X., Yuen, P.C., Lai, J. (eds) Biometric Recognition. CCBR 2012. Lecture Notes in Computer Science, vol 7701. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35136-5_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-35136-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35135-8
Online ISBN: 978-3-642-35136-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics