Impact of Prior Channel Information for Speaker Identification

  • C. Vaquero
  • N. Scheffer
  • S. Karajekar
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5558)

Abstract

Joint factor analysis (JFA) has been very successful in speaker recognition but its success depends on the choice of development data. In this work, we apply JFA to a very diverse set of recording conditions and conversation modes in NIST 2008 SRE, showing that having channel matched development data will give improvements of about 50% in terms of Equal Error Rate against a Maximum a Posteriori (MAP) system, while not having it will not give significant improvement. To provide robustness to the system, we estimate eigenchannels in two ways. First, we estimate the eigenchannels separately for each condition and stack them. Second, we pool all the relevant development data and obtain a single estimate. Both techniques show good performance, but the former leads to lower performance when working with low-dimension channel subspaces, due to the correlation between those subspaces.

Keywords

Gaussian Mixture Model Equal Error Rate Speaker Recognition Development Data Female Speaker 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Kenny, P., Ouellet, P., Dehak, N., Gupta, V., Dumouchel, P.: A Study of Inter-Speaker Variability in Speaker Verification. IEEE Trans. Audio, Speech and Language Processing 16(5), 980–988 (2008)Google Scholar
  2. 2.
    Vogt, R., Baker, B., Sridharan, S.: Modelling session variability in text-independent speaker verification. In: Ninth European Conference on Speech Communication and Technology, ISCA (2005)Google Scholar
  3. 3.
    Brümmer, N.: SUN SDV system description for the NIST SRE 2008 evaluation, Montreal, Canada (2008)Google Scholar
  4. 4.
    JHU: Johns Hopkins University, Summer workshop, Robust Speaker ID, Fast scoring team, Baltimore, MD (2008)Google Scholar
  5. 5.
    NIST: The NIST year 2005 speaker recognition evaluation plan (April 2004), http://www.nist.gov/speech/tests/spk/2004/SRE-04_evalplan-v1a.pdf
  6. 6.
    Shriberg, E., Graciarena, M., Bratt, H., Kathol, A., Kajarekar, S., Jameel, H., Richey, C., Goodman, F.: Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification. In: Proceedings of Interspeech, Brisbane, Australia (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • C. Vaquero
    • 1
    • 2
  • N. Scheffer
    • 2
  • S. Karajekar
    • 2
  1. 1.University of ZaragozaZaragozaSpain
  2. 2.SRI InternationalUSA

Personalised recommendations