Chapter

Advances in Speech and Language Technologies for Iberian Languages

Volume 328 of the series Communications in Computer and Information Science pp 11-19

On the use of Total Variability and Probabilistic Linear Discriminant Analysis for Speaker Verification on Short Utterances

  • Javier González DomínguezAffiliated withBiometric Recognition Group (ATVS), Escuela Politecnica Superior, Universidad Autonoma de Madrid
  • , Rubén ZazoAffiliated withBiometric Recognition Group (ATVS), Escuela Politecnica Superior, Universidad Autonoma de Madrid
  • , Joaquin González-RodríguezAffiliated withBiometric Recognition Group (ATVS), Escuela Politecnica Superior, Universidad Autonoma de Madrid

* Final gross prices may vary according to local VAT.

Get Access

Abstract

This paper explores the use of state-of-the-art acoustic systems, namely Total Variability and Probabilistic Linear Discriminant Analysis for speaker verification on short utterances. While the recent advances in the field dealing with the session variability problem have proved to greatly outperform speaker verification systems on typical scenarios where a reasonable amount of speech is available, this performance rapidly degrades at the presence of limited data in both enrolment and verification stages. This paper studies the behaviour of TV and PLDA on those scenarios where a scarce amount of speech (~10s) is available to train and testing a speaker identity. The analysis has been carried out on the well defined and standard 10s-10s task belonging to the NIST Speaker Recognition Evaluation 2010 (NIST SRE10) and it explores the multiple parameters, which define TV and PLDA in order to give some insight about their relevance in this specific scenario.

Keywords

i-vectors Total variability PLDA short utterances