Weighting Scores to Improve Speaker-Dependent Threshold Estimation in Text-Dependent Speaker Verification

Saeta, Javier R.; Hernando, Javier

doi:10.1007/11613107_6

Javier R. Saeta²³ &
Javier Hernando²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3817))

Included in the following conference series:

International Conference on Nonlinear Analyses and Algorithms for Speech Processing

707 Accesses
2 Citations

Abstract

The difficulty of obtaining data from impostors and the scarcity of data are two factors that have a large influence in the estimation of speakerdependent thresholds in text-dependent speaker verification. Furthermore, the inclusion of low quality utterances (background noises, distortion...) makes the process even harder. In such cases, the comparison of these utterances against the model can generate non-representative scores that deteriorate the correct estimations of statistical data from client scores. To mitigate the problem, some methods propose the suppresion of those scores which are far from the estimated scores mean. The tecnique results in a ‘hard decision’ that can produce errors especially when the number of scores is low. We propose here to take a ‘softer decision’ and weight scores according to their distance to the estimated scores mean. The Polycost and the BioTech databases have been used to show the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chen, K.: Towards Better Making a Decision in Speaker Verification. Pattern Recognition 36, 329–346 (2003)
Article Google Scholar
Saeta, J.R., Hernando, J.: Automatic Estimation of A Priori Speaker Dependent Thresholds in Speaker Verification. In: Kittler, J., Nixon, M.S. (eds.) AVBPA 2003. LNCS, vol. 2688, pp. 70–77. Springer, Heidelberg (2003)
Chapter Google Scholar
Saeta, J.R., Hernando, J.: On the Use of Score Pruning in Speaker Verification for Speaker Dependent Threshold Estimation. In: 2004: A Speaker Odyssey, The Speaker Recognition Workshop, pp. 215–218 (2004)
Google Scholar
Furui, S.: Cepstral Analysis for Automatic Speaker Verification. IEEE Trans. Speech and Audio Proc. 29(2), 254–272 (1981)
Article Google Scholar
Lindberg, J., Koolwaaij, J., Hutter, H.P., Genoud, D., Pierrot, J.B., Blomberg, M., Bimbot, F.: Techniques for A Priori Decision Threshold Estimation in Speaker Verification. In: Proceedings RLA2C, pp. 89–92 (1998)
Google Scholar
Pierrot, J.B., Lindberg, J., Koolwaaij, J., Hutter, H.P., Genoud, D., Blomberg, M., Bimbot, F.: A Comparison of A Priori Threshold Setting Procedures for Speaker Verification in the CAVE Project. In: Proceedings ICASSP, pp. 125–128 (1998)
Google Scholar
Zhang, W.D., Yiu, K.K., Mak, M.W., Li, C.K., He, M.X.: A Priori Threshold Determination for Phrase-Prompted Speaker Verification. In: Proceedings Eurospeech, pp. 1203–1206 (1999)
Google Scholar
Surendran, A.C., Lee, C.H.: A Priori Threshold Selection for Fixed Vocabulary Speaker Verification Systems. In: Proceedings ICSLP, vol. II, pp. 246–249 (2000)
Google Scholar
Bimbot, F., Genoud, D.: Likelihood Ratio Adjustment for the Compensation of Model Mismatch in Speaker Verification. In: Proceedings 2001: A Speaker Odyssey, The Speaker Recognition Workshop, pp. 73-76 (2001)
Google Scholar
Gravier, G., Chollet, G.: Comparison of Normalization Techniques for Speaker Verification. In: Proceedings RLA2C, pp. 97–100 (1998)
Google Scholar
Auckentaler, R., Carey, M., Lloyd-Thomas, H.: Score Normalization for Text-Independent Speaker Verification Systems. Digital Signal Processing 10, 42–54 (2000)
Article Google Scholar
Bimbot, F., Bonastre, F.J., Fredouille, C., Gravier, G., Magrin, I., Meignier, S., Merlin, T., Ortega-García, J., Petrovska, D., Reynolds, D.: A Tutorial on Text-Independent Speaker Verification. In: Proceedings Eusipco, pp. 430–451 (2004)
Google Scholar
Mirghafori, N., Heck, L.: An Adaptive Speaker Verification System with Speaker Dependent A Priori Decision Thresholds. In: Proceedings ICSLP, pp. 589–592 (2002)
Google Scholar
Navratil, J., Ramaswamy, G.N.: The Awe and Mystery of T-norm. In: Proceedings Eurospeech, pp. 2009–2012 (2003)
Google Scholar
Reynolds, D.: The Effect of Handset Variability on Speaker Recognition Performance: Experiments on the Switchboard Corpus. In: Proceedings ICASSP 1996, pp. 113–116 (1996)
Google Scholar
Reynolds, D.A.: Comparison of Background Normalization Methods for Text-Independent Speaker Verification. In: Proceedings Eurospeech, pp. 963–966 (1997)
Google Scholar
Heck, L.P., Weintraub, M.: Handset Dependent Background Models for Robust Text-Independent Speaker Recognition. In: Proceedings ICASSP, pp. 1071–1074 (1997)
Google Scholar
Saeta, J.R., Hernando, J.: New Speaker-Dependent Threshold Estimation Method in Speaker Verification based on Weighting Scores. In: Proceedings of the 3th Internacional Conference on Non-Linear Speech Processing (NoLisp), pp. 34–41 (2005)
Google Scholar
Li, Q., Juang, B.H., Zhou, Q., Lee, C.H.: Verbal Information Verification. In: Proceedings Eurospeech, pp. 839–842 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Biometric Technologies, S.L., 08007, Barcelona, Spain
Javier R. Saeta
TALP Research Center, Universitat Politècnica de Catalunya (UPC), 08034, Barcelona, Spain
Javier Hernando

Authors

Javier R. Saeta
View author publications
You can also search for this author in PubMed Google Scholar
Javier Hernando
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Escola Universitària Politècnica de Mataró, UPC, Spain
Marcos Faundez-Zanuy
Escola Universitària Politècnica de Mataró, Spain
Léonard Janer & Antonio Satue-Villar &
Department of Psychology, Second University of Naples, and IIASS, Via Pellegrino 19, 84019, Vietri sul Mare, (SA), Italy
Anna Esposito
The Auton Lab, Carnegie Mellon University, Pittsburgh, PA, USA
Josep Roure
Escola Universitària Politècnica de Mataró (UPC), Barcelona, Spain
Virginia Espinosa-Duro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saeta, J.R., Hernando, J. (2006). Weighting Scores to Improve Speaker-Dependent Threshold Estimation in Text-Dependent Speaker Verification. In: Faundez-Zanuy, M., Janer, L., Esposito, A., Satue-Villar, A., Roure, J., Espinosa-Duro, V. (eds) Nonlinear Analyses and Algorithms for Speech Processing. NOLISP 2005. Lecture Notes in Computer Science(), vol 3817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11613107_6

Download citation

DOI: https://doi.org/10.1007/11613107_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31257-4
Online ISBN: 978-3-540-32586-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics