Voice Biometrics

González-Rodríguez, Joaquín; Toledano, Doroteo Torre; Ortega-García, Javier

doi:10.1007/978-0-387-71041-9_8

Voice Biometrics

Joaquín González-Rodríguez⁴,
Doroteo Torre Toledano⁴ &
Javier Ortega-García⁴

Chapter

3846 Accesses
12 Citations

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. G. Adami and H. Hermansky. Segmentation of speech for speaker and language recognition. In Proceedings of Interspeech, pages 841–844, 2003.
Google Scholar
C.G.G. Aitken and F. Taroni. Statistics and the Evaluation of Evidence for Forensic Scientists. John Wiley and Sons, 2 edition, 2004.
Google Scholar
N. Brummer and J. Preez. Application-independent evaluation of speaker detection. Computer, Speech and Language, 20:230–275, 2006.
Article Google Scholar
D. K. Burton. Text-dependent speaker verification using vector quantization source coding. IEEE Transactions on Acoustics, Speech and Signal Processing, 35:133–143, 1987.
Article Google Scholar
J. Campbell and A. Higgins. Yoho speaker verification (ldc94s16). http://www.ldc.upenn.edu.
Google Scholar
J. P. Campbell. Testing with the yoho cd-rom voice verification corpus. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 341–344, 1995.
Google Scholar
W. M. Campbell, J. P. Campbell, D. A. Reynolds, D. A. Jones, and T. R. Leek. High-level speaker verification with support vector machines. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 73–76, 2004.
Google Scholar
W.M. Campbell. Generalized linear discriminant sequence kernels for speaker recognition. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pages 161–164, 2002.
Google Scholar
W.M. Campbell, D.E. Sturim, and D.A. Reynolds. Support vector machines using gmm supervectors for speaker verification. IEEE Signal Processing Letters, 13:308–311, 2006.
Article Google Scholar
P. Carr. English Phonetics and Phonology: An Introduction. Blackwell Publishing, Incorporated, 1999.
Google Scholar
Voice Biometrics Conference. http://www.voicebiocon.com.
Google Scholar
G. Doddington. Speaker recognition based on idiolectal difierences between speakers. In Proceedings of Interspeech, volume 4, pages 2517–2520, 2001.
Google Scholar
A. G. Adami et al. Modeling prosodic dynamics for speaker recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume IV, pages 788–791, 2003.
Google Scholar
B. Yegnanarayana et. al. Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. IEEE Transactions on Speech and Audio Processing, 13:575–582, 2005.
Article Google Scholar
D. Reynolds et al. Supersid project: Exploiting high-level information for high-accuracy speaker recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume IV, pages 784–787, April 2003.
Google Scholar
M. Wagner et al. An evaluation of ’commercial ofi-the-shelf’ speaker verification systems. In Proceedings of IEEE Odyssey, 2006.
Google Scholar
V. Ramasubramanian et. al. Text-dependent speaker-recognition systems based on one-pass dynamic programming algorithm. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 901–904, 2006.
Google Scholar
K. R. Farrell, R. J. Mammone, and K. T. Assaleh. Speaker recognition using neural networks and conventional classifiers. IEEE Transactions on Speech and Audio Processing, 2:194–205, 1994.
Article Google Scholar
J. Fierrez-Aguilar, J. Ortega-Garcia, D. T. Toledano, and J. Gonzalez-Rodriguez. Biosec baseline corpus: A multimodal biometric database. Pattern Recognition, 40:1389–1392, 2007.
Article Google Scholar
S. Furui. Cepstral analysis technique for automatic speaker verification. IEEE Transactions on Acoustics, Speech and Signal Processing, 29:254–272, 1981.
Article Google Scholar
J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallet, N. L. Dahlgren, and V. Zue. Timit acoustic-phonetic continuous speech corpus (ldc93s1). http://www.ldc.upenn.edu.
Google Scholar
J. Gonzalez-Rodriguez, A. Drygajlo, D. Ramos-Castro, M. Garcia-Gomar, and J. Ortega-Garcia. Robust estimation, interpretation and assessment of likelihood ratios in forensic speaker recognition. Computer, Speech and Language, 20:331–335, 2006.
Article Google Scholar
J. Gonzalez-Rodriguez, D. Ramos-Castro, D. T. Toledano, A. Montero-Asenjo, J. Gonzalez-Dominguez, I. Lopez-Moreno, J. Fierrez-Aguilar, D. Garcia-Romero, and J. Ortega-Garcia. Speaker recognition: the atvs-uam system at nist sre 05. IEEE AES Magazine, 22:15–21, 2007.
Google Scholar
A. O. Hatch, B. Peskin, and A. Stolcke. Improved phonetic speaker recognition using lattice decoding. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 165–168, 2005.
Google Scholar
H. Hermansky, B. Hanson, and H. Wakita. Perceptually based linear predictive analysis of speech. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 10, pages 509–512, 1985.
Google Scholar
H. Hermansky and N. Morgan. Rasta processing of speech. IEEE Transactions on Speech and Audio Processing, 2(4):578–589, October 1984.
Article Google Scholar
X. Huang, A. Acero, and H.-W. Hon. Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall PTR, 2001.
Google Scholar
F. Itakura. Line spectrum representation of linear predictive coeficients of speech signals. Journal of the Acoustical Society of America, 57:S35, 1975.
Article Google Scholar
Sachin Kajarekar, Luciana Ferrer, Kemal Sonmez, Jing Zheng, Elizabeth Shriberg, and Andreas Stolcke. Modeling NERFs for speaker recognition. In Proceedings of IEEE Odyssey, pages 51–56, Toledo, Spain, June 2004.
Google Scholar
P. Kenny, G. Boulianne, and P. Dumouchel. Eigenvoice modeling with sparse training data. IEEE Transactions on Speech and Audio Processing, 13:345–354, 2005.
Article Google Scholar
P. Kenny and P. Dumouchel. Disentangling speaker and channel efiects in speaker verification. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 37–40, 2004.
Google Scholar
C. J. Leggetter and P. C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer, Speech and Language, 9:171–185, 1995.
Article Google Scholar
R. G. Leonard and G. Doddington. Tidigits (ldc93s10). http://www.ldc.upenn.edu.
Google Scholar
T. Matsui and S. Furui. Comparison of text-independent speaker recognition methods using vq-distortion and discrete/continuous hmms. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 157–160, 1992.
Google Scholar
Nist speaker recognition evaluation. http://www.nist.gov/speech/tests/spk/.
Google Scholar
J. Ortega-Garcia, J. Bigun, D. Reynolds, and J. Gonzalez-Rodriguez. Authentication gets personal with biometrics. IEEE Signal Processing Magazine, 21:50–62, 2004.
Article Google Scholar
Matejka Pavel, Schwarz Petr, Cernock Jan, and Chytil Pavel. Phonotactic language identification using high quality phoneme recognition. In Proceedings of InterSpeech, pages 2237–2240, 2005.
Google Scholar
CAVE Project. Cave-the european caller verification project. http://www.ptttelecom.nl/cave/.
Google Scholar
M. A. Przybocki, A. F. Martin, and A. N. Le. Nist speaker recognition evaluation chronicles part 2. In Proceedings of IEEE Odyssey, 2006.
Google Scholar
L. R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77:257–286, 1989.
Google Scholar
L. R. Rabiner and R. W. Schafer. Digital Processing of Speech Signals. Prentice Hall, 1978.
Google Scholar
D. Ramos-Castro, J. Gonzalez-Rodriguez, and J. Ortega-Garcia. Likelihood ratio calibration in a transparent and testable forensic speaker recognition framework. In Proceedings of IEEE Odyssey, 2006.
Google Scholar
D. Reynolds, T. Quatieri, and R. Dunn. Speaker verification using adapted gaussian mixture models. Digital Signal Processing, 10:19–41, 2000.
Article Google Scholar
D. A. Reynolds. Channel robust speaker verification via feature mapping. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 2, pages 53–56, 2003.
Google Scholar
P. Rose. Forensic Speaker Identification. CRC, 1 edition, 2002.
Google Scholar
M. J. Saks and J. J. Koehler. The coming paradigm shift in forensic identification science. Science, 309:892–895, 2005.
Article Google Scholar
B. Scholkopf, S. Kah-Kay, C.J.C. Burges, F. Girosi, P. Niyogi, T. Poggio, and V. Vapnik. Comparing support vector machines with gaussian kernels to radial basis function classifiers. IEEE Transactions on Signal Processing, 45:2758–2765, 1997.
Article Google Scholar
Elizabeth Shriberg, Luciana Ferrer, Anand Venkataraman, and Sachin Kajarekar. SVM modeling of “SNERF-Grams” for speaker recognition. In Proc. Intl. Conf. Spoken Language Systems, pages 1409–1412, Jeju, Korea, October 2004.
Google Scholar
C. Soutar, D. Roberge, A. Stoianov, R. Gilroy, and B.V.K. Vijaya Kumar. Biometric encryption. (Online) http://www.bio-scrypt.com.
Google Scholar
K. N. Stevens. Acoustic Phonetics (Current Studies in Linguistics). The MIT Press, 2000.
Google Scholar
D. T. Toledano, R. Fernandez-Pozo, A. Hernandez-Trapote, and L. Hernandez-Gomez. Usability evaluation of multi-modal biometric verification systems. Interacting With Computers, 18:1101–1122, 2006.
Article Google Scholar
D. T. Toledano, C. Fombella, J. Gonzalez-Rodriguez, and L. Hernandez-Gomez. On the relationship between phonetic modeling precision and phonetic speaker recognition accuracy. In Proceedings of InterSpeech, pages 1993–1996, 2005.
Google Scholar
D. T. Toledano, L. Hernandez-Gomez, and L. Villarrubia-Grande. Automatic phonetic segmentation. IEEE Transactions on Speech and Audio Processing, 11:617–625, 2003.
Article Google Scholar
V. Wan and W. Campbell. Support vector machines for speaker verification and identification. In Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, volume 2, pages 775–784, 2000.
Google Scholar
R. Woo, A. Park, and T. J. Hazen. The mit mobile device speaker verification corpus: data collection and preliminary experiments. In Proceedings of IEEE Odyssey, 2006.
Google Scholar

Download references

Author information

Authors and Affiliations

ATVS – UAM, Escuela Politécnica Superior,Universidad Autónoma de Madrid, Madrid, Spain
Joaquín González-Rodríguez, Doroteo Torre Toledano & Javier Ortega-García

Authors

Joaquín González-Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Doroteo Torre Toledano
View author publications
You can also search for this author in PubMed Google Scholar
Javier Ortega-García
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science & Engineering, Michigan State University, 3115 Engineering Building, East Lansing, MI 48824, USA
Anil K. Jain
Dept. of Computer Science & Engineering, University of Notre Dame, 384 Fitzpatrick Hall, Notre Dame, IN 46556-5637, USA
Patrick Flynn
Dept. of Computer Science & Electrical Engineering, West Virginia University, Morgantown, WV 26506-6109, USA
Arun A. Ross

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

González-Rodríguez, J., Toledano, D.T., Ortega-García, J. (2008). Voice Biometrics. In: Jain, A.K., Flynn, P., Ross, A.A. (eds) Handbook of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-71041-9_8

Download citation

DOI: https://doi.org/10.1007/978-0-387-71041-9_8
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-71040-2
Online ISBN: 978-0-387-71041-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Buying options