Skip to main content

Voice Biometrics

  • Chapter
Book cover Handbook of Biometrics

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • A. G. Adami and H. Hermansky. Segmentation of speech for speaker and language recognition. In Proceedings of Interspeech, pages 841–844, 2003.

    Google Scholar 

  • C.G.G. Aitken and F. Taroni. Statistics and the Evaluation of Evidence for Forensic Scientists. John Wiley and Sons, 2 edition, 2004.

    Google Scholar 

  • N. Brummer and J. Preez. Application-independent evaluation of speaker detection. Computer, Speech and Language, 20:230–275, 2006.

    Article  Google Scholar 

  • D. K. Burton. Text-dependent speaker verification using vector quantization source coding. IEEE Transactions on Acoustics, Speech and Signal Processing, 35:133–143, 1987.

    Article  Google Scholar 

  • J. Campbell and A. Higgins. Yoho speaker verification (ldc94s16). http://www.ldc.upenn.edu.

    Google Scholar 

  • J. P. Campbell. Testing with the yoho cd-rom voice verification corpus. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 341–344, 1995.

    Google Scholar 

  • W. M. Campbell, J. P. Campbell, D. A. Reynolds, D. A. Jones, and T. R. Leek. High-level speaker verification with support vector machines. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 73–76, 2004.

    Google Scholar 

  • W.M. Campbell. Generalized linear discriminant sequence kernels for speaker recognition. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pages 161–164, 2002.

    Google Scholar 

  • W.M. Campbell, D.E. Sturim, and D.A. Reynolds. Support vector machines using gmm supervectors for speaker verification. IEEE Signal Processing Letters, 13:308–311, 2006.

    Article  Google Scholar 

  • P. Carr. English Phonetics and Phonology: An Introduction. Blackwell Publishing, Incorporated, 1999.

    Google Scholar 

  • Voice Biometrics Conference. http://www.voicebiocon.com.

    Google Scholar 

  • G. Doddington. Speaker recognition based on idiolectal difierences between speakers. In Proceedings of Interspeech, volume 4, pages 2517–2520, 2001.

    Google Scholar 

  • A. G. Adami et al. Modeling prosodic dynamics for speaker recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume IV, pages 788–791, 2003.

    Google Scholar 

  • B. Yegnanarayana et. al. Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. IEEE Transactions on Speech and Audio Processing, 13:575–582, 2005.

    Article  Google Scholar 

  • D. Reynolds et al. Supersid project: Exploiting high-level information for high-accuracy speaker recognition. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume IV, pages 784–787, April 2003.

    Google Scholar 

  • M. Wagner et al. An evaluation of ’commercial ofi-the-shelf’ speaker verification systems. In Proceedings of IEEE Odyssey, 2006.

    Google Scholar 

  • V. Ramasubramanian et. al. Text-dependent speaker-recognition systems based on one-pass dynamic programming algorithm. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 901–904, 2006.

    Google Scholar 

  • K. R. Farrell, R. J. Mammone, and K. T. Assaleh. Speaker recognition using neural networks and conventional classifiers. IEEE Transactions on Speech and Audio Processing, 2:194–205, 1994.

    Article  Google Scholar 

  • J. Fierrez-Aguilar, J. Ortega-Garcia, D. T. Toledano, and J. Gonzalez-Rodriguez. Biosec baseline corpus: A multimodal biometric database. Pattern Recognition, 40:1389–1392, 2007.

    Article  Google Scholar 

  • S. Furui. Cepstral analysis technique for automatic speaker verification. IEEE Transactions on Acoustics, Speech and Signal Processing, 29:254–272, 1981.

    Article  Google Scholar 

  • J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallet, N. L. Dahlgren, and V. Zue. Timit acoustic-phonetic continuous speech corpus (ldc93s1). http://www.ldc.upenn.edu.

    Google Scholar 

  • J. Gonzalez-Rodriguez, A. Drygajlo, D. Ramos-Castro, M. Garcia-Gomar, and J. Ortega-Garcia. Robust estimation, interpretation and assessment of likelihood ratios in forensic speaker recognition. Computer, Speech and Language, 20:331–335, 2006.

    Article  Google Scholar 

  • J. Gonzalez-Rodriguez, D. Ramos-Castro, D. T. Toledano, A. Montero-Asenjo, J. Gonzalez-Dominguez, I. Lopez-Moreno, J. Fierrez-Aguilar, D. Garcia-Romero, and J. Ortega-Garcia. Speaker recognition: the atvs-uam system at nist sre 05. IEEE AES Magazine, 22:15–21, 2007.

    Google Scholar 

  • A. O. Hatch, B. Peskin, and A. Stolcke. Improved phonetic speaker recognition using lattice decoding. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 165–168, 2005.

    Google Scholar 

  • H. Hermansky, B. Hanson, and H. Wakita. Perceptually based linear predictive analysis of speech. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 10, pages 509–512, 1985.

    Google Scholar 

  • H. Hermansky and N. Morgan. Rasta processing of speech. IEEE Transactions on Speech and Audio Processing, 2(4):578–589, October 1984.

    Article  Google Scholar 

  • X. Huang, A. Acero, and H.-W. Hon. Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall PTR, 2001.

    Google Scholar 

  • F. Itakura. Line spectrum representation of linear predictive coeficients of speech signals. Journal of the Acoustical Society of America, 57:S35, 1975.

    Article  Google Scholar 

  • Sachin Kajarekar, Luciana Ferrer, Kemal Sonmez, Jing Zheng, Elizabeth Shriberg, and Andreas Stolcke. Modeling NERFs for speaker recognition. In Proceedings of IEEE Odyssey, pages 51–56, Toledo, Spain, June 2004.

    Google Scholar 

  • P. Kenny, G. Boulianne, and P. Dumouchel. Eigenvoice modeling with sparse training data. IEEE Transactions on Speech and Audio Processing, 13:345–354, 2005.

    Article  Google Scholar 

  • P. Kenny and P. Dumouchel. Disentangling speaker and channel efiects in speaker verification. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 37–40, 2004.

    Google Scholar 

  • C. J. Leggetter and P. C. Woodland. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer, Speech and Language, 9:171–185, 1995.

    Article  Google Scholar 

  • R. G. Leonard and G. Doddington. Tidigits (ldc93s10). http://www.ldc.upenn.edu.

    Google Scholar 

  • T. Matsui and S. Furui. Comparison of text-independent speaker recognition methods using vq-distortion and discrete/continuous hmms. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 157–160, 1992.

    Google Scholar 

  • Nist speaker recognition evaluation. http://www.nist.gov/speech/tests/spk/.

    Google Scholar 

  • J. Ortega-Garcia, J. Bigun, D. Reynolds, and J. Gonzalez-Rodriguez. Authentication gets personal with biometrics. IEEE Signal Processing Magazine, 21:50–62, 2004.

    Article  Google Scholar 

  • Matejka Pavel, Schwarz Petr, Cernock Jan, and Chytil Pavel. Phonotactic language identification using high quality phoneme recognition. In Proceedings of InterSpeech, pages 2237–2240, 2005.

    Google Scholar 

  • CAVE Project. Cave-the european caller verification project. http://www.ptttelecom.nl/cave/.

    Google Scholar 

  • M. A. Przybocki, A. F. Martin, and A. N. Le. Nist speaker recognition evaluation chronicles part 2. In Proceedings of IEEE Odyssey, 2006.

    Google Scholar 

  • L. R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77:257–286, 1989.

    Google Scholar 

  • L. R. Rabiner and R. W. Schafer. Digital Processing of Speech Signals. Prentice Hall, 1978.

    Google Scholar 

  • D. Ramos-Castro, J. Gonzalez-Rodriguez, and J. Ortega-Garcia. Likelihood ratio calibration in a transparent and testable forensic speaker recognition framework. In Proceedings of IEEE Odyssey, 2006.

    Google Scholar 

  • D. Reynolds, T. Quatieri, and R. Dunn. Speaker verification using adapted gaussian mixture models. Digital Signal Processing, 10:19–41, 2000.

    Article  Google Scholar 

  • D. A. Reynolds. Channel robust speaker verification via feature mapping. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 2, pages 53–56, 2003.

    Google Scholar 

  • P. Rose. Forensic Speaker Identification. CRC, 1 edition, 2002.

    Google Scholar 

  • M. J. Saks and J. J. Koehler. The coming paradigm shift in forensic identification science. Science, 309:892–895, 2005.

    Article  Google Scholar 

  • B. Scholkopf, S. Kah-Kay, C.J.C. Burges, F. Girosi, P. Niyogi, T. Poggio, and V. Vapnik. Comparing support vector machines with gaussian kernels to radial basis function classifiers. IEEE Transactions on Signal Processing, 45:2758–2765, 1997.

    Article  Google Scholar 

  • Elizabeth Shriberg, Luciana Ferrer, Anand Venkataraman, and Sachin Kajarekar. SVM modeling of “SNERF-Grams” for speaker recognition. In Proc. Intl. Conf. Spoken Language Systems, pages 1409–1412, Jeju, Korea, October 2004.

    Google Scholar 

  • C. Soutar, D. Roberge, A. Stoianov, R. Gilroy, and B.V.K. Vijaya Kumar. Biometric encryption. (Online) http://www.bio-scrypt.com.

    Google Scholar 

  • K. N. Stevens. Acoustic Phonetics (Current Studies in Linguistics). The MIT Press, 2000.

    Google Scholar 

  • D. T. Toledano, R. Fernandez-Pozo, A. Hernandez-Trapote, and L. Hernandez-Gomez. Usability evaluation of multi-modal biometric verification systems. Interacting With Computers, 18:1101–1122, 2006.

    Article  Google Scholar 

  • D. T. Toledano, C. Fombella, J. Gonzalez-Rodriguez, and L. Hernandez-Gomez. On the relationship between phonetic modeling precision and phonetic speaker recognition accuracy. In Proceedings of InterSpeech, pages 1993–1996, 2005.

    Google Scholar 

  • D. T. Toledano, L. Hernandez-Gomez, and L. Villarrubia-Grande. Automatic phonetic segmentation. IEEE Transactions on Speech and Audio Processing, 11:617–625, 2003.

    Article  Google Scholar 

  • V. Wan and W. Campbell. Support vector machines for speaker verification and identification. In Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, volume 2, pages 775–784, 2000.

    Google Scholar 

  • R. Woo, A. Park, and T. J. Hazen. The mit mobile device speaker verification corpus: data collection and preliminary experiments. In Proceedings of IEEE Odyssey, 2006.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

González-Rodríguez, J., Toledano, D.T., Ortega-García, J. (2008). Voice Biometrics. In: Jain, A.K., Flynn, P., Ross, A.A. (eds) Handbook of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-71041-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-71041-9_8

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-71040-2

  • Online ISBN: 978-0-387-71041-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics