Robust Speaker Verification with Stochastic Matching

  • Qi (Peter) LiEmail author
Part of the Signals and Communication Technology book series (SCT)


In today’s telecommunications environment, which includes wireless, landline, VoIP, and computer networks, the mismatch between training and testing environments poses a big challenge to speaker authentication systems. In Chapter 8, we addressed the mismatch problem from a feature extraction point of view. In this chapter, we address the problem from an acoustic modeling point of view. These two approaches can be used independently or jointly.


Hide Markov Model Equal Error Rate Speaker Verification Test Utterance Feature Extraction Point 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Atal, B. S.: “Automatic recognition of speakers from their voices”. Proceeding of the IEEE 64, 460–475 (1976)CrossRefGoogle Scholar
  2. 2.
    Atal, B. S.: “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification”. Journal of the Acoustical Society of America 55, 1304–1312 (1974)CrossRefGoogle Scholar
  3. 3.
    Furui, S.: “Cepstral analysis techniques for automatic speaker verification”. IEEE Trans. Acoust., Speech, Signal Processing 27, 254–277 (1981)CrossRefGoogle Scholar
  4. 4.
    Li, Q., Parthasarathy, S., Rosenberg, A. E.: “A fast algorithm for stochastic matching with application to robust speaker verification,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (Munich), pp. 1543–1547, April 1997Google Scholar
  5. 5.
    Mammone, R. J., Zhang, X., Pamachandran, R. P.: “Robust speaker recognition”. IEEE Signal Processing Magazine 13, 58–71 (1996)CrossRefGoogle Scholar
  6. 6.
    Mansour, D., Juang, B.-H.: “A family of distortion measures based upon projection operation for robust speech recognition”. IEEE Trans. Acoust., Speech, Signal Processing 37, 1659–1671 (1989)CrossRefGoogle Scholar
  7. 7.
    Parthasarathy, S., Rosenberg, A. E.: “General phrase speaker verification using sub-word background models and likelihood-ratio scoring,” in Proceedings of ICSLP-96 (Philadelphia), October 1996Google Scholar
  8. 8.
    Rahim, M. G., Juang, B.-H.: “Signal bias removal by maximum likelihood estimation for robust telephone speech recognition”. IEEE Transactions on Speech and Audio Processing 4, 19–30 (1996)CrossRefGoogle Scholar
  9. 9.
    Rosenberg, A. E., Lee, C.-H., Soong, F. K.: “Cepstral channel normalization techniques for HMM-based speaker verification,” in Proceedings of Int. Conf. on Spoken Language Processing (Yokohama, Japan), pp. 1835–1838, 1994Google Scholar
  10. 10.
    Sankar, A., Lee, C.-H.: “A maximum-likelihood approach to stochastic matching for robust speech recognition” IEEE Transactions on Speech and Audio Processing 4, 190–202 (1996)Google Scholar
  11. 11.
    Surendran, A. C.: Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition. PhD thesis, Rutgers University, Busch, NJ, May 1996Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg  2012

Authors and Affiliations

  1. 1.Li Creative Technologies (LcT), IncFlorham ParkUSA

Personalised recommendations