This chapter overviews techniques for evaluating speech and speaker recognition systems. The chapter first describes principles of recognition methods, and specifies types of systems as well as their applications. The evaluation methods can be classified into subjective and objective methods, among which the chapter focuses on the latter methods. In order to compare/normalize performances of different speech recognition systems, test set perplexity is introduced as a measure of the difficulty of each task. Objective evaluation methods of spoken dialogue and transcription systems are respectively described. Speaker recognition can be classified into speaker identification and verification, and most of the application systems fall into the speaker verification category. Since variation of speech features over time is a serious problem in speaker recognition, normalization and adaptation techniques are also described. Speaker verification performance is typically measured by equal error rate, detection error trade-off (DET) curves, and a weighted cost value. The chapter concludes by summarizing various issues for future research.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this chapter
Cite this chapter
Furui, S. (2007). Speech and Speaker Recognition Evaluation. In: Dybkjær, L., Hemsen, H., Minker, W. (eds) Evaluation of Text and Speech Systems. Text, Speech and Language Technology, vol 37. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-5817-2_1
Download citation
DOI: https://doi.org/10.1007/978-1-4020-5817-2_1
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-5816-5
Online ISBN: 978-1-4020-5817-2
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)