Speech and Speaker Recognition Evaluation

Furui, Sadaoki

doi:10.1007/978-1-4020-5817-2_1

Speech and Speaker Recognition Evaluation

Sadaoki Furui⁶

Chapter

669 Accesses
7 Citations

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 37))

This chapter overviews techniques for evaluating speech and speaker recognition systems. The chapter first describes principles of recognition methods, and specifies types of systems as well as their applications. The evaluation methods can be classified into subjective and objective methods, among which the chapter focuses on the latter methods. In order to compare/normalize performances of different speech recognition systems, test set perplexity is introduced as a measure of the difficulty of each task. Objective evaluation methods of spoken dialogue and transcription systems are respectively described. Speaker recognition can be classified into speaker identification and verification, and most of the application systems fall into the speaker verification category. Since variation of speech features over time is a serious problem in speaker recognition, normalization and adaptation techniques are also described. Speaker verification performance is typically measured by equal error rate, detection error trade-off (DET) curves, and a weighted cost value. The chapter concludes by summarizing various issues for future research.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Department of Computer Science, Tokyo Institute of Technology Tokyo, Japan
Sadaoki Furui

Authors

Sadaoki Furui
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Prolog Development Center A/S, Odense, Denmark
Laila Dybkjær
DFKI Language Technology Lab, Berlin, Germany
Holmer Hemsen
University of Ulm, Germany
Wolfgang Minker

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Furui, S. (2007). Speech and Speaker Recognition Evaluation. In: Dybkjær, L., Hemsen, H., Minker, W. (eds) Evaluation of Text and Speech Systems. Text, Speech and Language Technology, vol 37. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-5817-2_1

Download citation

DOI: https://doi.org/10.1007/978-1-4020-5817-2_1
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-5816-5
Online ISBN: 978-1-4020-5817-2
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)

Publish with us

Policies and ethics