Multimedia Systems

, Volume 18, Issue 6, pp 499–518

Speech information retrieval: a review

Regular Paper

DOI: 10.1007/s00530-012-0266-0

Cite this article as:
Hafen, R.P. & Henry, M.J. Multimedia Systems (2012) 18: 499. doi:10.1007/s00530-012-0266-0

Abstract

Speech is an information-rich component of multimedia. Information can be extracted from a speech signal in a number of different ways, and thus there are several well-established speech signal analysis research fields. These fields include speech recognition, speaker recognition, event detection, and fingerprinting. The information that can be extracted from tools and methods developed in these fields can greatly enhance multimedia systems. In this paper, we present the current state of research in each of the major speech analysis fields. The goal is to introduce enough background for someone new in the field to quickly gain high-level understanding and to provide direction for further study.

Keywords

Speech signal processingSpeech event detectionSpeech classificationSpeech segmentationSpeech analysis featuresSpeech recognitionSpeaker recognitionIndexing and retrievalMultilingual analysisAcoustic fingerprinting

Copyright information

© Springer-Verlag 2012

Authors and Affiliations

  1. 1.Pacific Northwest National LaboratoryRichlandUSA