Multimedia Systems

, Volume 18, Issue 6, pp 499–518

Speech information retrieval: a review

Regular Paper

DOI: 10.1007/s00530-012-0266-0

Cite this article as:
Hafen, R.P. & Henry, M.J. Multimedia Systems (2012) 18: 499. doi:10.1007/s00530-012-0266-0


Speech is an information-rich component of multimedia. Information can be extracted from a speech signal in a number of different ways, and thus there are several well-established speech signal analysis research fields. These fields include speech recognition, speaker recognition, event detection, and fingerprinting. The information that can be extracted from tools and methods developed in these fields can greatly enhance multimedia systems. In this paper, we present the current state of research in each of the major speech analysis fields. The goal is to introduce enough background for someone new in the field to quickly gain high-level understanding and to provide direction for further study.


Speech signal processing Speech event detection Speech classification Speech segmentation Speech analysis features Speech recognition Speaker recognition Indexing and retrieval Multilingual analysis Acoustic fingerprinting 

Copyright information

© Springer-Verlag 2012

Authors and Affiliations

  1. 1.Pacific Northwest National LaboratoryRichlandUSA

Personalised recommendations