Speech information retrieval: a review
- First Online:
- Cite this article as:
- Hafen, R.P. & Henry, M.J. Multimedia Systems (2012) 18: 499. doi:10.1007/s00530-012-0266-0
Speech is an information-rich component of multimedia. Information can be extracted from a speech signal in a number of different ways, and thus there are several well-established speech signal analysis research fields. These fields include speech recognition, speaker recognition, event detection, and fingerprinting. The information that can be extracted from tools and methods developed in these fields can greatly enhance multimedia systems. In this paper, we present the current state of research in each of the major speech analysis fields. The goal is to introduce enough background for someone new in the field to quickly gain high-level understanding and to provide direction for further study.