Multimedia Systems

, Volume 7, Issue 1, pp 2–10 | Cite as

An overview of audio information retrieval

  • Jonathan Foote


The problem of audio information retrieval is familiar to anyone who has returned from vacation to find an answering machine full of messages. While there is not yet an “AltaVista” for the audio data type, many workers are finding ways to automatically locate, index, and browse audio using recent advances in speech recognition and machine listening. This paper reviews the state of the art in audio information retrieval, and presents recent advances in automatic speech recognition, word spotting, speaker and music identification, and audio similarity with a view towards making audio less “opaque”. A special section addresses intelligent interfaces for navigating and browsing audio and multimedia documents, using automatically derived information to go beyond the tape recorder metaphor.


Information Retrieval Data Type Speech Recognition Special Section Tape Record 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 1999

Authors and Affiliations

  • Jonathan Foote
    • 1
  1. 1. Institute of Systems Science, National University of Singapore, Heng Mui Keng Terrace, Singapore 119597 SG

Personalised recommendations