Skip to main content
Log in

Concept framework for audio information retrieval: ARF

  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

The majority of researches on content-based retrieval focused on visual media. However audio is also an important medium and information carrier from the viewpoint of human auditory perception, so it is needed to retrieve for audio collection. Audio is handled by conventional methods as an opaque stream medium, which is not suitable for information retrieval by its content. In fact, audio carries rich aural information with the form of speech, musical, and sound effects, so it could be retrieved based on its aural content, such as acoustic features, musical melodies and associated semantics. In this paper, a concept framework (ARF) for content-based audio retrieval is proposed from systematic perspectives, which describes audio content model, audio retrieval architecture and audio query schemes. Audio contents are represented by a hierarchical model and a set of formal descriptions from physical to acoustic to semantic level, which depict acoustic features, logical structure and semantics of audio and audio objects. The architecture consisting of audio meta-database, populating and accessing modules presents a system structure view of audio information retrieval. The query schemes give generalized approaches and modes concerning how users deliver audio information needs to audio collections. Finally, an audio retrieval example implemented is used to explain and specify the application of the components in the proposed ARF.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. ISO/IEC JTC1/SC29/WG11. MPEG-7 applications. N4676, Mar. 2002, Jeju, Korea.

  2. Ricardo Baeza-Yates, Berthier Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley Longman Limited. 1999.

  3. Erling Woodet al. Content based classification, search, and retrieval of audic.IEEE Multimedia, 1996.

  4. Wactlar H, Hauptmann A, Witbrock M. Informedia: News-on-demand experiments in speech recognition. InProc. ARPA Speech Recognition Workshop, Arden House, Harriman, NY, Feb. 18–21, 1996.

  5. Brown M G, Foote J T, Jones G J Fet al. Video mail retrieval by voice: An overview of the Cambridge/ Olivetti retrieval system. InProc. 2nd ACM Int. Conf. Multimedia Workshop on Multimedia Data-Base Management, San Francisco USA, October, 1994, pp.47–55.

  6. Laura Slaughteret al. A graphical interface for speech-based retrieval. InProc. the Third ACM Digital Library Conference, Pittsburgh, PA, June, 1998.

  7. McNab R J, Smith L A, Witten I H. Signal processing for melody transcription. InProc. Australian Computer Science Conference, Melbourne, Australia, January, 1996, pp.301–307.

  8. Ruben Gonzalez, Kathy Melin. Content-based retrieval of audio. InProc. Australian Telecommunication Networks & Applications Conference, 1996, pp.357–362.

  9. Jonathan Foote. Content-based retrieval of music and audio. InProc. SPIE, Multimedia Storage and Archiving Systems II, 1997, 3229: 138–147.

  10. Jonathan Foote. An overview of audio information retrieval.Multimedia Systems 1999, 7(1): 2–11.

    Article  Google Scholar 

  11. Guohui Li, Ashfaq A. Khokhar. Content-based indexing and retrieval of audio data using wavelet. InProc. IEEE International Conference on Multimedia and Expro (ICME'2000), August 2000, New York, pp.885–888.

  12. ISO/IEC JTC1/SC29/WG11. MPEG-7 Overview. Doc. N4980, Klangenfurt, July 2002.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li GuoHui.

Additional information

This research was sponsored by the National Natural Science Foundation of China (NSFC) under Grant No. 60273066

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, G., Wu, D. & Zhang, J. Concept framework for audio information retrieval: ARF. J. Comput. Sci. & Technol. 18, 667–673 (2003). https://doi.org/10.1007/BF02947127

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02947127

Keywords

Navigation