Definition
An audio signal is a signal that contains information in the audible frequency range. Audio content analysis refers to a set of theories, algorithms and systems that aim at extracting descriptors or metadata related to audio content and allowing search, retrieval and other user actions performed on audio signals.
Historical Background
Multimedia content analysis has been one of the most booming research directions in the past years. With the objective of providing fast, natural, intuitive and personalized content-based access to vast multimedia data collections, and building on the synergy of many scientific disciplines, such as signal processing, pattern recognition, machine learning, information retrieval, information theory, natural language processing and psychology, the research initiative born around the end of the 1980s has succeeded in inspiring and mobilizing enormous number of researchers worldwide....
Recommended Reading
Cai R, Lu L, Hanjalic A. Unsupervised content discovery in composite audio. In: Proceedings of the IEEE International Conference on Multimedia and Expo; 2005. p. 628–37.
Cai R, Lu L, Hanjalic A, Zhang H-J, Cai L-H. A flexible framework for key audio effects detection and auditory context inference. IEEE Trans Audio Speech Lang Process. 2006;14(3):1026–39.
Casey M, et al. Content-based music information retrieval: current directions and future challenges. In: Proceedings of the IEEE, Special Issue on Advances in Multimedia Information Retrieval. 2008;96(4):668–96.
Cheng W-H, Chu W-T, Wu J-L. Semantic context detection based on hierarchical audio models. In: Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval; 2003. p. 109–15.
Hanjalic A. Content-based analysis of digital video. Norwell: Kluwer; 2004.
Huang X, Acero A, Hon HW. Spoken language processing: a guide to theory, algorithm, and system development. Upper Saddle River: Prentice; 2001.
Lu L, Cai R, Hanjalic A. Audio elements based auditory scene segmentation. Proc IEEE Int Conf Acoust Speech Signal Process. 2006;5:17–20.
Lu L, Zhang H-J, Jiang H. Content analysis for audio classification and segmentation. IEEE Trans Speech Audio Process. 2002;10(7):504–16.
Radhakrishnan R, Divakaran A, Xiong Z. A time series clustering based framework for multimedia mining and summarization using audio features. In: Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval; 2004. p. 157–64.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media LLC
About this entry
Cite this entry
Lu, L., Hanjalic, A. (2016). Audio Content Analysis. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_1528-2
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7993-3_1528-2
Received:
Accepted:
Published:
Publisher Name: Springer, New York, NY
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering