Abstract
In this paper, we present a new technique for indexing and search in a database that stores songs. A song is represented by a high dimensional binary vector using the audio fingerprinting technique. Audio fingerprinting extracts from a song a fingerprint which is a content-based compact signature that summarizes an audio recording. A song can be recognized by matching an extracted fingerprint to a database of known audio fingerprints. In this paper, we are given a high dimensional binary fingerprint database of songs and focus our attention on the problem of effective and efficient database search. However, the nature of high dimensionality and binary space makes many modern search algorithms inapplicable. The high dimensionality of fingerprints suffers from the curse of dimensionality, i.e., as the dimension increases, the search performance decreases exponentially. In order to tackle this problem, we propose a new search algorithm based on inverted indexing, the multiple sub-fingerprint match principle, the offset match principle, and the early termination strategy. We evaluate our technique using a database of 2,000 songs containing approximately 4,000,000 sub-fingerprints and the experimental result shows encouraging performance.
This study was financially supported by Seoul National University of Science and Technology.
Download conference paper PDF
References
Seo, J.S., et al.: Audio Fingerprinting Based on Normalized Spectral Subband Moments. IEEE Signal Processing Letters 13(4), 209–212 (2006)
Cha, G.-H., Zhu, X., Petkovic, D., Chung, C.-W.: An efficient indexing method for nearest neighbor searches in high-dirnensional image databases. IEEE Tr. on Multimedia 4(1), 76–87 (2002)
Gionis, A., Indyk, P., Motwani, R.: Similarity Search in High Dimensions Via Hashing. In: Proc. VLDB Conf., pp. 518–529 (1999)
Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Metric Space Approach. Springer (2006)
Aggarwal, C.C., Yu, P.S.: On Indexing High Dimensional Data with Uncertainty. In: Proc. SIAM Data Mining Conference, pp. 621–631 (2008)
Haitsma, J., Kalker, T.: A Highly Robust Audio Fingerprinting System With an Efficient Search Strategy. J. New Music Research 32(2), 211–221 (2003)
Haitsma, J., Kalker, T.: Highly Robust Audio Fingerprinting System. In: Proc. Int. Symp. on Music Information Retrieval, pp. 107–115 (2002)
Oostveen, J., Kalker, T., Haitsma, J.: Feature Extraction and a Database Strategy for Video Fingerprinting. In: Chang, S.-K., Chen, Z., Lee, S.-Y. (eds.) VISUAL 2002. LNCS, vol. 2314, pp. 117–128. Springer, Heidelberg (2002)
Miller, M.L., Rodriguez, M.C., Cox, I.J.: Audio Fingerprinting: Nearest Neighbor Search in High Dimensional Binary Spaces. J. VLSI Signal Processing 41, 285–291 (2005)
Miller, M.L., Rodriguez, M.C., Cox, I.J.: Audio Fingerprinting: Nearest Neighbor Search in High Dimensional Binary Spaces. In: Proc. IEEE Multimedia Signal Processing Workshop, pp. 182–185 (2002)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30, 107–117 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 IFIP International Federation for Information Processing
About this paper
Cite this paper
Cha, GH. (2012). Indexing and Search for Fast Music Identification. In: Quirchmayr, G., Basl, J., You, I., Xu, L., Weippl, E. (eds) Multidisciplinary Research and Practice for Information Systems. CD-ARES 2012. Lecture Notes in Computer Science, vol 7465. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32498-7_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-32498-7_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32497-0
Online ISBN: 978-3-642-32498-7
eBook Packages: Computer ScienceComputer Science (R0)
