Abstract
An interesting problem in accessing music digital libraries is how to combine the information of different sources in order to improve the retrieval effectiveness. This paper introduces an approach to represent a collection of tagged songs through an hidden Markov model with the purpose to develop a system that merges in the same framework both acoustic similarity and semantic descriptions. The former provides content-based information on song similarity, the latter provides context-aware information about individual songs. Experimental results show how the proposed model leads to better performances than approaches that rank songs using both a single information source and a their linear combination.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barrington, L., Oda, R., Lanckriet, G.: Smarter than genius? Human evaluation of music recommender systems. In: Proceedings of the International Conference on Music Information Retrieval, pp. 357–362 (2009)
Barrington, L., Lanckriet, G., Turnbull, D., Yazdani, M.: Combining audio content and social context for semantic music discovery. In: Proceedings of ACM SIGIR, pp. 387–394 (2009)
McFee, B., Lanckriet, G.: Heterogenous embedding for subjective artist similarity. In: Proceedings of the International Conference on Music Information Retrieval, pp. 513–518 (2009)
Slaney, M., Weinberger, K., White, W.: Learning a metric for music similarity. In: Proceedings of the International Conference on Music Information Retrieval, pp. 313–318 (2008)
Mandel, M., Ellis, D.P.W.: Song-level features and support vector machines for music classification. In: Proceedings of the International Conference on Music Information Retrieval, pp. 594–599 (2005)
Hoffman, M., Blei, D., Cook, P.: Content-based musical similarity computation using the hierarchical dirichlet process. In: Proceedings of the International Conference on Music Information Retrieval, pp. 349–354 (2008)
Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Semantic annotation and retrieval of music and sound effects. IEEE Transactions on Audio, Speech, and Language Processing 16, 467–476 (2008)
Ness, S.R., Theocharis, A., Tzanetakis, G., Martins, L.G.: Improving automatic music tag annotation using stacked generalization of probabilistic svm outputs. In: Proceedings of ACM MULTIMEDIA, pp. 705–708 (2009)
Rabiner, L.: A tutorial on hidden Markov models and selected application. Proc. of the IEEE 77, 257–286 (1989)
Shifrin, J., Pardo, B., Meek, C., Birmingham, W.: HMM-based musical query retrieval. In: Proceedings of ACM/IEEE Joint Conference on Digital Libraries, pp. 295–300 (2002)
Miotto, R., Orio, N.: Automatic identification of music works through audio matching. In: Proceedings of the European Conference on Digital Libraries, pp. 124–135 (2007)
Montecchio, N., Orio, N.: A discrete filter bank approach to audio to score matching for polyphonic music. In: Proceedings of the International Conference on Music Information Retrieval, pp. 495–500 (2009)
Raphael, C.: Automatic segmentation of acoustic musical signals using hidden markov models. IEEE Transactions on Pattern Analysis and Machine Intelligence 21, 360–370 (1999)
Khadkevich, M., Omologo, M.: Use of hidden markov models and factored language models for automatic chord recognition. In: Proceedings of the International Conference on Music Information Retrieval, pp. 561–566 (2009)
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Heidelberg (2006)
Kullback, S., Leibler, R.: On information and sufficiency. Annals of Mathematical Statistics 12, 79–86 (1951)
Turnbull, D., Barrington, L., Torres, D., Lanckriet, G.: Towards musical query-by-semantic description using the CAL500 data set. In: Proceedings of ACM SIGIR, pp. 439–446 (2007)
Manning, C., Raghavan, P., Schtze, H.: Introduction to Information Retrieval. Cambridge University Press (2008)
Turnbull, D., Barrington, L., Lanckriet, G.: Five approaches to collecting tags for music. In: Proceedings of the International Conference on Music Information Retrieval, pp. 225–230 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Miotto, R., Orio, N. (2011). Accessing Music Digital Libraries by Combining Semantic Tags and Audio Content. In: Agosti, M., Esposito, F., Meghini, C., Orio, N. (eds) Digital Libraries and Archives. IRCDL 2011. Communications in Computer and Information Science, vol 249. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27302-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-27302-5_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27301-8
Online ISBN: 978-3-642-27302-5
eBook Packages: Computer ScienceComputer Science (R0)