Abstract
Music genres can be seen as categorical descriptions used to classify music basing on various characteristics such as instrumentation, pitch, rhythmic structure, and harmonic contents. Automatic music genre classification is important for music retrieval in large music collections on the web. We build a classifier that learns from very few labeled examples plus a large quantity of unlabeled data, and show that our methodology outperforms existing supervised and unsupervised approaches. We also identify salient features useful for music genre classification. We achieve 97.1% accuracy of 10-way classification on real-world audio collections.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Poria, S., Gelbukh, A., Cambria, E., Das, D., Bandyopadhyay, S.: Enriching SenticNet polarity scores through semi-supervised fuzzy clustering. IEEE ICDM, Brussels, 709–716 (2012)
Poria, S., Gelbukh, A., Hussain, A., Das, D., Bandyopadhyay, S.: Enhanced SenticNet with affective labels for concept-based opinion mining. IEEE Intelligent Systems (2013), doi:10.1109/MIS.2013.4
Lee, C., Shih, J., Yu, K., Lin, H.: Automatic music genre classification based on modulation spectral analysis of spectral and cepstral features. IEEE Transactions on Multimedia 11(4) (2009)
Scaringella, N., Zoia, G.: Automatic genre classification of music content: a survey. Signal Processing Magazine 23(2), 133–141 (2006)
Shao, X., Xu, C., Kankanhalli, M.: Unsupervised classification of musical genre using hidden Markov model. In: IEEE Int. Conf. of Multimedia Explore (ICME), Taiwan (2004)
Rauber, A., Pampalk, E., Merkl, D.: Using psycho-acoustic models and self-organizing maps to create a hierarchical structuring of music by sound similarity. In: 3rd Int. Conf. on Music Information Retrieval, France (2002)
Pampalk, E., Flexer, A., Widmer, G.: Improvements of audio based music similarity and genre classification? In: 6th Int. Symposium on Music Information Retrieval, UK, (2005)
Scaringella, N., Zoia, G.: On the modeling of time information for automatic genre recognition systems in audio signals. In: 6th Int. Symposium on Music Information Retrieval, UK (2005)
Soltau, H., Schultz, T., Westphal, M., Waibel, A.: Recognition of music types. In: IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), USA (1998)
Mandel, M., Ellis, D.: Song-level features and support vector machines for music classification. In: 6th Int. Symposium on Music Information Retrieval, UK (2005)
Lidy, T., Rauber, A.: Evaluation of feature extractors and psycho-acoustic transformationsfor music genre classification. In: 6th Int. Symposium on Music Information Retrieval, UK, (2005)
Scaringella, N., Mlynek, D.: A mixture of support vector machines for audio classification. Music Information Retrieval Evaluation exchange (MIREX) (2005), website, www.music-ir.org/evaluation/mirexresults/articles/audio_genre/scaringella.pdf
Berenzweig, A., Ellis, D., Lawrence, S.: Using voice segments to improve artist classification of music. In: AES 22nd International Conference on Virtual, Synthetic and Entertainment Audio (2002)
Peeters, G.: A large set of audio features for sound description (similarity and classification) in the CUIDADO project. CUIDADO I.S.T. Project Report (2004)
Saunders, J.: Real time discrimination of broadcast speech/music. In: Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), pp. 993–996 (1996)
Wold, E., Blum, T., Keislar, D., Wheaton, J.: Content-based classification,search, and retrieval of audio. IEEE Multimedia 3(2) (1996)
MPEG-7, “Information Technology – Multimedia Content Description Interface – Part 4: Audio”, ISO/IEC JTC 1/SC29, ISO/IEC FDIS 15938-4:2002 (2002)
Meng, A., Ahrendt, P., Larsen, J.: Improving music genre classification by short-time feature integration. In: 6th Int. Symposium on Music Information Retrieval, UK (2005)
Tzanetakis, G.: Music Genre Classification of Audio Signal. IEEE Transactions on Speech and Audio Processing 10(5) (2002)
McEnnis, D., McKay, C., Fujinaga, I., Depalle, P.: Jaudio: A Feature Extraction Library. In: ISMIR 2005(2005)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algoritms. Plenum Press, New York (1981)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: An update. SIGKDD Explorations 11(1) (2009)
Xu, Y., Zhang, C., Yang, C.: Semi-supervised classification of musical genre using multi-view features. In: International Computer Music Conference ICMC (2005)
Yaslan, Y., Zehra, C.: Audio genre classification with semi-supervised feature ensemble learning. In: 2nd International Workshop on Machine Learning and Music (2009)
Cambria, E., Song, Y., Wang, H., Howard, N.: Semantic multi-dimensional scaling for open-domain sentiment analysis. IEEE Intelligent Systems (2013), doi:10.1109/MIS.2012.118
Cambria, E., Hussain, A.: Sentic Computing: Techniques, Tools, and Applications, pp. 978–994. Springer, Dordrecht (2012) ISBN: 978-94-007-5069-2
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Poria, S., Gelbukh, A., Hussain, A., Bandyopadhyay, S., Howard, N. (2013). Music Genre Classification: A Semi-supervised Approach. In: Carrasco-Ochoa, J.A., Martínez-Trinidad, J.F., Rodríguez, J.S., di Baja, G.S. (eds) Pattern Recognition. MCPR 2013. Lecture Notes in Computer Science, vol 7914. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38989-4_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-38989-4_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38988-7
Online ISBN: 978-3-642-38989-4
eBook Packages: Computer ScienceComputer Science (R0)