Abstract
Music Genre Classification is one of the fundamental tasks in the field of Music Information Retrieval (MIR). In this paper the performance of various music genre classification algorithms including Random Forests, Multi-class Support Vector Machines and Deep Belief Networks is being compared. The study is based on the “Million Song Dataset” a freely-available collection of audio features and metadata. The emphasis is put not only on classification accuracy but also on robustness and scalability of algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bergstra, J., Bardenet R., Bengio Y., Kegl, B.: Algorithms for hyper-parameter optimization. In: Proceedings of the 24th Neural Information Processing Systems (NIPS 2011) (2011)
Bertin-Mahieux, T., Ellis, D., Whitman B., Lamere P.: The million song dataset. In: Proceedings of the 12th International Conference on Music Information Retrieval (2011)
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Heidelberg (2007)
Dieleman, S., Brakel, P., Schrauwen, B.: Audio-based music classification with a pretrained convolutional network. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (2011)
Hastie, T., Tibshirani, R., Friedman, J.H.: The Elements of Statistical Learning. Springer, New York (2001)
Hinton, G.E., Osindero, S., Teh, Y.: A fast learning algorithm for deep belief nets. Neural Comput. 18, 1527–1554 (2006)
Liang, D., Gu, H., O’Connor, B.: Music genre classication with the million song dataset. Machine Learning Department, CMU (2011). http://www.ee.columbia.edu/~dliang/files/FINAL.pdf
Platt, J.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in Large Margin Classifiers, pp. 61–74. MIT Press, Cambridge (1999)
Schindler, A., Rauber, A.: Capturing the temporal domain in Echonest Features for improved classification effectiveness. In: Proceedings of the 10th International Workshop on Adaptive Multimedia Retrieval (2012)
Schindler, A., Mayer, R., Rauber, A.: Facilitating comprehensive benchmarking experiments on the million song sataset. In: Proceedings of the 13th International Society for Music Information Retrieval Conference (2012)
Strum, B.L.: A survey of evaluation in music genre recognition. Adaptive multimedia retrieval: semantics, context, and adaptation. Lect. Notes Comput. Sci. 8382, 29–66 (2014)
Tzanetakis, G., Cook, P.: Musical genre classification of audiosignals. IEEE Trans. Audio Speech Process. 10(5), 293–302 (2002)
Wu, T.F., Lin, C.J., Weng, R.C.: Probability estimates for multi-class classification by pairwise coupling. JMLR 5, 975–100 (2004)
Yang, X., Chen, Q., Zhou, S., Wang, X.: Deep belief networks for automatic music genre classification, In: Proceedings of the 12th Annual Conference of the International Speech Communication Association (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Stokowiec, W. (2016). A Comparative Study on Music Genre Classification Algorithms. In: Ryżko, D., Gawrysiak, P., Kryszkiewicz, M., Rybiński, H. (eds) Machine Intelligence and Big Data in Industry. Studies in Big Data, vol 19. Springer, Cham. https://doi.org/10.1007/978-3-319-30315-4_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-30315-4_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30314-7
Online ISBN: 978-3-319-30315-4
eBook Packages: EngineeringEngineering (R0)