Skip to main content
Log in

Music genre classification and recognition using convolutional neural network

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The music genre classification system is crucial to users in the digital music business since it allows them to be more effective. Music suggestion and availability to consumers is one of the most successful uses of genre classification. Songs may be easily accessible by users when the genre of the song is recognized, and music recommendations to users are made simple with an accurate categorization system in place. Furthermore, automated genre categorization is necessary to tackle difficulties such as finding similar songs, identifying cultures that would enjoy certain music, and conducting surveys. Machine learning approaches have recently been shown to be useful in a variety of classification tasks, including music genre categorization. As a result, this research investigates the use of Convolutional Neural Networks (CNN) for music genre categorization. For this study, a fresh dataset of 1000 traditional music from ten genres was employed. Content-based features, were retrieved from the songs in the dataset and used as input into the classifier, as feature extraction is critical to audio analysis. We got the results of the accuracy level of the system is 98.9% with a precision of 98.7%, recall of 98.5%, and f1 score of 97.5%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data availability

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.

References

  1. Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293–302

    Article  Google Scholar 

  2. Taylor J, Meng A (2005) An investigation of feature models for music genre classification using the support vector classifier. In: 6th International Conference on Music Information Retrieval (ISMIR 2005), London, UK, pp 604–609

  3. West K, Cox S (2005) Finding an optimal segmentation for audio genre classification. In: 6th International Conference on Music Information Retrieval (ISMIR 2005), London, UK, pp 680–685

  4. Duda RO, Hart PE, Stork DG (2000) Pattern Classification, 2nd edn. Wiley-Interscience

    Google Scholar 

  5. Fu Z, Lu G, Ting K, Zhang D (2011) A survey of audio-based music classification and annotation. IEEE Trans Multimedia 13(2):303–319

    Article  Google Scholar 

  6. Baniya B, Ghimire D, Lee J (2014) A novel approach of automatic music genre classification based on timbrai texture and rhythmic content features, in Advanced Communication Technology (ICACT), 2014 16th International Conference, pp 96–102, IEEE

  7. Huang G, Zhu Q, Siew C (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501

    Article  Google Scholar 

  8. Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140

    Article  Google Scholar 

  9. Arabi A, Lu G (2009) Enhanced polyphonic music genre classification using high level features, In Signal and Image Processing Applications (ICSIPA), 2009 IEEE International Conference on, pp 101–106, IEEE

  10. Sarkar R, Saha S (2015) Music genre classification using emd and pitch based feature, in Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on, pp 1–6, IEEE

  11. Wei Y, Xia W, Lin M, Huang J, Ni B, Dong J, Zhao Y, Yan S (2016) Hcp: A exible cnn framework for multi-label image classification. IEEE Trans Pattern Anal Mach Intell 38(9):1901–1907

    Article  Google Scholar 

  12. Ciresan D, Meier U, Masci J, Gambardella ML, Schmidhuber J (2011) Flexible, high performance convolutional neural networks for image classification, in IJCAI Proceedings-International Joint Conference on Artificial Intelligence, 22:1237, Barcelona, Spain

  13. Dieleman S, Schrauwen B (2014) End-to-end learning for music audio, in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference, pp 6964–6968, IEEE

  14. Li T, Chan A, Chun A (2010) Automatic musical pattern feature extraction using convolutional neural network. In: International Multi Conference of Engineers and Computer Scientists (IMECS 2010), vol 1, pp 546–550

  15. Zhang W, Lei V, Xu X, Xing X (2016) Improved music genre classification with convolutional neural networks, in INTERSPEECH, pp 3304–3308

  16. Elman J (1990) Finding structure in time. Cogn Sci 14(2):179–211

    Article  Google Scholar 

  17. Pons J, Lidy T, Serra X (2016) Experimenting with musically motivated convolutional neural networks, in Content-Based Multimedia Indexing (CBMI), 2016 14th International Workshop on, pp 1–6, IEEE

  18. Lawrence S, Giles C, Tsoi A, Back A (1997) Face recognition: A convolutional neural-network approach. IEEE Trans Neural Networks 8(1):98–113

    Article  Google Scholar 

  19. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition, in Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  20. Choi K, Fazekas G, Sandler M, Cho K (2016) Convolutional recurrent neural networks for music classification, arXiv preprint arXiv:1609.04243

  21. Elbir A, Aydin N (2020) Music genre classification and music recommendation by using deep learning 2020. Electron Lett 56(12):627–629

    Article  Google Scholar 

  22. Pelchat N, Gelowitz C (2019) Neural network music genre classification. In: 2019 IEEE Canadian Conference of Electrical and Computer Engineering (CCECE), IEEE, pp 170–173

  23. Vishnupriya S, Meenakshi K (2018) Automatic music genre classification using convolution neural network. 2018 International Conference on Computer Communication and Informatics (ICCCI - 2017), Coimbatore, INDIA.IEEE

  24. Cano P, Gómez E, Gouyon F, Herrera P, Koppenberger M, Ong B, Serra X, Streich S, Wack N (2006) ISMIR 2004 audio description contest. Music Technology Group of the University at Pompeu Fabra, Technical Report

  25. Falola P, Alabi E, Ogunajo F, Fasae O (2022) Music genre classification using machine and deep learning techniques: A review. ResearchJet J Anal Inventions- RJAI 3(03):35–50

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nandkishor Narkhede.

Ethics declarations

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Narkhede, N., Mathur, S., Bhaskar, A. et al. Music genre classification and recognition using convolutional neural network. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-19243-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11042-024-19243-3

Keywords

Navigation