Skip to main content

Classification of Hindustani Musical Ragas Using One-Dimensional Convolutional Neural Networks

  • Conference paper
  • First Online:
Big Data, Machine Learning, and Applications (BigDML 2021)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1053))

  • 285 Accesses

Abstract

Ragas are a melodic progression of notes used in Indian classical music. They are believed to have mental and physiological enriching qualities and are used in Raga music therapy. Identification of Ragas necessitates a great deal of expertise since there are instances where two or more Ragas have very similar characteristics making them difficult to identify. An accurate classifier will be an indispensable tool for Indian classical music learners and enthusiasts alike. This paper proposes a One-Dimensional Convolutional Neural Network (1D-CNN) to classify Ragas in the Hindustani variant of the Indian classical music using raw audio waveform. We compare our model with an Artificial Neural Network (ANN) trained using audio features which were extracted using traditional signal processing techniques from the audio files. The original dataset generated and annotated by an expert consists of audio files for 12 Ragas played on the 4 instruments. An augmented dataset consisting of 12,000 samples was created from the original dataset using slight pitch variation. The ANN trained using audio features and the 1D-CNN trained using raw audio show an accuracy of 97.04% and 98.67%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bhat A, Vijaya Krishna A, Acharya S (2020) Analytical comparison of classification models for Raga identification in Carnatic classical instrumental polyphonic audio. SN Comput Sci 1(6):1–9

    Google Scholar 

  2. Balkwill L-L, Thompson WF (1999) A cross-cultural investigation of the perception of emotion in music: psychophysical and cultural cues. Music Percept 17(1):43–64

    Google Scholar 

  3. Valla JM, Alappatt JA, Mathur A, Singh NC (2017) Music and emotion—a case for north Indian classical music. Front Psychol 8:2115

    Article  Google Scholar 

  4. Anand A (2019) Raga identification using convolutional neural network. In: 2019 second international conference on advanced computational and communication paradigms (ICACCP). IEEE, pp 1–6

    Google Scholar 

  5. Computational models for the discovery of the world’s music. https://compmusic.upf.edu/datasets. Last accessed 8 July 2021

  6. Bidkar AA, Deshpande RS, Dandawate YH (2018) A novel approach for selection of features for North Indian classical raga recognition of instrumental music. In: 2018 international conference on advances in communication and computing technology (ICACCT). IEEE, pp 499–503

    Google Scholar 

  7. Muller M, Ellis DPW, Klapuri A, Richard G (2011) Signal processing for music analysis. IEEE J Sel Top Signal Process 5(6):1088–1110

    Google Scholar 

  8. Pandey G, Mishra C, Ipe P (2003) TANSEN: a system for automatic Raga identification. In: IICAI, pp 1350–1363

    Google Scholar 

  9. Sinith MS, Tripathi S, Murthy KVV (2020) Raga recognition using fibonacci series based pitch distribution in Indian Classical Music. Appl Acoust 167:107381

    Google Scholar 

  10. Shetty S, Achary KK (2009) Raga mining of Indian music by extracting arohana-avarohana pattern. Int J Recent Trends Eng 1(1):362

    Google Scholar 

  11. Samsekai Manjabhat S, Koolagudi SG, Rao KS, Ramteke PB (2017) Raga and tonic identification in Carnatic music. J New Music Res 46(3):229–245

    Google Scholar 

  12. Kumari P, Dandawate Y, Bidkar A (2018) Raga analysis and classification of instrumental music. In: International conference on advances in communication and computing technology (ICACCT)

    Google Scholar 

  13. Joshi D, Pareek J, Ambatkar P (2021) Indian classical Raga identification using machine learning

    Google Scholar 

  14. Bidkar AA, Deshpande RS, Dandawate YH (2021) A North Indian Raga recognition using ensemble classifier. Int J Electr Eng Technol (IJEET) 12(6):251–258

    Google Scholar 

  15. Kumar V, Pandya H, Jawahar CV (2014) Identifying Ragas in Indian music. In: 2014 22nd international conference on pattern recognition. IEEE, pp 767–772

    Google Scholar 

  16. Madhusudhan ST, Chowdhary G (2019) Deepsrgm-sequence classification and ranking in Indian classical music with deep learning. In: 20th international society for music information retrieval conference, ISMIR 2019. International Society for Music Information Retrieval, pp 533–540

    Google Scholar 

  17. Madhusdhan ST, Chowdhary G (2018) Tonic independent Raag classification in Indian classical music

    Google Scholar 

  18. McFee B, Raffel C, Liang D, Ellis DPW, McVicar M, Battenberg E, Nieto O (2015) librosa: audio and music signal analysis in python. In: Proceedings of the 14th python in science conference, vol 8, pp 18–25

    Google Scholar 

  19. Schlüter J, Grill T (2015) Exploring data augmentation for improved singing voice detection with neural networks. In: ISMIR, pp 121–126

    Google Scholar 

  20. Rajopddhye V (2002) Sangeet Shastra. Gandharv Mahavidyalaya Publication

    Google Scholar 

  21. Giannakopoulos T, Pikrakis A (2014) Introduction to audio analysis: a MATLAB® approach. Academic Press

    Google Scholar 

  22. Logan B (2000) Mel frequency cepstral coefficients for music modeling. In: ISMIR, vol 270, pp 1–11

    Google Scholar 

  23. Dai W, Dai C, Qu S, Li J, Das S (2017) Very deep convolutional neural networks for raw waveforms. In: 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 421–425

    Google Scholar 

  24. Wang Z, Yan W, Oates T (2017) Time series classification from scratch with deep neural networks: a strong baseline. In: 2017 international joint conference on neural networks (IJCNN). IEEE, pp 1578–1585

    Google Scholar 

  25. Tang W, Long G, Liu L, Zhou T, Jiang J, Blumenstein M (2020) Rethinking 1D-CNN for time series classification: a stronger baseline. arXiv preprint arXiv:2002.10061

  26. Kiranyaz S, Avci O, Abdeljaber O, Ince T, Gabbouj M, Inman DJ (2021) 1D convolutional neural networks and applications: a survey. Mech Syst Signal Process 151:107398

    Article  Google Scholar 

  27. Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M et al (2016) Tensorflow: a system for large-scale machine learning. In: 12th {USENIX} symposium on operating systems design and implementation ({OSDI} 16), pp 265–283

    Google Scholar 

  28. Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings, pp 249–256

    Google Scholar 

  29. Ravanelli M, Bengio Y (2018) Speaker recognition from raw waveform with SincNet. In: 2018 IEEE spoken language technology workshop (SLT). IEEE, pp 1021–1028

    Google Scholar 

  30. Koduri GK, Gulati S, Rao P (2011) A survey of raaga recognition techniques and improvements to the state-of-the-art. Sound Music Comput 38:39–41

    Google Scholar 

Download references

Acknowledgements

We express our gratitude to Mr. Deepak Desai, a sitarist and music expert, for sharing his knowledge in music and his efforts in annotating the dataset.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rutuparn Pawar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pawar, R., Gujar, S., Bidkar, A., Dandawate, Y. (2024). Classification of Hindustani Musical Ragas Using One-Dimensional Convolutional Neural Networks. In: Borah, M.D., Laiphrakpam, D.S., Auluck, N., Balas, V.E. (eds) Big Data, Machine Learning, and Applications. BigDML 2021. Lecture Notes in Electrical Engineering, vol 1053. Springer, Singapore. https://doi.org/10.1007/978-981-99-3481-2_23

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-3481-2_23

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-3480-5

  • Online ISBN: 978-981-99-3481-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics