Classification of Hindustani Musical Ragas Using One-Dimensional Convolutional Neural Networks

Pawar, Rutuparn; Gujar, Shubham; Bidkar, Anagha; Dandawate, Yogesh

doi:10.1007/978-981-99-3481-2_23

Rutuparn Pawar⁴⁰,
Shubham Gujar⁴⁰,
Anagha Bidkar⁴⁰ &
…
Yogesh Dandawate⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1053))

Included in the following conference series:

International Conference on Big Data, Machine Learning, and Applications

285 Accesses

Abstract

Ragas are a melodic progression of notes used in Indian classical music. They are believed to have mental and physiological enriching qualities and are used in Raga music therapy. Identification of Ragas necessitates a great deal of expertise since there are instances where two or more Ragas have very similar characteristics making them difficult to identify. An accurate classifier will be an indispensable tool for Indian classical music learners and enthusiasts alike. This paper proposes a One-Dimensional Convolutional Neural Network (1D-CNN) to classify Ragas in the Hindustani variant of the Indian classical music using raw audio waveform. We compare our model with an Artificial Neural Network (ANN) trained using audio features which were extracted using traditional signal processing techniques from the audio files. The original dataset generated and annotated by an expert consists of audio files for 12 Ragas played on the 4 instruments. An augmented dataset consisting of 12,000 samples was created from the original dataset using slight pitch variation. The ANN trained using audio features and the 1D-CNN trained using raw audio show an accuracy of 97.04% and 98.67%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bhat A, Vijaya Krishna A, Acharya S (2020) Analytical comparison of classification models for Raga identification in Carnatic classical instrumental polyphonic audio. SN Comput Sci 1(6):1–9
Google Scholar
Balkwill L-L, Thompson WF (1999) A cross-cultural investigation of the perception of emotion in music: psychophysical and cultural cues. Music Percept 17(1):43–64
Google Scholar
Valla JM, Alappatt JA, Mathur A, Singh NC (2017) Music and emotion—a case for north Indian classical music. Front Psychol 8:2115
Article Google Scholar
Anand A (2019) Raga identification using convolutional neural network. In: 2019 second international conference on advanced computational and communication paradigms (ICACCP). IEEE, pp 1–6
Google Scholar
Computational models for the discovery of the world’s music. https://compmusic.upf.edu/datasets. Last accessed 8 July 2021
Bidkar AA, Deshpande RS, Dandawate YH (2018) A novel approach for selection of features for North Indian classical raga recognition of instrumental music. In: 2018 international conference on advances in communication and computing technology (ICACCT). IEEE, pp 499–503
Google Scholar
Muller M, Ellis DPW, Klapuri A, Richard G (2011) Signal processing for music analysis. IEEE J Sel Top Signal Process 5(6):1088–1110
Google Scholar
Pandey G, Mishra C, Ipe P (2003) TANSEN: a system for automatic Raga identification. In: IICAI, pp 1350–1363
Google Scholar
Sinith MS, Tripathi S, Murthy KVV (2020) Raga recognition using fibonacci series based pitch distribution in Indian Classical Music. Appl Acoust 167:107381
Google Scholar
Shetty S, Achary KK (2009) Raga mining of Indian music by extracting arohana-avarohana pattern. Int J Recent Trends Eng 1(1):362
Google Scholar
Samsekai Manjabhat S, Koolagudi SG, Rao KS, Ramteke PB (2017) Raga and tonic identification in Carnatic music. J New Music Res 46(3):229–245
Google Scholar
Kumari P, Dandawate Y, Bidkar A (2018) Raga analysis and classification of instrumental music. In: International conference on advances in communication and computing technology (ICACCT)
Google Scholar
Joshi D, Pareek J, Ambatkar P (2021) Indian classical Raga identification using machine learning
Google Scholar
Bidkar AA, Deshpande RS, Dandawate YH (2021) A North Indian Raga recognition using ensemble classifier. Int J Electr Eng Technol (IJEET) 12(6):251–258
Google Scholar
Kumar V, Pandya H, Jawahar CV (2014) Identifying Ragas in Indian music. In: 2014 22nd international conference on pattern recognition. IEEE, pp 767–772
Google Scholar
Madhusudhan ST, Chowdhary G (2019) Deepsrgm-sequence classification and ranking in Indian classical music with deep learning. In: 20th international society for music information retrieval conference, ISMIR 2019. International Society for Music Information Retrieval, pp 533–540
Google Scholar
Madhusdhan ST, Chowdhary G (2018) Tonic independent Raag classification in Indian classical music
Google Scholar
McFee B, Raffel C, Liang D, Ellis DPW, McVicar M, Battenberg E, Nieto O (2015) librosa: audio and music signal analysis in python. In: Proceedings of the 14th python in science conference, vol 8, pp 18–25
Google Scholar
Schlüter J, Grill T (2015) Exploring data augmentation for improved singing voice detection with neural networks. In: ISMIR, pp 121–126
Google Scholar
Rajopddhye V (2002) Sangeet Shastra. Gandharv Mahavidyalaya Publication
Google Scholar
Giannakopoulos T, Pikrakis A (2014) Introduction to audio analysis: a MATLAB® approach. Academic Press
Google Scholar
Logan B (2000) Mel frequency cepstral coefficients for music modeling. In: ISMIR, vol 270, pp 1–11
Google Scholar
Dai W, Dai C, Qu S, Li J, Das S (2017) Very deep convolutional neural networks for raw waveforms. In: 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 421–425
Google Scholar
Wang Z, Yan W, Oates T (2017) Time series classification from scratch with deep neural networks: a strong baseline. In: 2017 international joint conference on neural networks (IJCNN). IEEE, pp 1578–1585
Google Scholar
Tang W, Long G, Liu L, Zhou T, Jiang J, Blumenstein M (2020) Rethinking 1D-CNN for time series classification: a stronger baseline. arXiv preprint arXiv:2002.10061
Kiranyaz S, Avci O, Abdeljaber O, Ince T, Gabbouj M, Inman DJ (2021) 1D convolutional neural networks and applications: a survey. Mech Syst Signal Process 151:107398
Article Google Scholar
Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M et al (2016) Tensorflow: a system for large-scale machine learning. In: 12th {USENIX} symposium on operating systems design and implementation ({OSDI} 16), pp 265–283
Google Scholar
Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings, pp 249–256
Google Scholar
Ravanelli M, Bengio Y (2018) Speaker recognition from raw waveform with SincNet. In: 2018 IEEE spoken language technology workshop (SLT). IEEE, pp 1021–1028
Google Scholar
Koduri GK, Gulati S, Rao P (2011) A survey of raaga recognition techniques and improvements to the state-of-the-art. Sound Music Comput 38:39–41
Google Scholar

Download references

Acknowledgements

We express our gratitude to Mr. Deepak Desai, a sitarist and music expert, for sharing his knowledge in music and his efforts in annotating the dataset.

Author information

Authors and Affiliations

Department of Electronics and Telecommunication, Vishwakarma Institute of Information Technology, Pune, Maharashtra, India
Rutuparn Pawar, Shubham Gujar, Anagha Bidkar & Yogesh Dandawate

Authors

Rutuparn Pawar
View author publications
You can also search for this author in PubMed Google Scholar
Shubham Gujar
View author publications
You can also search for this author in PubMed Google Scholar
Anagha Bidkar
View author publications
You can also search for this author in PubMed Google Scholar
Yogesh Dandawate
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rutuparn Pawar .

Editor information

Editors and Affiliations

Computer Science & Engineering, National Institute of Technology Silchar, Silchar, Assam, India
Malaya Dutta Borah
Computer Science & Engineering, National Institute of Technology Silchar, Silchar, Assam, India
Dolendro Singh Laiphrakpam
Computer Science & Engineering, Indian Institute of Technology Ropar, Rupnagar, Punjab, India
Nitin Auluck
Department of Automation and Applied Informatics, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pawar, R., Gujar, S., Bidkar, A., Dandawate, Y. (2024). Classification of Hindustani Musical Ragas Using One-Dimensional Convolutional Neural Networks. In: Borah, M.D., Laiphrakpam, D.S., Auluck, N., Balas, V.E. (eds) Big Data, Machine Learning, and Applications. BigDML 2021. Lecture Notes in Electrical Engineering, vol 1053. Springer, Singapore. https://doi.org/10.1007/978-981-99-3481-2_23

Download citation

DOI: https://doi.org/10.1007/978-981-99-3481-2_23
Published: 30 November 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-3480-5
Online ISBN: 978-981-99-3481-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics