Abstract
Indian classical music is broadly divided into two classifications Carnatic and Hindustani music. The concept of raga and shruthi is fundamental in both Hindustani and Carnatic styles of music. Hence, when analyzing them, the identification of shruthi and raga is of prime importance. However, identifying the pitch, raga, and instrument from an Indian classical instrumental polyphonic audio for analysis proves to be quite a tough challenge. This chapter proposes a comprehensive comparison among convolution neural networks (CNN), recurrent neural network (RNN), and XGboost. There are three distinguishing feature sets created for each task. The first feature set comprises 26 defining features extracted from the songs, while the second feature set consists of 10 of the most significant features among the 26 defining features, and the third feature set is made up of 26 features extracted from source-separated audio files. The three models were created for individual tasks, and their performance is evaluated for the three feature sets. The implemented CNN and XGboost models have had an accuracy of around 70% and 90%, respectively. The RNN model, on the other hand, showed approximately 98% percent accuracy due to its internal memory and unique features.
The three individual tasks are combined to make a single model with RNN architecture. This combined model then demonstrated an approximate accuracy of 97.2% which is close to the accuracy obtained for the individual tasks.
For the three tasks, CNN, RNN, and XGboost had favorable accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ashwini, Krishna, A.V., Mahesh, V. and Karrthik, G.K., 2020. Tone detection for Indian classical polyphonic instrumental audio using DNN model. International Journal of Forensic Engineering, 4(4), pp. 310–322.
Pawar, M.Y. and Mahajan, S., 2019. Automatic Tonic (Shruti) Identification System for Indian Classical Music. In Soft Computing and Signal Processing (pp. 733–742). Springer, Singapore.
Salamon, Justin & Gulati, Sankalp & Serra, Xavier. (2012). A MultiPitch Approach to Tonic Identification in Indian Classical Music. Proceedings - 13th International Society for Music Information Retrieval Conference (ISMIR 2012).
Gulati, Sankalp & Bellur, Ashwin & Salamon, Justin & H.G., Ranjani & Ishwar, Vignesh & Murthy, Hema & Serra, Xavier. (2014). Automatic Tonic Identification in Indian Art Music: Approaches and Evaluation. Journal of New Music Research. 43. 10.1080/09298215.2013.875042.
S. Samsekai Manjabhat, Shashidhar G. Koolagudi, K. S. Rao & Pravin Bhaskar Ramteke (2017) Raga and Tonic Identification in Carnatic Music, Journal of New Music Research, 46:3, 229–245, DOI: 10.1080/09298215.2017.1330351
D. Poojary, A. Pinto, A. Zahied, Anusha, and A. Shetty, “Automatic Tonic Identification in Indian Art Music”, IJRESM, vol. 3, no. 7, pp. 210–212, Jul. 2020.
Chordia, Parag, and Sertan Sentürk. “Joint Recognition of Raag and Tonic in North Indian Music.” <i>Computer Music Journal</i> 37, no. 3 (2013): 82–98. Accessed August 19, 2021. http://www.jstor.org/stable/24265515.
Madhusdhan, Sathwik Tejaswi and G. Chowdhary. “Tonic Independent Raag Classification in Indian Classical Music.” (2018).
S. Gaikwad, A. V. Chitre and Y. H. Dandawate, “Classification of Indian Classical Instruments Using Spectral and Principal Component Analysis Based Cepstrum Features,” 2014 International Conference on Electronic Systems, Signal Processing and Computing Technologies, 2014, pp. 276–279, doi: 10.1109/ICESC.2014.52.
Raga and Tonic Identification in Carnatic Music https://idr.nitk.ac.in/jspui/handle/123456789/12733
Gulati, S., Serrà, J., Ganguli, K. K., Şentürk, S., & Serra, X. (2016). Time-delayed melody surfaces for Raga recognition. In Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR), pp. 751–757. New York, USA.
Alekh, Sanchit. (2017). Automatic Raga Recognition in Hindustani Classical Music.
Bhat, A., Krishna, A.V. and Acharya, S., 2020. Analytical Comparison of Classification Models for Raga Identification in Carnatic Classical Instrumental Polyphonic Audio. SN Computer Science, 1(6), pp. 1–9.
McFee, B. et al., 2015. librosa: Audio and music signal analysis in python. In Proceedings of the 14th python in science conference.
Ashwini, B., Ramesh, N., Naik, S.M. and Krishna, A.V., 2017, November. Lead source separation for Indian instrumental audio. In TENCON 2017–2017 IEEE Region 10 Conference (pp. 1121–1126). IEEE.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Bhat, A., Krishnan, K.G., Mahesh, V., Ananthapadmanabha, V.K. (2023). Deep Learning Approach to Joint Identification of Instrument Pitch and Raga for Indian Classical Music. In: Biswas, A., Wennekes, E., Wieczorkowska, A., Laskar, R.H. (eds) Advances in Speech and Music Technology. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-031-18444-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-18444-4_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-18443-7
Online ISBN: 978-3-031-18444-4
eBook Packages: EngineeringEngineering (R0)