Skip to main content

Deep Learning Approach to Joint Identification of Instrument Pitch and Raga for Indian Classical Music

  • Chapter
  • First Online:
Advances in Speech and Music Technology

Abstract

Indian classical music is broadly divided into two classifications Carnatic and Hindustani music. The concept of raga and shruthi is fundamental in both Hindustani and Carnatic styles of music. Hence, when analyzing them, the identification of shruthi and raga is of prime importance. However, identifying the pitch, raga, and instrument from an Indian classical instrumental polyphonic audio for analysis proves to be quite a tough challenge. This chapter proposes a comprehensive comparison among convolution neural networks (CNN), recurrent neural network (RNN), and XGboost. There are three distinguishing feature sets created for each task. The first feature set comprises 26 defining features extracted from the songs, while the second feature set consists of 10 of the most significant features among the 26 defining features, and the third feature set is made up of 26 features extracted from source-separated audio files. The three models were created for individual tasks, and their performance is evaluated for the three feature sets. The implemented CNN and XGboost models have had an accuracy of around 70% and 90%, respectively. The RNN model, on the other hand, showed approximately 98% percent accuracy due to its internal memory and unique features.

The three individual tasks are combined to make a single model with RNN architecture. This combined model then demonstrated an approximate accuracy of 97.2% which is close to the accuracy obtained for the individual tasks.

For the three tasks, CNN, RNN, and XGboost had favorable accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Ashwini, Krishna, A.V., Mahesh, V. and Karrthik, G.K., 2020. Tone detection for Indian classical polyphonic instrumental audio using DNN model. International Journal of Forensic Engineering, 4(4), pp. 310–322.

    Google Scholar 

  2. Pawar, M.Y. and Mahajan, S., 2019. Automatic Tonic (Shruti) Identification System for Indian Classical Music. In Soft Computing and Signal Processing (pp. 733–742). Springer, Singapore.

    Google Scholar 

  3. Salamon, Justin & Gulati, Sankalp & Serra, Xavier. (2012). A MultiPitch Approach to Tonic Identification in Indian Classical Music. Proceedings - 13th International Society for Music Information Retrieval Conference (ISMIR 2012).

    Google Scholar 

  4. Gulati, Sankalp & Bellur, Ashwin & Salamon, Justin & H.G., Ranjani & Ishwar, Vignesh & Murthy, Hema & Serra, Xavier. (2014). Automatic Tonic Identification in Indian Art Music: Approaches and Evaluation. Journal of New Music Research. 43. 10.1080/09298215.2013.875042.

    Google Scholar 

  5. S. Samsekai Manjabhat, Shashidhar G. Koolagudi, K. S. Rao & Pravin Bhaskar Ramteke (2017) Raga and Tonic Identification in Carnatic Music, Journal of New Music Research, 46:3, 229–245, DOI: 10.1080/09298215.2017.1330351

    Google Scholar 

  6. D. Poojary, A. Pinto, A. Zahied, Anusha, and A. Shetty, “Automatic Tonic Identification in Indian Art Music”, IJRESM, vol. 3, no. 7, pp. 210–212, Jul. 2020.

    Google Scholar 

  7. Chordia, Parag, and Sertan Sentürk. “Joint Recognition of Raag and Tonic in North Indian Music.” <i>Computer Music Journal</i> 37, no. 3 (2013): 82–98. Accessed August 19, 2021. http://www.jstor.org/stable/24265515.

    Article  Google Scholar 

  8. Madhusdhan, Sathwik Tejaswi and G. Chowdhary. “Tonic Independent Raag Classification in Indian Classical Music.” (2018).

    Google Scholar 

  9. S. Gaikwad, A. V. Chitre and Y. H. Dandawate, “Classification of Indian Classical Instruments Using Spectral and Principal Component Analysis Based Cepstrum Features,” 2014 International Conference on Electronic Systems, Signal Processing and Computing Technologies, 2014, pp. 276–279, doi: 10.1109/ICESC.2014.52.

    Google Scholar 

  10. Raga and Tonic Identification in Carnatic Music https://idr.nitk.ac.in/jspui/handle/123456789/12733

  11. Gulati, S., Serrà, J., Ganguli, K. K., Şentürk, S., & Serra, X. (2016). Time-delayed melody surfaces for Raga recognition. In Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR), pp. 751–757. New York, USA.

    Google Scholar 

  12. Alekh, Sanchit. (2017). Automatic Raga Recognition in Hindustani Classical Music.

    Google Scholar 

  13. Bhat, A., Krishna, A.V. and Acharya, S., 2020. Analytical Comparison of Classification Models for Raga Identification in Carnatic Classical Instrumental Polyphonic Audio. SN Computer Science, 1(6), pp. 1–9.

    Article  Google Scholar 

  14. McFee, B. et al., 2015. librosa: Audio and music signal analysis in python. In Proceedings of the 14th python in science conference.

    Google Scholar 

  15. Ashwini, B., Ramesh, N., Naik, S.M. and Krishna, A.V., 2017, November. Lead source separation for Indian instrumental audio. In TENCON 2017–2017 IEEE Region 10 Conference (pp. 1121–1126). IEEE.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ashwini Bhat .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Bhat, A., Krishnan, K.G., Mahesh, V., Ananthapadmanabha, V.K. (2023). Deep Learning Approach to Joint Identification of Instrument Pitch and Raga for Indian Classical Music. In: Biswas, A., Wennekes, E., Wieczorkowska, A., Laskar, R.H. (eds) Advances in Speech and Music Technology. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-031-18444-4_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-18444-4_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-18443-7

  • Online ISBN: 978-3-031-18444-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics