Skip to main content
Log in

A review on tonic estimation algorithms in indian art music

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Tonic is one of the basic concepts in music. The tonic pitch is normally entrenched by the main performers, depending on their vocal range and the range of the accompanying instruments. Hence, it varies among artists as well as performances. Tonic identification is a key task to be performed since it plays an important role in solving other problems like raga recognition, tuning analysis, etc. In this paper, we review some of the latest tonic identification approaches in Indian art music. We study the performance of each method in the context of music tradition using the Indian art music dataset. We analyzed the performance of the cepstral-based method of tonic pitch estimation with and without optimal selection of frames, non-negative matrix factorization (NMF), frequency-ratio method, and group delay-based algorithm. Furthermore, we show that the modified group delay-based methods outperform the conventional tonic estimation methods. We also present a detailed error analysis of each method which will be helpful for future research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Availability of data and materials

The datasets analyzed in this manuscript are available from CompMusic project group on request.

Notes

  1. alapana is a form of manodharmam, or improvisation, that introduces and develops a raga (musical scale)

  2. http://www.shivkumar.org/music/manodharma/index.html

  3. https://en.wikipedia.org/wiki/Gamaka(music)

References

  1. Sentürk S, Gulati S, Serra X (2013) Score informed tonic identification for makam music of Turkey. In Proceedings of 14th international society for music information retrieval conference. Curitiba, Brazil 4–8 novembre de 2013. International society for music information retrieval (ISMIR)

  2. Atli HS, Bozkurt B, Sentürk S (2015) A method for tonic frequency identification of Turkish makam music recordings. In 5th international workshop on folk music analysis 2015 Jun 10–12 Paris. France Association Dirac 2015, pp 119–22

  3. Tzanetakis G, Kapur A, Schloss WA, Wright M (2007) Computational ethnomusicology. J Interdiscip Music Studies 1(2):1–24

    Google Scholar 

  4. Alison A (1998) The garland encyclopedia of world music: South Asia: the Indian subcontinent, (vol 1). Taylor & Francis

    Google Scholar 

  5. Bor J (ed) (2010) Hindustani music: thirteenth to twentieth centuries, Manohar Publishers & Distributors Codarts

  6. Viswanathan T, Allen MH (2004) Music in South India: the Karn\(\dagger \)atak concert tradition and beyond: experiencing music, expressing culture, (No Sirsi) i9780195145908

  7. Madhusudhan ST, Chowdhary G (2019) Deepsrgm-sequence classification and ranking in Indian classical music with deep learning. In Proceedings of the 20th international society for music information retrieval conference, pp 533–540

  8. Wade BC (1998) Imaging sound: An ethnomusicological study of music, art, and culture in Mughal India. University of Chicago Press

    Google Scholar 

  9. Caudhurī VR (2000) The dictionary of Hindustani classical music, vol 8, Motilal Banarsidass Publ

  10. Unnikrishnan G (2018) An efficient method for tonic detection from south Indian classical music. Int J Res Eng Innov 2(3):293–298

    Google Scholar 

  11. Koduri GK, Gulati S, Rao P, Serra X (2012) Rāga recognition based on pitch distribution methods. J New Music Res 41(4):337–350

    Article  Google Scholar 

  12. Deva BC (1980) The music of India: A scientific study, Humanities Press

  13. Bellur A, Murthy HA (2013) A cepstrum based approach for identifying tonic pitch in Indian classical music. Journal of new music research, IEEE, pp 1–5

  14. Gulati S, Salamon J, Serra X (2012) A two-stage approach for tonic identification in Indian art music. In Proceedings of the 2nd CompMusic workshop, Jul 12-13; Istanbul, Turkey. Barcelona: Universitat Pompeu Fabra; 2012, p 119–127

  15. Ranjani HG, Arthi S, Sreenivas TV (2011) Carnatic music analysis: Shadja, swara identification and raga verification in alapana using stochastic models. In 2011 IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 29–32. IEEE

  16. Salamon J, Gulati S, Serra X (2012) A multipitch approach to tonic identification in indian classical music. In Gouyon F, Herrera P, Martins LG, Müller M (eds) ISMIR 2012: Proceedings of the 13th international society for music information retrieval conference; 2012 Oct 8-12; Porto, Portugal. Porto: FEUP Ediçoes; 2012. International society for music information retrieval (ISMIR)

  17. Samsekai MS, Koolagudi SG, Rao KS, Ramteke PB (2017) Raga and tonic identification in carnatic music. J New Music Res 46(3):229–245. https://doi.org/10.1080/09298215.2017.1330351

    Article  Google Scholar 

  18. Pawar MY, Mahajan S, (2019) Automatic tonic (shruti) identification system for indian classical music. In Soft computing and signal processing: proceedings of ICSCSP, (2018) vol 1, pp 733–742. Springer Singapore. https://doi.org/10.1007/978-981-13-3600-370

  19. Bogert BP (1963) The quefrency alanysis of time series for echoes; Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking. Time series analysis pp 209–243

  20. Rabiner LR (1978) Digital processing of speech signals. Pearson Education India

  21. Smaragdis P, Brown JC (2003) Non-negative matrix factorization for polyphonic music transcription. In 2003 IEEE workshop on applications of signal processing to audio and acoustics, (IEEE Cat No 03TH8684) (pp 177–180). IEEE

  22. Anantapadmanabhan A, Bellur A, Murthy HA (2013) Modal analysis and transcription of strokes of the mridangam using non-negative matrix factorization. In 2013 IEEE international conference on acoustics, speech and signal processing (pp 181–185). IEEE

  23. Schmidt MN, Olsson RK (2006) Single-channel speech separation using sparse non-negative matrix factorization. Interspeech 2:2–5

    Google Scholar 

  24. Aiswarya MA, Sinith MS, Rajan R (2023) Automatic tonic pitch estimation in south indian classical music using frequency-ratio method. In 2023 international conference on intelligent systems for communication, IoT and security (ICISCoIS) IEEE, pp 527–532

  25. Yegnanarayana B, Saikia D, Krishnan T (1984) Significance of group delay functions in signal reconstruction from spectral magnitude or phase. IEEE Trans Acoustics Speech Signal Process 32(3):610–623

    Article  Google Scholar 

  26. Oppenheim AV (1999) Discrete-time signal processing. Pearson Education India

  27. Murthy HA, Yegnanarayana B (2011) Group delay functions and its applications in speech technology. Sadhana 36(5):745–782

    Article  Google Scholar 

  28. Rajan R, Murthy HA (2013) Group delay based melody monopitch extraction from music. In 2013 IEEE international conference on acoustics, speech and signal processing IEEE, pp 186–190

  29. Rajan R, Murthy HA (2017) Two-pitch tracking in co-channel speech using modified group delay functions. Speech Commun 89:37–46

    Article  Google Scholar 

  30. Aiswarya MA (2022) Tonic estimation in classical music. MTech thesis, APJ Abdul Kalam Technological University, Kerala, India

  31. Gulati S, Bellur A, Salamon J, Ranjani HG, Ishwar V, Murthy HA, Serra X (2014) Automatic tonic identification in Indian art music: approaches and evaluation. J New Music Res 43(1):53–71

    Article  Google Scholar 

  32. De Cheveigné A, Kawahara H (2002) YIN, a fundamental frequency estimator for speech and music. J Acoust Soc Amer 111(4):1917–1930

    Article  Google Scholar 

  33. Seung D, Lee L (2001) Algorithms for non-negative matrix factorization. Adv Neural Inf Process Syst 13:556–562

    Google Scholar 

  34. Camacho A, Harris JG (2008) A sawtooth waveform inspired pitch estimator for speech and music. J Acoust Soc Amer 124(3):1638–1652

    Article  Google Scholar 

  35. Salamon J, Gómez E (2012) Melody extraction from polyphonic music signals using pitch contour characteristics. IEEE Trans Audio Speech Language Process 20(6):1759–1770

    Article  Google Scholar 

Download references

Acknowledgements

We sincerely thank the CompMusic project group for sharing the dataset for the experiments.

Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors

Author information

Authors and Affiliations

Authors

Contributions

Equal contributions from all authors

Corresponding author

Correspondence to Aiswarya M.A..

Ethics declarations

Consent for publication

Not Applicable

Ethics approval

Not Applicable

Competing interests

The authors declare that there is no competing interest related to this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

M.A., A., Rajan, R. A review on tonic estimation algorithms in indian art music. Multimed Tools Appl 83, 38443–38463 (2024). https://doi.org/10.1007/s11042-023-17161-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-17161-4

Keywords

Navigation