Abstract
Tonic is one of the basic concepts in music. The tonic pitch is normally entrenched by the main performers, depending on their vocal range and the range of the accompanying instruments. Hence, it varies among artists as well as performances. Tonic identification is a key task to be performed since it plays an important role in solving other problems like raga recognition, tuning analysis, etc. In this paper, we review some of the latest tonic identification approaches in Indian art music. We study the performance of each method in the context of music tradition using the Indian art music dataset. We analyzed the performance of the cepstral-based method of tonic pitch estimation with and without optimal selection of frames, non-negative matrix factorization (NMF), frequency-ratio method, and group delay-based algorithm. Furthermore, we show that the modified group delay-based methods outperform the conventional tonic estimation methods. We also present a detailed error analysis of each method which will be helpful for future research.
Similar content being viewed by others
Availability of data and materials
The datasets analyzed in this manuscript are available from CompMusic project group on request.
Notes
alapana is a form of manodharmam, or improvisation, that introduces and develops a raga (musical scale)
References
Sentürk S, Gulati S, Serra X (2013) Score informed tonic identification for makam music of Turkey. In Proceedings of 14th international society for music information retrieval conference. Curitiba, Brazil 4–8 novembre de 2013. International society for music information retrieval (ISMIR)
Atli HS, Bozkurt B, Sentürk S (2015) A method for tonic frequency identification of Turkish makam music recordings. In 5th international workshop on folk music analysis 2015 Jun 10–12 Paris. France Association Dirac 2015, pp 119–22
Tzanetakis G, Kapur A, Schloss WA, Wright M (2007) Computational ethnomusicology. J Interdiscip Music Studies 1(2):1–24
Alison A (1998) The garland encyclopedia of world music: South Asia: the Indian subcontinent, (vol 1). Taylor & Francis
Bor J (ed) (2010) Hindustani music: thirteenth to twentieth centuries, Manohar Publishers & Distributors Codarts
Viswanathan T, Allen MH (2004) Music in South India: the Karn\(\dagger \)atak concert tradition and beyond: experiencing music, expressing culture, (No Sirsi) i9780195145908
Madhusudhan ST, Chowdhary G (2019) Deepsrgm-sequence classification and ranking in Indian classical music with deep learning. In Proceedings of the 20th international society for music information retrieval conference, pp 533–540
Wade BC (1998) Imaging sound: An ethnomusicological study of music, art, and culture in Mughal India. University of Chicago Press
Caudhurī VR (2000) The dictionary of Hindustani classical music, vol 8, Motilal Banarsidass Publ
Unnikrishnan G (2018) An efficient method for tonic detection from south Indian classical music. Int J Res Eng Innov 2(3):293–298
Koduri GK, Gulati S, Rao P, Serra X (2012) Rāga recognition based on pitch distribution methods. J New Music Res 41(4):337–350
Deva BC (1980) The music of India: A scientific study, Humanities Press
Bellur A, Murthy HA (2013) A cepstrum based approach for identifying tonic pitch in Indian classical music. Journal of new music research, IEEE, pp 1–5
Gulati S, Salamon J, Serra X (2012) A two-stage approach for tonic identification in Indian art music. In Proceedings of the 2nd CompMusic workshop, Jul 12-13; Istanbul, Turkey. Barcelona: Universitat Pompeu Fabra; 2012, p 119–127
Ranjani HG, Arthi S, Sreenivas TV (2011) Carnatic music analysis: Shadja, swara identification and raga verification in alapana using stochastic models. In 2011 IEEE workshop on applications of signal processing to audio and acoustics (WASPAA), pp 29–32. IEEE
Salamon J, Gulati S, Serra X (2012) A multipitch approach to tonic identification in indian classical music. In Gouyon F, Herrera P, Martins LG, Müller M (eds) ISMIR 2012: Proceedings of the 13th international society for music information retrieval conference; 2012 Oct 8-12; Porto, Portugal. Porto: FEUP Ediçoes; 2012. International society for music information retrieval (ISMIR)
Samsekai MS, Koolagudi SG, Rao KS, Ramteke PB (2017) Raga and tonic identification in carnatic music. J New Music Res 46(3):229–245. https://doi.org/10.1080/09298215.2017.1330351
Pawar MY, Mahajan S, (2019) Automatic tonic (shruti) identification system for indian classical music. In Soft computing and signal processing: proceedings of ICSCSP, (2018) vol 1, pp 733–742. Springer Singapore. https://doi.org/10.1007/978-981-13-3600-370
Bogert BP (1963) The quefrency alanysis of time series for echoes; Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking. Time series analysis pp 209–243
Rabiner LR (1978) Digital processing of speech signals. Pearson Education India
Smaragdis P, Brown JC (2003) Non-negative matrix factorization for polyphonic music transcription. In 2003 IEEE workshop on applications of signal processing to audio and acoustics, (IEEE Cat No 03TH8684) (pp 177–180). IEEE
Anantapadmanabhan A, Bellur A, Murthy HA (2013) Modal analysis and transcription of strokes of the mridangam using non-negative matrix factorization. In 2013 IEEE international conference on acoustics, speech and signal processing (pp 181–185). IEEE
Schmidt MN, Olsson RK (2006) Single-channel speech separation using sparse non-negative matrix factorization. Interspeech 2:2–5
Aiswarya MA, Sinith MS, Rajan R (2023) Automatic tonic pitch estimation in south indian classical music using frequency-ratio method. In 2023 international conference on intelligent systems for communication, IoT and security (ICISCoIS) IEEE, pp 527–532
Yegnanarayana B, Saikia D, Krishnan T (1984) Significance of group delay functions in signal reconstruction from spectral magnitude or phase. IEEE Trans Acoustics Speech Signal Process 32(3):610–623
Oppenheim AV (1999) Discrete-time signal processing. Pearson Education India
Murthy HA, Yegnanarayana B (2011) Group delay functions and its applications in speech technology. Sadhana 36(5):745–782
Rajan R, Murthy HA (2013) Group delay based melody monopitch extraction from music. In 2013 IEEE international conference on acoustics, speech and signal processing IEEE, pp 186–190
Rajan R, Murthy HA (2017) Two-pitch tracking in co-channel speech using modified group delay functions. Speech Commun 89:37–46
Aiswarya MA (2022) Tonic estimation in classical music. MTech thesis, APJ Abdul Kalam Technological University, Kerala, India
Gulati S, Bellur A, Salamon J, Ranjani HG, Ishwar V, Murthy HA, Serra X (2014) Automatic tonic identification in Indian art music: approaches and evaluation. J New Music Res 43(1):53–71
De Cheveigné A, Kawahara H (2002) YIN, a fundamental frequency estimator for speech and music. J Acoust Soc Amer 111(4):1917–1930
Seung D, Lee L (2001) Algorithms for non-negative matrix factorization. Adv Neural Inf Process Syst 13:556–562
Camacho A, Harris JG (2008) A sawtooth waveform inspired pitch estimator for speech and music. J Acoust Soc Amer 124(3):1638–1652
Salamon J, Gómez E (2012) Melody extraction from polyphonic music signals using pitch contour characteristics. IEEE Trans Audio Speech Language Process 20(6):1759–1770
Acknowledgements
We sincerely thank the CompMusic project group for sharing the dataset for the experiments.
Funding
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors
Author information
Authors and Affiliations
Contributions
Equal contributions from all authors
Corresponding author
Ethics declarations
Consent for publication
Not Applicable
Ethics approval
Not Applicable
Competing interests
The authors declare that there is no competing interest related to this manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
M.A., A., Rajan, R. A review on tonic estimation algorithms in indian art music. Multimed Tools Appl 83, 38443–38463 (2024). https://doi.org/10.1007/s11042-023-17161-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17161-4