Exploring Perceptual Based Timbre Feature for Singer Identification

Kalayar Khine, Swe Zin; Nwe, Tin Lay; Li, Haizhou

doi:10.1007/978-3-540-85035-9_10

Swe Zin Kalayar Khine¹,
Tin Lay Nwe¹ &
Haizhou Li¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4969))

Included in the following conference series:

International Symposium on Computer Music Modeling and Retrieval

1960 Accesses
7 Citations

Abstract

Timbre can be defined as feature of an auditory stimulus that allows us to distinguish the sounds which have the same pitch and loudness. In this paper, we explore timbre based perceptual feature for singer identification. We start with a vocal detection process to extract the vocal segments from the sound. The cepstral coefficients, which reflect timbre characteristics, are then computed from the vocal segments. The cepstral coefficients of timbre are formulated by combining information of harmonic and the dynamic characteristics of the sound such as vibrato and the attack-decay envelope of the songs. Bandpass filters that spread according to the octave frequency scale are used to extract vibrato and harmonic information of sounds. The experiments are conducted on a database of 84 popular songs. The results show that the proposed timbre based perceptual feature is robust and effective. We achieve an average error rate of 12.2% in segment level singer identification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bartsch, M.A., Wakefield, G.H.: Singing Voice Identification Using Spectral Envelope Estimation. IEEE Transactions, Speech and Audio Processing 12, 100–109 (2004)
Article Google Scholar
Bretos, J., Sundberg, J.: Measurements of Vibrato Parameters in Long Sustained Crescendo Notes As Sung by Ten Sopranos. Journal of Voice 17, 343–352 (2003)
Article Google Scholar
Cleveland, T.F.: Acoustic Properties of Voice Timbre Types and Their Influence on Voice Classification. Journal of Acoustical Society of America 61, 1622–1629 (1977)
Article Google Scholar
Dejonckere, P.H., Hirano, M., Sundberg, J.: Vibrato, ch. 2. Singular Pub., San Diego (1995)
Google Scholar
Dromey, C., Carter, N., Hopkin, A.: Vibrato Rate Adjustment. Journal of Voice 17, 168–178 (2003)
Article Google Scholar
Erickson, M., Perry, S., Handel, S.: Discrimination Functions: Can They Be Used to Classify Singing Voices? Journal of Voice 15, 492–502 (2001)
Article Google Scholar
Everest, F.A.: Master Handbook of Acoustics. McGraw-Hill Professional, New York (2000)
Google Scholar
Joliveau, E., Smith, J., Wolfe, J.: Vocal Tract Resonances in Singing: The Soprano Voice. Journal of Acoustical Society of America 116, 2434–2439 (2004)
Article Google Scholar
Poli, G.D., Prandoni, P.: Sonological Models for Timber Characterization. Journal of New Music Research 26, 170–197
Google Scholar
Nwe, T.L., Foo, S.W., De Silva, L.C.: Stress classification using subband based features. IEICE Trans. Information and Systems, Special Issue on Speech Information Processing E86-D(3), 565–573 (2003)
Google Scholar
Nwe, T.L., Li, H.: Exploring Vibrato-Motivated Acoustic Features for Singer Identification. IEEE Transactions, Audio, Speech and Language Processing 15(2) (2007)
Google Scholar
Sukkar, R.A., Lee, C.H.: Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition. IEEE Trans. Speech and Audio Processing 4, 420–429 (1996)
Article Google Scholar
Sundberg, J.: The Science of Singing Voice. Northern Illinois University Press (1987)
Google Scholar
Timmers, R., Desain, P.: Vibrato: Questions and Answers from Musicians and Science. In: Proc. Int. Conf. On Music Perception And Cognition, England (2000)
Google Scholar
Winckell, F.: Music, Sound and Sensation. Dover, NY (1967)
Google Scholar
Zhang, T.: System and method for automatic singer identification. In: Proceedings IEEE International Conference Multimedia and Expo., Baltimore, MD (2003)
Google Scholar
Zhang, T., Kuo, C.C.J.: Content-Based Audio Classification and Retrieval for Data Parsing. Kluwer Academic Publishers, USA (2001)
MATH Google Scholar
Helmholtz, H.: On the Sensation of Tone. Dover Publication, New York (1954)
Google Scholar
Fredouille, C., Bonastre, J.-F., Merlin, T.: Bayesian approach based-decision in speaker verification, A Speaker Odyssey, Crete, Greece (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Infocomm Research, , 21 Heng Mui Keng Terrace, Singapore, 119613
Swe Zin Kalayar Khine, Tin Lay Nwe & Haizhou Li

Authors

Swe Zin Kalayar Khine
View author publications
You can also search for this author in PubMed Google Scholar
Tin Lay Nwe
View author publications
You can also search for this author in PubMed Google Scholar
Haizhou Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Richard Kronland-Martinet Sølvi Ystad Kristoffer Jensen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kalayar Khine, S.Z., Nwe, T.L., Li, H. (2008). Exploring Perceptual Based Timbre Feature for Singer Identification. In: Kronland-Martinet, R., Ystad, S., Jensen, K. (eds) Computer Music Modeling and Retrieval. Sense of Sounds. CMMR 2007. Lecture Notes in Computer Science, vol 4969. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85035-9_10

Download citation

DOI: https://doi.org/10.1007/978-3-540-85035-9_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85034-2
Online ISBN: 978-3-540-85035-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics