Abstract
A model is presented for the analysis and synthesis of low frequency human-like pitch deviation, as a replacement for existing modulation techniques in singing voice synthesis systems. Fundamental frequency (f 0) measurements are taken from vocalists producing a selected range of utterances without vibrato and trends in the data are observed. A probabilistic function that provides natural sounding low frequency f 0 modulation to synthesized singing voices is presented and the perceptual relevance is evaluated with subjective listening tests.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bonada, J., Serra, X.: Synthesis of the singing voice by performance sampling and spectral models. IEEE Signal Processing Magazine 24(2), 67–79 (2007)
Burnett, T., Senner, J., Larson, C.: Voice F0 responses to pitch-shifted auditory feedback: a preliminary study. Journal of Voice 11(2), 202–211 (1997)
Cook, P.R.: Identification of control parameters in an articulatory vocal tract model, with applications to the synthesis of singing. Ph.D. thesis, Stanford University, Stanford, CA, USA (1991), uMI Order No. GAX91-15756
Fabig, L., Janer, J.: Transforming singing voice expression - the sweetness effect. In: Proc. of the 7th Int. Conference on Digital Audio Effects (DAFX 2004), Naples, Italy (2004)
Goto, M., Nakano, T.: Vocalistner: a singing-to-singing synthesis system based on iterative parameter estimation. In: Sound and Music Computing Conference (2010)
Kenmochi, H., Ohshita, H.: Vocaloid-commercial singing synthesizer based on sample concatenation. In: Eighth Annual Conference of the International Speech Communication Association (2007)
Klatt, D., Klatt, L.: Analysis, synthesis, and perception of voice quality variations among female and male talkers. The Journal of the Acoustical Society of America 87, 820 (1990)
Kreiman, J., Gerratt, B.: Perception of aperiodicity in pathological voice. The Journal of the Acoustical Society of America 117, 2201 (2005)
Lai, W.: F0 control model for mandarin singing voice synthesis. In: International Conference on Digital Telecommunications, p. 12 (2007)
Larson, C., Burnett, T., Bauer, J., Kiran, S., Hain, T.: Comparison of voice f responses to pitch-shift onset and offset conditions. The Journal of the Acoustical Society of America 110, 2845 (2001)
Leydon, C., Bauer, J., Larson, C.: The role of auditory feedback in sustaining vocal vibrato. The Journal of the Acoustical Society of America 114, 1575 (2003)
Loscos, A.: Spectral Processing of the singing voice. Ph.D. thesis, Universitat Pompeu Fabra, Barcelona (2007)
Macon, M., Jensen-Link, L., Oliverio, J., Clements, M., George, E.: Concatenation-based MIDI-to-singing voice synthesis. Audio Engineering Society Journal (1997)
Orlikoff, R., Baken, R.: Fundamental frequency modulation of the human voice by the heartbeat: preliminary results and possible mechanisms. The Journal of the Acoustical Society of America 85, 888 (1989)
Rodet, X.: Synthesis and Processing of the Singing Voice. Ph.D. thesis, IRCAM, Paris (2002)
Ruinskiy, D., Lavner, Y.: Stochastic models of pitch jitter and amplitude shimmer for voice modification. In: IEEE 25th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2008, pp. 489–493. IEEE (2009)
Saitou, T., Unoki, M., Akagi, M.: Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis. Speech Communication 46(3-4), 405–417 (2005)
Schoentgen, J.: Estimation of the modulation frequency and modulation depth of the fundamental frequency owing to vocal micro-tremor of the voice source signal. In: Eurospeech, p. 1499 (2001)
Schröder, M.: Emotional speech synthesis: A review. In: Proceedings of EUROSPEECH, vol. 1, pp. 561–564 (2001)
Shipp, T., Sundberg, J., Doherty, E.: The effect of delayed auditory feedback on vocal vibrato. Journal of Voice 2(3), 195–199 (1988)
Shipp, T., Doherty, E.T., Haglund, S.: Physiologic factors in vocal vibrato production. Journal of Voice 4(4), 300–304 (1990)
Stables, R., Bullock, J., Athwal, C.: The humanisation of stochastic processes for the modelling of f 0 drift in singing. In: CMMR-FRSM 2011 Joint International Symposium on Computer Music Modelling and Retrieval with Frontiers of Research on Speech and Music (2011)
Stables, R., Bullock, J., Athwal, C.: Towards a model for the humanisation of pitch drift in singing voice synthesis. In: International Computer Music Conference (ICMC 2011), Huddersfield, UK (2011)
Stables, R., Bullock, J., Williams, I.: Perceptually relevant models for drum pattern humanisation. In: Proceedings of 131st Audio Engineering Society Convention. AES, New York (2011)
Sun, X.: Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, (ICASSP 2002), vol. 1. IEEE (2005)
Sundberg, J.: Science of the Singing Voice. Northern Illinois University Press (1987)
Ternström, S., Friberg, A.: Analysis and simulation of small variations in the fundamental frequency of sustained vowels. STL-QPSR 30(3), 1–14 (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Stables, R., Athwal, C., Bullock, J. (2012). Fundamental Frequency Modulation in Singing Voice Synthesis. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K., Mohanty, S. (eds) Speech, Sound and Music Processing: Embracing Research in India. CMMR FRSM 2011 2011. Lecture Notes in Computer Science, vol 7172. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31980-8_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-31980-8_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31979-2
Online ISBN: 978-3-642-31980-8
eBook Packages: Computer ScienceComputer Science (R0)