Transferring Vocal Expression of F0 Contour Using Singing Voice Synthesizer

Ikemiya, Yukara; Itoyama, Katsutoshi; Okuno, Hiroshi G.

doi:10.1007/978-3-319-07467-2_27

Yukara Ikemiya²³,
Katsutoshi Itoyama²³ &
Hiroshi G. Okuno²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8482))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

1578 Accesses
2 Citations

Abstract

A system for transferring vocal expressions separately from singing voices with accompaniment to singing voice synthesizers is described. The expressions appear as fluctuations in the fundamental frequency contour of the singing voice, such as vibrato, glissando, and kobushi. The fundamental frequency contour of the singing voice is estimated using the subharmonic summation in a limited frequency range and aligned temporally to chromatic pitch sequence. Each expression is transcribed and parameterized in accordance with designed rules. Finally, the expressions are transferred to given scores on the singing voice synthesizer. Experiments demonstrated that the proposed system can transfer the vocal expressions while retaining singer’s individuality on two singing voice synthesizers: the Vocaloid and the CeVIO.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Downie, J.S.: Music information retrieval. Annu. Rev. Inf. Sci. Technol. 37, 295–340 (2003)
Article Google Scholar
Kenmochi, H., Ohshita, H.: Vocaloid - commercial singing synthesizer based on sample concatenation. In: INTERSPEECH 2007, pp. 4009–4010 (2007)
Google Scholar
Saito, T., Goto, M.: Acoustic and perceptual effects of vocal training in amateur male singing. In: INTERSPEECH 2009, pp. 832–835 (September 2009)
Google Scholar
Guzman, M.A., Dowdall, J., Rubin, A.D., Maki, A., Levin, S., Mayerhoff, R., Jackson-Menaldi, M.C.: Influence of emotional expression, loudness, and gender on the acoustic parameters of vibrato in classical singers. Journal of Voice 26(5), 675–681 (2012)
Article Google Scholar
Stables, R., Athwal, C., Bullock, J.: Fundamental frequency modulation in singing voice synthesis. In: International Conference on Speech, Sound and Music Processing: Embracing Research in India, pp. 104–119 (2012)
Google Scholar
Umbert, M., Bonada, J., Blaauw, M.: Generating singing voice expression contours based on unit selection. In: SMAC (July 2013)
Google Scholar
Nakano, T., Goto, M.: VocaListener2: A singing synthesis system able to mimic a user’s singing in terms of voice timbre changes as well as pitch and dynamics. In: ICASSP 2011, pp. 453–456 (2011)
Google Scholar
Ohishi, Y., Kameoka, H., Mochihashi, D., Kashino, K.: A stochastic model of singing voice F0 contours for characterizing expressive dynamic components. In: Proc. INTERSPEECH (September 2012)
Google Scholar
Oura, K., Mase, A., Yamada, T., Muto, S., Nankaku, Y., Tokuda, K.: Recent development of the HMM-based singing voice synthesis system - Sinsy. In: Proc. ISCA Tutorial and Research Workshop on Speech Synthesis, pp. 211–216 (September 2010)
Google Scholar
Saino, K., Tachibana, M., Kenmochi, H.: A singing style modeling system for singing voice synthesizers. In: Proc. INTERSPEECH, pp. 2894–2897 (September 2010)
Google Scholar
Lee, S.W., Ang, S.T., Dong, M., Li, H.: Generalized F0 modelling with absolute and relative pitch features for singing voice synthesis. In: Proc. ICASSP, pp. 429–432 (March 2012)
Google Scholar
Yasuraoka, N., Abe, T., Itoyama, K., Takahashi, T., Ogata, T., Okuno, H.G.: Changing timbre and phrase in existing musical performances as you like. In: ACM Multimedia 2009, p. 10 (2009)
Google Scholar
Hermes, D.J.: Measurement of pitch by subharmonic summation. J. Acoust. Soc. Am. 83(1), 257–264 (1988)
Article Google Scholar
Brown, J.C.: Calculation of a constant q spectral transform. J. Acoust. Soc. Am. 89(1), 425–434 (1991)
Article Google Scholar
Nakano, T., Goto, M.: An automatic singing skill evaluation method for unknown melodies using pitch interval accuracy and vibrato features. In: Proc. INTER- SPEECH (September 2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Informatics, Kyoto University, 606-8501, Sakyo, Kyoto, Japan
Yukara Ikemiya, Katsutoshi Itoyama & Hiroshi G. Okuno

Authors

Yukara Ikemiya
View author publications
You can also search for this author in PubMed Google Scholar
Katsutoshi Itoyama
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi G. Okuno
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Texas State University, 78666, San Marcos, TX, USA
Moonis Ali
Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, No. 415, Chien Kung Road, 80778, Kaohsiung, Taiwan
Jeng-Shyang Pan
Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan
Shyi-Ming Chen
Department of Electronics Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan (ROC)
Mong-Fong Horng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ikemiya, Y., Itoyama, K., Okuno, H.G. (2014). Transferring Vocal Expression of F0 Contour Using Singing Voice Synthesizer. In: Ali, M., Pan, JS., Chen, SM., Horng, MF. (eds) Modern Advances in Applied Intelligence. IEA/AIE 2014. Lecture Notes in Computer Science(), vol 8482. Springer, Cham. https://doi.org/10.1007/978-3-319-07467-2_27

Download citation

DOI: https://doi.org/10.1007/978-3-319-07467-2_27
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07466-5
Online ISBN: 978-3-319-07467-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics