HMM Based Duration Control for Singing TTS

Khan, Najeeb Ullah; Lee, Jung Chul

doi:10.1007/978-981-10-0281-6_20

Najeeb Ullah Khan⁵ &
Jung Chul Lee⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 373))

1396 Accesses

Abstract

In order to develop a HMM based singing TTS system, we need a huge singing voice database to train HMM model parameters. However there is no singing voice database publically available and the construction of it is much more difficult than that of speech database. In this paper we propose a new method to improve the naturalness of singing TTS system using HMM models from speech database. Duration control model based on the syllabic analysis is applied to adapt speech duration model to singing duration model. The proposed method results in better singing voice quality compared to the maximum likelihood generation of durations using the speech database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Adding Singing Capabilities to Unit Selection TTS Through HNM-Based Conversion

Prosody Control and Variation Enhancement Techniques for HMM-Based Expressive Speech Synthesis

Synthesizing Asli Malay Song: Transforming Spoken Voices into Singing Voices

References

Kenmochi, H., Ohshita, H.: VOCALOID-commercial singing synthesizer based on sample concatenation. In: INTERSPEECH, Antwerp, Belgium, pp. 4009–4010 (2007)
Google Scholar
Kenmochi, H.: Singing synthesis as a new musical instrument. In: ICASSP, Kyoto, Japan, pp. 5385–5388 (2012)
Google Scholar
Saino, K., Zen, H., Nankaku, Y., Lee, A., Tokuda, K.: An HMM-based singing voice synthesis system. In: INTERSPEECH, Pittsburgh, Pennsylvania, pp. 2274–2277 (2006)
Google Scholar
Oura, K., Mase, A., Yamada, T., Muto, S., Nankaku, Y., Tokuda, K.: Recent development of the HMM-based singing voice synthesis system-Sinsy. In: SSW, pp. 211–216 (2010)
Google Scholar
Nakamura, K., Oura, K., Nankaku, Y., Tokuda, K.: HMM-Based singing voice synthesis and its application to Japanese and English. In: ICASSP, Florence, Italy, pp. 265-269 (2014)
Google Scholar
Shirota, K., Nakamura, K., Hashimoto, K., Oura, K., Nankaku, Y., Tokuda, K.: Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis. In: ICASSP, Florence, Italy, pp. 2559–2563 (2014)
Google Scholar
Zen, H., Nose, T., Yamagishi, J., Sako, S., Masuko, T., Black, A.W., et al.: The HMM-based speech synthesis system (HTS) version 2.0. In: SSW, pp. 294–299 (2007)
Google Scholar
Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J., Oura, K.: Speech synthesis based on hidden markov models. Proceedings of the IEEE 101, 1234–1252 (2013)
Article Google Scholar
CMU_ARCTIC speech synthesis databases. http://festvox.org/cmu_arctic/

Download references

Author information

Authors and Affiliations

School of Electrical Engineering, University of Ulsan, Ulsan, South Korea
Najeeb Ullah Khan & Jung Chul Lee

Authors

Najeeb Ullah Khan
View author publications
You can also search for this author in PubMed Google Scholar
Jung Chul Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jung Chul Lee .

Editor information

Editors and Affiliations

Department of Computer Software Engg..., Soonchunhyang University, Chungnam, Korea (Republic of)
Doo-Soon Park
I-Lan, Taiwan
Han-Chieh Chao
Dept. of Multimedia Eng., Dongguk University, Seoul, Korea (Republic of)
Young-Sik Jeong
Department of Computer Science and Engg., Seoul University of Science & Technology, Seoul, Korea (Republic of)
James J. (Jong Hyuk) Park

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khan, N.U., Lee, J.C. (2015). HMM Based Duration Control for Singing TTS. In: Park, DS., Chao, HC., Jeong, YS., Park, J. (eds) Advances in Computer Science and Ubiquitous Computing. Lecture Notes in Electrical Engineering, vol 373. Springer, Singapore. https://doi.org/10.1007/978-981-10-0281-6_20

Download citation

DOI: https://doi.org/10.1007/978-981-10-0281-6_20
Published: 18 December 2015
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0280-9
Online ISBN: 978-981-10-0281-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

HMM Based Duration Control for Singing TTS

Abstract

Access this chapter

Preview

Similar content being viewed by others

Adding Singing Capabilities to Unit Selection TTS Through HNM-Based Conversion

Prosody Control and Variation Enhancement Techniques for HMM-Based Expressive Speech Synthesis

Synthesizing Asli Malay Song: Transforming Spoken Voices into Singing Voices

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

HMM Based Duration Control for Singing TTS

Abstract

Access this chapter

Preview

Similar content being viewed by others

Adding Singing Capabilities to Unit Selection TTS Through HNM-Based Conversion

Prosody Control and Variation Enhancement Techniques for HMM-Based Expressive Speech Synthesis

Synthesizing Asli Malay Song: Transforming Spoken Voices into Singing Voices

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation