Advertisement

Abstract

In expressive speech synthesis, a key challenge is the generation of flexibly varying expressive tone while maintaining the high quality achieved with unit selection speech synthesis methods. Existing approaches have either concentrated on achieving high synthesis quality with no flexibility, or they have aimed at parametric models, requiring the use of parametric synthesis technologies such as diphone, formant or HMM-based synthesis.

Keywords

Speech Synthesis Unit Selection Target Frame Linear Predictive Code Voice Conversion 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Turk, O., Schröder, M., Bozkurt, B., Arslan, L.: Voice quality interpolation for emotional text-to-speech synthesis. In: Proc. Interspeech 2005, Lisbon, Portugal, pp. 797–800 (2005)Google Scholar
  2. 2.
    Paliwal, K.K.: Interpolation properties of linear prediction parametric representations. In: Proc. Eurospeech 1995, Madrid, Spain, pp. 1029–1032 (1995)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Marc Schröder
    • 1
  1. 1.DFKI GmbH, SaarbrückenGermany

Personalised recommendations