Interpolating Expressions in Unit Selection
In expressive speech synthesis, a key challenge is the generation of flexibly varying expressive tone while maintaining the high quality achieved with unit selection speech synthesis methods. Existing approaches have either concentrated on achieving high synthesis quality with no flexibility, or they have aimed at parametric models, requiring the use of parametric synthesis technologies such as diphone, formant or HMM-based synthesis.
KeywordsSpeech Synthesis Unit Selection Target Frame Linear Predictive Code Voice Conversion
Unable to display preview. Download preview PDF.
- 1.Turk, O., Schröder, M., Bozkurt, B., Arslan, L.: Voice quality interpolation for emotional text-to-speech synthesis. In: Proc. Interspeech 2005, Lisbon, Portugal, pp. 797–800 (2005)Google Scholar
- 2.Paliwal, K.K.: Interpolation properties of linear prediction parametric representations. In: Proc. Eurospeech 1995, Madrid, Spain, pp. 1029–1032 (1995)Google Scholar