Abstract
Recent researches in speech synthesis are mainly focused on naturalness, and the emotional speech synthesis becomes one of the highlighted research topics. Although quite a many studies on emotional speech in English or Japanese have been addressed, the studies in Korean can seldom be found. This paper presents an analysis of emotional speech in Korean. Emotional speech features related to human speech prosody, such as F0, the duration, and the amplitude with their variations, are exploited. Their attribution to three different types of typical human speech is tried to be quantified and modeled. By utilizing the analysis results, emotional voice conversion from the neutral speech to the emotional one is also performed and tested.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vine, D.S.B., Sahandi, R.: Synthesis of Emotional Speech using RP-PSOLA. In: IEE Colloquium on the State of the Art in Speech Synthesis (2000)
Murray, I., Arnott, J.: Implementation and Testing of a System for Producing Emotion-by-Rule in Synthetic Speech. Speech Communication, 369–390 (1995)
Jun, S., Shigeo, M.: Emotion Modeling in Speech Production using Emotion Space. In: IEEE International Workshop on Robot and Human Communication, pp. 472–477 (1996)
Tsuyoshi, M., Shinji, O.: Emotional Recognition and Synthesis System on Speech. In: Proceedings of IEEE International Conference on Multimedia Computing and Systems, pp. 840–844 (1999)
Erhard, R., Hannes, P.: Generating Emotional Speech with a Concaternative Synthesizer. In: Proceedings of ICSLP 1998, pp. 671–675 (1998)
Galanis, D., Darsinos, V., Kokkinakis, G.: Investigating Emotional Speech Parameters for Speech Synthesis. In: Proceedings of ICECS 1996, pp. 1227–1230 (1996)
Kazuhito, K., Hirotaka, S., Hiroaki, S.: Prosodic Parameters in Emotional Speech. In: Proceedings of ICSLP 1998, pp. 679–682 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, SJ., Kim, KK., Han, H.B., Hahn, M. (2005). Study on Emotional Speech Features in Korean with Its Application to Voice Conversion. In: Tao, J., Tan, T., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2005. Lecture Notes in Computer Science, vol 3784. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573548_44
Download citation
DOI: https://doi.org/10.1007/11573548_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29621-8
Online ISBN: 978-3-540-32273-3
eBook Packages: Computer ScienceComputer Science (R0)