Abstract
This paper describes our work on the design and development of a spoken dialog system, which uses synthesized speech of various different Viennese varieties. In a previous study we investigated the usefulness of synthesis of varieties. The developed spoken dialog system was especially designed for the different personas that can be realized with multiple varieties. This brings more realistic and fun-to-use spoken dialog systems to the end user and can serve as speech-based user interface for blind users and users with visual impairment. The benefits for this group of users are the increased acceptability and also comprehensibility that comes about when the synthesized speech reflects the user’s linguistic and/or social identity.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Viennese Sociolect and Dialect Synthesis, http://dialect-tts.ftw.at
Cassell, J.: Social Practice: Becoming Enculturated in Human-Computer Interaction. In: Stephanidis, C. (ed.) Universal Access in HCI (UAHCI), HCI 2009. LNCS, vol. 5616, pp. 303–313. Springer, Heidelberg (2009)
Cohen, M.H., Giangola, J.P., Balogh, J.: Voice user interface design. Addison-Wesley, Reading (2004)
Pucher, M., Schuchmann, G., Fröhlich, P.: Regionalized Text-to-Speech Systems: Persona Design and Application Scenarios. In: Esposito, A., Hussain, A., Marinaro, M., Martone, R. (eds.) Multimodal Signals: Cognitive and Algorithmic Issues. LNCS (LNAI), vol. 5398, pp. 216–222. Springer, Heidelberg (2009)
Pucher, M., Schabus, D., Yamagishi, J., Neubarth, F., Strom, V.: Modeling and Interpolation of Austrian German and Viennese Dialect in HMM-based Speech Synthesis. Speech Communication 52(2), 164–179 (2010)
Moosmüller, S.: Soziophonologische Variation im gegenwärtigen Wiener Deutsch. Franz Steiner Verlag, Stuttgart (1987)
VoiceXML 2.0 recommendation, http://www.w3.org/TR/voicexml20/
Dahlbäck, N., Wang, Q., Nass, C., Alwin, J.: Similarity is more important than expertise: Accent effects in speech interfaces. In: Proc. SIGCHI conference on human factors in computing systems, pp. 1553–1556 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pucher, M., Neubarth, F., Schabus, D. (2010). Design and Development of Spoken Dialog Systems Incorporating Speech Synthesis of Viennese Varieties. In: Miesenberger, K., Klaus, J., Zagler, W., Karshmer, A. (eds) Computers Helping People with Special Needs. ICCHP 2010. Lecture Notes in Computer Science, vol 6180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14100-3_53
Download citation
DOI: https://doi.org/10.1007/978-3-642-14100-3_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14099-0
Online ISBN: 978-3-642-14100-3
eBook Packages: Computer ScienceComputer Science (R0)