Design and Development of Spoken Dialog Systems Incorporating Speech Synthesis of Viennese Varieties

  • Michael Pucher
  • Friedrich Neubarth
  • Dietmar Schabus
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6180)


This paper describes our work on the design and development of a spoken dialog system, which uses synthesized speech of various different Viennese varieties. In a previous study we investigated the usefulness of synthesis of varieties. The developed spoken dialog system was especially designed for the different personas that can be realized with multiple varieties. This brings more realistic and fun-to-use spoken dialog systems to the end user and can serve as speech-based user interface for blind users and users with visual impairment. The benefits for this group of users are the increased acceptability and also comprehensibility that comes about when the synthesized speech reflects the user’s linguistic and/or social identity.


Spoken dialog system speech synthesis dialect 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Viennese Sociolect and Dialect Synthesis,
  2. 2.
    Cassell, J.: Social Practice: Becoming Enculturated in Human-Computer Interaction. In: Stephanidis, C. (ed.) Universal Access in HCI (UAHCI), HCI 2009. LNCS, vol. 5616, pp. 303–313. Springer, Heidelberg (2009)Google Scholar
  3. 3.
    Cohen, M.H., Giangola, J.P., Balogh, J.: Voice user interface design. Addison-Wesley, Reading (2004)Google Scholar
  4. 4.
    Pucher, M., Schuchmann, G., Fröhlich, P.: Regionalized Text-to-Speech Systems: Persona Design and Application Scenarios. In: Esposito, A., Hussain, A., Marinaro, M., Martone, R. (eds.) Multimodal Signals: Cognitive and Algorithmic Issues. LNCS (LNAI), vol. 5398, pp. 216–222. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  5. 5.
    Pucher, M., Schabus, D., Yamagishi, J., Neubarth, F., Strom, V.: Modeling and Interpolation of Austrian German and Viennese Dialect in HMM-based Speech Synthesis. Speech Communication 52(2), 164–179 (2010)CrossRefGoogle Scholar
  6. 6.
    Moosmüller, S.: Soziophonologische Variation im gegenwärtigen Wiener Deutsch. Franz Steiner Verlag, Stuttgart (1987)Google Scholar
  7. 7.
    VoiceXML 2.0 recommendation,
  8. 8.
    Dahlbäck, N., Wang, Q., Nass, C., Alwin, J.: Similarity is more important than expertise: Accent effects in speech interfaces. In: Proc. SIGCHI conference on human factors in computing systems, pp. 1553–1556 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Michael Pucher
    • 1
  • Friedrich Neubarth
    • 2
  • Dietmar Schabus
    • 1
  1. 1.Telecommunications Research Center Vienna (FTW)ViennaAustria
  2. 2.Austrian Research Institute for Artificial Intelligence (OFAI)ViennaAustria

Personalised recommendations