WeldVUI: Establishing Speech-Based Interfaces in Industrial Applications

  • Mirjam AugsteinEmail author
  • Thomas Neumayr
  • Sebastian Pimminger
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11748)


Voice User Interfaces (VUIs) and speech-based applications have recently gained increasing popularity. During the past years, they have been included in a wide range of mass-market devices (smart phones or technology installed in common car cockpits) and are thus available for many everyday interaction scenarios (e.g., making phone calls or switching the lights on and off). This popularity also led to a number of guidelines for VUI design, software libraries and devices for speech recognition becoming available for interface designers and developers. Although generally helpful, these resources are often broad and do not fully satisfy the specific requirements of certain industrial applications. First, grammar and vocabulary in such settings usually differ drastically from everyday scenarios. Second, common software libraries and devices are often not able to comply with the conditions in industrial environments (e.g. involving high levels of noise). This paper describes the iterative, user-centered design process for VUIs and functional speech-based interaction prototypes for the domain of industrial welding, including a two-stage Wizard of Oz procedure, rapid prototyping, speech recognition improvement and thorough user involvement. Our experiences throughout this process generalize to other industrial applications and so-called “niche applications” where grammar and vocabulary usually have to be established from scratch. They are intended to guide other researchers setting up a similar process for designing and prototyping domain-specific VUIs.


Voice user interface design User-centered design Interaction design Speech-based interfaces Industrial applications 



The work described in this paper has been conducted within the scope of the project Welding Interaction in Future Industry funded through the BRIDGE 1 program, managed by the Austrian Research Promotion Agency (FFG). Project partners are the University of Applied Sciences Upper Austria, LIFEtool gemeinnützige GmbH and Fronius International GmbH.


  1. 1.
    Augstein, M., Neumayr, T., Pimminger, S., Ebner, C., Altmann, J., Kurschl, W.: Contextual design in industrial settings: experiences and recommendations. In: Proceedings of the 20th International Conference on Enterprise Information Systems, Funchal, Madeira, Portugal (2018)Google Scholar
  2. 2.
    Beyer, H., Holtzblatt, K.: Contextual design. Interactions, pp. 32–42 (1999)Google Scholar
  3. 3.
    Brooke, J.: SUS - a quick and dirty usability scale. Usability Eval. Ind. 189(194), 4–7 (1996)Google Scholar
  4. 4.
    Cohen, M., Giangola, J., Balogh, J.: Voice User Interface Design. Addison-Wesley, Boston (2004)Google Scholar
  5. 5.
    Cordasco, G., et al.: Assessing voice user interfaces: the vassist system prototype. In: Proceedings of the 5th IEEE Conference on Cognitive Infocommunications, Vietri sul Mare, Italy (2014)Google Scholar
  6. 6.
    Dybkjaer, L., Bernsen, N.O.: Usability evaluation in spoken language dialogue systems. In: Proceedings of the ACL 2001 Workshop on Evaluation Methodologies for Language and Dialogue Systems (2001)Google Scholar
  7. 7.
    Farinazzo, V., Salvador, M., De Oliveira Neto, J.S., Kawamoto, A.S.: Requirement engineering contributions to voice user interface. In: Proceedings of the First International Conference on Advances in Human-Computer Interaction. Sainte-Luce, France (2008)Google Scholar
  8. 8.
    Farinazzo, V., Salvador, M., Kawamoto, A.L., De Oliveira Neto, J.S.: An empirical approach for the evaluation of voice user interfaces. In: Matrai, R. (ed.) User Interfaces. InTech (2010)Google Scholar
  9. 9.
    Hart, S.G., Staveland, L.E.: Development of NASA-TLX (task load index): results of empirical and theoretical research. Adv. Psychol. 52, 139–183 (1988)CrossRefGoogle Scholar
  10. 10.
    Hassenzahl, M.: Hedonic, emotional and experiential perspectives on product quality. In: Ghaoui, C. (ed.) Encyclopedia of Human Computer Interaction. Idea Group Reference (2006)Google Scholar
  11. 11.
    Hirschman, L., Thompson, H.S.: Overview of evaluation in speech and natural language processing. In: Survey of the State of the Art in Human Language Technology. Oxford University Press, Oxford (1997)Google Scholar
  12. 12.
    Holtzblatt, K., Jones, S.: Contextual inquiry: a participatory technique for system design. In: Schuler, D., Namioka, A. (eds.) Participatory Design. Principles and Practices, vol. 9. Lawrence Erlbaum Associates, New York (1993)Google Scholar
  13. 13.
    Kelley, J.F.: An iterative design methodology for user-friendly natural language office information applications. ACM Trans. Inf. Syst. 2(1), 26–41 (1984). Scholar
  14. 14.
    Klemmer, S., et al.: Suede: a wizard of oz prototyping tool for speech user interfaces. In: Proceedings of the 13th Annual Symposium on User Interface Software and Technology, San Diego, California, USA, pp. 1–10 (2000)Google Scholar
  15. 15.
    Knight, S., Gorrell, G., Rayner, M., Milward, D., Koeling, R., Lewin, I.: Comparing grammar-based and robust approaches to speech understanding: a case study. In: Proceedings of Eurospeech 2001, Aalborg, Denmark (2001)Google Scholar
  16. 16.
    Laugwitz, B., Held, T., Schrepp, M.: Construction and evaluation of a user experience questionnaire. In: Holzinger, A. (ed.) USAB 2008. LNCS, vol. 5298, pp. 63–76. Springer, Heidelberg (2008). Scholar
  17. 17.
    Licklider, J.: Man-computer symbiosis. IRE Trans. Hum. Factors Electron. HFE-1(1), 4–11 (1960)CrossRefGoogle Scholar
  18. 18.
    Loizou, P.C.: Speech Enhancement: Theory and Practice, 2nd edn. CRC Press Inc., Boca Raton (2013)CrossRefGoogle Scholar
  19. 19.
    Möller, S.: A new taxonomy for the quality of telephone services based on spoken dialogue systems. In: Proceedings of the 3rd SIGdial Workshop on Discourse and Dialogue, Philadelphia, PA, USA (2002)Google Scholar
  20. 20.
    Möller, S.: Assessment and evaluation of speech-based interactive systems: from manual annotation to automatic usability evaluation. In: Chen, F., Jokinen, K. (eds.) Speech Technology. Theory and Applications. Springer, Boston (2010). Scholar
  21. 21.
    Noyes, J., Haas, E.: Military applications: human factors aspects of speech-based systems. In: Chen, F., Jokinen, K. (eds.) Speech Technology. Theory and Applications. Springer, Boston (2010). Scholar
  22. 22.
    Pearl, C.: Designing Voice User Interfaces: Principles of Conversational Experiences. O’Reilly Media Inc., Newton (2016)Google Scholar
  23. 23.
    Pieraccini, R., Suendermann, D., Dayanidhi, K., Liscombe, J.: Are we there yet? Research in commercial spoken dialog systems. In: Proceedings of the International Conference on Text, Speech and Dialogue, Pilsen, Czech Republic, pp. 3–13 (2009)CrossRefGoogle Scholar
  24. 24.
    Pires, N.: Robot-by-voice: experiments on commanding an industrial robot using the human voice. Ind. Robot: Int. J. Robot. Res. Appl. 32(6), 505–511 (2005)MathSciNetCrossRefGoogle Scholar
  25. 25.
    Rayner, M., Hockey, B.A., Renders, J.M., Chatzichrisafis, N., Farrell, K.: Spoken language processing in the clarissa procedure browser. Nat. Lang. Eng. 1(1), 1–28 (2005)Google Scholar
  26. 26.
    Rayner, M., Hockey, B.A., Renders, J.M., Chatzichrisafis, N., Farrell, K.: Spoken dialogue application in space: the clarissa procedure browser. In: Chen, F., Jokinen, K. (eds.) Speech Technology. Theory and Applications. Springer, Boston (2010). Scholar
  27. 27.
    Rogowski, A.: Industrially oriented voice control system. Robot. Comput.-Integr. Manuf. 28(3), 303–315 (2012)CrossRefGoogle Scholar
  28. 28.
    Sauro, J., Dumas, J.: Comparison of three one-question, post-task usability questionnaires. In: Proceedings of the 27th International Conference on Human Factors in Computing Systems, Boston, MA, USA (2009)Google Scholar
  29. 29.
    Schmandt, C.: Voice Communication with Computers: Conversational Systems. Van Nostrand Reinhold Co., New York (1994)zbMATHGoogle Scholar
  30. 30.
    Schnelle, D., Lyardet, F.: Voice user interface design patterns. In: Proceedings of the 11th European Conference on Pattern Languages of Programs. Irrsee, Germany (2006)Google Scholar
  31. 31.
    Turunen, M.: Jaspis - A spoken dialogue architecture and its applications. Ph.D. thesis, University of Tampere, Department of Information Studies, Tampere, Finland (2004)Google Scholar
  32. 32.
    Ullrich, D., Diefenbach, S.: Intui. exploring the facets of intuitive interaction. In: Tagungsband Mensch & Computer 2010: Interaktive Kulturen, Duisburg, Germany (2010)Google Scholar

Copyright information

© IFIP International Federation for Information Processing 2019

Authors and Affiliations

  1. 1.University of Applied Sciences Upper AustriaHagenbergAustria

Personalised recommendations