Abstract
The maturity of standards and the availability of open source components for all levels of the MRCP stack provide us with new opportunities for the development of spoken dialog technology. In this paper a standard-based and modular architecture for interactive voice response (IVR) systems is presented together with its implementation – Zanzibar OpenIVR. The architecture, described in terms of components and standards, is compared to other existing frameworks. The usage of our framework is discussed regarding different aspects of spoken dialog technology such as speech recognition and synthesis, integration of the components, dialog management, natural language understanding. It is designed to work over VoIP as well as with usual telephony communication channels, thus provides an ability for web based access. Zanzibar OpenIVR is able to serve as a starting point for building dialog systems and research in voice-enabled technologies.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bolt, R.A.: Put-that-there: Voice and gesture at the graphics interface. SIGGRAPH Comput. Graph. 14, 262–270 (1980)
Veracode: State of software security report volume 2. Research report, Veracode (2010)
Hammond, J.S., Gerush, M., Sileikis, J.: Open source software goes mainstream. Research document, Forrester Research (2009)
Jackson, E.: Speaking up for cost savings in the call center: Vxml takes on the dinosaur of legacy ivr (2003), http://www.thefreelibrary.com/Speaking+up+for+cost+savings+in+the+call+center:+VXML+takes+on+the...-a0107216561 (last accessed 08/20/2010)
Walker, W., Lamere, P., Kwok, P., Raj, B., Singh, R., Gouvea, E., Wolf, P., Woelfel, J.: Sphinx-4: a flexible open source framework for speech recognition. Technical report, Mountain View, CA, USA (2004)
Schnelle, D.: Context Aware Voice User Interfaces for Workflow Support. PhD thesis, TU Darmstadt (2007)
Cohen, M.H., Giangola, J.P., Balogh, J.: Voice User Interface Design. Addison-Wesley, Boston (2004)
Kaitrungrit, D., Dailey, M.N.: Thai voice application gateway. In: Proceedings of ECTI-CON 2008, pp. 101–104. IEEE, Los Alamitos (2008)
Bohus, D., Raux, A., Harris, T.K., Eskenazi, M., Rudnicky, A.I.: Olympus: an open-source framework for conversational spoken language interface research. In: NAACL-HLT 2007: Proceedings of the Workshop on Bridging the Gap, pp. 32–39. Association for Computational Linguistics, Morristown (2007)
Turunen, M., Hakulinen, J.: Jaspis – a framework for multilingual adaptive speech applications. In: Proceedings of the 6th International Conference on Spoken Language Processing, Beijing (2000)
Nöth, E., Horndasch, A., Gallwitz, F., Haas, J.: Experiences with Commercial Telephone-based Dialogue Systems (Erfahrungen mit kommerziellen Telefon-Sprachdialogsystemen). It - Information Technology 46(6), 315–321 (2004)
Nuno, J.N., Neto, J.P., Mamede, N.J., Cassaca, R., Oliveira, L.C.: The Development Of A Multi-Purpose Spoken Dialogue System. In: Proceedings of EUROSPEECH (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Prylipko, D., Schnelle-Walka, D., Lord, S., Wendemuth, A. (2011). Zanzibar OpenIVR: An Open-Source Framework for Development of Spoken Dialog Systems. In: Habernal, I., Matoušek, V. (eds) Text, Speech and Dialogue. TSD 2011. Lecture Notes in Computer Science(), vol 6836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23538-2_47
Download citation
DOI: https://doi.org/10.1007/978-3-642-23538-2_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23537-5
Online ISBN: 978-3-642-23538-2
eBook Packages: Computer ScienceComputer Science (R0)