Zanzibar OpenIVR: An Open-Source Framework for Development of Spoken Dialog Systems

  • Dmytro Prylipko
  • Dirk Schnelle-Walka
  • Spencer Lord
  • Andreas Wendemuth
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6836)


The maturity of standards and the availability of open source components for all levels of the MRCP stack provide us with new opportunities for the development of spoken dialog technology. In this paper a standard-based and modular architecture for interactive voice response (IVR) systems is presented together with its implementation – Zanzibar OpenIVR. The architecture, described in terms of components and standards, is compared to other existing frameworks. The usage of our framework is discussed regarding different aspects of spoken dialog technology such as speech recognition and synthesis, integration of the components, dialog management, natural language understanding. It is designed to work over VoIP as well as with usual telephony communication channels, thus provides an ability for web based access. Zanzibar OpenIVR is able to serve as a starting point for building dialog systems and research in voice-enabled technologies.


IVR VoiceXML VoIP Spoken dialog system 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bolt, R.A.: Put-that-there: Voice and gesture at the graphics interface. SIGGRAPH Comput. Graph. 14, 262–270 (1980)CrossRefGoogle Scholar
  2. 2.
    Veracode: State of software security report volume 2. Research report, Veracode (2010)Google Scholar
  3. 3.
    Hammond, J.S., Gerush, M., Sileikis, J.: Open source software goes mainstream. Research document, Forrester Research (2009)Google Scholar
  4. 4.
    Jackson, E.: Speaking up for cost savings in the call center: Vxml takes on the dinosaur of legacy ivr (2003), (last accessed 08/20/2010)
  5. 5.
    Walker, W., Lamere, P., Kwok, P., Raj, B., Singh, R., Gouvea, E., Wolf, P., Woelfel, J.: Sphinx-4: a flexible open source framework for speech recognition. Technical report, Mountain View, CA, USA (2004)Google Scholar
  6. 6.
    Schnelle, D.: Context Aware Voice User Interfaces for Workflow Support. PhD thesis, TU Darmstadt (2007)Google Scholar
  7. 7.
    Cohen, M.H., Giangola, J.P., Balogh, J.: Voice User Interface Design. Addison-Wesley, Boston (2004)Google Scholar
  8. 8.
    Kaitrungrit, D., Dailey, M.N.: Thai voice application gateway. In: Proceedings of ECTI-CON 2008, pp. 101–104. IEEE, Los Alamitos (2008)Google Scholar
  9. 9.
    Bohus, D., Raux, A., Harris, T.K., Eskenazi, M., Rudnicky, A.I.: Olympus: an open-source framework for conversational spoken language interface research. In: NAACL-HLT 2007: Proceedings of the Workshop on Bridging the Gap, pp. 32–39. Association for Computational Linguistics, Morristown (2007)Google Scholar
  10. 10.
    Turunen, M., Hakulinen, J.: Jaspis – a framework for multilingual adaptive speech applications. In: Proceedings of the 6th International Conference on Spoken Language Processing, Beijing (2000)Google Scholar
  11. 11.
    Nöth, E., Horndasch, A., Gallwitz, F., Haas, J.: Experiences with Commercial Telephone-based Dialogue Systems (Erfahrungen mit kommerziellen Telefon-Sprachdialogsystemen). It - Information Technology 46(6), 315–321 (2004)CrossRefGoogle Scholar
  12. 12.
    Nuno, J.N., Neto, J.P., Mamede, N.J., Cassaca, R., Oliveira, L.C.: The Development Of A Multi-Purpose Spoken Dialogue System. In: Proceedings of EUROSPEECH (2003)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Dmytro Prylipko
    • 1
  • Dirk Schnelle-Walka
    • 2
  • Spencer Lord
    • 3
  • Andreas Wendemuth
    • 1
  1. 1.Chair of Cognitive SystemsOtto-von-Guericke University MagdeburgMagdeburgGermany
  2. 2.Telecooperation LabDarmstadt University of TechnologyDarmstadtGermany
  3. 3.Spokentech, Inc.San FranciscoUSA

Personalised recommendations