Personal and Ubiquitous Computing

, Volume 14, Issue 6, pp 527–539 | Cite as

Design and implementation of a VoiceXML-driven wiki application for assistive environments on the web

  • Constantinos Kolias
  • Vassilis Kolias
  • Ioannis Anagnostopoulos
  • Georgios Kambourakis
  • Eleftherios Kayafas
Original Article

Abstract

In this paper, we describe the design and implementation of an audio wiki application accessible via both the Public Switched Telephone Network and the Internet. The application exploits mature World Wide Web Consortium standards, such as VoiceXML, Speech Synthesis Markup Language, and Speech Recognition Grammar Specification toward achieving our goals. The purpose of such an application is to assist visually impaired, technologically uneducated, and underprivileged people in accessing information originally intended to be accessed visually via a personal computer (PC). Users may access wiki content via fixed or mobile phones, or via a PC using a Web Browser or a Voice over IP service. This feature promotes pervasiveness to collaboratively created content to an extremely large population, i.e., those who simply own a telephone line.

Keywords

VoiceXML Wiki applications Pervasive technologies Assistive environments Collaborative applications 

Abbreviations

ABNF

Augmented Backus–Naur Form

ASR

Automatic Speech Recognition

CGI

Common Gateway Interface

DTB

Digital Talking Book

DTMF

Dual-Tone Multi-Frequency

GrXML

Grammar XML

HTML

Hypertext Markup Language

IT

Information Technology

JSGF

Java Speech Grammar Format

JSML

Java Speech API Markup Language

PC

Personal Computer

PSTN

Public Switched Telephone Network

SIP

Session Initiation Protocol

SMS

Short Message Service

SRGS

Speech Recognition Grammar Specification

SSML

Speech Synthesis Markup Language

TTS

Text to Speech

URI

Uniform Resource Identifier

VoiceXML

Voice eXtensible Markup Language

VoIP

Voice over IP

WML

Wireless Markup Language

W3C

World Wide Web Consortium

XHTML

Extensible Hyper Text Markup Language

XML

eXtensible Markup Language

References

  1. 1.
  2. 2.
    Kolias C, Demertzis S, Kambourakis G (2008) Design and implementation of a secure mobile wiki system. In: Uskov V (ed) 7th IASTED international conference on web-based education (WBE 2008). March 2008, Innsbruck, Austria, pp 212–217Google Scholar
  3. 3.
  4. 4.
  5. 5.
    Voice Extensible Markup Language 2.0 Specification. http://www.w3.org/TR/voicexml20/. Retrieved on 30 Dec 2008
  6. 6.
    Leinonen T, Aucamp FN, Sari ER (2006) Audio Wiki for mobile communities: information system for the rest of Us. In: Workshop on speech in mobile and pervasive environments, Mobile HCI 06 conference, 12 Sept 2006, pp 3Google Scholar
  7. 7.
    MediaWiki. http://www.mediawiki.org/wiki/MediaWiki. Retrieved on 5 May 2008
  8. 8.
    Werner S, Wolff M, Eichner M, Hoffmann R (2004) Integrating speech enabled services in a web-based e-learning environment. In: Proceedings of international conference on information technology: coding and computing, vol 2. ITCC 2004, 5–7 April 2004, pp 303–307Google Scholar
  9. 9.
    Wang L, Roe P, Pham B, Tjondronegoro D (2008) An audio wiki supporting mobile collaboration. In: Proceedings of the 2008 ACM symposium on applied computing (Fortaleza, Ceara, Brazil, 16–20 March 2008). SAC ‘08. ACM, New York, NY, pp 1889–1896Google Scholar
  10. 10.
    Borodin Y, Mahmud J, Ramakrishman IV, Stent A (2007) The hearsay non-visual web browser. In: ACM international conference proceeding series, proceedings of the 2007 international cross-disciplinary conference on web accessibility (W4A), vol 225. Banff, Canada, pp 128–129Google Scholar
  11. 11.
    The CMU Sphinx Group Open Source Speech Recognition Engines. http://cmusphinx.sourceforge.net/html/cmusphinx.php. Retrieved on 30 Dec 2008
  12. 12.
    The DAISY Consortium. http://www.daisy.org/. Retrieved on 30 Dec 2008
  13. 13.
    Speech Recognition Grammar 1.0 Specification. http://www.w3.org/TR/speech-grammar/. Retrieved on 30 Dec 2008
  14. 14.
    Speech Synthesis Markup Language. http://www.w3.org/TR/speech-synthesis/. Retrieved on 30 Dec 2008
  15. 15.
    JSpeech Grammar Format. http://www.w3.org/TR/2000/NOTE-jsgf-20000605/. Retrieved on 30 Dec 2008
  16. 16.
    JSpeech Markup Language. http://www.w3.org/TR/jsml/. Retrieved on 30 Dec 2008
  17. 17.
    Session Initiation Protocol. http://www.cs.columbia.edu/sip/drafts.html. Retrieved on 30 Dec 2008
  18. 18.
    MS SQL Server 2005. http://www.microsoft.com/Sqlserver/2005/en/us/express.aspx. Retrieved on 30 Dec 2008
  19. 19.
    Vocalocity’s openVXI 3.0. http://www.speech.cs.cmu.edu/openvxi/. Retrieved on 30 Dec 2008
  20. 20.
    The Festival Speech Synthesis System. http://www.cstr.ed.ac.uk/projects/festival/. Retrieved on 30 Dec 2008
  21. 21.
    Ding L (2009) Learn about VoIP quality measurements. http://www.embeddeddesignindia.co.in/STATIC/PDF/200903. EE Times-India, white paper Retrieved on 4 May
  22. 22.
    ITU-T Rec G.107 (2005) The E-Model, a computational model for use in transmission planning. March 2005Google Scholar
  23. 23.
    Spirent Communications (2007) Measuring jitter accurately. http://www.spirent.com/documents/4814.pdf, white paper Retrieved on 4 May 2009
  24. 24.
    WikyBlog. http://www.wikyblog.com/. Retrieved on 20 July 2009
  25. 25.
    Twitter. http://twitter.com/. Retrieved on 20 July 2009

Copyright information

© Springer-Verlag London Limited 2010

Authors and Affiliations

  • Constantinos Kolias
    • 1
  • Vassilis Kolias
    • 2
  • Ioannis Anagnostopoulos
    • 1
  • Georgios Kambourakis
    • 1
  • Eleftherios Kayafas
    • 2
  1. 1.University of the AegeanSamosGreece
  2. 2.National Technical University of AthensZografouGreece

Personalised recommendations