Multimodal Web-based Dialogue: XHTML + Voice and SALT

  • Michael F. McTear


VoiceXML was designed with telephone-based interaction in mind. VoiceXML supports recognition of spoken and Dual Tone Multiple Frequency (DTMF) input, output of synthesised and recorded prompts, and interaction with back-end applications on the web. An extension to VoiceXML supports multimodal interaction with web-based applications on devices such as advanced mobile (cellular) phones and Personal Digital Assistants (PDAs). This extension,known as XHTML + Voice, allows developers to speech-enable visual web pages by incorporating and linking VoiceXML code with the visual markup code written in XHTML. An alternative to XHTML + Voice is SALT (Speech Application Language Tags). SALT is a set of extensions that add speech recognition, speech synthesis, and telephony capabilities to existing markup languages, such as HTML and XHTML, to enable multimodal and telephony access to web-based applications from PCs, telephones, tablet PCs and wireless PDAs.


Speech Recognition Multimodal Interface SALT Application Speech Application Speech Control 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 6.
    The assistance of Jim Larson in providing the correct code for namespace declarations and references to the Speech Add-in is gratefully acknowledged.Google Scholar

Copyright information

© Springer-Verlag London 2004

Authors and Affiliations

  • Michael F. McTear
    • 1
  1. 1.School of Computing and MathematicsUniversity of UlsterNewtownabbeyNorthern Ireland

Personalised recommendations