A Path to Multimodal Data Services for Telecommunications

  • Georg Niklfeld
  • Michael Pucher
  • Robert Finan
  • Wolfgang Eckhart
Part of the Text, Speech and Language Technology book series (TLTB, volume 28)

Abstract

This chapter investigates some issues faced in developing multimodal data services for public mobile telecommunications. It discusses applications, standards, mobile devices, and existing R&D efforts. Three demonstrators developed by the authors are presented, including QuickMap, a map finder based on GPRS and WAP-Push. Findings are summarised in the description of a path that will lead to successful multimodal data services in mobile telecommunications.

Keywords

Mobile data services Multimodality Telecommunications Standards Systems architecture 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Almeida, L., Amdal, I., Beires, N., Boualem, M., Boves, L., den Os, E., Filoche, P., Gomes, R., Knudsen, J. E., Kvale, K., Rugelbak, J., Tallec, C., and Warakagoda, N. (2002). The MUST guide to Paris; implementation and expert evaluation of a multimodal tourist guide to Paris. In Proceedings of ISCA Tutorial and Research Workshop Multimodal Dialogue in Mobile Environments (IDS02), Kloster Irsee, Germany.Google Scholar
  2. Azzini, I., Giorgino, T., Nardelli, L., Orlando, M., and Rognoni, C. (2002). An architecture for a multi-modal web browser. In Proceedings of ISCA Tutorial and Research Workshop Multimodal Dialogue in Mobile Environments (IDS02), Kloster Irsee, Germany.Google Scholar
  3. Bohus, D. and Rudnicky, A. (2004). LARRI: A language-based maintenance and repair assistant. In Minker, W., Bühler, D., and Dybkjær, L., editors, Spoken Multimodal Human-Computer Dialogue in Mobile Environments. Kluwer Academic Publishers, Dordrecht, The Netherlands. (this volume).Google Scholar
  4. Bühler, D., Minker, W., Häussler, J., and Krüger, S. (2002). The SmartKom Mobile multi-modal dialogue system. In Proceedings of ISCA Tutorial and A Path to Multimodal Data Services for Telecommunications Research Workshop Multimodal Dialogue in Mobile Environments (IDS02), pages 66–70, Kloster Irsee, Germany.Google Scholar
  5. Cheyer, A. and Martin, D. (2001). The Open Agent Architecture. Journal of Autonomous Agents and Multi-Agent Systems, 4(1/2):143–148.CrossRefGoogle Scholar
  6. Cohen, P., Johnston, M., McGee, D., Oviatt, S., Pittman, J., Smith, I., Chen, L., and Clow, J. (1997). QuickSet: Multimodal interaction for distributed applications. In Proceedings of ACM International Conference on Multimedia, pages 31–40, Seattle, Washington, USA.Google Scholar
  7. Cohen, P. and Oviatt, S. (1995). The role of voice input for human-machine communication. In Proceedings of National Academy of Sciences, number 22, pages 9921–9927.CrossRefGoogle Scholar
  8. Doherty, P., Granlund, G., Kuchcinski, K., Sandewall, E., Nordberg, K., Skarman, E., and Wiklund, J. (2000). The WITAS unmanned aerial vehicle project. In Proceedings of 14th European Conference on Artificial Intelligence (ECAI), pages 747–755, Berlin, Germany.Google Scholar
  9. Elting, C. and Michelitsch, G. (2001). A multimodal presentation planner for a home entertainment environment. In Proceedings of Workshop on Perceptive User Interfaces (PUI), Lake Buena Vista, Florida, USA.Google Scholar
  10. Goldschen, A. and Loehr, D. (1999). The role of the DARPA communicator architecture as a human computer interface for distributed simulations. In Proceedings of Spring Simulation Interoperability Workshop, Orlando, Florida, USA. Simulation Interoperability Standards Organization.Google Scholar
  11. Herfet, T. and Kirste, T. (2001). EMBASSI — multimodal assistance for infotainment & service infrastructures. In Proceedings of Statustagung der Leitprojekte Mensch-Technik-Interaktion, pages 35–44, Saarbrficken, Germany.Google Scholar
  12. Hoellerer, S. (2002). Challenges and important aspects in planning and performing evaluation studies for multimodal dialogue systems. In Proceedings of ELSNET Workshop Towards a Roadmap for Multimodal Language Resources and Evaluation at LREC 2002, Las Palmas, Gran Canaria, Spain.Google Scholar
  13. Huang, X., Acero, A., Chelba, C., Deng, L., Duchene, D., Goodman, J., Hon, H., Jacoby, D., Jiang, L., Loynd, R., Mahajan, M., Mau, P., Meredith, S., Mughal, S., Neto, S., Plumpe, M., Wang, K., and Wang, Y. (2000). MIPAD: A next generation PDA prototype. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 33–36, Beijing, China.Google Scholar
  14. Ishii, H. (2002). Tangible bits: Designing the seamless interface between people, bits and atoms. Keynote speech at Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02). Pittsburgh, Pennsylvania, USA.Google Scholar
  15. Johnston, M., Bangalore, S., Stent, A., Vasireddy, G., and Ehlen, P. (2002). Multimodal language processing for mobile information access. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 2237–2241, Denver, Colorado, USA.Google Scholar
  16. Kleindienst, J., Seredi, L., Kapanen, P., and Bergman, J. (2002). CATCH-2004 multi-modal browser: Overview description with usability analysis. In Proceedings of IEEE International Conference on Multimodal Interfaces (ICMI), pages 442–447, Pittsburgh, Pennsylvania, USA.Google Scholar
  17. Kumar, S., Cohen, P., and Levesque, H. (2000). The Adaptive Agent Architecture: Achieving fault-tolerance using persistent broker teams. In Proceedings of International Conference on Multi-Agent Systems (ICMAS), pages 159–166, Boston, Massachusetts, USA.Google Scholar
  18. Lemon, O., Bracy, A., Gruenstein, A., and Peters, S. (2001). The WITAS multi-modal dialogue system I. In Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH), pages 1559–1562, Aalborg, Denmark.Google Scholar
  19. Maybury, M. T. (2002). Multimodal systems, resources, and evaluation. In Proceedings of International Conference on Language Resources and Evaluation (LREC), pages g–n, Las Palmas, Gran Canaria, Spain.Google Scholar
  20. Nass, C. (2002). Integrating multiple modalities: Psychology and design of multimodal interfaces. Keynote speech at Fourth IEEE International Conference on Multimodal Interfaces (ICMI'02). Pittsburgh, Pennsylvania, USA.Google Scholar
  21. Niklfeld, G., Finan, R., and Pucher, M. (2001a). Architecture for adaptive multimodal dialog systems based on VoiceXML. In Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH), pages 2341–2344, Aalborg, Denmark.Google Scholar
  22. Niklfeld, G., Finan, R., and Pucher, M. (2001b). Multimodal interface architecture for mobile data services. In Proceedings of TCMC Workshop on Wearable Computing, Graz, Austria.Google Scholar
  23. Oviatt, S. (2000). Multimodal system processing in mobile environments. In Proceedings of Annual ACM Symposium on User Interface Software and Technology, pages 21–30, San Diego, California, USA.Google Scholar
  24. Oviatt, S., Cohen, P., Wu, L., Vergo, J., Duncan, L., Suhm, B., Bers, J., Holzman, T., Winograd, T., Landay, J., Larson, J., and Ferro, D. (2000). Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions. Human Computer Interaction, 15:263–322.CrossRefGoogle Scholar
  25. Oviatt, S., Stevens, C., Coulston, R., Xiao, B., Wesson, M., Girand, C., and Mellander, E. (2002). Towards adaptive conversational interfaces: Modeling speech convergence with animated personas. In Proceedings of ISCA Tutorial and Research Workshop Multimodal Dialogue in Mobile Environments (IDS02), Kloster Irsee, Germany.Google Scholar
  26. Pearce, D. and Kopp, D. (2001). ETSI STQ Aurora presentation to 3GPP. Slide presentation.Google Scholar
  27. Pieraccini, R., Carpenter, B., Woudenberg, E., Caskey, S., Springer, S., Bloom, J., and Phillips, M. (2002). Multi-modal spoken dialog with wireless devices. In Proceedings of ISCA Tutorial and Research Workshop Multimodal Dialogue in Mobile Environments (IDS02), Kloster Irsee, Germany.Google Scholar
  28. Pospischil, G., Umlauft, M., and Michlmayr, E. (2002). Designing Lol@, a mobile tourist guide for UMTS. In Proceedings of International Symposium on Human Computer Interaction with Mobile Devices (Mobile HCI), pages 140–154, Pisa, Italy.Google Scholar
  29. Rössler, H., Sienel, J., Wajda, W., Hoffmann, J., and Kostrzewa, M. (2001). Multimodal interaction for mobile environments. In Proceedings of International Workshop on Information Presentation and Natural Multimodal Dialogue, pages 47–51, Verona, Italy.Google Scholar
  30. Seneff, S., Hurley, E., Lau, R., Pao, C., Schmid, P., and Zue, V. (1998). Galaxy-II: a reference architecture for conversational system development. In Proceedings of International Conference on Spoken Language Processing (IC-SLP), pages 931–934, Sydney, Australia.Google Scholar
  31. Sturm, J., Bakx, I., Cranen, B., Terken, J., and Wang, F. (2002a). Usability evaluation of a Dutch multimodal system for railway information. In Proceedings of International Conference on Language Resources and Evaluation (LREC), pages 255–261, Las Palmas, Gran Canaria, Spain.Google Scholar
  32. Sturm, J., Cranen, B., Wang, F., Terken, J., and Bakx, I. (2002b). The effect of user experience on interaction with multimodal systems. In Proceedings of ISCA Tutorial and Research Workshop Multimodal Dialogue in Mobile Environments (IDS02), Kloster Irsee, Germany.Google Scholar
  33. Wahlster, W., Reithinger, N., and Blocher, A. (2001). SmartKom: Multimodal communication with a life-like character. In Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH), pages 1547–1550, Aalborg, Denmark.Google Scholar

Copyright information

© Springer 2005

Authors and Affiliations

  • Georg Niklfeld
    • 1
  • Michael Pucher
    • 1
  • Robert Finan
    • 2
  • Wolfgang Eckhart
    • 3
  1. 1.ftw. Forschungszentrum Telekommunikation WienViennaAustria
  2. 2.Mobilkom Austria AG & Co KGViennaAustria
  3. 3.Sonorys Technology GmbHKorneuburgAustria

Personalised recommendations