Model-Based, Multimodal Interaction in Document Browsing

Eslambolchilar, Parisa; Murray-Smith, Roderick

doi:10.1007/11965152_1

Parisa Eslambolchilar¹⁹ &
Roderick Murray-Smith^19,20

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4299))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

765 Accesses
7 Citations

Abstract

In this paper we introduce a dynamic system approach to the design of multimodal interactive systems. We use an example where we support human behavior in browsing a document, by adapting the dynamics of navigation and the visual feedback (using a focus-in-context (F+C) method) to support the current inferred task. We also demonstrate non-speech audio feedback, based on a language model. We argue that to design interaction we need models of key aspects of the process, here for example, we need models for the dynamic system, language model and sonification. We show how the user’s intention is coupled to the visualization technique via the dynamic model, and how the focus-in-context method couples details in context to audio samples via the language identification system. We present probabilistic audio feedback as an example of a multimodal approach to sensing different languages in a multilingual text. This general approach is well suited to mobile and wearable applications, and shared displays.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

McCullough, M.: Abstract Craft: Practical Digital Hand. The MIT Press, Cambridge (1998)
Google Scholar
Kelley, C.R.: Manual and Automatic Control. John Wiley and Sons, Inc., New York (1968)
Google Scholar
Beaudouin-Lafon, M.: Designing Interaction, not Interfaces. In: AVI 2004. Proceedings of the working conference on Advanced visual interfaces, pp. 15–22 (2004)
Google Scholar
Doherty, G., Massink, M.: Continuous Interaction and Human Control. In: Alty, J. (ed.) Proceedings of the XVIII European Annual Conference on Human Decision Making and Manual Control, pp. 80–96 (1999)
Google Scholar
Faconti, G., Massink, M.: Continuous interaction with computers: Issues and Requirements. In: Stephanidis, C. (ed.) Proceedings of Universal Access in HCI, vol. 3. Lawrence Erlbaum Associates, Mahwah (2001)
Google Scholar
Bederson, B.B.: Fisheye Menus. In: UIST 2000. Proceedings of the 13th annual ACM symposium on User interface software and technology, pp. 217–225 (2000)
Google Scholar
Furnas, G.: Generalized Fisheye Views. In: Proceedings of CHI 1986, pp. 16–23 (1986)
Google Scholar
Lamping, J., Rao, R., Pirolli, P.: A focus+context technique based on hyperbolic geobreak metry for visualizing large hierarchies. In: Proceedings of CHI 1995, pp. 401–408 (1995)
Google Scholar
Mackinlay, J.D., Robertson, G.G., Card, C.K.: The Perspective Wall: Detail and Context Smoothly Integrated. In: Proceedings of CHI 1991, pp. 173–179 (1991)
Google Scholar
Sarkar, M., Brown, M.H.: Graphical fisheye views of graphs. In: Bauersfeld, P., Bennett, J., Lynch, G. (eds.) Human Factors in Computing Systems, CHI 1992 Conference Proceedings: Striking A Balance, pp. 83–91. ACM Press, New York (1992)
Chapter Google Scholar
Carpendale, M.S.T.: A Framework for Elastic Presentation Space. Ph.D thesis, Department of Computing Science, Simon Fraser University, Canada (1999)
Google Scholar
Preece, J., Rogers, Y., Sharp, H.: Interaction Design: Beyond Human Computer Interaction. John Wiley, Chichester (2002)
Google Scholar
Sheridan, T.B., Ferrell, W.R.: Man-Machine Systems: Information, Control, and Decision Models of Human Performance. MIT Press, Cambridge (1974)
Google Scholar
Eslambolchilar, P., Murray-Smith, R.: Tilt-based Automatic Zooming and Scaling in mobile devices-a state-space implementation. In: Dunlop, M.D. (ed.) Mobile HCI 2004. LNCS, vol. 3160, pp. 120–131. Springer, Heidelberg (2004)
Chapter Google Scholar
Gutwin, C.: Improving focus targeting in interactive fisheye views. In: Proceeding of CHI 2002, pp. 267–274 (2002)
Google Scholar
Powers, W.T.: Living Control Systems: Selected papers of Powers, W.T. The Control Systems Group Book (1989)
Google Scholar
Powers, W.T.: Living Control Systems II: Selected papers of Powers, W.T. The Control Systems Group Book (1992)
Google Scholar
Tischler, M.B.: Advances in Aircraft flight Control. Taylor & Francis, Abington (1994)
Google Scholar
Bell, T., Cleary, J., Witten, I.: Text Compression. Prentice Hall Advanced Reference Series. Prentice-Hall, Englewood Cliffs (1990)
Google Scholar
Williamson, J., Murray-Smith, R.: Dynamics and probabilistic text entry. In: Murray-Smith, R., Shorten, R. (eds.) Switching and Learning 2004. LNCS, vol. 3355, pp. 333–342. Springer, Heidelberg (2005)
Chapter Google Scholar
Lesher, G., Rinkus, G.: Leveraging word prediction to improve character prediction in a scanning configuration. In: Proceedings of the RESNA 2002, Annual Conference (2002)
Google Scholar
Carpendale, S., Montagnese, C.: A framework for unifying presentation space. In: Proceedings of UIST 2001, pp. 82–92 (2001)
Google Scholar
Eslambochilar, P., Williamson, J., Murray-Smith, R.: Multimodal feedback for tilt controlled speed dependent automatic zooming. In: UIST 2004. Proceedings of the 17th annual ACM symposium on User interface software and technology. ACM, New York (2004)
Google Scholar
Williamson, J., Murray-Smith, R.: Sonification of probabilistic feedback through granular synthesis. IEEE Multimedia 12(2), 45–52 (2005)
Article Google Scholar
Shannon, C.E.: A mathematical theory of communication. Bell System Technical Journal 27, 379–423, 623–656 (1948), http://cm.bell-labs.com/cm/ms/what/shannonday/paper.html
MATH MathSciNet Google Scholar
Foley, J., Dam, A.V., Feiner, S., Hughes, J.F.: Computer Graphics, reissued 2nd edn. Addison-Wesley, Reading (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Hamilton Institute, National University of Ireland, Maynooth, Co.Kildare, Ireland
Parisa Eslambolchilar & Roderick Murray-Smith
Department of Computing Science, Glasgow University, Glasgow, Scotland
Roderick Murray-Smith

Authors

Parisa Eslambolchilar
View author publications
You can also search for this author in PubMed Google Scholar
Roderick Murray-Smith
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Edinburgh, Edinburgh, Scotland
Steve Renals
IDIAP Research Institute, Martigny, Switzerland
Samy Bengio
National Institute Of Standards and Technology, 100 Bureau Drive Stop 8940, Gaithersburg, MD, 20899
Jonathan G. Fiscus

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Eslambolchilar, P., Murray-Smith, R. (2006). Model-Based, Multimodal Interaction in Document Browsing. In: Renals, S., Bengio, S., Fiscus, J.G. (eds) Machine Learning for Multimodal Interaction. MLMI 2006. Lecture Notes in Computer Science, vol 4299. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11965152_1

Download citation

DOI: https://doi.org/10.1007/11965152_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69267-6
Online ISBN: 978-3-540-69268-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics