Abstract
Speech-based technology is a useful alternative to traditional input techniques such as the keyboard and mouse. For people with disabilities that hinder use of traditional input devices, a hands-free speechbased interaction solution is highly desirable. Various speech-based navigation techniques have been discussed in the literature and employed in commercial software applications. Among them, grid-based navigation has shown both potential and limitations. Grid-based solutions allow users to position the cursor using recursive grids to ‘drill down’ until the cursor is in the desired location. We report the results of an empirical study that assessed the efficacy of two enhancements to the grid-based navigation technique: magnification and fine-tuning. Both mechanisms were designed to facilitate the process of selecting small targets. The results suggest that both the magnification and the fine-tuning capabilities significantly improved the participants’ performance when selecting small targets and that fine-tuning also has benefits when selecting larger targets. Participants preferred the solution that provided both enhancements.
Keywords
References
VistaTM Speech Recognition, http://en.wikipedia.org/wiki/Windows_Speech_Recognition
Oviatt, S.L.: Multimodal interactive maps: Designing for human performance. In: Human-Computer Interaction, vol. 12, pp. 93–129. Springer, Heidelberg (1997)
Feng, J., Sears, A.: Using confidence scores to improve hands-free speech based navigation in continuous dictation systems. ACM Trans. Comput.-Hum. Interact. 11(4), 329–356 (2004)
Lai, J., Yankelovich, N.: Conversational speech interfaces. In: Jacko, J.A., Sears, A. (eds.) The Human-Computer interaction Handbook: Fundamentals, Evolving Technologies and Emerging Applications. Human Factors and Ergonomics, pp. 698–713. L. Erlbaum Associates, Hillsdale (2007)
Cohen, M.H., Giangola, J.P., Balogh, J.: Voice User Interface Design. Addison Wesley Longman Publishing Co., Inc., Redwood City (2004)
Oviatt, S.: Multimodal Interfaces. In: Jacko, J.A., Sears, A. (eds.) The Human-Computer interaction Handbook: Fundamentals, Evolving Technologies and Emerging Applications, pp. 413–432. L. Erlbaum Associates, Hillsdale (2007)
Sears, A., Young, M., Feng, J.: Physical Disabilities and Computing Technologies: An Analysis of Impairments. In: Jacko, J.A., Sears, A. (eds.) The Human-Computer interaction Handbook: Fundamentals, Evolving Technologies and Emerging Applications. Human Factors and Ergonomics, pp. 829–852. L. Erlbaum Associates, Hillsdale (2007)
Kamel, H., Landay, J.: The Integrated Communication 2 Draw (IC2D): A drawing program for the visually impaired. In: CHI 1999 (1999)
Dai, L., Goldman, R., Sears, A., Lozier, J.: Speech-based cursor control: a study of grid-based solutions. In: Assets 2004: Proceedings of the 6th international ACM SIGACCESS conference on Computers and accessibility, pp. 94–101. ACM, New York (2004)
Feng, J., Zhu, S., Hu, R., Sears, A.: Speech technology in real world environment: early results from a long term study. In: Assets 2008: Proceedings of the 10th international ACM SIGACCESS conference on Computers and accessibility, pp. 233–234. ACM, New York (2008)
Halverson, C., Horn, D., Karat, C., Karat, J.: The Beauty of Errors: Patterns of Error Correction in Desktop Speech systems. In: Proc. INTERACT 1999, pp. 1–9 (1999)
Karat, C., Vergo, J., Nahamoo, D.: Conversational Interface Technologies. In: Jacko, J., Sears, A. (eds.) The Human-Computer Interaction Handbook, pp. 169–186. LEA, NJ (2003)
Karimullah, A.S., Sears, A.: Speech-based cursor control. In: Proceedings of Assets 2002, pp. 178–185 (2002)
Sears, A., Feng, J., Oseitutu, K., Karat, C.: Hands-free, speech-based navigation during dictation: difficulties, consequences, and solutions. Hum.-Comput. Interact. 18(3), 229–257 (2003)
Kamel, H., Landay, J.: A study of blind drawing practice: Creating graphical information without the visual channel. In: ASSETS 2000 (2000)
Kamel, H., Landay, J.: Sketching images eyes-free: A grid-based dynamic drawing tool for the blind. In: ASSETS 2002 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 IFIP International Federation for Information Processing
About this paper
Cite this paper
Zhu, S., Ma, Y., Feng, J., Sears, A. (2009). Speech-Based Navigation: Improving Grid-Based Solutions. In: Gross, T., et al. Human-Computer Interaction – INTERACT 2009. INTERACT 2009. Lecture Notes in Computer Science, vol 5726. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03655-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-03655-2_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03654-5
Online ISBN: 978-3-642-03655-2
eBook Packages: Computer ScienceComputer Science (R0)