Visual Focus of Attention Recognition in the Ambient Kitchen
This paper presents a model for visual focus of attention recognition in the Ambient Kitchen, a pervasive computing prototyping environment. The kitchen is equipped with several blended displays on one wall and users may use information presented on these displays from multiple locations. Our goal is to recognize which display the user is looking at so that the environment can adjust the display content accordingly. We propose a dynamic Bayesian network model to infer the focus of attention, which models the relation between multiple foci of attention, multiple user locations and faces captured by the multiple cameras in the environment. Head pose is not explicitly computed but measured by a similarity vector which represents the likelihoods of multiple face clusters. Video data are collected in the Ambient Kitchen environment and experimental results demonstrate the effectiveness of our model.
Unable to display preview. Download preview PDF.
- 3.Stiefelhagen, R.: Tracking Focus of Attention in Meetings. In: Proc. Fourth IEEE Conf. Multimodal Interfaces (2002)Google Scholar
- 4.Voit, M., Stiefelhagen, R.: Deducing the Visual Focus of Attention from Head Pose Estimation in Dynamic Multi-view Meeting Scenarios. In: ACM and IEEE International Conference on Multimodal Interfaces (ICMI 2008), Chania, Crete, Greece, October 20-22 (2008)Google Scholar
- 6.Otsuka, K., Sawada, H., Yamato, J.: Automatic Inference of Cross-modal Nonverbal Interactions in Multiparty Conversations. In: Proc. of ACM 9th Int. Conf. Multimodal Interfaces (ICMI 2007), Nagoya, Japan, November 2007, pp. 255–262 (2007)Google Scholar
- 7.Smith, K., Ba, S.O., Perez, D.G., Odobez, J.M.: Tracking the multi person wandering visual focus of attention. In: Proceedings of the 8th international conference on Multimodal interfaces, Banff, Alberta, Canada, November 2-4 (2006)Google Scholar
- 8.Zhang, H., Toth, L., Deng, W., Guo, J., Yang, J.: Monitoring Visual Focus of Attention via Local Discriminant Projection. In: Proceedings of ACM International Conference on Multimedia Information Retrieval (2008)Google Scholar
- 9.Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 511–518 (2001)Google Scholar
- 10.Jones, M., Viola, P.: Fast multi-view face detection. Technical Report TR2003-96, MERL (June 2003)Google Scholar
- 11.Ross, D., Lim, J., Lin, R.-S., Yang, M.-H.: Incremental Learning for Robust Visual Tracking. International Journal of Computer Vision (2007)Google Scholar
- 13.Olivier, P., Monk, A., Xu, G., Hoey, J.: Ambient Kitchen: designing situated services using a high fidelity prototyping environment. In: Proceedings of 2nd International Conference on Pervasive Technologies Related to Assistive Environments, Workshop on Affect and Behaviour Related Assistance in Support for the Elderly, Corfu, Greece (June 2009)Google Scholar
- 14.Pham, C., Olivier, P.: Slice and Dice: Recognizing food preparation activities using embedded accelerometers. In: Proceedings of the 3rd European Conference on Ambient Intelligence (AmI 2009), Salzburg, Austria (2009)Google Scholar