Visual Focus of Attention Recognition in the Ambient Kitchen

  • Ligeng Dong
  • Huijun Di
  • Linmi Tao
  • Guangyou Xu
  • Patrick Oliver
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5996)


This paper presents a model for visual focus of attention recognition in the Ambient Kitchen, a pervasive computing prototyping environment. The kitchen is equipped with several blended displays on one wall and users may use information presented on these displays from multiple locations. Our goal is to recognize which display the user is looking at so that the environment can adjust the display content accordingly. We propose a dynamic Bayesian network model to infer the focus of attention, which models the relation between multiple foci of attention, multiple user locations and faces captured by the multiple cameras in the environment. Head pose is not explicitly computed but measured by a similarity vector which represents the likelihoods of multiple face clusters. Video data are collected in the Ambient Kitchen environment and experimental results demonstrate the effectiveness of our model.


Face Image Video Data Hide Variable Pervasive Computing Dynamic Bayesian Network 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Roda, C., Thomas, J.: Attention aware systems: Theories, applications, and research agenda. Computers in Human Behavior 22, 557–587 (2006)CrossRefGoogle Scholar
  2. 2.
    Langton, S.R.H., Watt, R.J., Bruce, V.: Do the Eyes Have it? Cues to the Direction of Social Attention. Trends in Cognitive Sciences 4(2), 50–58 (2000)CrossRefGoogle Scholar
  3. 3.
    Stiefelhagen, R.: Tracking Focus of Attention in Meetings. In: Proc. Fourth IEEE Conf. Multimodal Interfaces (2002)Google Scholar
  4. 4.
    Voit, M., Stiefelhagen, R.: Deducing the Visual Focus of Attention from Head Pose Estimation in Dynamic Multi-view Meeting Scenarios. In: ACM and IEEE International Conference on Multimodal Interfaces (ICMI 2008), Chania, Crete, Greece, October 20-22 (2008)Google Scholar
  5. 5.
    Ba, S.O., Odobez, J.M.: A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, pp. 75–87. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  6. 6.
    Otsuka, K., Sawada, H., Yamato, J.: Automatic Inference of Cross-modal Nonverbal Interactions in Multiparty Conversations. In: Proc. of ACM 9th Int. Conf. Multimodal Interfaces (ICMI 2007), Nagoya, Japan, November 2007, pp. 255–262 (2007)Google Scholar
  7. 7.
    Smith, K., Ba, S.O., Perez, D.G., Odobez, J.M.: Tracking the multi person wandering visual focus of attention. In: Proceedings of the 8th international conference on Multimodal interfaces, Banff, Alberta, Canada, November 2-4 (2006)Google Scholar
  8. 8.
    Zhang, H., Toth, L., Deng, W., Guo, J., Yang, J.: Monitoring Visual Focus of Attention via Local Discriminant Projection. In: Proceedings of ACM International Conference on Multimedia Information Retrieval (2008)Google Scholar
  9. 9.
    Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 511–518 (2001)Google Scholar
  10. 10.
    Jones, M., Viola, P.: Fast multi-view face detection. Technical Report TR2003-96, MERL (June 2003)Google Scholar
  11. 11.
    Ross, D., Lim, J., Lin, R.-S., Yang, M.-H.: Incremental Learning for Robust Visual Tracking. International Journal of Computer Vision (2007)Google Scholar
  12. 12.
    Frey, B., Jojic, N.: A comparison of algorithms for inference and learning in probabilistic graphical models. IEEE Trans. Pattern Analysis and Machine Intelligence 27(9), 1–25 (2005)CrossRefGoogle Scholar
  13. 13.
    Olivier, P., Monk, A., Xu, G., Hoey, J.: Ambient Kitchen: designing situated services using a high fidelity prototyping environment. In: Proceedings of 2nd International Conference on Pervasive Technologies Related to Assistive Environments, Workshop on Affect and Behaviour Related Assistance in Support for the Elderly, Corfu, Greece (June 2009)Google Scholar
  14. 14.
    Pham, C., Olivier, P.: Slice and Dice: Recognizing food preparation activities using embedded accelerometers. In: Proceedings of the 3rd European Conference on Ambient Intelligence (AmI 2009), Salzburg, Austria (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Ligeng Dong
    • 1
  • Huijun Di
    • 1
  • Linmi Tao
    • 1
  • Guangyou Xu
    • 1
  • Patrick Oliver
    • 2
  1. 1.Key Lab on Pervasive Computing, Department of Computer Science and TechnologyTsinghua UniversityBeijingChina
  2. 2.Culture Lab, Computing ScienceNewcastle UniversityNewcastle Upon TyneUK

Personalised recommendations