Key Object Driven Multi-category Object Recognition, Localization and Tracking Using Spatio-temporal Context

  • Yuan Li
  • Ram Nevatia
Conference paper

DOI: 10.1007/978-3-540-88693-8_30

Volume 5305 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Li Y., Nevatia R. (2008) Key Object Driven Multi-category Object Recognition, Localization and Tracking Using Spatio-temporal Context. In: Forsyth D., Torr P., Zisserman A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5305. Springer, Berlin, Heidelberg

Abstract

In this paper we address the problem of recognizing, localizing and tracking multiple objects of different categories in meeting room videos. Difficulties such as lack of detail and multi-object co-occurrence make it hard to directly apply traditional object recognition methods. Under such circumstances, we show that incorporating object-level spatio-temporal relationships can lead to significant improvements in inference of object category and state. Contextual relationships are modeled by a dynamic Markov random field, in which recognition, localization and tracking are done simultaneously. Further, we define human as the key object of the scene, which can be detected relatively robustly and therefore is used to guide the inference of other objects. Experiments are done on the CHIL meeting video corpus. Performance is evaluated in terms of object detection and false alarm rates, object recognition confusion matrix and pixel-level accuracy of object segmentation.

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Yuan Li
    • 1
  • Ram Nevatia
    • 1
  1. 1.Institute for Robotics and Intelligent SystemsUniversity of Southern CaliforniaLos AngelesUSA