Wide Area Tracking in Single and Multiple Views

Abstract

Maintaining the stability of tracks on multiple targets in video over extended time periods and wide areas remains a challenging problem. Basic trackers like the Kalman filter or particle filter deteriorate in performance as the complexity of the scene increases. A few methods have recently shown encouraging results in these application domains. They rely on learning context models, the availability of training data, or modeling the inter-relationships between the tracks. In this chapter, we provide an overview of research in the area of long-term tracking in video. We review some of the methods in the literature and analyze the common sources of errors which cause trackers to fail. We also discuss the limits of performance of the trackers as multiple objects come together to form groups and crowds. On multiple real-life video sequences obtained for a single camera as well as a camera network, we compare the performance of some of the methods.

References

  1. 1.
    Ali, S., Shah, M.: Floor fields for tracking in high density crowd scenes. In: Euro. Conference on Computer Vision (2008) Google Scholar
  2. 2.
    Arulampalam, M.S., Maskell, S., Gordon, N.: A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. Signal Process. 50, 174–188 (2002) CrossRefGoogle Scholar
  3. 3.
    Babenko, B., Yang, M., Belongie, S.: Visual tracking with online multiple instance learning. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009) Google Scholar
  4. 4.
    Bar-Shalom, Y., Fortmann, T.: Tracking and Data Association. Academic Press, San Diego (1988) MATHGoogle Scholar
  5. 5.
    Bose, B., Wang, X., Grimson, E.: Multi-class object tracking algorithm that handles fragmentation and grouping. In: IEEE Conf. on Computer Vision and Pattern Recognition (2007) Google Scholar
  6. 6.
    Denina, G., Bhanu, B., Nguyen, H., Ding, C., Kamal, A., Ravishankar, C., Roy-Chowdhury, A., Ivers, A., Varda, B.: VideoWeb dataset for multi-camera activities and non-verbal communication. In: Bhanu, B., Ravishankar, C., Roy-Chowdhury, A., Aghajan, H., Terzopoulos, D. (eds.) Distributed Video Sensor Networks. Springer, London (2011) Google Scholar
  7. 7.
    Farenzena, M., Bazzani, L., Perina, A., Cristani, M., Murino, V.: Person re-identification by symmetry-driven accumulation of local features. In: IEEE Conf. on Computer Vision and Pattern Recognition (2010) Google Scholar
  8. 8.
    Ge, W., Collins, R.: Multi-target data association by tracklets with unsupervised parameter estimation. In: British Machine Vision Conference (2008) Google Scholar
  9. 9.
    Gheissari, N., Sebastian, T., Hartley, R.: Person re-identification using spatiotemporal appearance. In: IEEE Conf. on Computer Vision and Pattern Recognition (2006) Google Scholar
  10. 10.
    Gray, D., Tao, H.: Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Euro. Conference on Computer Vision (2008) Google Scholar
  11. 11.
    Hoiem, D., Efros, A., Hebert, M.: Geometric context from a single image. In: IEEE Intl. Conf. on Computer Vision (2005) Google Scholar
  12. 12.
    Hu, M., Ali, S., Shah, M.: Detecting global motion patterns in complex videos. In: Intl. Conf. on Pattern Recognition (2008) Google Scholar
  13. 13.
    Huang, T., Russel, S.: Object identification in a Bayesian context. In: International Joint Conference on Artificial Intelligence (1997) Google Scholar
  14. 14.
    Hue, C., Cadre, J.L., Prez, P.: Sequential Monte Carlo methods for multiple target tracking and data fusion. IEEE Trans. Signal Process. 50(2), 309–325 (2002) CrossRefGoogle Scholar
  15. 15.
    Isard, M., Blake, A.: Condensation – conditional density propagation for visual tracking. Int. J. Comput. Vis. 29(1), 5–28 (1998) CrossRefGoogle Scholar
  16. 16.
    Javed, O., Rasheed, Z., Shafique, K., Shah, M.: Tracking across multiple cameras with disjoint views. In: IEEE Intl. Conf. on Computer Vision (2003) Google Scholar
  17. 17.
    Jiang, H., Fels, S., Little, J.: A linear programming approach for multiple object tracking. In: IEEE Conf. on Computer Vision and Pattern Recognition (2007) Google Scholar
  18. 18.
    Junior, J.C.S.J., Musse, S.R., Jung, C.R.: Crowd analysis using computer vision techniques. IEEE Signal Process. Mag. 27(5), 66–77 (2010) Google Scholar
  19. 19.
    Kalman, R.: A new approach to linear filtering and prediction problems. Trans. ASME – J. Basic Eng. 82, 35–45 (1990) Google Scholar
  20. 20.
    Kang, J., Cohen, I., Medioni, G.: Continuous tracking within and across camera streams. In: IEEE Conf. on Computer Vision and Pattern Recognition (2004) Google Scholar
  21. 21.
    Kettnaker, V., Zabih, R.: Bayesian multi-camera surveillance. In: IEEE Conf. on Computer Vision and Pattern Recognition (1999) Google Scholar
  22. 22.
    Kuo, C.-H., Huang, C., Nevatia, R.: Inter-camera association of multi-target tracks by on-line learned appearance affinity models. In: Euro. Conference on Computer Vision (2010) Google Scholar
  23. 23.
    Leibe, B., Schindler, K., Gool, L.V.: Coupled detection and trajectory estimation for multi-object tracking. In: IEEE Intl. Conf. on Computer Vision (2007) Google Scholar
  24. 24.
    Leoputra, W., Tan, T., Lim, F.L.: Non-overlapping distributed tracking using particle filter. In: Intl. Conf. on Pattern Recognition (2006) Google Scholar
  25. 25.
    Li, Y., Huang, C., Nevatia, R.: Learning to associate: HybridBoosted multi-target tracker for crowded scene. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009) Google Scholar
  26. 26.
    Loy, C., Xiang, T., Gong, S.: Multi-camera activity correlation analysis. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009) Google Scholar
  27. 27.
    Perera, A., Srinivas, C., Hoogs, A., Brooksby, G., Hu, W.: Multi-object tracking through simultaneous long occlusions and split-merge conditions. In: IEEE Conf. on Computer Vision and Pattern Recognition (2006) Google Scholar
  28. 28.
    Prosser, B., Zheng, W., Gong, S., Xiang, T.: Person re-identification by support vector ranking. In: British Machine Vision Conference (2010) Google Scholar
  29. 29.
    Qureshi, F.Z., Terzopoulos, D.: Surveillance in virtual reality: System design and multi-camera control. In: IEEE Conf. on Computer Vision and Pattern Recognition (2007) Google Scholar
  30. 30.
    Rahimi, A., Darrell, T.: Simultaneous calibration and tracking with a network of non-overlapping sensors. In: IEEE Conf. on Computer Vision and Pattern Recognition (2004) Google Scholar
  31. 31.
    Reid, D.: An algorithm for tracking multiple targets. IEEE Trans. Autom. Control 24(6), 843–854 (1979) CrossRefGoogle Scholar
  32. 32.
    Rodriguez, M., Ali, S., Kanade, T.: Tracking in unstructured crowded scenes. In: IEEE Intl. Conf. on Computer Vision (2009) Google Scholar
  33. 33.
    Sethi, R., Roy-Chowdhury, A.: Modeling and recognition of complex multi-person interactions in video. In: ACM Intl. Workshop on Multimodal Pervasive Video Analysis (2010) Google Scholar
  34. 34.
    Shafique, K., Shah, M.: A non-iterative greedy algorithm for multi-frame point correspondence. IEEE Trans. Pattern Anal. Mach. Intell. 27(1), 51–65 (2005) CrossRefGoogle Scholar
  35. 35.
    Song, B., Jeng, T., Staudt, E., Roy-Chowdhury, A.: A stochastic graph evolution framework for robust multi-target tracking. In: Euro. Conference on Computer Vision (2010) Google Scholar
  36. 36.
    Song, B., Kamal, A., Soto, C., Ding, C., Farrell, J., Roy-Chowdhury, A.: Tracking and activity recognition through consensus in distributed camera networks. IEEE Trans. Image Process. 19(10), 2564–2579 (2010) MathSciNetCrossRefGoogle Scholar
  37. 37.
    Song, B., Roy-Chowdhury, A.: Stochastic adaptive tracking in a camera network. In: IEEE Intl. Conf. on Computer Vision (2007) Google Scholar
  38. 38.
    Song, B., Roy-Chowdhury, A.: Robust tracking in a camera network: A multi-objective optimization framework. IEEE J. Sel. Topics Signal Process. (Special Issue on Distributed Processing in Vision Networks) 2(4), 582–596 (2008) CrossRefGoogle Scholar
  39. 39.
    Soto, C., Song, B., Roy-Chowdhury, A.: Distributed multi-target tracking in a self-configuring camera network. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009) Google Scholar
  40. 40.
    Welch, G., Bishop, G.: An Introduction to the Kalman Filter. Technical report, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (1995) Google Scholar
  41. 41.
    Xiao, J., Cheng, H., Sawhney, H.S., Han, F.: Vehicle detection and tracking in wide field-of-view aerial video. In: IEEE Conf. on Computer Vision and Pattern Recognition (2010) Google Scholar
  42. 42.
    Xing, J., Ai, H., Lao, S.: Multi-object tracking through occlusion by local tracklets filtering and global tracklets association with detection responses. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009) Google Scholar
  43. 43.
    Yang, M., Wu, Y., Hua, G.: Context-aware visual tracking. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1195–1209 (2009) CrossRefGoogle Scholar
  44. 44.
    Yu, Q., Medioni, G., Cohen, I.: Multiple target tracking using spatio-temporal Markov chain Monte Carlo data association. In: IEEE Conf. on Computer Vision and Pattern Recognition (2007) Google Scholar
  45. 45.
    Zhang, L., Li, Y., Nevatia, R.: Global data association for multi-object tracking using network flows. In: IEEE Conf. on Computer Vision and Pattern Recognition (2008) Google Scholar

Copyright information

© Springer-Verlag London Limited 2011

Authors and Affiliations

  • Bi Song
    • 1
  • Ricky J. Sethi
    • 2
  • Amit K. Roy-Chowdhury
    • 1
  1. 1.University of CaliforniaRiversideUSA
  2. 2.University of CaliforniaLos AngelesUSA

Personalised recommendations