Toward Robust Online Visual Tracking

Yang, Ming-Hsuan; Ho, Jeffrey

doi:10.1007/978-0-85729-127-1_8

Toward Robust Online Visual Tracking

Ming-Hsuan Yang &
Jeffrey Ho

Chapter

1365 Accesses
5 Citations

Abstract

We pursue a research direction that will empower machines with simultaneous tracking and recognition capabilities similar to human cognition. Toward that, we develop algorithms that leverage prior knowledge/model obtained offline with information available online via novel learning algorithms. While humans can effortlessly locate moving objects in different environments, visual tracking remains one of the most important and challenging problems in computer vision. Robust cognitive visual tracking algorithms facilitate answering important questions regarding how objects move and interact in complex environments. They have broad applications including surveillance, navigation, human computer interfaces, object recognition, motion analysis and video indexing, to name a few.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Adam, A., Rivlin, E., Shimshoni, I.: Robust fragments-based tracking using the integral histogram. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 798–805 (2006)
Google Scholar
Adelson, E.H., Bergen, J.R.: The plenoptic function and the elements of early vision. In: Landy, M., Movshon, J.A. (eds.) Computational Models of Visual Processing, pp. 1–20. MIT Press, Cambridge (1991)
Google Scholar
Avidan, S.: Ensemble tracking. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 261–271 (2007)
Article Google Scholar
Babenko, B., Yang, M.-H., Belongie, S.: Visual tracking with online multiple instance learning. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 983–990 (2009)
Google Scholar
Balan, A.O., Black, M.J.: An adaptive appearance model approach for model-based articulated object tracking. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 758–765 (2006)
Google Scholar
Belhumeur, P., Kreigman, D.: What is the set of images of an object under all possible lighting conditions. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 270–277 (1997)
Google Scholar
Black, M.J., Jepson, A.D.: Eigentracking: Robust matching and tracking of articulated objects using a view-based representation. Int. J. Comput. Vis. 26(1), 63–84 (1998)
Article Google Scholar
Bregler, C., Malik, J.: Tracking people with twists and exponential map. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 8–15 (1998)
Google Scholar
Cham, T.J., Rehg, J.M.: A multiple hypothesis approach to figure tracking. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 239–245 (1998)
Google Scholar
Charikar, M., Chekuri, C., Feder, T., Motwani, R.: Incremental clustering and dynamic information retrieval. SIAM J. Comput. 33(6), 1417–1440 (2004)
Article MathSciNet MATH Google Scholar
Collins, R.T., Liu, Y., Leordeanu, M.: Online selection of discriminative tracking features. IEEE Trans. Pattern Anal. Mach. Intell. 27(10), 1631–1643 (2005).
Article Google Scholar
Comaniciu, D., Ramesh, V., Meer, P.: Kernel-based object tracking. IEEE Trans. Pattern Anal. Mach. Intell. 25(5), 564–577 (2003)
Article Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886–893 (2005)
Google Scholar
Deutscher, J., Blake, A., Reid, I.: Articulated body motion capture by annealed particle filtering. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 126–133 (2000)
Google Scholar
Dietterich, T.G., Lathrop, R.H., Perez, L.T.: Solving the multiple-instance problem with axis parallel rectangles. Artif. Intell. 89(1–2), 31–71 (1997)
Article MATH Google Scholar
Dollár, P., Tu, Z., Tao, H., Belongie, S.: Feature mining for image classification. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2007
Google Scholar
Forsyth, D., Arikan, O., Ikemoto, L., O’Brien, J., Ramanan, D.: Computational Studies of Human Motion: Part 1, Tracking and Motion Synthesis. Now publishers, Hanover (2006)
Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55, 119–139 (1997)
Article MathSciNet MATH Google Scholar
Friedman, J.H.: Greedy function approximation: A gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)
Article MATH Google Scholar
Ghahramani, Z., Beal, M.: Variational inference for Bayesian mixtures of factor analysers. In: Advances in Neural Information Processing Systems, pp. 449–455 (2000)
Google Scholar
Golub, G.H., Van Loan, C.F.: Matrix Computations. The Johns Hopkins University Press, Baltimore (1996)
MATH Google Scholar
Grabner, H., Grabner, M., Bischof, H.: Real-time tracking via on-line boosting. In: Proceedings of British Machine Vision Conference, pp. 47–56 (2006)
Google Scholar
Hager, G.D., Belhumeur, P.N.: Efficient region tracking with parametric models of geometry and illumination. IEEE Trans. Pattern Anal. Mach. Intell. 20(10), 1025–1039 (1998)
Article Google Scholar
Hall, P., Marshall, D., Martin, R.: Adding and subtracting eigenspaces with eigenvalue decomposition and singular value decomposition. Image Vis. Comput. 20(13–14), 1009–1016 (2002)
Article Google Scholar
Ho, J., Lee, K.-C., Yang, M.-H., Kriegman, D.: Visual tracking using learned linear subspaces. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 782–789 (2004)
Google Scholar
Humphreys, G., Bruce, V.: Visual Cognition: Computational, Experimental and Neuropsychological Perspectives. Psychology Press, London (1989)
MATH Google Scholar
Ioffe, S., Forsyth, D.: Probabilistic methods for finding people. Int. J. Comput. Vis. 43(1), 45–68 (2001)
Article MATH Google Scholar
Isard, M., Blake, A.: CONDENSATION—conditional density propagation for visual tracking. Int. J. Comput. Vis. 29(1), 5–28 (1998)
Article Google Scholar
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: A review. ACM Comput. Surv. 31(3), 264–323 (1999)
Article Google Scholar
Jepson, A.D., Fleet, D.J., El-Maraghi, T.F.: Robust online appearance models for visual tracking. IEEE Trans. Pattern Anal. Mach. Intell. 25(10), 1296–1311 (2003)
Article Google Scholar
Lee, K.-C., Ho, J., Yang, M.-H., Kriegman, D.: Visual tracking and recognition using probabilistic appearance manifolds. Comput. Vis. Image Underst. 99(3), 303–331 (2005)
Article Google Scholar
Levy, A., Lindenbaum, M.: Sequential Karhunen-Loeve basis extraction and its application to images. IEEE Trans. Image Process. 9(8), 1371–1374 (2000)
Article MATH Google Scholar
Li, R., Yang, M.-H., Sclaroff, S., Tian, T.-P.: Monocular tracking of 3D human motion with a coordinated mixture of factor analyzers. In: Proceedings of European Conference on Computer Vision, pp. 137–150 (2006)
Google Scholar
Li, R., Tian, T.-P., Sclaroff, S., Yang, M.-H.: 3D human motion tracking with a coordinated mixture of factor analyzers. Int. J. Comput. Vis. 87(1–2), 170–190 (2010)
Article Google Scholar
Lim, J., Ross, D., Lin, R.-S., Yang, M.-H.: Incremental learning for visual tracking. In: Advances in Neural Information Processing Systems, pp. 793–800. MIT Press, Cambridge (2005)
Google Scholar
Lin, R.-S., Liu, C.-B., Yang, M.-H., Ahuja, N., Levinson, S.: Learning nonlinear manifolds from time series. In: Proceedings of European Conference on Computer Vision, pp. 239–250 (2004)
Google Scholar
Lin, R.-S., Ross, D., Lim, J., Yang, M.-H.: Adaptive discriminative generative model and its applications. In: Advances in Neural Information Processing Systems, pp. 801–808. MIT Press, Cambridge (2005)
Google Scholar
Mohan, A., Papageorgiou, C., Poggio, T.: Example-based object detection in images by components. IEEE Trans. Pattern Anal. Mach. Intell. 23(4), 349–361 (2001)
Article Google Scholar
Moselund, T., Granum, E.: A survey of computer vision-based human motion capture. Comput. Vis. Image Underst. 81(3), 231–268 (2001)
Article Google Scholar
Murase, H., Nayar, S.: Visual learning and recognition of 3d objects from appearance. Int. J. Comput. Vis. 14(1), 5–24 (1995)
Article Google Scholar
Nejhum, S.M.S., Ho, J., Yang, M.-H.: Online articulate object tracking with appearance and shape. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2008
Google Scholar
Oza, N.C.: Online Ensemble Learning. Ph.D. Thesis, University of California, Berkeley (2001)
Google Scholar
Pentland, A., Moghaddam, B., Starner, T., Oligide, O., Turk, M.: View-based and modular eigenspaces for face recognition. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 84–91 (1994)
Chapter Google Scholar
Porikli, F.: Integral histogram: A fast way to extract histograms in Cartesian spaces. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 829–836 (2005)
Google Scholar
Ramanan, D., Forsyth, D.: Finding and tracking people from the bottom up. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 467–474 (2003)
Google Scholar
Ronfard, R., Schmid, C., Triggs, B.: Learning to parse pictures of people. In: Proceedings of the Seventh European Conference on Computer Vision, pp. 700–714 (2002)
Google Scholar
Ross, D., Lim, J., Yang, M.-H.: Adaptive probabilistic visual tracking with incremental subspace update. In: Proceedings of European Conference on Computer Vision, pp. 470–482 (2004)
Google Scholar
Ross, D., Lim, J., Lin, R.-S., Yang, M.-H.: Incremental learning for robust visual tracking. Int. J. Comput. Vis. 77(1–3), 125–141 (2008)
Article Google Scholar
Roweis, S., Saul, L., Hinton, G.E.: Global coordination of local linear models. In: Advances in Neural Information Processing Systems, pp. 889–896 (2001)
Google Scholar
Rowley, H., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Trans. Pattern Anal. Mach. Intell. 20(1), 23–38 (1998)
Article Google Scholar
Schneiderman, H., Kanade, T.: Object detection using the statistics of parts. Int. J. Comput. Vis. 56(3), 151–177 (2004)
Article Google Scholar
Sidenbladh, H., Black, M.: Learning image statistics for Bayesian tracking. In: Proceedings of IEEE International Conference on Computer Vision, pp. 709–716 (2001)
Google Scholar
Sigal, L., Bhatia, S., Roth, S., Black, M., Isard, M.: Tracking loose-limbed people. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 421–428 (2004)
Google Scholar
Sigal, L., Black, M.: HumanEva: synchronized video and motion capture dataset for evaluation of articulated human motion. Technical Report CS-06-08, Brown University (2006)
Google Scholar
Sminchisescu, C., Triggs, B.: Covariance scaled sampling for monocular 3D body tracking. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 447–454 (2001)
Google Scholar
Sung, K.-K., Poggio, T.: Example-based learning for view-based human face detection. IEEE Trans. Pattern Anal. Mach. Intell. 20(1), 39–51 (1998)
Article Google Scholar
Teh, Y.W., Roweis, S.: Automatic alignment of local representations. In: Advances in Neural Information Processing Systems, pp. 841–848 (2002)
Google Scholar
Tian, T.-P., Li, R., Sclaroff, S.: Tracking human body pose on a learned smooth space. Technical Report 2005-029, Boston University (2005)
Google Scholar
Toyama, K., Blake, A.: Probabilistic tracking with exemplars in a metric space. Int. J. Comput. Vis. 48(1), 9–19 (2002)
Article MATH Google Scholar
Urtasun, R., Fleet, D., Hertzmann, A., Fua, P.: Priors for people tracking from small training sets. In: Proceedings of IEEE International Conference on Computer Vision, pp. 403–410 (2005)
Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 511–518 (2001)
Google Scholar
Viola, P., Platt, J.C., Zhang, C.: Multiple instance boosting for object detection. In: Advances in Neural Information Processing Systems, pp. 1417–1426. MIT Press, Cambridge (2005)
Google Scholar
Yilmaz, A., Javed, O., Shah, M.: Object tracking: A survey. ACM Comput. Surv. 38(4), 1–45 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Authors

Ming-Hsuan Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Ho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ming-Hsuan Yang .

Editor information

Editors and Affiliations

Center for Research, Intelligent Systems, University of California, Riverside, Riverside, 92521, California, USA
Bir Bhanu
Dept. Computer Science & Engineering, University of California, Riverside, Riverside, 92521, California, USA
Chinya V. Ravishankar
Dept. Electrical Engineering, University of California, Riverside, Riverside, 92521, California, USA
Amit K. Roy-Chowdhury
Dept. Electrical Engineering, Packard Bldg., Stanford University, Serra Mall 350, Stanford, 94305-9505, California, USA
Hamid Aghajan
Dept. Computer Science, University of California, Los Angeles, Boelter Hall 4731, Los Angeles, 90095-1596, California, USA
Demetri Terzopoulos

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yang, MH., Ho, J. (2011). Toward Robust Online Visual Tracking. In: Bhanu, B., Ravishankar, C., Roy-Chowdhury, A., Aghajan, H., Terzopoulos, D. (eds) Distributed Video Sensor Networks. Springer, London. https://doi.org/10.1007/978-0-85729-127-1_8

Download citation

DOI: https://doi.org/10.1007/978-0-85729-127-1_8
Publisher Name: Springer, London
Print ISBN: 978-0-85729-126-4
Online ISBN: 978-0-85729-127-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics