Abstract
Direct visual tracking can be impaired by changes in illumination if the right choice of similarity function and photometric model is not made. Tracking using the sum of squared differences, for instance, often needs to be coupled with a photometric model to mitigate illumination changes. More sophisticated similarities, e.g. mutual information and cross cumulative residual entropy, however, can cope with complex illumination variations at the cost of a reduction of the convergence radius, and an increase of the computational effort. In this context, the normalized cross correlation (NCC) represents an interesting alternative. The NCC is intrinsically invariant to affine illumination changes, and also presents low computational cost. This article proposes a new direct visual tracking method based on the NCC. Two techniques have been developed to improve the robustness to complex illumination variations and partial occlusions. These techniques are based on subregion clusterization, and weighting by a residue invariant to affine illumination changes. The last contribution is an efficient Newton-style optimization procedure that does not require the explicit computation of the Hessian. The proposed method is compared against the state of the art using a benchmark database with ground-truth, as well as real-world sequences.
Keywords
- Mutual Information
- Augmented Reality
- Reference Image
- Visual Tracking
- Illumination Change
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Download conference paper PDF
References
Szeliski, R.: Image alignment and stitching: a tutorial. Foundations and Trends in Computer Graphics and Vision 2 (2006)
Comport, A.I., Malis, E., Rives, P.: Real-time quadrifocal visual odometry. International Journal of Robotics Research 29 (2010)
Klein, G., Murray, D.: Parallel tracking and mapping for small AR workspaces. In: International Symposium on Mixed and Augmented Reality (2007)
Roth, P.M., Winter, M.: Survey of appearance-based methods for object recognition. Technical Report ICG-TR-01/08 (2008)
Baker, S., Matthews, I.: Equivalence and efficiency of image alignment algorithms. In: IEEE Conference on Computer Vision and Pattern Recognition (2001)
Benhimane, S., Malis, E.: Homography-based 2D visual tracking and servoing. International Journal of Robotics Research 26 (2007)
Silveira, G., Malis, E.: Unified direct visual tracking of rigid and deformable surfaces under generic illumination changes in grayscale and color images. International Journal of Computer Vision 89 (2010)
Hager, G., Belhumeur, P.: Efficient region tracking with parametric models of geometry and illumination. IEEE Transactions Pattern Analysis and Machine Intelligence 20 (1998)
Dame, A., Marchand, E.: Accurate real-time tracking using mutual information. In: International Symposium on Mixed and Augmented Reality (2010)
Wang, F., Vemuri, B.C.: Non-rigid multi-modal image registration using cross-cumulative residual entropy. International Journal of Computer Vision 74 (2007)
Richa, R., Hager, G.: Robust similarity measures for direct gradient-based visual tracking. Technical report (2012), http://www.cs.jhu.edu/~richa/robust.html
Irani, M., Anandan, P.: Robust multi-sensor image alignment. In: International Conference on Computer Vision (1998)
Brooks, R., Arbel, T.: Generalizing inverse compositional and ESM image alignment. International Journal of Computer Vision 87 (2010)
Lieberknecht, S., Benhimane, S., Meier, P., Navab, N.: A dataset and evaluation methodology for template-based tracking algorithms. In: International Symposium on Mixed and Augmented Reality (2009)
Nocedal, J., Wright, S.J.: Numerical Optimization. Springer (2000)
Lloyd, S.: Least squares quantization in PCM. IEEE Transactions on Information Theory 28 (1982)
Huber, P.J.: Robust Statistics. John Wiley and Sons, New York (1981)
Arya, K., Gupta, P., Kalra, P., Mitra, P.: Image registration using robust M-estimators. Pattern Recognition Letters 28 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Scandaroli, G.G., Meilland, M., Richa, R. (2012). Improving NCC-Based Direct Visual Tracking. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7577. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33783-3_32
Download citation
DOI: https://doi.org/10.1007/978-3-642-33783-3_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33782-6
Online ISBN: 978-3-642-33783-3
eBook Packages: Computer ScienceComputer Science (R0)