Robust visual tracking with contiguous occlusion constraint

Wang, Pengcheng; Qian, Weixian; Chen, Qian

doi:10.1007/s10043-015-0152-z

Robust visual tracking with contiguous occlusion constraint

Regular Paper
Published: 23 November 2015

Volume 23, pages 40–52, (2016)
Cite this article

Optical Review Aims and scope Submit manuscript

Pengcheng Wang¹,
Weixian Qian¹ &
Qian Chen¹

1327 Accesses
2 Citations
Explore all metrics

Abstract

Visual tracking plays a fundamental role in video surveillance, robot vision and many other computer vision applications. In this paper, a robust visual tracking method that is motivated by the regularized $\ell$1 tracker is proposed. We focus on investigating the case that the object target is occluded. Generally, occlusion can be treated as some kind of contiguous outlier with the target object as background. However, the penalty function of the $\ell$1 tracker is not robust for relatively dense error distributed in the contiguous regions. Thus, we exploit a nonconvex penalty function and MRFs for outlier modeling, which is more probable to detect the contiguous occluded regions and recover the target appearance. For long-term tracking, a particle filter framework along with a dynamic model update mechanism is developed. Both qualitative and quantitative evaluations demonstrate a robust and precise performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

Article Open access 12 April 2024

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Article Open access 08 October 2020

Fully-Convolutional Siamese Networks for Object Tracking

References

Grabner, H., Grabner, M., Bischof, H.: Real-time tracking via on-line boosting. In: BMVC, p. 6 (2006)
Ross, D.A., Lim, J., Lin, R.-S., Yang, M.-H.: Incremental learning for robust visual tracking. Int. J. Comput. Vis. 77, 125–141 (2008)
Article Google Scholar
Babenko, B., Yang, M.-H., Belongie, S.: Robust object tracking with online multiple instance learning. Pattern Anal. Mach. Intell. IEEE Trans. 33, 1619–1632 (2011)
Article Google Scholar
Hare, S., Saffari, A., Torr, P.H.: Struck: Structured output tracking with kernels. In: Computer vision (ICCV), 2011 IEEE international conference on IEEE, pp. 263–270 (2011)
Mei, X., Ling, H.: Robust visual tracking and vehicle classification via sparse representation. Pattern Anal. Mach. Intell. IEEE Trans. 33, 2259–2272 (2011)
Article Google Scholar
Kalal, Z., Mikolajczyk, K., Matas, J.: Tracking-learning-detection. Pattern Anal. Mach. Intell. IEEE Trans. 34, 1409–1422 (2012)
Article Google Scholar
Wu, Y., Lim, J., Yang, M.-H.: Online object tracking: a benchmark. In: Computer vision and pattern recognition (CVPR), 2013 IEEE conference on IEEE, pp. 2411–2418 (2013)
Chandrasekaran, V., Sanghavi, S., Parrilo, P.A., Willsky, A.S.: Sparse and low-rank matrix decompositions. In: Communication, control, and computing, 2009. Allerton 2009. 47th Annual Allerton conference on IEEE, pp. 962–967 (2009)
Candes, E.J., Tao, T.: Near optimal signal recovery from random projections: universal encoding strategies? Inf. Theory IEEE Trans. 52, 5406–5425 (2004)
Article MathSciNet Google Scholar
Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31, 210–227 (2009)
Article Google Scholar
Zhou, X., Yang, C., Yu, W.: Moving object detection by detecting contiguous outliers in the low-rank representation. Pattern Anal. Mach. Intell. IEEE Trans. 35, 597–610 (2013)
Article Google Scholar
Li, H., Shen, C., Shi, Q.: Real-time visual tracking using compressive sensing. In: Computer vision and pattern recognition (CVPR), 2011 IEEE conference, pp. 1305–1312 (2011)
Mei, X., Ling, H., Wu, Y., Blasch, E., Bai, L.: Minimum error bounded efficient ℓ1 tracker with occlusion detection. In: Computer vision and pattern recognition (CVPR), 2011 IEEE Conference (2011)
Bao, C., Wu, Y., Ling, H., Ji, H.: Real time robust ℓ1 tracker using accelerated proximal gradient approach. In: Proceedings/CVPR, IEEE computer society conference on computer vision and pattern recognition, pp. 1830–1837 (2012)
Kim, S.-J., Koh, K., Lustig, M., Boyd, S., Gorinevsky, D.: An interior-point method for large-scale ℓ1-regularized least squares. Sel. Topics Signal Process. IEEE J. 1, 606–617 (2007)
Article ADS Google Scholar
Goldstein, T., Osher, S.: The split bregman method for ℓ1-regularized problems. Siam J. Imaging Sci. 2, 323–343 (2009)
Article MathSciNet MATH Google Scholar
Zhang, T., Liu, S., Ahuja, N., Yang, M.H., Ghanem, B.: Robust visual tracking via consistent low-rank sparse learning. Int. J. Comput. Vis. 111, 171–190 (2014)
Article Google Scholar
Li, S.Z.: Markov random field modeling in image analysis. Markov random field modeling in image analysis: advances in pattern recognition. ISBN 978-1-84800-279-1. Springer-Verlag, London, pp. xxiv + 357 (2009)
Tseng, P., Tseng, P.: On accelerated proximal gradient methods for convex-concave optimization. Siam J. Optim. (2008)
Boykov, Y., Veksler, O., Zabih, R.: Fast approximate energy minimization via graph cuts. In: Computer vision. The Proceedings of the Seventh IEEE International Conference, vol. 371, pp. 377–384 (1999)
Kolmogorov, V., Zabih, R.: What energy functions can be minimized via graph cuts. In: IEEE transactions on pattern analysis and machine intelligence, pp. 65–81 (2004)
Wang, D., Lu, H.: Visual tracking via probability continuous outlier model. In: Computer vision and pattern recognition (CVPR), 2014 IEEE conference on IEEE, pp. 3478–3485 (2014)
Kong, X., Chen, Q., Xu, F., Gu, G., Ren, K., Qian, W.: Motion object tracking based on the low-rank matrix representation. Opt. Rev. 22, 786–801 (2015)
Article Google Scholar
Model, I.: Sequential monte carlo methods in practice. Factored Dyn. Syst. 93, 209–250 (2001)
Google Scholar
Hartley, R., Zisserman, A.: Multiple view geometry in computer vision, Cambridge University Press (2003)
Lu, H.: Huchuan Lu’s Homepage. http://ice.dlut.edu.cn/lu/index.html
Visual Tracker Benchmark. http://cvlab.hanyang.ac.kr/tracker_benchmark/

Download references

Acknowledgments

This research is supported by Jiangsu Province High-level Talents in Six Industries (2012-DZXX-037), Program for New Century Excellent Talents in University (NCET-12-0630), and the Natural Science Foundation of Jiangsu Province of China (BK20130769).

Author information

Authors and Affiliations

Jiangsu Key Laboratory of Spectral Imaging and Intelligent Sense, Nanjing University of Science and Technology, Nanjing, 210094, China
Pengcheng Wang, Weixian Qian & Qian Chen

Authors

Pengcheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Weixian Qian
View author publications
You can also search for this author in PubMed Google Scholar
Qian Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pengcheng Wang.

Appendix

The APG algorithm is originally designed for solving the following unconstrained program

$$\mathop {\hbox{min} }\limits_{{\mathbf{x}}} F({\mathbf{x}}) + G({\mathbf{x}})$$

(16)

where F(x) is a differentiable convex function with Lipschitz continuous gradient and G(x) is a non-smooth but convex function. Equation (7) is a nonlinear problem with nonnegativity constraint, which can be reformulated as

$$\mathop {\hbox{min} }\limits_{{\mathbf{a}}} \frac{1}{2}||\mathcal{P}_{{{\hat{\mathbf{S}}}^{ \bot } }} ({\mathbf{y}} - {\mathbf{Ta}})||_{2}^{2} + \frac{\lambda }{{\alpha ({\hat{\mathbf{S}}})}}||{\mathbf{a}}||_{1} \; + {\mathbf{1}}_{{{\mathbb{R}} + }} ({\mathbf{a}})$$

(17)

where the additional term ${\mathbf{1}}_{{{\mathbb{R}} + }} ({\mathbf{a}})$ is defined as

$${\mathbf{1}}_{{{\mathbb{R}} + }} ({\mathbf{a}})\left\{ {\begin{array}{*{20}c} 0 & {{\text{for}}\,\,{\mathbf{a}} \ge 0} \\ { + \infty } & {\text{otherwise}} \\ \end{array} } \right.$$

(18)

Then, Eq. (7) can be decomposed as

$$F({\mathbf{a}}) = \frac{1}{2}||{\mathcal{P}}_{\hat{{\mathbf{S}}}^\bot} ({\mathbf{y}} - {\mathbf{Ta}})||_{2}^{2} ,\quad\,G({\mathbf{a}}) = \frac{\lambda }{{\alpha ({\hat{\mathbf{S}}})}}||{\mathbf{a}}||_{1} + {\mathbf{1}}_{{{\mathbb{R}} + }} ({\mathbf{a}})$$

(19)

The gradient of F(a) is Lipschitz continuous because for any ${\mathbf{x}}_{1} ,{\mathbf{x}}_{2} \in {\mathbb{R}}^{\text{n}}$,

$$\begin{aligned} &||\nabla F({\mathbf{x}}_{1} ) - \nabla F({\mathbf{x}}_{2} )|| \\ & = ||{\mathbf{A}}^{\text{T}} ({\mathbf{Ax}}_{1} - {\mathbf{b}}) - {\mathbf{A}}^{\text{T}} ({\mathbf{Ax}}_{2} - {\mathbf{b}})|| \\ & = ||{\mathbf{A}}^{\text{T}} {\mathbf{A}}({\mathbf{x}}_{1} - {\mathbf{x}}_{2} )|| \le ||{\mathbf{A}}^{\text{T}} {\mathbf{A}}||_{2} ||{\mathbf{x}}_{1} - {\mathbf{x}}_{2} || \\ \end{aligned}$$

(20)

where the matrix ${\mathbf{A}} = \mathcal{P}_{{{\hat{\mathbf{S}}}^{ \bot } }} ({\mathbf{T}})$ and ${\mathbf{b}} = \mathcal{P}_{{{\hat{\mathbf{S}}}^{ \bot } }} ({\mathbf{y}})$. Thus, $L = \left\| \mathcal{P}_{\hat{{\mathbf{S}}}^{\bot}} ({\mathbf{T}})^{\text{T}} \mathcal{P}_{\hat{{\mathbf{S}}}^{\bot}} ({\mathbf{T}}) \right\|_{2}$.

In the generic APG algorithm, we need to solve the following optimization

$$\alpha_{k + 1} = \arg \mathop {\hbox{min} }\limits_{{\mathbf{a}}} \frac{L}{2}\left\| {{\mathbf{a}} - \beta_{k + 1} \frac{{\nabla F(\beta_{k + 1} )}}{L}} \right\|_{2}^{2} + G({\mathbf{a}})$$

(21)

For the G(a) defined in Eq. (19), the function is equivalent to

$$\alpha_{k + 1} = \arg \mathop {\hbox{min} }\limits_{{\mathbf{a}}} \frac{L}{2}\left\| {{\mathbf{a}} - \beta_{k + 1} \frac{{\nabla F(\beta_{k + 1} )}}{L}} \right\|_{2}^{2} + \frac{\lambda }{{\alpha ({\hat{\mathbf{S}}})}}||{\mathbf{a}}||_{1} ,{\text{s}} . {\text{t}} .\,\,{\mathbf{a}} \ge 0$$

(22)

The optimization is a truncated quadratic problem and has a closed-form solution

$$\alpha_{k + 1} = \hbox{max} \left\{ 0,\beta_{k + 1} - \frac{{\nabla F(\beta_{k + 1} )}}{L} - \frac{\lambda } L\alpha (\hat{{\mathbf{S}}})\right\}$$

(23)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, P., Qian, W. & Chen, Q. Robust visual tracking with contiguous occlusion constraint. Opt Rev 23, 40–52 (2016). https://doi.org/10.1007/s10043-015-0152-z

Download citation

Received: 03 August 2015
Accepted: 31 October 2015
Published: 23 November 2015
Issue Date: February 2016
DOI: https://doi.org/10.1007/s10043-015-0152-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust visual tracking with contiguous occlusion constraint

Abstract

Access this article

Similar content being viewed by others

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Fully-Convolutional Siamese Networks for Object Tracking

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Robust visual tracking with contiguous occlusion constraint

Abstract

Access this article

Similar content being viewed by others

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Fully-Convolutional Siamese Networks for Object Tracking

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation