A robust visual tracking method via local feature extraction and saliency detection


Visual object tracking is a fundamental problem in computer vision. It heavily relies on feature description for the appearance of object. In this paper, we present a robust algorithm which exploits the locally adaptive regression kernel (LARK) feature for visual tracking. The proposed approach formulates the LARK feature in a tracking by detection framework. In addition, we compute a target-specific saliency map as LARK feature with the guidance of the tracking framework. The tracking problem is solved by maximizing an object location likelihood function. We adopt Fast Fourier Transform for fast learning and detection in this work. Extensive experimental results on challenging videos show that the proposed algorithm performs favorably against state-of-the-art methods in terms of accuracy and robustness.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17


Xian Wei

All the authors declare that we have no conflict of interest.

This work was jointly supported by CAS Pioneer Hundred Talents Program (Type C) under Grant No. 2017-122, National Science Found for Young Scholars under Grant No. 61806186 and the National Natural Science Foundation of China (No. 61503173, No. 61873246).

Wang, Y., Wei, X., Ding, L. et al. A robust visual tracking method via local feature extraction and saliency detection. Vis Comput 36, 683–700 (2020). https://doi.org/10.1007/s00371-019-01646-1

  • Visual object tracking
  • Locally adaptive regression kernel
  • Correlation filter tracking
  • Saliency detection