Visual Object Tracking with Adaptive Template Update and Global Search Augmentation

Zeng, Lu; He, Wei; Zhang, Wenqiang

doi:10.1007/978-981-99-0301-6_3

Lu Zeng¹⁰,
Wei He¹⁰ &
Wenqiang Zhang¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1770))

Included in the following conference series:

China Intelligent Robotics Annual Conference

432 Accesses

Abstract

The realization of human-machine-environment intimate interaction by intelligent robots is the research direction of cutting-edge exploration in the field of robotics. One of the important tasks is to realize active target tracking on the robot platform. Single-target tracking is subject to data changes such as target position and size in the video sequence, and is prone to target drift or loss when the environment changes drastically or is occluded. This paper aims at the application background of the intelligent foot robot platform, where deep learning technology is used to adopt adaptive multi-target tracking. The frame detection template updated Shuffle net V2–0.5 convolutional neural network builds a deep tracking model, which speeds up the model calculation. At the same time, the multi-template input ensures that the required target information can be located in a larger search image and a global search module is added. The target position re-detection is carried out, and the background enhancement training is integrated to significantly strengthen the discrimination ability of the global search network. The target tracking accuracy of the improved visual object tracking algorithm reaches 64.7%, and the accuracy of the target located at the center point of the marker frame reaches 86.8%, which is significantly improved compared with the traditional algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bao, H., Lu, Y., Wang, Q.:Single target tracking via correlation filter and context adaptively. Multimedia Tools and Appl. 79, 27465–27482 (2020). https://doi.org/10.1007/s11042-020-09309-3
Wang, D., et al.: Online single target tracking in WAMI: benchmark and evaluation. Multimedia Tools Appl. 77(9), 10939–10960 (2018)
Article Google Scholar
Xiao, J., et al.: Dynamic multi-level appearance models and adaptive clustered decision trees for single target tracking. Pattern Recognition 69.(2017). https://doi.org/10.1016/j.patcog.2017.04.001. Author, F.: Contribution title. In: 9th International Proceedings on Proceedings, pp. 1–2. Publisher, Location (2010)
Yanqing, W., Liang, Z., Cheng, X.: Fast target tracking based on improved deep sort and YOLOv3 fusion algorithm. Abstracts of the 7th International Conference of Pioneering Computer Scientists, Engineers and Educators (ICPCSEE 2021) Part I.Ed.. Springer, pp. 107–109 (2021). https://doi.org/10.1007/978-981-16-5940-9_27
Kwa, H.L., et al.: Optimal swarm strategy for dynamic target search and tracking. Autonomous Agents and MultiAgent Systems.Ed., pp. 672680 (2020)
Google Scholar
Yıldırım, S., Jiang, L., Singh, S.S., Dean, T.A.: Calibrating the Gaussian multi-target tracking model. Stat. Comput. 25(3), 595–608 (2014)
Article MathSciNet MATH Google Scholar
Nam, H., Han, B.: Learning multi-domain convolutional neural networks for visual tracking. Computer Vision and Pattern Recognition IEEE (2015)
Google Scholar
Tao, R., Gavves, E., Smeulders, A.: Siamese instance search for tracking. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1420–1429 (2016)
Google Scholar
Bertinetto, L., et al.: Fully-Convolutional Siamese Networks for Object Tracking. CoRR abs/1606.09549 (2016)
Google Scholar
Li, B., et al.: SiamRPN++: Evolution of siamese visual tracking with very deep networks. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE (2020)
Google Scholar
Chen, Z.D., Zhong, B.N., Li, G.R., et al.: Siamese box adaptive network for visual tracking. In: Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle: IEEE, pp. 6667–6676 (2020)
Google Scholar
Voigtlaender, P., Luiten, J., Torr, P.H.S., et al.: Siam R-CNN:Visual tracking by re-detection. In: Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE, pp. 6577–6587 (2020)
Google Scholar
Zhang, X., et al.: ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. CoRR abs/1707.01083 (2017)
Google Scholar
Grimaldi, M., et al.: Dynamic ConvNets on Tiny Devices via Nested Sparsity. arXiv e-prints (2022)
Google Scholar
Sharma, S.: Ermenegildo Zegna OTB Process Analysis. (2015)
Google Scholar
Bo, L., et al.: High performance visual tracking with siamese region proposal network. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE (2018)
Google Scholar
Folberth, J., Becker, S.: Efficient Adjoint Computation for Wavelet and Convolution Operators (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Fudan University, Shanghai, China
Lu Zeng, Wei He & Wenqiang Zhang

Authors

Lu Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Wei He
View author publications
You can also search for this author in PubMed Google Scholar
Wenqiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lu Zeng .

Editor information

Editors and Affiliations

Harbin Engineering University, Harbin, China
Zhiwen Yu
Xi’an University of Technology, Xi’an, China
Xinhong Hei
Beijing University of Posts and Telecommunications, Beijing, China
Duanling Li
Harbin University of Science and Technology, Harbin, China
Xianhua Song
National Academy of Guo Ding Institute of Data Science, Beijing, China
Zeguang Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zeng, L., He, W., Zhang, W. (2023). Visual Object Tracking with Adaptive Template Update and Global Search Augmentation. In: Yu, Z., Hei, X., Li, D., Song, X., Lu, Z. (eds) Intelligent Robotics. CIRAC 2022. Communications in Computer and Information Science, vol 1770. Springer, Singapore. https://doi.org/10.1007/978-981-99-0301-6_3

Download citation

DOI: https://doi.org/10.1007/978-981-99-0301-6_3
Published: 18 February 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0300-9
Online ISBN: 978-981-99-0301-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics