Abstract
In the visual tracking problem, fusion of visible and infrared sensors provides complementarily useful features and can consistently help distinguish the target from the background efficiently. Recently, multi-view learning has received growing attention due to its enormous potential in combining diverse view features containing consistent and complementary characteristics. Therefore, in this paper, a visible and infrared fusion tracking algorithm based on multi-view multi-kernel fusion (MVMKF) model is presented. The proposed MVMKF model considers the diversities of visible and infrared views and embeds complementary information from them. Furthermore, the multi-kernel framework is used to learn the importance of view features so that an integrated appearance representation is made with regard to the respective performance. Besides, the tracking task is completed with naive Bayes classifier in sophisticated compressive feature domain, considering the high performances of classifier-level and sophisticated feature-level learning for multiple views. The experimental results demonstrate that the MVMKF tracking algorithm performs well in terms of accuracy and robustness.
Similar content being viewed by others
References
Achlioptas, D.: Database-friendly random projections: Johnson-lindenstrauss with binary coins. J. Comput. Syst. Sci. 66(4), 671–687 (2003)
Babenko, B., Yang, M.H., Belongie, S.: Robust object tracking with online multiple instance learning. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1619–1632 (2011)
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the eleventh annual conference on Computational learning theory, pp. 92–100. ACM (1998)
Conaire, C.Ó., OConnor, N.E., Smeaton, A.: Thermo-visual feature fusion for object tracking using multiple spatiogram trackers. Mach. Vis. Appl. 19(5–6), 483–494 (2008)
Deza, M.M., Deza, E.: Encyclopedia of distances. Springer (2009)
Guo, D., Zhang, J., Liu, X., Cui, Y., Zhao, C.: Multiple kernel learning based multi-view spectral clustering. In: Pattern recognition (ICPR), 2014 22nd international conference on, pp. 3774–3779. IEEE (2014)
He, H., Kong, F., Tan, J.: Dietcam: Multi-view food recognition using a multi-kernel svm. IEEE J. Biomed. Health Inf. 1(99), 1–8 (2015)
Hong, Z., Mei, X., Prokhorov, D., Tao, D.: Tracking via robust multi-task multi-view joint sparse representation. In: Computer vision (ICCV), 2013 IEEE international conference on, pp. 649–656. IEEE (2013)
Jiang, N., Liu, W., Wu, Y.: Learning adaptive metric for robust visual tracking. IEEE Trans. Image Process. 20(8), 2288–2300 (2011)
Johnson, R., Zhang, T.: Semi-supervised learning with multi-view embedding: theory and application with convolutional neural networks. (2015). arXiv:1504.01255
Jordan, A.: On discriminative vs. generative classifiers: a comparison of logistic regression and naive bayes. Adv. Neural Inf. Process. Syst. 14, 841 (2002)
Klausner, A., Tengg, A., Rinner, B.: Vehicle classification on multi-sensor smart cameras using feature-and decision-fusion. In: Distributed smart cameras, 2007. ICDSC’07. First ACM/IEEE international conference on, pp. 67–74. IEEE (2007)
Kludas, J., Bruno, E., Marchand-Maillet, S.: Information fusion in multimedia information retrieval. In: Adaptive multimedia retrieval: retrieval, user, and semantics, pp. 147–159. Springer (2008)
Koishi, T., Sasaki, M., Nakaguchi, T., Tsumura, N., Miyake, Y.: Endoscopy system for length measurement by manual pointing with an electromagnetic tracking sensor. Opt. Rev. 17(2), 54–60 (2010)
Luo, Y., Liu, T., Tao, D., Xu, C.: Multiview matrix completion for multilabel image classification. IEEE Trans. Image Process. 24(8), 2355–2368 (2015)
Luo, Y., Tang, J., Yan, J., Xu, C., Chen, Z.: Pre-trained multi-view word embedding using two-side neural network. In: Twenty-eighth AAAI conference on artificial intelligence, pp. 1982–1988 (2014)
Meltzer, J., Yang, M.H., Gupta, R., Soatto, S.: Multiple view feature descriptors from image sequences via kernel principal component analysis. In: Computer vision–ECCV 2004, pp. 215–227. Springer (2004)
Mihaylova, L., Loza, A., Nikolov, S.G., Lewis, J.J., Canga, E.F., Li, J., Dixon, T.D., Canagarajah, C.N., Bull, D.R.: The influence of multi-sensor video fusion on object tracking using a particle filter. GI Jahrestagung 1, 354–358 (2006)
Mittal, A., Davis, L.S.: M2tracker: a multi-view approach to segmenting and tracking people in a cluttered scene. Int. J. Comput. Vis. 51(3), 189–203 (2003)
Parzen, E.: On estimation of a probability density function and mode. Ann. Math. Stat., pp. 1065–1076 (1962)
Peng, C., Chen, Q., Qian, W.X.: Eigenspace-based tracking for feature points. Opt. Rev. 21(3), 304–312 (2014)
Sindhwani, V., Niyogi, P., Belkin, M.: A co-regularization approach to semi-supervised learning with multiple views. In: Proceedings of ICML workshop on learning with multiple views, pp. 74–79. Citeseer (2005)
Snoek, C.G., Worring, M., Smeulders, A.W.: Early versus late fusion in semantic video analysis. In: Proceedings of the 13th annual ACM international conference on multimedia, pp. 399–402. ACM (2005)
Timofte, R., Zimmermann, K., Van Gool, L.: Multi-view traffic sign detection, recognition, and 3d localisation. Mach. Vis. Appl. 25(3), 633–647 (2014)
Wang, P., Ji, Q.: Multi-view face tracking with factorial and switching hmm. In: Application of computer vision, 2005. WACV/MOTIONS’05 Volume 1. Seventh IEEE Workshops on, vol. 1, pp. 401–406. IEEE (2005)
Wang, P., Ji, Q.: Multi-view face and eye detection using discriminant features. Comput. Vis. Image Underst. 105(2), 99–111 (2007)
White, M., Zhang, X., Schuurmans, D., Yu, Y.l.: Convex multi-view subspace learning. In: Advances in neural information processing systems, pp. 1673–1681 (2012)
Xiao, G., Yun, X., Wu, J.: A multi-cue mean-shift target tracking approach based on fuzzified region dynamic image fusion. Sci. China Inf. Sci. 55(3), 577–589 (2012)
Xing, J., Ai, H., Lao, S.: Multi-object tracking through occlusions by local tracklets filtering and global tracklets association with detection responses. In: Computer vision and pattern recognition, 2009. CVPR 2009. IEEE conference on, pp. 1200–1207. IEEE (2009)
Xu, C., Tao, D., Xu, C.: A survey on multi-view learning, pp. 1–59 (2013). arXiv:1304.5634
Zhang, K., Zhang, L., Liu, Q., Zhang, D., Yang, M.H.: Fast visual tracking via dense spatio-temporal context learning. In: Computer vision–ECCV 2014, pp. 127–141. Springer (2014)
Zhang, K., Zhang, L., Yang, M.H.: Real-time compressive tracking. In: Computer vision–ECCV 2012, pp. 864–877. Springer (2012)
Zhang, K., Zhang, L., Yang, M.H.: Real-time object tracking via online discriminative feature selection. IEEE Trans. Image Process. 22(12), 4664–4677 (2013)
Zhang, K., Zhang, L., Yang, M.H.: Fast compressive tracking. IEEE Trans. Pattern Anal. Mach. Intell. 36(10), 2002–2015 (2014)
Zhou, P., Yao, J., Pei, J.: Implementation of an energy-efficient scheduling scheme based on pipeline flux leak monitoring networks. Sci. China Ser. F Inf. Sci. 52(9), 1632–1639 (2009)
Acknowledgments
This work is supported by the National Natural Science Foundation of China (Grant Nos. 61175028, 61365009) and the Ph.D. Programs Foundation of the Ministry of Education of China (Grant Nos. 20090073110045).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yun, X., Jing, Z. & Jin, B. Visible and infrared tracking based on multi-view multi-kernel fusion model. Opt Rev 23, 244–253 (2016). https://doi.org/10.1007/s10043-015-0175-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10043-015-0175-5