Abstract
Local feature extraction is an important solution for video analysis. The common framework of local feature extraction consists of a local keypoint detector and a keypoint descriptor. Existing keypoint detectors mainly focus on the spatial relationships among pixels, resulting in a large amount of redundant keypoints on background which are often temporally stationary. This paper proposes an object-aware local keypoint selection approach to keep the active keypoints on object and to reduce the redundant keypoints on background by exploring the temporal coherence among successive frames in video. The proposed approach is made up of three local temporal coherence criteria: (1) local temporal intensity coherence; (2) local temporal motion coherence; (3) local temporal orientation coherence. Experimental results on two publicly available datasets show that the proposed approach reduces more than 60% keypoints, which are redundant, and doubles the precision of keypoints.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fan, B., Wang, Z., Wang, F.: Local Image Descriptor: Modern Approaches. Springer, Heidelberg (2015). https://doi.org/10.1007/978-3-662-49173-7
Awad, A.I., Hassaballah, M.: Image Feature Detectors and Descriptors. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-319-28854-3
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Harris, C., Stephens, M.: A combined coer and edge detector. In: Alvey Vision Conference, pp. 147–151 (1988)
Beaudet, P.: Rotationally invariant image operators. In: International Conference on Pattern Recognition, pp. 579–583 (1978)
Smith, S.M., Brady, J.M.: SUSAN: a new approach to low level image processing. Int. J. Comput. Vis. 23(1), 45–78 (1997)
Rosten, E., Drummond, T.: Fusing points and lines for high performance tracking. In: International Conference on Computer Vision, pp. 1508–1515 (2005)
Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_34
Rosten, E., Porter, R., Drummond, T.: Faster and better: a machine learning approach to corner detection. IEEE Trans. Pattern Anal. Mach. Intell. 32(1), 105–119 (2010)
Zhang, H.J., Wu, J., Zhong, D., Smoliar, S.: An integrated system for content-based video retrieval and browsing. Pattern Recognit. 30(4), 643–658 (1997)
Laptev, I., Lindeberg, T.: Space-time interest points. In: International Conference on Computer Vision, pp. 432–439 (2003)
Noguchi, A., Yanai, K.: Extracting spatio-temporal local features considering consecutiveness of motions. In: Zha, H., Taniguchi, R., Maybank, S. (eds.) ACCV 2009. LNCS, vol. 5995, pp. 458–467. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-12304-7_43
Baroffio, L., Cesana, M., Redondi, A., Tagliasacchi, M., Tubaro, S.: Fast keypoint detection in video sequences. In: International Conference on Acoustics, Speech and Signal Processing, pp. 1342–1346 (2016)
Huong, V.T.L., Park, D.-C., Woo, D.M., Lee, Y.: Centroid neural network with Chi square distance measure for texture classification. In: International Joint Conference on Neural Networks, pp. 1310–1315 (2009)
Cuevas, C., Yáñez, E.M., García, N.: Labeled dataset for integral evaluation of moving object detection algorithms: LASIESTA. Comput. Vis. Image Underst. 152, 103–117 (2016)
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_56
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2011)
Acknowledgments
This work was supported by KAKENHI (16K13006) and Waseda University Grant for Special Research Projects (2017K-263).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Du, S., Ikenaga, T. (2018). Local Temporal Coherence for Object-Aware Keypoint Selection in Video Sequences. In: Zeng, B., Huang, Q., El Saddik, A., Li, H., Jiang, S., Fan, X. (eds) Advances in Multimedia Information Processing – PCM 2017. PCM 2017. Lecture Notes in Computer Science(), vol 10736. Springer, Cham. https://doi.org/10.1007/978-3-319-77383-4_53
Download citation
DOI: https://doi.org/10.1007/978-3-319-77383-4_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77382-7
Online ISBN: 978-3-319-77383-4
eBook Packages: Computer ScienceComputer Science (R0)