Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help?

Zhao, Wanlei; Jiang, Yu-Gang; Ngo, Chong-Wah

doi:10.1007/11788034_8

Wanlei Zhao²⁰,
Yu-Gang Jiang²⁰ &
Chong-Wah Ngo²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4071))

Included in the following conference series:

International Conference on Image and Video Retrieval

847 Accesses
32 Citations

Abstract

Bag-of-words representation with visual keypoints has recently emerged as an attractive approach for video search. In this paper, we study the degree of improvement when point-to-point (P2P) constraint is imposed on the bag-of-words. We conduct investigation on two tasks: near-duplicate keyframe (NDK) retrieval, and high-level concept classification, covering parts of TRECVID 2003 and 2005 datasets. In P2P matching, we propose a one-to-one symmetric keypoint matching strategy to diminish the noise effect during keyframe comparison. In addition, a new multi-dimensional index structure is proposed to speed up the matching process with keypoint filtering. Through experiments, we demonstrate that P2P constraint can significantly boost the performance of NDK retrieval, while showing competitive accuracy in concept classification of broadcast domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wu, X., Ngo, C.-W., Li, Q.: Threading and Autodocumenting News Videos. IEEE Signal Processing Magazine 23(2), 59–68 (2006)
Article Google Scholar
Chang, S.-F., et al.: Columbia University TRECVID-2005 Video Search and High-Level Feature Extraction. In: TRECVID Online Proceedings (2005)
Google Scholar
Zhang, D.-Q., Chang, S.-F.: Detecting Image Near-Duplicate by Stochastic Attributed Relational Graph Matching with Learning. In: ACM International Conference on Multimedia, pp. 877–884 (2004)
Google Scholar
TREC Video Retrieval Evaluation, http://www-nlpir.nist.gov/projects/trecvid/
Csurka, G., Dance, C., Fan, L., et al.: Visual Categorization with Bags of Keypoints. In: ECCV 2004 Workshop on Statistical Learning in Computer Vision, pp. 59–74 (2004)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: A Text Retrieval Approach to Object Matching in Videos. In: International Conference on Computer Vision, pp. 1470–1477 (2003)
Google Scholar
Ke, Y., Suthankar, R., Huston, L.: Efficient Near-Duplicate Detection and Sub-image Retrieval. In: ACM International Conference on Multimedia, pp. 869–876 (2004)
Google Scholar
Grauman, K., Darrell, T.: Efficient Image Matching with Distributions of Local Invariant Features. Computer Vision and Pattern Recognition, 627–634 (2005)
Google Scholar
Rubner, Y., Tomasi, C., Guibas, L.J.: The Earth Mover’s Distance as a Metric for Image Retrieval. International Journal of Computer Vision 40, 99–121 (2000)
Article MATH Google Scholar
Mikolajczyk, K., Schmid, C.: Scale and Affine Invariant Interest Point Detectors. International Journal of Computer Vision 60, 63–86 (2004)
Article Google Scholar
Mikolajczyk, K., Tuytelaars, T., Schmid, C., et al.: A Comparison of Affine Region Detectors. International Journal on Computer Vision 65(1-2), 43–72 (2005)
Article Google Scholar
Lowe, D.: Distinctive Image Features from Scale-Invariant Key Points. International Journal of Computer Vision 60, 91–110 (2004)
Article Google Scholar
Matas, J., Chum, O., Urban, M., et al.: Robust Wide Baseline Stereo from Maximally Stable Extremal Regions. In: British Machine Vision Conference, pp. 384–393 (2002)
Google Scholar
Mikolajczyk, K., Schmid, C.: A Performance Evaluation of Local Descriptors. Computer Vision and Pattern Recognition, 257–263 (2003)
Google Scholar
Ke, Y., Sukthankar, R.: PCA-SIFT: A More Distinctive Representation for Local Image Descriptors. Computer Vision and Pattern Recognition 2, 506–513 (2004)
Google Scholar
Zhao, Y., Karypis, G.: Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering. Machine Learning 55, 311–331 (2004)
Article MATH Google Scholar
Quelhas, P., Monay, F., et al.: Modeling Scenes with Local Descriptors and Latent Aspects. In: International Conference on Computer Vision, pp. 883–890 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
Wanlei Zhao, Yu-Gang Jiang & Chong-Wah Ngo

Authors

Wanlei Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yu-Gang Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Chong-Wah Ngo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Arts, Media and Engineering Program, Arizona State University, 85281, Tempe, AZ,
Hari Sundaram
Intelligent Information Management Department, IBM T.J. Watson Research Center, 19 Skyline Drive, 10532, Hawthorne, NY, USA
Milind Naphade
Intelligent Information Management Department, IBM T. J. Watson Research Center, 19 Skyline Drive, 10532, Hawthorne, NY, USA
John R. Smith
Microsoft Corporation, Microsoft China R&D Group, 49 Zhichun Road, 100080, Beijing, China
Yong Rui

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, W., Jiang, YG., Ngo, CW. (2006). Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help?. In: Sundaram, H., Naphade, M., Smith, J.R., Rui, Y. (eds) Image and Video Retrieval. CIVR 2006. Lecture Notes in Computer Science, vol 4071. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11788034_8

Download citation

DOI: https://doi.org/10.1007/11788034_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36018-6
Online ISBN: 978-3-540-36019-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics