Visual Codebooks Survey for Video On-Line Processing

Beran, Vítězslav; Zemčík, Pavel

doi:10.1007/978-3-642-15910-7_1

Vítězslav Beran²⁰ &
Pavel Zemčík²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6374))

Included in the following conference series:

International Conference on Computer Vision and Graphics

1143 Accesses

Abstract

This paper explores techniques in the pipeline of image description based on visual codebooks suitable for video on-line processing. The pipeline components are (i) extraction and description of local image features, (ii) translation of each high-dimensional feature descriptor to several most appropriate visual words selected from the discrete codebook and (iii) combination of visual words into bag-of-words using hard or soft assignment weighting scheme. For each component, several state-of-the-art techniques are analyzed and discussed and their usability for video on-line processing is addressed. The experiments are evaluated on the standard Kentucky and Oxford building datasets using image retrieval framework. The results show the impact loosing the pipeline precision in the price of improving the time cost which is crucial for real-time video processing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sivic, J., Zisserman, A.: Video Google: Efficient visual search of videos. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds.) Toward Category-Level Object Recognition. LNCS, vol. 4170, pp. 127–144. Springer, Heidelberg (2006)
Chapter Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: CVPR 2006: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, pp. 2161–2168. IEEE Computer Society, Los Alamitos (2006)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Jiang, Y.G., Ngo, C.W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: CIVR 2007: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 494–501. ACM, New York (2007)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: Improving particular object retrieval in large scale image databases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar
Uijlings, J.R.R., Smeulders, A.W.M., Scha, R.J.H.: Real-time bag of words, approximately. In: CIVR 2009: Proceeding of the ACM International Conference on Image and Video Retrieval, pp. 1–8. ACM, New York (2009)
Chapter Google Scholar
Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Gool, L.V.: A comparison of affine region detectors. Int. J. Comput. Vision 65(1-2), 43–72 (2005)
Article Google Scholar
Bay, H., Tuytelaars, T., Gool, L.V.: Surf: Speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Chapter Google Scholar
Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)
Chapter Google Scholar
Mikolajczyk, K., Schmid, C.: Scale & affine invariant interest point detectors. Int. J. Comput. Vision 60(1), 63–86 (2004)
Article Google Scholar
Dalal, N., Triggs, B., Schmid, C.: Human detection using oriented histograms of flow and appearance. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 428–441. Springer, Heidelberg (2006)
Chapter Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27(10), 1615–1630 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Graphics and Multimedia Faculty of Information Technology, Brno University of Technology, Božetěchova 2, 612 66, Brno, CZ
Vítězslav Beran & Pavel Zemčík

Authors

Vítězslav Beran
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Zemčík
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Polish-Japanese Institute of Information Technology, 86 Koszykowa St, 02-008, Warsaw, Poland
Leonard Bolc
Institute of Automatics, AGH University of Science and Technology, Al. Mickiewicza 30, 30-059, Kraków, Poland
Ryszard Tadeusiewicz
Faculty of Applied Informatics and Mathematics, Department of Informatics, Warsaw University of Life Sciences, Nowoursynowska 159, 02-776, Warsaw, Poland
Leszek J. Chmielewski
Polish-Japanese Institute of Information Technology, 86 Koszykowa St, 02-008 Warsaw, konradw@pjwstk.edu.pl, The Silesian University of Technology, Institute of Computer Science, Akademicka 16, 44-100 Gliwice, Polan
Konrad Wojciechowski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Beran, V., Zemčík, P. (2010). Visual Codebooks Survey for Video On-Line Processing. In: Bolc, L., Tadeusiewicz, R., Chmielewski, L.J., Wojciechowski, K. (eds) Computer Vision and Graphics. ICCVG 2010. Lecture Notes in Computer Science, vol 6374. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15910-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-15910-7_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15909-1
Online ISBN: 978-3-642-15910-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics