International Journal of Computer Vision

, Volume 103, Issue 1, pp 163–175

Learning Vocabularies over a Fine Quantization

  • Andrej Mikulik
  • Michal Perdoch
  • Ondřej Chum
  • Jiří Matas
Article

DOI: 10.1007/s11263-012-0600-1

Cite this article as:
Mikulik, A., Perdoch, M., Chum, O. et al. Int J Comput Vis (2013) 103: 163. doi:10.1007/s11263-012-0600-1

Abstract

A novel similarity measure for bag-of-words type large scale image retrieval is presented. The similarity function is learned in an unsupervised manner, requires no extra space over the standard bag-of-words method and is more discriminative than both L2-based soft assignment and Hamming embedding. The novel similarity function achieves mean average precision that is superior to any result published in the literature on the standard Oxford 5k, Oxford 105k and Paris datasets/protocols. We study the effect of a fine quantization and very large vocabularies (up to 64 million words) and show that the performance of specific object retrieval increases with the size of the vocabulary. This observation is in contradiction with previously published results. We further demonstrate that the large vocabularies increase the speed of the tf-idf scoring step.

Keywords

Image retrieval Vocabulary Feature track 

Copyright information

© Springer Science+Business Media New York 2012

Authors and Affiliations

  • Andrej Mikulik
    • 1
  • Michal Perdoch
    • 1
  • Ondřej Chum
    • 1
  • Jiří Matas
    • 1
  1. 1.CMP, Department of Cybernetics, Faculty of Electrical EngineeringCzech Technical University in PraguePragueCzech Republic