Skip to main content

Fast Visual Vocabulary Construction for Image Retrieval Using Skewed-Split k-d Trees

  • Conference paper
  • First Online:
MultiMedia Modeling (MMM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9516))

Included in the following conference series:

Abstract

Most of the image retrieval approaches nowadays are based on the Bag-of-Words (BoW) model, which allows for representing an image efficiently and quickly. The efficiency of the BoW model is related to the efficiency of the visual vocabulary. In general, visual vocabularies are created by clustering all available visual features, formulating specific patterns. Clustering techniques are k-means oriented and they are replaced by approximate k-means methods for very large datasets. In this work, we propose a faster construction of visual vocabularies compared to the existing method in the case of SIFT descriptors, based on our observation that the values of the 128-dimensional SIFT descriptors follow the exponential distribution. The application of our method to image retrieval in specific image datasets showed that the mean Average Precision is not reduced by our approximation, despite that the visual vocabulary has been constructed significantly faster compared to the state of the art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    http://pami.xmu.edu.cn/~wlzhao/lip-vireo.htm.

  2. 2.

    http://host.robots.ox.ac.uk/pascal/VOC/voc2007/index.html.

  3. 3.

    http://www.multimedia-computing.de/flickrlogos/.

  4. 4.

    http://www.robots.ox.ac.uk/~vgg/data/oxbuildings/.

  5. 5.

    http://www.vision.caltech.edu/Image_Datasets/Caltech101/.

  6. 6.

    http://wang.ist.psu.edu/docs/related/.

  7. 7.

    https://www.r-project.org/.

References

  1. Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Berlin (2006)

    MATH  Google Scholar 

  2. Devroye, L.: Sample-based non-uniform random variate generation. In: Proceedings of the 18th Conference on Winter Simulation, pp. 260–265. ACM, December 1986

    Google Scholar 

  3. Jégou, H., Douze, M., Schmid, C., Pérez, P.: Aggregating local descriptors into a compact image representation. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3304–3311. IEEE, June 2010

    Google Scholar 

  4. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  5. Luo, Q., Zhang, S., Huang, T., Gao, W., Tian, Q.: Superimage: packing semantic-relevant images for indexing and retrieval. In: Proceedings of International Conference on Multimedia Retrieval, p. 41. ACM, April 2014

    Google Scholar 

  6. Mikolajczyk, K., Leibe, B., Schiele, B.: Multiple object class detection with a generative model. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 26–36. IEEE, June 2006

    Google Scholar 

  7. Mikulik, A., Chum, O., Matas, J.: Image retrieval for online browsing in large image collections. In: Brisaboa, N., Pedreira, O., Zezula, P. (eds.) SISAP 2013. LNCS, vol. 8199, pp. 3–15. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  8. Moise, D., Shestakov, D., Gudmundsson, G., Amsaleg, L.: Indexing and searching 100 M images with map-reduce. In: Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval, pp. 17–24. ACM, April 2013

    Google Scholar 

  9. Philbin, J.: Scalable object retrieval in very large image collections. Doctoral dissertation, Oxford University (2010)

    Google Scholar 

  10. Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8. IEEE, June 2007

    Google Scholar 

  11. Rawlings, J.O., Pantula, S.G., Dickey, D.A.: Applied Regression Analysis: a Research Tool. Springer Science & Business Media, New York (1998)

    Book  MATH  Google Scholar 

  12. Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision, Proceedings, pp. 1470–1477. IEEE, October 2003

    Google Scholar 

Download references

Acknowledgements

This work was supported by the projects MULTISENSOR (FP7-610411) and KRISTINA (H2020-645012), funded by the European Commission.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ilias Gialampoukidis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Gialampoukidis, I., Vrochidis, S., Kompatsiaris, I. (2016). Fast Visual Vocabulary Construction for Image Retrieval Using Skewed-Split k-d Trees. In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9516. Springer, Cham. https://doi.org/10.1007/978-3-319-27671-7_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27671-7_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27670-0

  • Online ISBN: 978-3-319-27671-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics