East European Conference on Advances in Databases and Information Systems

ADBIS 2013: Advances in Databases and Information Systems pp 98-111

Compact and Distinctive Visual Vocabularies for Efficient Multimedia Data Indexing

  • Dimitris Kastrinakis
  • Symeon Papadopoulos
  • Athena Vakali
Conference paper

DOI: 10.1007/978-3-642-40683-6_8

Volume 8133 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Kastrinakis D., Papadopoulos S., Vakali A. (2013) Compact and Distinctive Visual Vocabularies for Efficient Multimedia Data Indexing. In: Catania B., Guerrini G., Pokorný J. (eds) Advances in Databases and Information Systems. ADBIS 2013. Lecture Notes in Computer Science, vol 8133. Springer, Berlin, Heidelberg

Abstract

Multimedia data indexing for content-based retrieval has attracted significant attention in recent years due to the commoditization of multimedia capturing equipment and the widespread adoption of social networking platforms as means for sharing media content online. Due to the very large amounts of multimedia content, notably images, produced and shared online by people, a very important requirement for multimedia indexing approaches pertains to their efficiency both in terms of computation and memory usage. A common approach to support query-by-example image search is based on the extraction of visual words from images and their indexing by means of inverted indices, a method proposed and popularized in the field of text retrieval.

The main challenge that visual word indexing systems currently face arises from the fact that it is necessary to build very large visual vocabularies (hundreds of thousands or even millions of words) to support sufficiently precise search. However, when the visual vocabulary is large, the image indexing process becomes computationally expensive due to the fact that the local image descriptors (e.g. SIFT) need to be quantized to the nearest visual words.

To this end, this paper proposes a novel method that significantly decreases the time required for the above quantization process. Instead of using hundreds of thousands of visual words for quantization, the proposed method manages to preserve retrieval quality by using a much smaller number of words for indexing. This is achieved by the concept of composite words, i.e. assigning multiple words to a local descriptor in ascending order of distance. We evaluate the proposed method in the Oxford and Paris buildings datasets to demonstrate the validity of the proposed approach.

Keywords

multimedia data indexing local descriptors visual word composite visual word 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Dimitris Kastrinakis
    • 1
  • Symeon Papadopoulos
    • 2
  • Athena Vakali
    • 1
  1. 1.Department of InformaticsAristotle UniversityThessalonikiGreece
  2. 2.Information Technologies InstituteCERTH-ITIThessalonikiGreece