Multimedia Tools and Applications

, Volume 72, Issue 2, pp 1483–1506

Aligning codebooks for near duplicate image detection

  • Sebastiano Battiato
  • Giovanni Maria Farinella
  • Giovanni Puglisi
  • Daniele Ravì
Article

DOI: 10.1007/s11042-013-1470-4

Cite this article as:
Battiato, S., Farinella, G.M., Puglisi, G. et al. Multimed Tools Appl (2014) 72: 1483. doi:10.1007/s11042-013-1470-4

Abstract

The detection of near duplicate images in large databases, such as the ones of popular social networks, digital investigation archives, and surveillance systems, is an important task for a number of image forensics applications. In digital investigation, hashing techniques are commonly used to index large quantities of images for the detection of copies belonging to different archives. In the last few years, different image hashing techniques based on the Bags of Visual Features paradigm appeared in literature. Recently, this paradigm has been augmented by using multiple descriptors (e.g., Bags of Visual Phrases) in order to exploit the coherence between different feature spaces. In this paper we propose to further improve the Bags of Visual Phrases approach considering the coherence between feature spaces not only at the level of image representation, but also during the codebook generation phase. Also we introduce a novel image database specifically designed for the development and benchmarking of near duplicate image retrieval techniques. The dataset consists of more than 3,300 images depicting more than 500 different scenes having at least three real near duplicates. The dataset has a huge variability in terms of geometric and photometric transformations between scenes and their corresponding near duplicates. Finally, we suggest a method to compress the proposed image representation for storage purposes. Experiments show the effectiveness of the proposed near duplicate retrieval technique, which outperforms the original Bags of Visual Phrases approach.

Keywords

Image forensics Near duplicate images Image retrieval Bags of visual words Bags of visual phrases Codebooks alignment 

Copyright information

© Springer Science+Business Media New York 2013

Authors and Affiliations

  • Sebastiano Battiato
    • 1
  • Giovanni Maria Farinella
    • 1
  • Giovanni Puglisi
    • 1
  • Daniele Ravì
    • 1
  1. 1.Department of Mathematics and Computer Science, Image Processing LaboratoryUniversity of CataniaCataniaItaly

Personalised recommendations