An effective image retrieval mechanism using family-based spatial consistency filtration with object region

Article

Abstract

How to construct an appropriate spatial consistent measurement is the key to improving image retrieval performance. To address this problem, this paper introduces a novel image retrieval mechanism based on the family filtration in object region. First, we supply an object region by selecting a rectangle in a query image such that system returns a ranked list of images that contain the same object, retrieved from the corpus based on 100 images, as a result of the first rank. To further improve retrieval performance, we add an efficient spatial consistency stage, which is named family-based spatial consistency filtration, to re-rank the results returned by the first rank. We elaborate the performance of the retrieval system by some experiments on the dataset selected from the key frames of “TREC Video Retrieval Evaluation 2005 (TRECVID2005)”. The results of experiments show that the retrieval mechanism proposed by us has vast major effect on the retrieval quality. The paper also verifies the stability of the retrieval mechanism by increasing the number of images from 100 to 2000 and realizes generalized retrieval with the object outside the dataset.

Keywords

Content-based image retrieval object region family-based spatial consistency filtration local affine invariant feature spatial relationship 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    C. Carson, M. Thomas, S. Belongie, J. M. Hellerstein, J. Malik. Blobworld: A system for region-based image indexing and retrieval. In Proceeding of the 3rd International Conference on Visual Information Systems, IEEE Computer Society, Amsterdam, Holand, vol. 2, pp. 509–516, 1999.Google Scholar
  2. [2]
    J. Sivic, A. Zisserman. Video google: A text retrieval approach to object matching in videos. In Proceedings of the 9th IEEE International Conference on Computer Vision, IEEE, Nice, France, vol. 2, pp. 1470–1477, 2003.CrossRefGoogle Scholar
  3. [3]
    J. Philbin, O. Chum, M. Isard, J. Sivic, A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, IEEE, Minneapolis, USA, pp. 1–8, 2007.Google Scholar
  4. [4]
    K. Gao, S. X. Lin, J. B. Guo, D. M. Zhang, Y. D. Zhang, Y. F. Wu. Object retrieval based on spatially frequent items with informative patches. In Proceedings of IEEE International Conference on Multimedia and Expo, IEEE, Hanoverian, Germany, pp. 1305–1308, 2008.CrossRefGoogle Scholar
  5. [5]
    S. Lazebnik, C. Schmid, J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Proceedings of IEEE Computer Society Conference on Conference on Computer Vision and Pattern Recognition, IEEE, New York, USA, vol. 2, pp. 2169–2178, 2006.Google Scholar
  6. [6]
    Q. F. Zheng, W. Q. Wang, W. Gao. Effective and efficient object-based image retrieval using visual phrases. In Proceedings of the 14th Annual ACM International Conference on Multimedia, ACM, Santa Barbara, USA, pp. 77–80, 2006.Google Scholar
  7. [7]
    D. Nistér, H. Stewénius. Scalable recognition with a vocabulary tree. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE, New York, USA, vol. 2, pp. 2161–2168, 2006.Google Scholar
  8. [8]
    C. Schmid, R. Mohr. Local grayvalue invariants for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 5, pp. 530–535, 1997.CrossRefGoogle Scholar
  9. [9]
    T. Tuytelaars, L. Van Gool. Content-based image retrieval based on local affinely invariant regions. Lecture Notes in Computer Science, Springer, vol. 1614, pp. 493–500, 1999.CrossRefGoogle Scholar
  10. [10]
    F. Schaffalitzky. A. Zisserman. Multi-view matching for unordered image sets. In Proceedings of the 7th European Conference on Computer Vision, Lecture Notes in Computer Science, Springer, Copenhagen, Denmark, vol. 2350, pp. 414–431, 2002.Google Scholar
  11. [11]
    A. Baumberg. Reliable feature matching across widely separated views. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Hilton Head Island, USA, vol. 1, pp. 774–781, 2000.Google Scholar
  12. [12]
    K. Mikolajczyk, C. Schmid. Scale and affine invariant interest point detectors. International Journal of Computer Vision, vol. 60, no. 1, pp. 63–86, 2004.CrossRefGoogle Scholar
  13. [13]
    T. Lindeberg. Feature detection with automatic scale selection. International Journal of Computer Vision, vol. 30, no. 2, pp. 79–116, 1998.CrossRefGoogle Scholar
  14. [14]
    D. G. Lowe. Distinctive image features from scale invariant keypoints. International Journal of Computer Vision, vol. 60, no. 2, pp. 91–110, 2004.CrossRefGoogle Scholar
  15. [15]
    K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, L. Van Gool. A comparison of affine region detectors. International Journal of Computer Vision, vol. 65, no. 1–2, pp. 43–72, 2005.CrossRefGoogle Scholar
  16. [16]
    K. Yamamoto, R. Oi. Color correction for multi-view video using energy minimization of view networks. International Journal of Automation and Computing, vol. 5, no. 3, pp. 234–245, 2008.CrossRefGoogle Scholar
  17. [17]
    S. F. Liu, C. McMahon, M. Darlington, S. Culley, P. Wild. EDCMS: A content management system for engineering documents. International Journal of Automation and Computing, vol. 4, no. 1, pp. 56–70, 2007.CrossRefGoogle Scholar
  18. [18]
    R. Baeza-Yates, B. Ribeiro-Neto. Modern Information Retrieval, ACM Press, pp. 24–34, 1999.Google Scholar

Copyright information

© Institute of Automation, Chinese Academy of Sciences and Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  1. 1.School of Mechanical EngineeringDalian University of TechnologyDalianPRC

Personalised recommendations