Skip to main content

A Graph Community and Bag of Categorized Visual Words Based Image Retrieval

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9492))

Included in the following conference series:

Abstract

We propose a novel method for organizing image dataset with tags in which each image is regarded as a node of a complex network and the semantic information of the dataset can be grouped with community detection algorithm. The retrieval process are divided into two phases to improve accuracy by searching in a smaller sub-dataset. In the first phase, tags of a query image are taken as the priority matter to select target communities; in the second one, content based image retrieval is performed within the target communities. Bag of categorized visual words model is proposed as image content representation to improve description ability for objects compared with bag of visual words. Besides we also try to implement bag of visual words with SOM classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Note that the term seed has the same meaning as community in this algorithm.

  2. 2.

    Available at: http://www.cmlab.csie.ntu.edu.tw/%7Ekuonini/Flickr11K.

References

  1. Li, X., Snoek, C.G.M., Worring, M.: Learning social tag relevance by neighbor voting. IEEE Trans. Multimedia 11(7), 1310–1322 (2009)

    Article  Google Scholar 

  2. Victor, L., Manmatha, R., Jiwoon, J.: A model for learning the semantics of pictures. In: Advances in Neural Information Processing Systems Conference, vol. 16 (2003)

    Google Scholar 

  3. Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 42–50. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  4. Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N.: Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 29(3), 394–410 (2007)

    Article  Google Scholar 

  5. Wu, L., Jin, R., Jain, A.K.: Tag completion for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 35(3), 716–727 (2013)

    Article  Google Scholar 

  6. Kuo, Y.H., Cheng, W.H., HT, Lin, Hsu, W.H.: Unsupervised semantic feature discovery for image object retrieval and tag refinement. IEEE Trans. Multimedia 14(4), 1079–1090 (2012)

    Article  Google Scholar 

  7. Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision, vol. 2, pp. 1470–1477. IEEE, Nice (2003)

    Google Scholar 

  8. Fei-Fei, L., Perona, P.: A Bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 524–531. IEEE, San Diego (2005)

    Google Scholar 

  9. Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, Anchorage (2008)

    Google Scholar 

  10. Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2161–2168. IEEE, New York City (2006)

    Google Scholar 

  11. Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2169–2178. IEEE, Los Alamitos (2006)

    Google Scholar 

  12. Ali, A.H.Z., Amir, H.D.: Finding communities in linear time by developing the seeds. Phys. Rev. E 84(3), 1553–1563 (2011)

    Google Scholar 

Download references

Acknowledgements

This work was supported in part by National Natural Science Foundation of China under grant 61371148.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaodong Gu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhang, W., Lu, H., Sun, S., Gu, X. (2015). A Graph Community and Bag of Categorized Visual Words Based Image Retrieval. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9492. Springer, Cham. https://doi.org/10.1007/978-3-319-26561-2_56

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26561-2_56

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26560-5

  • Online ISBN: 978-3-319-26561-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics