A Graph Community and Bag of Categorized Visual Words Based Image Retrieval

Zhang, Wang; Lu, Haoyuan; Sun, Shangdi; Gu, Xiaodong

doi:10.1007/978-3-319-26561-2_56

Wang Zhang¹⁷,
Haoyuan Lu¹⁷,
Shangdi Sun¹⁷ &
…
Xiaodong Gu¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9492))

Included in the following conference series:

International Conference on Neural Information Processing

Abstract

We propose a novel method for organizing image dataset with tags in which each image is regarded as a node of a complex network and the semantic information of the dataset can be grouped with community detection algorithm. The retrieval process are divided into two phases to improve accuracy by searching in a smaller sub-dataset. In the first phase, tags of a query image are taken as the priority matter to select target communities; in the second one, content based image retrieval is performed within the target communities. Bag of categorized visual words model is proposed as image content representation to improve description ability for objects compared with bag of visual words. Besides we also try to implement bag of visual words with SOM classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Note that the term seed has the same meaning as community in this algorithm.
2.
Available at: http://www.cmlab.csie.ntu.edu.tw/%7Ekuonini/Flickr11K.

References

Li, X., Snoek, C.G.M., Worring, M.: Learning social tag relevance by neighbor voting. IEEE Trans. Multimedia 11(7), 1310–1322 (2009)
Article Google Scholar
Victor, L., Manmatha, R., Jiwoon, J.: A model for learning the semantics of pictures. In: Advances in Neural Information Processing Systems Conference, vol. 16 (2003)
Google Scholar
Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 42–50. Springer, Heidelberg (2004)
Chapter Google Scholar
Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N.: Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 29(3), 394–410 (2007)
Article Google Scholar
Wu, L., Jin, R., Jain, A.K.: Tag completion for image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 35(3), 716–727 (2013)
Article Google Scholar
Kuo, Y.H., Cheng, W.H., HT, Lin, Hsu, W.H.: Unsupervised semantic feature discovery for image object retrieval and tag refinement. IEEE Trans. Multimedia 14(4), 1079–1090 (2012)
Article Google Scholar
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Ninth IEEE International Conference on Computer Vision, vol. 2, pp. 1470–1477. IEEE, Nice (2003)
Google Scholar
Fei-Fei, L., Perona, P.: A Bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 524–531. IEEE, San Diego (2005)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE, Anchorage (2008)
Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2161–2168. IEEE, New York City (2006)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2169–2178. IEEE, Los Alamitos (2006)
Google Scholar
Ali, A.H.Z., Amir, H.D.: Finding communities in linear time by developing the seeds. Phys. Rev. E 84(3), 1553–1563 (2011)
Google Scholar

Download references

Acknowledgements

This work was supported in part by National Natural Science Foundation of China under grant 61371148.

Author information

Authors and Affiliations

Department of Electronic Engineering, Fudan University, Shanghai, 200433, China
Wang Zhang, Haoyuan Lu, Shangdi Sun & Xiaodong Gu

Authors

Wang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Haoyuan Lu
View author publications
You can also search for this author in PubMed Google Scholar
Shangdi Sun
View author publications
You can also search for this author in PubMed Google Scholar
Xiaodong Gu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaodong Gu .

Editor information

Editors and Affiliations

University of Istanbul, Istanbul, Turkey
Sabri Arik
University at Qatar, Doha, Qatar
Tingwen Huang
Tunku Abdul Rahman University College, Kuala Lumpur, Malaysia
Weng Kin Lai
University of Science Technology, Wuhan, China
Qingshan Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, W., Lu, H., Sun, S., Gu, X. (2015). A Graph Community and Bag of Categorized Visual Words Based Image Retrieval. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9492. Springer, Cham. https://doi.org/10.1007/978-3-319-26561-2_56

Download citation

DOI: https://doi.org/10.1007/978-3-319-26561-2_56
Published: 18 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26560-5
Online ISBN: 978-3-319-26561-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics