Skip to main content

Towards Translation of Tags in Large Annotated Image Collections

  • Conference paper
  • First Online:
Artificial Intelligence: Methodology, Systems, and Applications (AIMSA 2016)

Abstract

This paper presents an approach for translation of tags in professional and social image databases, using an original lexical resource extracted from Wikipedia. The translation integrates a tag sense disambiguation algorithm based on WordNet and Wikipedia (as external resources defining word senses). Our disambiguation technique uses the Lesk algorithm, extended gloss overlaps and similarity measures in order to achieve successful resolution of lexical ambiguity and accurate translation of tags. We show how to involve Wikipedia as a source of translation correspondences since open WordNets are not available for most languages. Experimental results and performance evaluation show 97 % accuracy for professional images and 86 % accuracy for social images from Flickr. This translation technique can be applied by auto-tagging programs and information retrieval systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  1. Etzioni, O., Reiter, K., Soderland, S., Sammer, H.: Lexical translation with application to image search on the web. In: Macgoad, B. (ed.) Proceedings of Machine Translation Summit XI, pp. 175–182 (2007)

    Google Scholar 

  2. Reiter, K., Soderland, S., Etzioni, O.: Cross-lingual image search on the web. In: Proceedings of the IJCAI 2007 Workshop on Cross-Lingual Information Access (2007). http://search.iiit.ac.in/CLIA2007/papers/CLIS.pdf

  3. Ustalov, D.: TagBag: Annotating a foreign language lexical resource with pictures. In: Khachay, M.Y., Konstantinova, N., Panchenko, A., Ignatov, D.I., Labunets, V.G. (eds.) AIST 2015. CCIS, vol. 542, pp. 361–369. Springer, Heidelberg (2015). doi:10.1007/978-3-319-26123-2_35

    Chapter  Google Scholar 

  4. Mueller, V.K.: English-Russian Dictionary. http://mueller-dict.sourceforge.net/cgi-bin/dict.cgi?word=00-database-info;dict=mueller-dict;scroll=on#defs

  5. Stampouli, A., Giannakidou, E., Vakali, A.: Tag disambiguation through Flickr and Wikipedia. In: Yoshikawa, M., Meng, X., Yumoto, T., Ma, Q., Sun, L., Watanabe, C. (eds.) DASFAA 2010. LNCS, vol. 6193, pp. 252–263. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  6. Kangpyo, L., Hyunwoo, K., Hyopil, S., Hyoung-Joo, K.: Tag sense disambiguation for clarifying the vocabulary of social tags. In: Proceedings of the International Conference on Computational Science and Engineering, CSE 2009, vol. 4, pp. 729–734 (2009)

    Google Scholar 

  7. Kaipeng, L., Binxing, F., Weizhe, Z.: Unsupervised tag sense disambiguation in folksonomies. J. Comput. 5(11), 1715–1722 (2010)

    Google Scholar 

  8. Wu, Z., Palmer, M.: Verb semantics and lexical selection. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, pp. 133–138 (1994)

    Google Scholar 

  9. Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th Annual International Conference on Systems Documentation, SIGDOC 1986, pp. 24–26. ACM, USA (1986)

    Google Scholar 

  10. Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the 18th International Joint Conference on AI (IJCAI), pp. 805–810 (2003)

    Google Scholar 

  11. Kanishcheva, O., Angelovaou, G.: About sense disambiguation of image tags in large annotated image collections. In: Margenov, S., Angelova, G., Agre, G. (eds.) Innovative Approaches and Solutions in Advanced Intelligent Systems. Studies in Computational Intelligence, vol. 648, pp. 133–149. Springer, Heidelberg (2016)

    Chapter  Google Scholar 

  12. Kanishcheva, O., Angelova, G.: A pipeline approach to image auto-tagging refinement. In: Bădică, et al. (eds.) Proceedings of the 7th Balkan Conference on Informatics. ACM, New York (2015)

    Google Scholar 

Download references

Acknowledgements

This research is partially supported by the EC FP7 grant 316087 AComIn “Advanced Computing for Innovation”, 2012–2016. It is also related to the COST Action IC1307 “Integrating Vision and Language (iV&L Net)”. The authors are thankful to Imagga’s technical team for their comments, recommendations and experimental datasets.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Olga Kanishcheva .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Kanishcheva, O., Angelova, G., Nikolov, S.G. (2016). Towards Translation of Tags in Large Annotated Image Collections. In: Dichev, C., Agre, G. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2016. Lecture Notes in Computer Science(), vol 9883. Springer, Cham. https://doi.org/10.1007/978-3-319-44748-3_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-44748-3_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-44747-6

  • Online ISBN: 978-3-319-44748-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics