Web Image Annotation Using an Effective Term Weighting
The number of images on the World Wide Web has been increasing tremendously. Providing search services for images on the web has been an active research area. Web images are often surrounded by different associated texts like ALT text, surrounding text, image filename, html page title etc. Many popular internet search engines make use of these associated texts while indexing images and give higher importance to the terms present in ALT text. But, a recent study has shown that around half of the images on the web have no ALT text. So, predicting the ALT text of an image in a web page would be of great use in web image retrieval. We propose an approach on top of term co-occurrence approach proposed in the literature to ALT text prediction. Our results show that our approach and the simple term co-occurrence approach produce almost the same results. We analyze both the methods and describe the usage of the methods in different situations. We build an image annotation system on top of our proposed approach and compare the results with the image annotation system built on top of the term co-occurrence approach. Preliminary experiments on a set of 1000 images show that our proposed approach performs well over the simple term co-occurrence approach for web image annotation.
KeywordsNoun Phrase Image Annotation Term Weighting Latent Dirichlet Allocation Model Anchor Text
Unable to display preview. Download preview PDF.
- 1.Cascia, M.L., Sethi, S., Sclaroff, S.: Combining textual and visual cues for content-based image retrieval on the world wide web. In: IEEE Workshop on Content-based Access of Image and Video Libraries, pp. 24–28 (1998)Google Scholar
- 3.Petrie, H., Harrison, C., Dev, S.: Describing images on the web: a survey of current practice and prospects for the future. In: Proceedings of Human Computer Interaction International, HCII 2005 (2005)Google Scholar
- 4.Craven, T.C.: Some features of alt texts associated with images in web pages. Information Research 11 (2006)Google Scholar
- 8.Hironobu, Y.M., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. Boltzmann machines, Neural Networks 4 (1999)Google Scholar
- 9.Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002, Part IV. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)CrossRefGoogle Scholar
- 10.Blei, D.M., Jordan, M.I.: Modeling annotated data. In: SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, pp. 127–134 (2003)Google Scholar
- 11.Jeon, J., Lavrenko, V., Manmatha, R., Callan, J., Cormack, G., Clarke, C., Hawking, D., Smeaton, A.: Automatic image annotation and retrieval using cross-media relevance models. SIGIR Forum, 119–126 (2003)Google Scholar
- 13.Yang, C., Dong, M.: Region-based image annotation using asymmetrical support vector machine-based multi-instance learning. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2 (2006)Google Scholar
- 15.Rui, X., Li, M., Li, Z., Ma, W.Y., Yu, N.: Bipartite graph reinforcement model for web image annotation. ACM Multimedia, 585–594 (2007)Google Scholar
- 16.Shen, H.T., Ooi, B.C., Tan, K.L.: Giving meaning to www images. ACM Multimedia, 39–47 (2000)Google Scholar
- 17.Kuo, C.H., Chou, T.C., Tsao, N.L., Lan, Y.H.: Canfind: A semantic image indexing and retrieval system. In: ISCAS (3), pp. 644–647 (2003)Google Scholar