Imagination: Exploiting Link Analysis for Accurate Image Annotation

  • Ilaria Bartolini
  • Paolo Ciaccia
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4918)

Abstract

The advent of digital photography calls for effective techniques for managing growing amounts of color images. Systems that only rely on low-level image features are nowadays limited by the semantic gap problem, which leads to a mismatch between the user subjective notion of similarity and the one adopted by a system. A possible way to reduce the semantic gap is to (semi-)automatically assign meaningful terms to images, so as to enable a high-level, concept-based, retrieval. In this paper we explore the opportunities offered by graph-based link analysis techniques in the development of a semi-automatic image captioning system. The approach we propose is appealing since the predicted terms for an image are in variable number, depending on the image content, represent correlated terms, and can also describe abstract concepts. We present preliminary results on our prototype system and discuss possible extensions.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bartolini, I.: Context-Based Image Similarity Queries. In: Detyniecki, M., Jose, J.M., Nürnberger, A., van Rijsbergen, C.J.K. (eds.) AMR 2005. LNCS, vol. 3877, pp. 222–235. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  2. 2.
    Bartolini, I., Ciaccia, P., Patella, M.: A Sound Algorithm for Region-Based Image Retrieval Using an Index. In: Proceedings of the 4th International Workshop on Query Processing and Multimedia Issue in Distributed Systems (QPMIDS 2000), Greenwich, London, UK, September 2000, pp. 930–934 (2000)Google Scholar
  3. 3.
    Bartolini, I., Ciaccia, P., Waas, F.: FeedbackBypass: A New Approach to Interactive Similarity Query Processing. In: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB 2001), Rome, Italy, September 2001, pp. 201–210 (2001)Google Scholar
  4. 4.
    Basseville, M.: Distance Measures for Signal Processing and Pattern Recognition. European Journal of Signal Processing 18(4), 349–369 (1989)MathSciNetCrossRefGoogle Scholar
  5. 5.
    Bomze, I., Budinich, M., Pardalos, P., Pelillo, M.: The Maximum Clique Problem, vol. 4. Kluwer Academic Publishers, Boston, MA (1999)Google Scholar
  6. 6.
    Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.A.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  7. 7.
    Escalante, H.J., Montes, M., Sucar, L.E.: Improving Automatic Image Annotation Based on Word Co-occurrence. In: Proceedings of the 5th International Workshop on Adaptive Multimedia Retrieval (AMR 2007), Paris, France (2007)Google Scholar
  8. 8.
    Fogaras, D., Rácz, B.: Scaling Link-based Similarity Search. In: Proceedings of the 14th International Conference on World Wide Web (WWW 2005), Chiba, Japan, May 2005, pp. 641–650 (2005)Google Scholar
  9. 9.
    Hentschel, C., Stober, S., Nrnberger, A., Detyniecki, M.: Automatic Image Annotation using a Visual Dictionary based on Reliable Image Segmentation. In: Proceedings of the 5th International Workshop on Adaptive Multimedia Retrieval (AMR 2007), Paris, France (2007)Google Scholar
  10. 10.
    Jeon, J., Lavrenko, V., Manmatha, R.: Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, August 2003, pp. 119–126 (2003)Google Scholar
  11. 11.
    Maron, O., Ratan, A.L.: Multiple-instance Learning for Natural Scene Classification. In: Proceedings of the 15th International Conference on Machine Learning (ICML 1998), San Francisco, CA, USA, July 1998, pp. 341–349 (1998)Google Scholar
  12. 12.
    Mori, Y., Takahashi, H., Oka, R.: Image-to-word Transformation Based on Dividing and Vector Quantizing Images with Words. In: Proceedings of the 1st International Workshop on Multimedia Intelligent Storage and Retrieval Management (MISRM 1999) (1999)Google Scholar
  13. 13.
    Pan, J.-Y., Yang, H.-J., Faloutsos, C., Duygulu, P.: Automatic Multimedia Cross-modal Correlation Discovery. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, USA, August 2004, pp. 653–658 (2004)Google Scholar
  14. 14.
    Rui, Y., Huang, T.S., Ortega, M., Mehrotra, S.: Relevance Feedback: A Power Tool for Interactive Content-Based Image Retrieval. IEEE Transactions on Circuits and Systems for Video Technology 8(5), 644–655 (1998)CrossRefGoogle Scholar
  15. 15.
    Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-Based Image Retrieval at the End of the Early Years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)CrossRefGoogle Scholar
  16. 16.
    Tong, H., Faloutsos, C., Pan, J.-Y.: Fast Random Walk with Restart and Its Applications. In: Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), December 2006, Hong Kong, China, pp. 613–622 (2006)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Ilaria Bartolini
    • 1
  • Paolo Ciaccia
    • 1
  1. 1.DEISUniversity of BolognaItaly

Personalised recommendations