Skip to main content

Learning Distance Functions for Automatic Annotation of Images

  • Conference paper
Adaptive Multimedia Retrieval: Retrieval, User, and Semantics (AMR 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4918))

Included in the following conference series:

Abstract

This paper gives an overview of recent approaches towards image representation and image similarity computation for content-based image retrieval and automatic image annotation (category tagging). Additionaly, a new similarity function between an image and an object class is proposed. This similarity function combines various aspects of object class appearance through use of representative images of the class. Similarity to a representative image is determined by weighting local image similarities, where weights are learned from training image pairs, labeled “same” and “different”, using linear SVM. The proposed approach is validated on a challenging dataset where it performed favorably.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)

    Article  Google Scholar 

  2. Berg, A.C., Malik, J.: Geometric blur for template matching. In: CVPR, vol. 1, pp. 607–614 (2001)

    Google Scholar 

  3. Chang, C., Lin, C.: LIBSVM: A library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm

  4. Duin, R.P.W.: The combining classifier: To train or not to train? In: ICPR (2002)

    Google Scholar 

  5. Fritz, G., Seifert, C., Paletta, L.: A mobile vision system for urban detection with informative local descriptors. In: Computer Vision Systems (2006)

    Google Scholar 

  6. Frome, A., Singer, Y., Malik, J.: Image retrieval and classification using local distance functions. In: NIPS, pp. 417–424. MIT Press, Cambridge, MA (2007)

    Google Scholar 

  7. Gudivada, V.N., Raghavan, V.V.: Content-based image retrieval-systems. Computer 28(9), 18–22 (1995)

    Article  Google Scholar 

  8. Harris, C., Stephens, M.: A combined corner and edge detector. In: Proc. of Fourth Alvey Vision Conf., pp. 147–151 (1988)

    Google Scholar 

  9. Jain, A.K., Vailaya, A.: Image retrieval using color and shape. Pattern Recognition (1996)

    Google Scholar 

  10. Johansson, B., Cipolla, R.: A system for automatic pose-estimation from a single image in a city scene. In: Int. Conf. Signal Proc. Pattern Rec. and Analysis (2002)

    Google Scholar 

  11. Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: International Conference on Computer Vision (2005)

    Google Scholar 

  12. Kadir, T., Brady, M.: Saliency, scale and image description. International Journal of Computer Vision V45(2), 83–105 (2001)

    Article  Google Scholar 

  13. Ke, Y., Sukthankar, R.: Pca-sift: a more distinctive representation for local image descriptors. In: CVPR 2004, pp. II: 506–513 (2004)

    Google Scholar 

  14. Leung, T., Malik, J.: Recognizing surfaces using three-dimensional textons. In: ICCV, vol. 2, pp. 1010–1017. IEEE, Los Alamitos, CA (1999)

    Google Scholar 

  15. Lindeberg, T.: Feature detection with automatic scale selection. Int. J. Comput. Vision 30(2), 79–116 (1998)

    Article  Google Scholar 

  16. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  17. Manjunath, B.S., Ma, W.Y.: Texture features for browsing and retrieval of image data. IEEE Trans. Pattern Anal. Mach. Intell. 18(8), 837–842 (1996)

    Article  Google Scholar 

  18. Mikolajczyk, K., Leibe, B., Schiele, B.: Local features for object class recognition. In: ICCV, vol. 2, pp. 1792–1799. IEEE, Los Alamitos, CA (2005)

    Google Scholar 

  19. Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. International Journal of Computer Vision 60(1), 63–86 (2004)

    Article  Google Scholar 

  20. Moosmann, F., Triggs, B., Jurie, F.: Fast discriminative visual codebooks using randomized clustering forests. In: NIPS, pp. 985–992 (2007)

    Google Scholar 

  21. Niculescu-Mizil, A., Caruana, R.: Predicting good probabilities with supervised learning. In: ICML, pp. 625–632. ACM, New York, NY, USA (2005)

    Google Scholar 

  22. Nistér, D., Naroditsky, O., Bergen, J.: Visual odometry for ground vehicle applications. Journal of Field Robotics 23(1), 3–20 (2006)

    Article  MATH  Google Scholar 

  23. Nowak, E., Jurie, F.: Learning visual similarity measures for comparing never seen objects. In: CVPR. IEEE, Los Alamitos, CA (2007)

    Google Scholar 

  24. Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: European Conference on Computer Vision. Springer, Heidelberg (2006)

    Google Scholar 

  25. Obdrzalek, S., Matas, J.: Object recognition using local affine frames on distinguished regions. In: BMVA 2002, vol. 1, pp. 113–122 (2002)

    Google Scholar 

  26. Opelt, A., Pinz, A., Fussenegger, M., Auer, P.: Generic object recognition with boosting. IEEE Trans. Pattern Anal. Mach. Intell. 28(3), 416–431 (2006)

    Article  Google Scholar 

  27. Platt, J.: Probabilistic outputs for support vector machines and comparison to regularized likelihood methods. Technical report, Microsoft Research (1999)

    Google Scholar 

  28. Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(5), 530–535 (1997)

    Article  Google Scholar 

  29. Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: ICCV 2003, pp. 1470–1477 (2003)

    Google Scholar 

  30. Steger, C.: An unbiased detector of curvilinear structures. IEEE Trans. Pattern Anal. Mach. Intell. 20(2), 113–125 (1998)

    Article  MathSciNet  Google Scholar 

  31. van de Weijer, J., Schmid, C., Verbeek, J.: Learning color names from real-world images. In: CVPR (June 2007)

    Google Scholar 

  32. Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference, vol. 2, pp. 1800–1807 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Krapac, J., Jurie, F. (2008). Learning Distance Functions for Automatic Annotation of Images. In: Boujemaa, N., Detyniecki, M., Nürnberger, A. (eds) Adaptive Multimedia Retrieval: Retrieval, User, and Semantics. AMR 2007. Lecture Notes in Computer Science, vol 4918. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79860-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-79860-6_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-79859-0

  • Online ISBN: 978-3-540-79860-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics