Skip to main content

Learning Concepts by Modeling Relationships

  • Conference paper
Multimedia Content Analysis and Mining (MCAM 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4577))

Included in the following conference series:

Abstract

Supporting multimedia search has emerged as an important research topic. There are three paradigms on the research spectrum that ranges from the least automatic to the most automatic. On the far left end, there is the pure manual labeling paradigm that labels multimedia content, e.g., images and video clips, manually with text labels and then use text search to search multimedia content indirectly. On the far right end, there is the content-based search paradigm that can be fully automatic by using low-level features from multimedia analysis. In recent years, a third paradigm emerged which is in the middle: the annotation paradigm. Once the concept models are trained, this paradigm can automatically detect/annotate concepts in unseen multimedia content. This paper looks into this annotation paradigm. Specifically, this paper argues that within the annotation paradigm, the relationship-based annotation approach outperforms other existing annotation approaches, because individual concepts are considered jointly instead of independently. We use two examples to illustrate the argument. The first example is on image annotation and the second one is on video annotation. Experiments indeed show that relationship-based annotation approaches render superior performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chen, Y., Wang, J.Z.: Image Categorization by Learning and Reasoning with Regions. In: Journal of Machine Learning Research 5, 913–939 (2004)

    Google Scholar 

  2. Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. Journal of Machine Learning Research 2, 265–292 (2001)

    Article  Google Scholar 

  3. Godbole, S., Sarawagi, S.: Discriminative methods for multi-labeled classification. In: Dai, H., Srikant, R., Zhang, C. (eds.) PAKDD 2004. LNCS (LNAI), vol. 3056, Springer, Heidelberg (2004)

    Google Scholar 

  4. Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (roc) curve. Radiology 143, 29–36 (1982)

    Google Scholar 

  5. Maron, O., Ratan, A.L.: Multiple-instance learning for natural scene classification. In: Proc. International Conference on Machine Learning, pp. 341–349 (1998)

    Google Scholar 

  6. Marr, D.: Vision. W.H. Freeman and Company, New York (1982)

    Google Scholar 

  7. Naphade, M.R., Kennedy, L., Kender, J.R., Chang, S.-F., Smith, J.R., Over, P., Hauptmann, A.: A Light Scale Concept Ontology for Multimedia Understanding for TRECVID 2005. In: IBM Research Technical Report (2005)

    Google Scholar 

  8. Qi, G.-J., Hua, X.-S., Rui, Y., et al.: Concurrent Multiple Instance Learning for Image Categorization. In: Proc. of CVPR 2007 (2007)

    Google Scholar 

  9. Qi, G.-J., Hua, X.-S., Rui, Y., et al.: Correlative Multi-Label Video Annotation. In: Pre-prints of ACM Multimedia 2007 (submission)

    Google Scholar 

  10. Smith, J.R., Naphade, M., Natsev, A.: Multimedia semantic indexing using model vectors. In: IEEE International Conference on Multimedia and Expo (2003)

    Google Scholar 

  11. Snoek, C., Worring, M., Gemert, J., Geusebroek, J.-M., Smeulders, A.: The Challenge Problem for Automated Detection of 101 Semantic Concepts in Multimedia. In: Proceedings of the ACM International Conference on Multimedia October 2006, pp. 421–430, Santa Barbara, USA (2006)

    Google Scholar 

  12. TRECVID: TREC video retrieval evaluation, http://www.nlpir.nist.gov/projects/trecvid/

  13. Viola, P., Platt, J.C., Zhang, C.: Multiple instance boosting for object detection. In: Proc. of Advances in Neural Information Processing System (2005)

    Google Scholar 

  14. Wu, Y., Tseng, B.L., Smith, J.R.: Ontology-based multi-classification learning for video concept detection. In: IEEE International Conference on Multimedia and Expo (2004)

    Google Scholar 

  15. Yang, C., Dong, M., Hua, J.: Region-based image annotation using asymmetrical support vector machine-based multi-instance learning. In: Proc. of IEEE International Conference on CVPR (2006)

    Google Scholar 

  16. Zhang, Q., Goldman, S.A.: Em-dd: An improved multiple-instance learning technique. In: Proc. of Advances in Neural Information Processing System (2001)

    Google Scholar 

  17. Kofidis, E., Regalia, P.: On the Best Rank-1 Approximation of Higher-order Super-symmetric Tensors. SIAM Journal on Matrix Analysis and Applications 23(3), 863–884 (2002)

    Article  MATH  Google Scholar 

  18. Schölkopf, B., Herbrich, R., Smola, A.J.: A Generalized Representer Theorem. In: Proc. of the Annual Conference on Computational Learning Theory, pp. 416–426 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Nicu Sebe Yuncai Liu Yueting Zhuang Thomas S. Huang

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Rui, Y., Qi, GJ. (2007). Learning Concepts by Modeling Relationships. In: Sebe, N., Liu, Y., Zhuang, Y., Huang, T.S. (eds) Multimedia Content Analysis and Mining. MCAM 2007. Lecture Notes in Computer Science, vol 4577. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73417-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73417-8_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73416-1

  • Online ISBN: 978-3-540-73417-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics