Skip to main content

Detection of High-Level Concepts in Multimedia

  • Reference work entry

Synonyms

High-level concept detection; Semantic concept detection; Image classification; Object recognition

Definition

High-level concept detection is the process within which the high-level concepts or objects which are presented in a multimedia document are determined. For example, given an image, a detection scheme would reply that it contains concepts such as “sky,” “sand,” “sea,” the image depicts an “outdoor” and more specifically a “beach” scene. In some cases, the actual position of concepts within the image is also detected.

Introduction

The continuously growing volume of multimedia content has led many research efforts to high-level concept detection, since the semantics a document contains provide an effective and desirable annotation of its content. However, detecting the actual semantics within image and video documents remains still a challenging, yet unsolved problem. Its two main and most interesting aspects are the selection of the low-level features to be extracted...

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   449.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Image Segmentation is a process that divides images into regions using certain criteria of homogeneity such as color and/or texture.

  2. 2.

    Weak classifiers are those with a nearly random performance, i.e., for a binary problem a weak classifier would have a performance slightly over 50%. With many techniques such as boosting/AdaBoost, a combination of many weak classifiers leads to a strong classifier.

References

  1. A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, “Content-based image retrieval at the end of the early years,” IEEE Transactions, PAMI 22 (2000).

    Google Scholar 

  2. D. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, Vol. 60, No. 2, 2004, pp. 91, 110.

    Article  Google Scholar 

  3. S. Haykin, “Neural Networks: A comprehensive foundation,” 2nd Edition, Prentice Hall, Englewood Cliffs, NJ, 1999.

    Google Scholar 

  4. G.J. Klir and B. Yuan, “Fuzzy Sets and Fuzzy Logic, Theory and Applications,” Prentice Hall, Englewood Cliffs, NJ, 1995.

    Google Scholar 

  5. V.N. Vapnik, “The nature of statistical learning theory,” Springer, New York, 1995.

    Google Scholar 

  6. C.-T. Lin and C.S. Lee, “Neural fuzzy Systems: A neuro-fuzzy synergism to intelligent systems,” Prentice-Hall, Englewood Cliffs, NJ, 1995.

    Google Scholar 

  7. M. Mitchell, “An introduction to Genetic Algorithms,” MIT Press, Cambridge, MA, 1996.

    Google Scholar 

  8. M. Szummer and R. Picard, “Indoor-outdoor image classification,” in IEEE international workshop on content-based access of images and video databases, 1998.

    Google Scholar 

  9. A. Vailaya, A. Jain, and H.J. Zhang, “On image classification: City images vs. landscapes,” Pattern Recognition, Vol. 31, 1998, pp. 1921–1936.

    Article  Google Scholar 

  10. E. Spyrou, H. Le Borgne, T. Mailis, E. Cooke, Y. Avrithis, and N. O'Connor, “Fusing MPEG-7 Visual Descriptors for Image Classification”, in W. Duch et al. (Eds.), ICANN 2005, LNCS 3697, 2005, pp. 847–852.

    Google Scholar 

  11. E. Spyrou and Y. Avrithis, “A Region Thesaurus Approach for High-Level Concept Detection in the Natural Disaster Domain,” Second International Conference on Semantics and Digital Media Technologies (SAMT), Genova, 2007.

    Google Scholar 

  12. N. Voisine, S. Dasiopoulou, V. Mezaris, E. Spyrou, T. Athanasiadis, I. Kompatsiaris, Y. Avrithis, and M.G. Strintzis, “Knowledge-assisted video analysis using a genetic algorithm,” Proceedings of Sixth International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), 2005.

    Google Scholar 

  13. D. Gokalp and S. Aksoy, “Scene Classiffcation Using Bag-of-Regions Representations,” Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR ‘07).

    Google Scholar 

  14. W.L. Zhao, Y.G. Jiang, and C.W. Ngo, “Keyframe retrieval by keypoints: Can point-to-point matching help?” Proceedings of the International Conference on Image and Video Retrieval, Tempe, AZ, USA, pp. 72–81, Springer, 2006.

    Chapter  Google Scholar 

  15. A. Opelt, A. Pinz, and A. Zisserman, “Incremental learning of object detectors using a visual shape alphabet,” in Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR ‘06), Washington, DC, USA.

    Google Scholar 

  16. P. Viola and M.J. Jones, “Robust Real-Time Face Detection,” International Journal of Computer Vision, Vol. 57, No. 2, 2004, pp. 137–154.

    Article  Google Scholar 

  17. P. Murphy, A. Torralba, and W.T. Freeman “Using the forest to see the trees: a graphical model relating features, objects and scenes,” Advances in Neural Information Processing Systems 16 (NIPS), Vancouver, BC, MIT Press, Cambridge, MA, 2003.

    Google Scholar 

  18. Ph. Mylonas, E. Spyrou, and Y. Avrithis, “Enriching a context ontology with mid-level features for semantic multimedia analysis,” First Workshop on Multimedia Annotation and Retrieval Enabled by Shared Ontologies, co-located with SAMT 2007.

    Google Scholar 

  19. A. Singhal, J. Luo, and W. Zhu, “Probabilistic Spatial Context Models for Scene Content Understanding,” in Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’03).

    Google Scholar 

  20. E. Spyrou, G. Tolias, Ph. Mylonas, and Y. Avrithis, “A Semantic Multimedia Analysis Approach Utilizing a Region Thesaurus and LSA,” International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), Klagenfurt, Austria, 2008.

    Google Scholar 

  21. “LSCOM Lexicon Definitions and Annotations Version 1.0”, DTO Challenge Workshop on Large Scale Concept Ontology for Multimedia, Columbia University ADVENT Technical Report #217–2006–3, March 2006.

    Google Scholar 

  22. A.F. Smeaton, P. Over, and W. Kraaij, October 26–27, 2006, “Evaluation campaigns and TRECVid,” in Proceedings of the Eighth ACM International Workshop on Multimedia Information Retrieval, Santa Barbara, California, USA, MIR ‘06.

    Google Scholar 

  23. S.F. Chang, T. Sikora, and A. Puri, “Overview of the MPEG – 7 standard”, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 11, No. 6, June 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag

About this entry

Cite this entry

Spyrou, E., Avrithis, Y. (2008). Detection of High-Level Concepts in Multimedia. In: Furht, B. (eds) Encyclopedia of Multimedia. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-78414-4_16

Download citation

Publish with us

Policies and ethics