Detection of High-Level Concepts in Multimedia

Spyrou, Evaggelos; Avrithis, Yannis

doi:10.1007/978-0-387-78414-4_16

Detection of High-Level Concepts in Multimedia

Evaggelos Spyrou² &
Yannis Avrithis²

Reference work entry

154 Accesses
1 Citations

Synonyms

High-level concept detection; Semantic concept detection; Image classification; Object recognition

Definition

High-level concept detection is the process within which the high-level concepts or objects which are presented in a multimedia document are determined. For example, given an image, a detection scheme would reply that it contains concepts such as “sky,” “sand,” “sea,” the image depicts an “outdoor” and more specifically a “beach” scene. In some cases, the actual position of concepts within the image is also detected.

Introduction

The continuously growing volume of multimedia content has led many research efforts to high-level concept detection, since the semantics a document contains provide an effective and desirable annotation of its content. However, detecting the actual semantics within image and video documents remains still a challenging, yet unsolved problem. Its two main and most interesting aspects are the selection of the low-level features to be extracted...

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 449.00; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
Image Segmentation is a process that divides images into regions using certain criteria of homogeneity such as color and/or texture.
2.
Weak classifiers are those with a nearly random performance, i.e., for a binary problem a weak classifier would have a performance slightly over 50%. With many techniques such as boosting/AdaBoost, a combination of many weak classifiers leads to a strong classifier.

References

A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, “Content-based image retrieval at the end of the early years,” IEEE Transactions, PAMI 22 (2000).
Google Scholar
D. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, Vol. 60, No. 2, 2004, pp. 91, 110.
Article Google Scholar
S. Haykin, “Neural Networks: A comprehensive foundation,” 2nd Edition, Prentice Hall, Englewood Cliffs, NJ, 1999.
Google Scholar
G.J. Klir and B. Yuan, “Fuzzy Sets and Fuzzy Logic, Theory and Applications,” Prentice Hall, Englewood Cliffs, NJ, 1995.
Google Scholar
V.N. Vapnik, “The nature of statistical learning theory,” Springer, New York, 1995.
Google Scholar
C.-T. Lin and C.S. Lee, “Neural fuzzy Systems: A neuro-fuzzy synergism to intelligent systems,” Prentice-Hall, Englewood Cliffs, NJ, 1995.
Google Scholar
M. Mitchell, “An introduction to Genetic Algorithms,” MIT Press, Cambridge, MA, 1996.
Google Scholar
M. Szummer and R. Picard, “Indoor-outdoor image classification,” in IEEE international workshop on content-based access of images and video databases, 1998.
Google Scholar
A. Vailaya, A. Jain, and H.J. Zhang, “On image classification: City images vs. landscapes,” Pattern Recognition, Vol. 31, 1998, pp. 1921–1936.
Article Google Scholar
E. Spyrou, H. Le Borgne, T. Mailis, E. Cooke, Y. Avrithis, and N. O'Connor, “Fusing MPEG-7 Visual Descriptors for Image Classification”, in W. Duch et al. (Eds.), ICANN 2005, LNCS 3697, 2005, pp. 847–852.
Google Scholar
E. Spyrou and Y. Avrithis, “A Region Thesaurus Approach for High-Level Concept Detection in the Natural Disaster Domain,” Second International Conference on Semantics and Digital Media Technologies (SAMT), Genova, 2007.
Google Scholar
N. Voisine, S. Dasiopoulou, V. Mezaris, E. Spyrou, T. Athanasiadis, I. Kompatsiaris, Y. Avrithis, and M.G. Strintzis, “Knowledge-assisted video analysis using a genetic algorithm,” Proceedings of Sixth International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), 2005.
Google Scholar
D. Gokalp and S. Aksoy, “Scene Classiffcation Using Bag-of-Regions Representations,” Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR ‘07).
Google Scholar
W.L. Zhao, Y.G. Jiang, and C.W. Ngo, “Keyframe retrieval by keypoints: Can point-to-point matching help?” Proceedings of the International Conference on Image and Video Retrieval, Tempe, AZ, USA, pp. 72–81, Springer, 2006.
Chapter Google Scholar
A. Opelt, A. Pinz, and A. Zisserman, “Incremental learning of object detectors using a visual shape alphabet,” in Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR ‘06), Washington, DC, USA.
Google Scholar
P. Viola and M.J. Jones, “Robust Real-Time Face Detection,” International Journal of Computer Vision, Vol. 57, No. 2, 2004, pp. 137–154.
Article Google Scholar
P. Murphy, A. Torralba, and W.T. Freeman “Using the forest to see the trees: a graphical model relating features, objects and scenes,” Advances in Neural Information Processing Systems 16 (NIPS), Vancouver, BC, MIT Press, Cambridge, MA, 2003.
Google Scholar
Ph. Mylonas, E. Spyrou, and Y. Avrithis, “Enriching a context ontology with mid-level features for semantic multimedia analysis,” First Workshop on Multimedia Annotation and Retrieval Enabled by Shared Ontologies, co-located with SAMT 2007.
Google Scholar
A. Singhal, J. Luo, and W. Zhu, “Probabilistic Spatial Context Models for Scene Content Understanding,” in Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’03).
Google Scholar
E. Spyrou, G. Tolias, Ph. Mylonas, and Y. Avrithis, “A Semantic Multimedia Analysis Approach Utilizing a Region Thesaurus and LSA,” International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), Klagenfurt, Austria, 2008.
Google Scholar
“LSCOM Lexicon Definitions and Annotations Version 1.0”, DTO Challenge Workshop on Large Scale Concept Ontology for Multimedia, Columbia University ADVENT Technical Report #217–2006–3, March 2006.
Google Scholar
A.F. Smeaton, P. Over, and W. Kraaij, October 26–27, 2006, “Evaluation campaigns and TRECVid,” in Proceedings of the Eighth ACM International Workshop on Multimedia Information Retrieval, Santa Barbara, California, USA, MIR ‘06.
Google Scholar
S.F. Chang, T. Sikora, and A. Puri, “Overview of the MPEG – 7 standard”, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 11, No. 6, June 2001.
Google Scholar

Download references

Author information

Authors and Affiliations

National Technical University of Athens, , Greece
Evaggelos Spyrou & Yannis Avrithis

Authors

Evaggelos Spyrou
View author publications
You can also search for this author in PubMed Google Scholar
Yannis Avrithis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Florida Atlantic University (FAU), 33431, Boca Raton, FL, USA
Borko Furht (Department Chair) (Department Chair)

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Spyrou, E., Avrithis, Y. (2008). Detection of High-Level Concepts in Multimedia. In: Furht, B. (eds) Encyclopedia of Multimedia. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-78414-4_16

Download citation

DOI: https://doi.org/10.1007/978-0-387-78414-4_16
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-74724-8
Online ISBN: 978-0-387-78414-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics