Fusing MPEG-7 Visual Descriptors for Image Classification

Spyrou, Evaggelos; Le Borgne, Hervé; Mailis, Theofilos; Cooke, Eddie; Avrithis, Yannis; O’Connor, Noel

doi:10.1007/11550907_134

Evaggelos Spyrou²⁰,
Hervé Le Borgne²¹,
Theofilos Mailis²⁰,
Eddie Cooke²¹,
Yannis Avrithis²⁰ &
…
Noel O’Connor²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3697))

Included in the following conference series:

International Conference on Artificial Neural Networks

3575 Accesses
24 Citations

Abstract

This paper proposes three content-based image classification techniques based on fusing various low-level MPEG-7 visual descriptors. Fusion is necessary as descriptors would be otherwise incompatible and inappropriate to directly include e.g. in a Euclidean distance. Three approaches are described: A “merging” fusion combined with an SVM classifier, a back-propagation fusion combined with a KNN classifier and a Fuzzy-ART neurofuzzy network. In the latter case, fuzzy rules can be extracted in an effort to bridge the “semantic gap” between the low-level descriptors and the high-level semantics of an image. All networks were evaluated using content from the repository of the aceMedia project and more specifically in a beach/urban scene classification problem.

An erratum to this chapter can be found at http://dx.doi.org/10.1007/11550907_163 .

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE t. PAMI 22, 1349–1380 (2000)
Google Scholar
Szummer, M., Picard, R.: Indoor-outdoor image classification. In: IEEE international workshop on content-based access of images and video databases (1998)
Google Scholar
Vailaya, A., Jain, A., Zhang, H.-J.: On image classification: City images vs. landscapes. Pattern Recognition 31, 1921–1936 (1998)
Article Google Scholar
Wang, D.H., Tian, Q., Gao, S., Sung, W.-K.: News sports video shot classification with sports play field and motion features. In: ICIP 2004, pp. 2247–2250 (2004)
Google Scholar
Mc Donald, K., Smeaton, A.: A comparison of score, rank and probability-based fusion methods for video shot retrieval. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 61–70. Springer, Heidelberg (2005)
Chapter Google Scholar
Chang, S.-F., Sikora, T., Puri, A.: Overview of the mpeg-7 standard. IEEE trans. on Circuits and Systems for Video Technology 11, 688–695 (2001)
Article Google Scholar
Kompatsiaris, I., Avrithis, Y., Hobson, P., Strinzis, M.: Integrating knowledge, semantics and content for user-centred intelligent media services: the acemedia project. In: Proc. of WIAMIS 2004, Portugal, April 21-23 (2004)
Google Scholar
MPEG-7: Visual experimentation model (xm) version 10.0. ISO/IEC/ JTC1/SC29/WG11, Doc. N4062 (2001)
Google Scholar
Manjunath, B.S., Ohm, J.-R., Vasudevan, V.V., Yamada, A.: Color and texture descriptors. IEEE trans. on Circuits and Systems for Video Technology 11, 703–715 (2001)
Article Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)
MATH Google Scholar
Lin, C.T., Lee, C.S.G.: Neural-network-based fuzzy logic control and decision system. IEEE trans. Comput. 40, 1320–1336 (1991)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Image, Video and Multimedia Systems Laboratory, National Technical University of Athens, 9 Iroon Polytechniou Str, 157 73, Athens, Greece
Evaggelos Spyrou, Theofilos Mailis & Yannis Avrithis
Center for Digital Video Processing, Dublin City University, Collins Ave., Ireland
Hervé Le Borgne, Eddie Cooke & Noel O’Connor

Authors

Evaggelos Spyrou
View author publications
You can also search for this author in PubMed Google Scholar
Hervé Le Borgne
View author publications
You can also search for this author in PubMed Google Scholar
Theofilos Mailis
View author publications
You can also search for this author in PubMed Google Scholar
Eddie Cooke
View author publications
You can also search for this author in PubMed Google Scholar
Yannis Avrithis
View author publications
You can also search for this author in PubMed Google Scholar
Noel O’Connor
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, Nicolaus Copernicus University, Toruń, Poland
Włodzisław Duch
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01–447, Warsaw, Poland
Janusz Kacprzyk
Adaptive Informatics Research Centre, Helsinki University of Technology, P.O. Box 5400, 02015, HUT, Finland
Erkki Oja
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447, Warsaw, Poland
Sławomir Zadrożny

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Spyrou, E., Le Borgne, H., Mailis, T., Cooke, E., Avrithis, Y., O’Connor, N. (2005). Fusing MPEG-7 Visual Descriptors for Image Classification. In: Duch, W., Kacprzyk, J., Oja, E., Zadrożny, S. (eds) Artificial Neural Networks: Formal Models and Their Applications – ICANN 2005. ICANN 2005. Lecture Notes in Computer Science, vol 3697. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11550907_134

Download citation

DOI: https://doi.org/10.1007/11550907_134
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28755-1
Online ISBN: 978-3-540-28756-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics