Survey on modeling and indexing events in multimedia

Scherp, Ansgar; Mezaris, Vasileios

doi:10.1007/s11042-013-1427-7

Survey on modeling and indexing events in multimedia

Published: 24 March 2013

Volume 70, pages 7–23, (2014)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Ansgar Scherp¹ &
Vasileios Mezaris²

733 Accesses
26 Citations
Explore all metrics

Abstract

Events have gained increasing interest in the area of multimedia in recent years. There have been many approaches published and research conducted on how to extract events from multimedia, represent it using appropriate models, and how to use events in end user applications. In this paper, we conduct an extensive analysis of existing event models along commonly identified aspects of events. In addition, we analyze how the different aspects of events relate to each other and how they can be applied together. Subsequently, we look into different approaches for how to index multimedia data. Finally, we elaborate on how to link the multimedia data with events in order to provide the basis for future event-based multimedia applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Notes

http://www.w3.org/2003/01/geo/, last visited: December 14, 2012.
http://www.w3.org/TR/owl-time/.
http://www.w3.org/TR/owl-features/.
http://www.w3.org/2004/02/skos/.
Large Scale Concept Ontology for Multimedia, http://www.lscom.org.

References

Allen JF (1983) Maintaining knowledge about temporal intervals. Commun ACM 26(11):832–843. ISSN 0001-0782. doi:10.1145/182.358434
Article MATH Google Scholar
Appan P, Sundaram H (2004) Networked multimedia event exploration. In: Proceedings of the 12th annual ACM international conference on multimedia, MULTIMEDIA ’04. ACM, New York, NY, pp 40–47. ISBN 1-58113-893-8. doi:10.1145/1027527.1027536
Chapter Google Scholar
Arndt R, Troncy R, Staab S, Hardman L, Vacura M (2007) COMM: designing a well-founded multimedia ontology for the web. In: The Semantic Web: ISWC 2007 + ASWC 2007, lecture notes in computer science, vol 4825. Springer, Berlin, pp 30–43
Chapter Google Scholar
Atrey PK, Saddik AE, Kankanhalli MS (2011) Effective multimedia surveillance using a human-centric approach. Multimed Tools Appl 51(2):697–721
Article Google Scholar
Ballan L, Bertini M, Bimbo AD, Seidenari L, Serra G (2011) Event detection and recognition for semantic annotation of video. Multimed Tools Appl 51(1):279–302
Article Google Scholar
Ballan L, Bertini M, Bimbo AD, Serra G (2010) Semantic annotation of soccer videos by visual instanc clustering and spatial/temporal reasoning in ontologies. Multimed Tools Appl 48(2):313–337
Article Google Scholar
Ballan L, Bertini M, Serra G (2010) Video annotation and retrieval using ontologies and rule learning. IEEE Multimed 17(4):80–88
Article Google Scholar
Baumgartner N, Retschitzegger W (2006) A survey of upper ontologies for situation awareness. In: Knowledge sharing and collaborative engineering. ACTA Press, St. Thomas, VI, pp 1–9
Google Scholar
Bay H, Ess A, Tuytelaars T, Gool LV (2008) Surf: speeded up robust features. Comput Vis Image Underst 110(3):346–359
Article Google Scholar
Bertini M, Bimbo AD, Serra G, Torniai C, Cucchiara R, Grana C, Vezzani R (2009) Dynamic pictorially enriched ontologies for digital video libraries. IEEE Multimed 16:42–51
Article Google Scholar
Cao L, Codella N, Gong L et al (2012) Ibm research and columbia university trecvid-2012 multimedia event detection (med), multimedia event recounting (mer), and semantic indexing (sin) systems. In: Proc. TRECVID 2012 workshop. Gaithersburg, MD, USA
Carbonaro A (2008) Ontology-based video retrieval in a semantic-based learning environment. J E-Learn Knowl Soc 4(3):203–212
MathSciNet Google Scholar
Casati R, Varzi A (2006) Events. Stanford Encyclopedia of Philosophy. http://plato.stanford.edu/entries/events
Cervesato I, Franceschet M, Montanari A (1999) A guided tour through some extensions of the event calculus. Comput Intell 16(2):307–347
Article MathSciNet Google Scholar
Chandy KM, Charpentier M, Capponi A (2007) Towards a theory of events. In: Proceedings of the 2007 inaugural international conference on distributed event-based systems, DEBS ’07. ACM, New York, NY, pp 180–187. ISBN 978-1-59593-665-3. doi:10.1145/1266894.1266929
Chapter Google Scholar
Chang, S-F, He J, Jiang Y-G, Khoury EE, Ngo C-W, Yanagawa A, Zavesky E (2008) Columbia University/VIREO-CityU/IRIT TRECVID2008 high-level feature extraction and interactive video search. In: Proc. TRECVID 2008 workshop. Gaithersburg, MD, USA
Chechik G, Ie E, Rehn M, Bengio S, Lyon D (2008) Large-scale content-based audio retrieval from tex queries. In: Proc. 1st ACM int. conf. on Multimedia Information Retrieval, (MIR ’08). Vancouver, BC, Canada, pp 105–112
Chen H, Finin TW, Joshi A (2003) Using OWL in a pervasive computing broker. In: Proceedings ontologies in agent systems CEUR workshop, CEUR-WS.org, vol 73. Melbourne, Australia, pp 9–16
Chen H, Joshi A (2004) The SOUPA ontology for pervasive computing. Birkhauser Publishing Ltd.
Cheng H, Liu J, Ali S et al (2012) Sri-sarnoff aurora system at TRECVID 2012 multimedia event detection and recounting. In: Proc. TRECVID 2012 workshop. Gaithersburg, MD, USA
Dasiopoulou S, Mezaris V, Kompatsiaris I, Papastathis V, Strintzis M (2005) Knowledge-assisted semantic video object detection. IEEE Trans Circuits Syst Video Technol 15(10):1210–1224
Article Google Scholar
Doerr M, Ore C-E, Stead S (2007) The CIDOC conceptual reference model: a new standard for knowledge sharing. In: Conceptual modeling. Australian Computer Society Inc., pp 51–56. ISBN 978-1-920682-64-4
Ekin A, Tekalp AM, Mehrotra R (2004) Integrated semantic-syntactic video modeling for search and browsing. IEEE Trans Multimedia 6(6):839–851
Article Google Scholar
Francois ARJ, Nevatia R, Hobbs J, Bolles RC (2005) VERL: an ontology framework for representing and annotating video events. IEEE Multimed 12(4):76–86
Article Google Scholar
Gangemi A, Guarino N, Masolo C, Oltramari A, Schneider L (2002) Sweetening ontologies with DOLCE. In: International conference on knowledge engineering and knowledge management. Springer, London, pp 166–181. ISBN 3-540-44268-5
Google Scholar
Gangemi A, Guarino N, Masolo C, Oltramari A, Schneider L (2002) Sweetening ontologies with DOLCE. In: Proc. of the 13th int. conf. on knowledge engineering and knowledge management. Ontologies and the semantic web, (EKAW ’02). London, UK, pp 166–181
Gangemi A, Presutti V (2009) Ontology design patterns. In: Staab S, Studer R (eds) Handbook of ontologies, 2nd edn. International handbooks on information systems. Springer
Gkalelis N, Mezaris V, Kompatsiaris I (2010) A joint content-event model for event-centric multimedia indexing. In: Proceedings of the 4th IEEE international conference on semantic computing, (ICSC 2010). Carnegie Mellon University, Pittsburgh. IEEE, PA, pp 79–84, 22–24 September 2010
Google Scholar
Gkalelis N, Mezaris V, Kompatsiaris I (2011) High-level event detection in video exploiting discriminant concepts. In: Proc. 9th International workshop on Content-Based Multimedia Indexing, (CBMI 2011). Madrid, Spain, pp 85–90
Gkalelis N, Mezaris V, Kompatsiaris I (2011) Mixture subclass discriminant analysis. IEEE Signal Process Lett 18(5):319–322
Article Google Scholar
Gkalelis N, Mezaris V, Kompatsiaris I, Stathaki T (2013) Mixture subclass discriminant analysis link to restricted Gaussian model and other generalizations. IEEE Transactions on Neural Networks and Learning Systems 24(1):8–21
Article Google Scholar
Gupta A, Jain R (2011) Managing event information: modeling, retrieval, and applications. Synthesis lectures on data management. Morgan & Claypool Publishers
Hakeem A, Sheikh Y, Shah M (2004) Casee: a hierarchical event representation for the analysis of videos. In: McGuinness DL, Ferguson G (eds) Proceedings of the 19th national conference on artificial intelligence, 16th conference on innovative applications of artificial intelligence. AAAI Press/The MIT Press, San Jose, CA, pp 263–268. ISBN 0-262-51183-5, 25–29 July 2004
Google Scholar
Hill M, Hua G, Natsev A et al (2010) IBM research TRECVID 2010 video copy detection and multimedia event detection system. In: Proc. TRECVID 2010 workshop. Gaithersburg, MD, USA
IPTC International Press Telecommunications Council, London, UK (2012) EventML. http://www.iptc.org/site/News_Exchange_Formats/EventsML-G2/Specification Last accessed 15 Mar 2013
IPTC International Press Telecommunications Council, London, UK (2012) NewsML. http://www.iptc.org/site/News_Exchange_Formats/NewsML-G2 Last accessed 15 Mar 2013
Itkonen E (1983) Causality in linguistic theory. Indiana Univ. Press, Bloomington, IN
Google Scholar
Jain R (2008) EventWeb: developing a human-centered computing system. Comput 41(2):42–50. ISSN 0018-9162. doi:10.1109/MC.2008.49
Article Google Scholar
Jiang Y, Zeng X, Ye G et al (2010) Columbia-UCF TRECVID 2010 multimedia event detection: combining multiple modalities, contextual concepts, and temporal matching. In: Proc. TRECVID 2010 workshop. Gaithersburg, MD, USA
Jiang Y-G, Bhattacharya S, Chang S-F, Shah M (2012) High-level event recognition in unconstrained videos. Int J Multimedia Infor Retr. doi:10.1007/s13735-012-0024-2
Kokar MM, Matheus CJ, Baclawski K (2009) Ontology-based situation awareness. Inf Fusion 10(1):83–98. ISSN 1566-2535. doi:10.1016/j.inffus.2007.01.004
Google Scholar
Kowalski R, Sergot M (1986) A logic-based calculus of events. New Gener Comput 4(1):67–95. ISSN 0288-3635. doi:10.1007/BF03037383
Article Google Scholar
Lin F (1996) Embracing causality in specifying the indeterminate effects of actions. In: AAAI/IAAI, vol 1, pp 670–676
Lin F (2008) Handbook of knowledge representation, chapter situtation calculus. Elsevier
Lombard L (1986) Events: a metaphysical study. Routledge & Kegan Paul
Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Article Google Scholar
Manjunath B, Ohm J-R, Vasudevan V, Yamada A (2001) Color and texture descriptors. IEEE Trans Circuits Syst Video Technol 11(6):703–715
Article Google Scholar
Matheus C, Kokar M, Baclawski K, Letkowski J, Call C, Hinman M, Salerno J, Boulware D (2005) Sawa: an assistant for higher-level fusion and situation awareness. In: Multisensor, multisource informatio fusion: architectures, algorithms, and applications. SPIE, Orlando, pp 75–85
Google Scholar
Matheus CJ, Baclawski K, Kokar MM, Letkowski J (2005) Using SWRL and OWL to capture domain knowledge for situation awareness application applied to a supply logistics scenario. In: Rules and rule markup languages for the semantic web, LNCS, vol 3791. Springer, pp 130–144
Matheus CJ, Kokar MM, Baclawski K (2003) A core ontology for situation awareness. In: Information fusion. Cairns, Australia, pp 545–552
Google Scholar
Matheus CJ, Kokar MM, Baclawski K, Letkowski J (2005) An application of semantic web technologies to situation awareness. In: International semantic web conference, LNCS, vol 3729. Springer, pp 944–958
Merler M, Huang B, Xie L, Hua G, Natsev A (2012) Semantic model vectors for complex video event recognition. IEEE Trans Multimedia 14(1):88–101
Article Google Scholar
Mezaris V, Dimou A, Kompatsiaris I (2010) On the use of feature tracks for dynamic concept detection in video. In: Proc. IEEE International Conference on Image Processing (ICIP 2010). Hong Kong, China pp 4697–4700
Mezaris V, Gidaros S, Papadopoulos G, Kasper W, Steffen J, Ordelman R, Huijbregts M, de Jong F, Kompatsiaris I, Strintzis M (2010) A system for the semantic multi-modal analysis of news audio-visual content. EURASIP J Adv Signal Process. doi:10.1155/2010/645052
Google Scholar
Moumtzidou A, Gkalelis N, Sidiropoulos P, Dimopoulos M, Nikolopoulos S, Vrochidis S, Mezaris V, Kompatsiaris I (2012) Iti-certh participation to trecvid 2012. In: Proc. TRECVID 2012 workshop. Gaithersburg, MD, USA
Mueller ET (2008) Handbook of knowledge representation, chapter event calculus. Elsevier
Nack F, Ossenbruggen J, Hardman L (2005) That obscure object of desire: multimedia metadata on the web, part 2. IEEE Multimed 12(1):54–63
Article Google Scholar
Nevatia R, Hobbs J, Bolles B (2004) An ontology for video event representation. In: Proceedings of the 2004 conference on Computer Vision and Pattern Recognition Workshop, CVPRW’04, vol 7. IEEE Computer Society, Washington, DC, p 119. ISBN 0-7695-2158-4. URL: http://dl.acm.org/citation.cfm?id=1032638.1033010
Chapter Google Scholar
OASIS Emergency Management TC (2010) Common alerting protocol version 1.2 (oasis standard). http://docs.oasis-open.org/emergency/cap/v1.2/CAP-v1.2.doc
Over P, Fiscus J, Sanders G, Shaw B, Awad G, Michel M, Smeaton A, Kraaij W, Quenot G (2012) Trecvid 2012—goals, tasks, data, evaluation mechanisms and metrics. In: Proc. TRECVID 2012 workshop. Gaithersburg, MD, USA
Papadopoulos G, Briassouli A, Mezaris V, Kompatsiaris I, Strintzis M (2009) Statistical motion information extraction and representation for semantic video analysis. IEEE Trans Circuits Syst Video Technol 19(10):1513–1528
Article Google Scholar
Quinton A (1979) Objects and events. Mind 88(350):197–214
Article Google Scholar
Raimond Y, Abdallah S (2007) The event ontology. http://motools.sf.net/event Last accessed 15 Mar 2013
Saathoff C, Scherp A (2010) Unlocking the semantics of multimedia presentations in the web with the multimedia metadata ontology. In: World Wide Web conference. ACM, Raleigh, NC, pp 831–840
Google Scholar
Scherp A, Agaram S, Jain R (2008) Event-centric media management. In: SPIE, vol 6820
Scherp A, Eißing D, Saathoff C (2012) A method for integrating multimedia metadata standards and metadata formats with the multimedia metadata ontology. Int J Semantic Computing 6(1):25–50
Article Google Scholar
Scherp A, Franz T, Saathoff C, Staab S (2009) F–a model of events based on the foundational ontology DOLCE+DnS Ultralight. In: Proceedings of the 5th International conference on knowledge capture (K-CAP 2009). ACM, Redondo Beach, CA, pp 137–144. ISBN 978-1-60558- 658-8, 1–4 September 2009
Scherp A, Franz T, Saathoff C, Staab S (2012) A core ontology on events for representing occurrences in the real world. Multimed Tools Appl 58(2):293–331
Article Google Scholar
Scherp A, Saathoff C, Franz T, Staab S (2011) Designing core ontologies. Appl Ontology 6(3):177–221
Google Scholar
Shadbolt N, Berners-Lee T, Hall W (2006) The semantic web revisited. IEEE Intell Syst 21(3):96–101
Article Google Scholar
Shaw R, Troncy R, Hardman L (2009) Lode: linking open descriptions of events. In: Gómez-Pérez A, Yu Y, Ding Y (eds) Proceedings the semantic web, 4th Asian conference, ASWC 2009, Shanghai, China, vol 5926. Lecture notes in computer science. Springer, pp 153–167. ISBN 978-3-642-10870-9, 6–9 December 2009
Shipley B (2002) Cause and correlation in biology. Cambridge Univ. Press
Sinclair P, Addis M, Choi F, Doerr M, Lewis P, Martinez K (2006) The use of CRM core in multimedia annotation. In: Semantic web annotations for multimedia
Smeaton AF, Over P, Kraaij W (2009) High-level feature detection from video in TRECV id: a 5-year retrospective of achievements. In: Divakaran A (ed) Multimedia content analysis, theory and applications. Springer-Verlag, Berlin, pp 151–174
Google Scholar
Snoek C, Worring M (2009) Concept-based video retrieval. Foundations and Trends in Information Retrieval 4(2):215–322
Google Scholar
Snoek C, Worring M, van Gemert J, Geusebroek J-M, Smeulders A (2006) The challenge problem for automate detection of 101 semantic concepts in multimedia. In: Proc. ACM Multimedia. Santa Barbara, USA, pp 421–430
Technical Standardization Committee on AV & IT Storage Systems and Equipment (2002) Exchangeable image file format for digital still cameras: exif version 2.2. Technical report
Tesic J (2005) Metadata practices for consumer photos. IEEE Multimed 12(3):86–92
Article Google Scholar
Tjondronegoro DW, Chen YP (2010) Knowledge-discounted event detection in sports video. IEEE Trans Syst Man Cybern Part A Syst Humans 40(5):1009–1024
Article Google Scholar
Troncy R, Celma O, Little S, Garcia R, Tsinaraki C (2007) MPEG-7 based multimedia ontologies: interoperability support or interoperability issue? In: Proc. 1st workshop on multimedia annotation and retrieval enabled by shared ontologies. Genova, Italy
van de Sande K, Gevers T, Snoek C (2010) Evaluating color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell 32(9):1582–1596
Article Google Scholar
van Hage WR, Malaisé V, de Vries G, Schreiber G, van Someren M (2012) Abstracting and reasoning over ship trajectories and web data with the simple event model (sem). Multimed Tools Appl 57(1):175–197
Article Google Scholar
Wang F, Jiang Y-G, Ngo C-W (2008) Video event detection using motion relativity and visual relatedness. In: Proc. 16th ACM international conference on multimedia. Vancouver, BC, Canada, pp 239–248
Wang X, Mamadgi S, Thekdi A, Kelliher A, Sundaram H (2007) Eventory—an event based media repository. In: Semantic computing. IEEE, Washington, DC, pp 95–104. ISBN 0-7695-2997-6
Google Scholar
Wang XH, Zhang DQ, Gu T, Pung HK (2004) Ontology based context modeling and reasoning using OWL. In: Pervasive computing and communications workshops. IEEE, Washington, DC, p 18. ISBN 0-7695-2106-1
Google Scholar
Westermann U, Jain R (2006) E—a generic event model for event-centric multimedia data management in echronicle applications. In: Data engineering workshops. IEEE, Washington, DC, p 106. ISBN 0-7695-2571-7. doi:10.1109/ICDEW.2006.1
Google Scholar
Westermann U, Jain R (2007) Toward a common event model for multimedia applications. IEEE Multimed 14(1):19–29
Article Google Scholar
Xu D, Chang S-F (2008) Video event recognition using kernel methods with multilevel temporal alignment. IEEE Trans Pattern Anal Mach Intell 30(11):1985–1997
Article Google Scholar
Yan W, Kieran DF, Rafatirad S, Jain R (2011) A comprehensive study of visual event computing. Multimed Tools Appl 55(3):443–481
Article Google Scholar
Yau SS, Liu J (2006) Hierarchical situation modeling and reasoning for pervasive computing. In: Software technologies for future embedded and ubiquitous systems. IEEE, Washington, DC, pp 5–10. ISBN 0-7695-2560-1
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science and Business Informatics, University of Mannheim, B6, 26, 68131, Mannheim, Germany
Ansgar Scherp
Information Technologies Institute (ITI), Centre for Research and Technology Hellas (CERTH), 6th Km. Charilaou-Thermi Road, P.O. Box 60361, 57001, Thermi-Thessaloniki, Greece
Vasileios Mezaris

Authors

Ansgar Scherp
View author publications
You can also search for this author in PubMed Google Scholar
Vasileios Mezaris
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ansgar Scherp.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Scherp, A., Mezaris, V. Survey on modeling and indexing events in multimedia. Multimed Tools Appl 70, 7–23 (2014). https://doi.org/10.1007/s11042-013-1427-7

Download citation

Published: 24 March 2013
Issue Date: May 2014
DOI: https://doi.org/10.1007/s11042-013-1427-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Survey on modeling and indexing events in multimedia

Abstract

Access this article

Similar content being viewed by others

Event analysis in social multimedia: a survey

5W1H Aware Framework for Representing and Detecting Real Events from Multimedia Digital Ecosystem

Multimedia Data Modeling and Management

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Survey on modeling and indexing events in multimedia

Abstract

Access this article

Similar content being viewed by others

Event analysis in social multimedia: a survey

5W1H Aware Framework for Representing and Detecting Real Events from Multimedia Digital Ecosystem

Multimedia Data Modeling and Management

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation