Abstract
This paper describes the role natural language processing (NLP) can play for multimedia applications. As an example of such an application, we present an approach dealing with the conceptual indexing of soccer videos which the help of structured information automatically extracted by NLP tools from multiple sources of information relating to video content, consisting in a rich range of textual and transcribed sources covering soccer games. This work has been investigated and developed in the EU funded project MUMIS. As a second example of such an application, we describe briefly ongoing work in the context of the Esperonto project dealing with upgrading the actual web towards the Semantic Web (SW), including the automatic semantic indexing of web pages containing a combination of text and images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Adani, N., Bugatti, A., Leonardi, R., Migliorati, P.: Semantic description of multimedia documents: the Mpeg-7 approach. In: Proceedings of the Conference on Content-Based Multimedia Indexing, CBMI 2001, Brescia (2001)
André, E.: Natural Language in Multimedia/Multimodal Systems. In: Mitkov, R. (ed.) Handbook of Computational Linguistics, Oxford (2000)
André, E.: The Generation of Multimedia Presentations. In: Handbook of Natural Language Processing, Marcel Dekker, New York (2000)
Assfalg, J., Bertini, M., Colombo, C., Del Bimbo, A.: Semantic annotations of sports videos. In: Proceedings of the Conference on Content-Based Multimedia Indexing, CBMI 2001, Brescia (2001)
Cunningham, H.: Information Extraction: A user Guide, Research Report CS-99-07, Department of Computer Science, May 1999. University of Sheffield (1999)
Day, N.: MPEG-7 Applications: Multimedia Search and Retrieval. In: Proceedings of the First International Workshop on Multimedia Annotation, MMA 2001 (2001)
Declerck, T., Wittenburg, P., Cunningham, H.: The Automatic Generation of Formal Annotations in a Multimedia Indexing and Searching Environment. In: Proceedings of the Workshop on Human Language Technology and Knowledge Management, ACL 2001 (2001)
Declerck, T.: A set of tools for integrating linguistic and non-linguistic information. In: Proceedings of SAAKM 2002, ECAI 2002, Lyon (2002)
Djoerd, H., de Jong, F., Netter, K. (eds.): 14th Twente Workshop on Language Technology, Language Technology in Multimedia Information Retrieval, TWLT 14, Enschede, Universiteit Twente (1998)
Johnston, M.: Unification-based Multimodal Parsing. In: Proceedings of the 17th International Conference on Computational Linguistics, COLING 1998 (1998)
de Jong, F., Gauvin, J., Hiemstra, D., Netter, K.: Language-Based Multimedia Information Retrieval. In: Proceedings of the 6th Conference on Recherche d’Information Assistee par Ordinateur, RIAO-2000, 2000. IndexingWorkshop (CBMI 2001) (2001)
Krieger, H.-U., Schaefer, U.: TDL – a type description language for constraint-based grammars. In: Proceedings of the 15th International Conference on Computational Linguistics, COLING 1994 (1994)
Maybury, M.: Multimedia Interaction for the New Millenium. In: Proceedings of Eurospeech 1999 (1999)
McKeown, K.: Text generation. Cambridge University Press, Cambridge (1985)
Merlino, A., Morey, D., Maybury, M.: Broadcast News Navigation using Story Segments. In: ACM International Multimedia Conference (1997)
Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM 11 (1995)
Moore, J., Paris, C.: Planning Text for Advisory Dialogues. In: Proceedings of the 27th ACL, Vancouver (1989)
Sixth Message Understanding Conference (MUC-6). Morgan Kaufmann, San Francisco (1995)
Seventh Message Understanding Conference (MUC-7) SAIC Information Extraction (1998), http://www.muc.saic.com/
Naphade Milid, R., Huang, T.S.: Recognizing high-level concepts for video indexing. In: Proceedings of the Conference on Content-Based Multimedia Indexing, CBMI 2001, Brescia (2001); Extraction and Navigation System. In: Proceedings of the 6th Conference on Recherche d’Information Assistee par Ordinateur, RIAO 2000 (2000)
Saggion, H., Cunningham, H., Bontcheva, K., Maynard, D., Ursu, C., Hamza, O., Wilks, Y.: Access to Multimedia Information through Multisource and Multilanguage Information Extraction. In: Andersson, B., Bergholtz, M., Johannesson, P. (eds.) NLDB 2002. LNCS, vol. 2553, pp. 160–171. Springer, Heidelberg (2002)
Salembier, P.: An overview of Mpeg-7 multimedia description schemes and of future visual information challenges for content-based indexing. In: Proceedings of the Conference on Content-Based Multimedia Indexing, CBMI 2001, Brescia (2001)
Salway, A.: Talking Pictures: Indexing and Representing Video with Collateral Texts. In: Hiemstra, D., de Jong, F., Netter, K. (eds.) Language Technology in Multimedia Information Retrieval (Proceedings of the 14th Twente Workshop on Language Technology, TWLT 14), Enschede, Universiteit Twente (1998)
Staab, S., Maedche, A., Handschuh, S.: An Annotation Framework for the Semantic Web. In: The First International Workshop on Multimedia Annotation, Tokyo, Japan (2001)
Wester, M., Kessens, J.M., Strik, H.: Goal-directed ASR in a Multimedia Indexing and Searching Environment (MUMIS). In: Proceedings of the 7th International Conference on Spoken Language Processsing (ICLSP 2002) (2002)
THI: http://www.dcs.shef.ac.uk/research/groups/spandh/projects/thisl
UNL: http://www.ias.unu.edu/research_prog/science_technology_universalnetwork_languge.html
Veltkamp, R., Tanase, M.: Content-based Image Retrieval Systems: a survey. Technical report UU-CS-2000-34, Utrecht University (2000)
Chang, S.F., Chen, W., Meng, H.J., Sundaram, H., Zhong, D.: A Fully Automated Content-based Video Search Engine Supporting Spatio Temporal Queries. IEEE Transactions on Circuits and Systems for Video Technology (1998)
Netter, K.: Pop-Eye and OLIVE. Human Language as the Medium for Cross-lingual Multimedia Information Retrieval. Technical report, Language Technology Lab. DFKI GmbH (1998)
Sable, C., Hatzivassiloglou, V.: Text-based approaches for the categorization of images. In: Proceedings of ECDL (1999)
Srihari, R.K.: Automatic Indexing and Content-Based Retrieval of Captioned Images, Computer 28/9 (1995)
Gong, Y., Sin, L.T., Chuan, C.H., Zhang, H., Sakauchi, M.: Automatic Parsing of TV Soccer Programs. In: Proceedings of the International Conference on Multimedia Computing and Systems (IEEE) (1995)
Steinbiss, V., Ney, H., Haeb-Umbach, R., Tran, B.-H., Essen, U., Kneser, R., Oerder, M., Meier, H.-G., Aubert, X., Dugast, C., Geller, D.: The Philips Research System for Large-Vocabulary Continuous-Speech Recognition. In: Proc. of Eurospeech 1993 (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Declerck, T., Kuper, J., Saggion, H., Samiotou, A., Wittenburg, P., Contreras, J. (2004). Contribution of NLP to the Content Indexing of Multimedia Documents. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_71
Download citation
DOI: https://doi.org/10.1007/978-3-540-27814-6_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22539-3
Online ISBN: 978-3-540-27814-6
eBook Packages: Springer Book Archive