Access to Multimedia Information through Multisource and Multilanguage Information Extraction
We describe our work on information extraction from multiple sources for the Multimedia Indexing and Searching Environment, a project aiming at developing technology to produce formal annotations about essential events in multimedia programme material. The creation of a composite index from multiple and multi-lingual sources is a unique aspect of this project. The domain chosen for tuning the software components and testing is football. Our information extraction system is based on the use of finite state machinery pipelined with full semantic analysis and discourse interpretation.
Unable to display preview. Download preview PDF.
- 1.G. Burnage. CELEX: A Guide for users. Centre for Lexical Information, Nijmegen, 1990.Google Scholar
- 2.H. Cunningham, R. G. Gaizauskas, K. Humphreys, and Y. Wilks. Experience with a Language Engineering Architecture: Three Years of GATE. In Proceedings of the AISB’99 Workshop on Reference Architectures and Data Standards for NLP, Edinburgh, April 1999. The Society for the Study of Artificial Intelligence and Simulation of Behaviour.Google Scholar
- 3.H. Cunningham, D. Maynard, K. Bontcheva, V. Tablan, and C. Ursu. The GATE User Guide. http://gate.ac.uk/, 2002.
- 4.R. Gaizauskas and K. Humphreys. Quantitive Evaluation of Coreference Algorithms in an Information Extraction System. In DAARC96-Discourse Anaphora and Anaphor Resolution Colloquium. Lancaster University, 1996.Google Scholar
- 5.R. Gaizauskas and K. Humphreys. XI: A Simple Prolog-based Language for Cross-Classification and Inhetotance. In Proceedings of the 7th International Conference in Artificial Intelligence: Methodology, Systems, Applications, pages 86–95, Sozopol, Bulgaria, 1996.Google Scholar
- 6.G. Gazdar and C. Mellish. Natural Language Processing in Prolog: An Introduction to Computational Linguistics. Addison-Wesley Publishing Company, 1989.Google Scholar
- 8.Mark Hepple. Independence and commitment: Assumptions for rapid training and execution of rule-based POS taggers. In Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics (ACL-2000), Hong Kong, October 2000.Google Scholar
- 9.K. Humphreys, R. Gaizauskas, S. Azzam, C. Huyck, B. Mitchell, H. Cunningham, and Y. Wilks. Description of the LaSIE system as used for MUC-7. In Proceedings of the Seventh Message Understanding Conference (MUC-7). http://www.itl.nist.gov/iaui/894.02/related projects/muc/index.html, 1998.
- 10.K. Netter. Pop-eye and olive. human language as the medium for cross-lingual multimedia information retrieval. Technical report, Language Technology Lab. DFKI GmbH., 1998.Google Scholar
- 11.M. Scott. WordSmith Tools Manual. Oxford University Press, 1996.Google Scholar
- 12.SICSTUS. Intelligent Systems Laboratory. Swedish Institute of Computer Science, PO Box 1263. SE-164 29 Kista, Sweden. SICStus Prolog User’s Manual, May 2000.Google Scholar