Multilingual Real-time Event Extraction for Border Security Intelligence Gathering
- First Online:
This chapter gives an overview of tools developed for Frontex, the European Agency for the Management of Operational Cooperation at the External Borders of the Member States of the European Union, to facilitate the process of extracting structured information on events related to border security from on-line news articles, with a particular focus on incidents and developments in the context of illegal migration, cross-border crime, and related crisis situations at the EU external borders and in third countries. A hybrid event extraction system has been constructed, which consists of two core event extraction engines, namely, NEXUS, developed at the Joint Research Centre (JRC) of the European Commission and PULS, developed at the University of Helsinki. These systems are applied to the stream of news articles continuously gathered and pre-processed by the Europe Media Monitor (EMM) – a large-scale multilingual news aggregation engine, developed at the JRC. In order to bridge the automated analysis phase with in-depth human analysis phase an event moderation tool has been developed, which allows the user to access the database of automatically extracted event descriptions and to clean, validate, group, enhance, and export them into other knowledge repositories.