Online News Event Extraction for Global Crisis Surveillance

  • Jakub Piskorski
  • Hristo Tanev
  • Martin Atkinson
  • Eric van der Goot
  • Vanni Zavarella

Abstract

This article presents a real-time and multilingual news event extraction system developed at the Joint Research Centre of the European Commission. It is capable of accurately and efficiently extracting violent and natural disaster events from online news. In particular, a linguistically relatively lightweight approach is deployed, in which clustered news are heavily exploited at all stages of processing. Furthermore, the technique applied for event extraction assumes the inverted-pyramid style of writing news articles, i.e., the most important parts of the story are placed in the beginning and the least important facts are left toward the end. The article focuses on the system’s architecture, real-time news clustering, geo-locating and geocoding clusters, event extraction grammar development, adapting the system to the processing of new languages, cluster-level information fusion, visual event tracking, event extraction accuracy evaluation, and detecting event reporting boundaries in news article streams. This article is an extended version of [20].

Keywords

event extraction global crisis monitoring shallow text processing information aggregation 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Jakub Piskorski
    • 1
  • Hristo Tanev
    • 2
  • Martin Atkinson
    • 2
  • Eric van der Goot
    • 2
  • Vanni Zavarella
    • 2
  1. 1.Polish Academy of SciencesWarszawaPoland
  2. 2.Joint Research of the European CommissionIspraItaly

Personalised recommendations