Techniques for Multilingual Security-Related Event Extraction from Online News

  • Martin Atkinson
  • Mian Du
  • Jakub Piskorski
  • Hristo Tanev
  • Roman Yangarber
  • Vanni Zavarella
Part of the Studies in Computational Intelligence book series (SCI, volume 458)

Abstract

This chapter presents a number of techniques for multilingual event extraction, the main task is to accurately and efficiently detect key information about security-related events from electronic news media and summarize it in the form of database-like structures. Gathering such information over time is an important task for developing global news surveillance systems, particularly in the context of security threats and mass emergencies. In particular, this chapter describes novel techniques for dealing with specific extraction tasks, including: an event type classification method based on domain-specific inference rules, an approach to event geo-tagging based on utilisation of lexico-semantic patterns, a simple method for cross-lingual event information fusion, and techniques for scoring the relevance rank of automatically extracted facts.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Appelt, D.: Introduction to Information Extraction Technology. Tutorial Held at IJCAI 1999 (1999)Google Scholar
  2. 2.
    Ashish, N., Appelt, D., Freitag, D., Zelenko, D.: In: Proceedings of the Workshop on Event Extraction and Synthesis. Held in conjunction with the AAAI 2006 (2006)Google Scholar
  3. 3.
    Atkinson, M., van der Goot, E.: Near Real Time Information Mining in Multilingual News. In: Proceedings of WWW 2009 (2009)Google Scholar
  4. 4.
    Atkinson, M., Piskorski, J., Van der Goot, E., Yangarber, R.: Multilingual Real-Time Event Extraction for Border Security Intelligence Gathering. In: Counterterrorism and Open Source Intelligence Series. Lecture Notes in Social Networks, vol. 2 (2011)Google Scholar
  5. 5.
    Chen, Z., Ji, H.: Can one Language Bootstrap the Other: A Case Study on Event Extraction. In: Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing (2009)Google Scholar
  6. 6.
    Downey, D., Etzioni, O., Soderland, S.: A Probabilistic Model of Redundancy in Information Extraction. In: Proceedings of IJCAI 2005 (2005)Google Scholar
  7. 7.
    Hall, M.A.: Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning. In: Proceedings of ICML (2000)Google Scholar
  8. 8.
    Huttunen, S., Vihavainen, A., von Etter, P., Yangarber, R.: Relevance Prediction in Information Extraction Using Discourse and Lexical Features. In: Proceedings of the 18th Nordic Conference on Computational Linguistics, NODALIDA (2011)Google Scholar
  9. 9.
    Grishman, R., Huttunen, S., Yangarber, R.: Real-time Event Extraction for Infectious Disease Outbreaks. In: Proceedings of HLT 2002 (2002)Google Scholar
  10. 10.
    Ji, H., Grishman, R.: Refining Event Extraction through Cross-Document Inference. In: Proceedings of ACL 2008, pp. 254–262 (2008)Google Scholar
  11. 11.
    Ji, H.: Challenges from Information Extraction to Information Fusion. In: Proceedings of ACL 2008, pp. 507–515 (2010)Google Scholar
  12. 12.
    King, G., Lowe, W.: An Automated Information Extraction Tool For International Conflict Data with Performance as Good as Human Coders. In: International Organization, vol. 57 (2003)Google Scholar
  13. 13.
    Kohavi, R., John, G.H.: Wrappers for Feature Subset Selection. In: Artificial Intelligence, vol. 57(1) (1997)Google Scholar
  14. 14.
    Lee, A., Passantino, M., Ji, H., Qi, G., Huang, T.: Enhancing Multi-lingual Information Extraction via Cross-Media Inference and Fusion. In: Proceedings of COLING 2010: Posters, pp. 630–638 (2010)Google Scholar
  15. 15.
    Li, J., Li, J., Tang, J.: A Flexible Topic-driven Framework for News Exploration. In: Proceedings of KDD 2007 (2007)Google Scholar
  16. 16.
    Liao, S., Grishman, R.: Using Document Level Cross-Event Inference to Improve Event Extraction. In: Proceedings of ACL 2010, pp. 789–797 (2010)Google Scholar
  17. 17.
    Mikheev, A., Moens, M., Grover, C.: Named Entity Recognition without Gazetteers. In: Proceedings of EACL 1999 (1999)Google Scholar
  18. 18.
    Naughton, M., Kushmerick, N., Carthy, J.: Event Extraction from Heterogeneous News Sources. In: Proceedings of the AAAI 2006 Workshop on Event Extraction and Synthesis (2006)Google Scholar
  19. 19.
    Patwardhan, S., Riloff, E.: Effective Information Extraction with Semantic Affinity Patterns and Relevant Regions. In: Proceedings of EMNLP-CONLL 2007 (2007)Google Scholar
  20. 20.
    Piskorski, J.: ExPRESS: Extraction Pattern Recognition Engine and Specification Suite. In: Proceedings of the International Workshop Finite-State Methods and Natural Language Processing (2007)Google Scholar
  21. 21.
    Piskorski, J., Tanev, H., Atkinson, M., van der Goot, E., Zavarella, V.: Online News Event Extraction for Global Crisis Surveillance. In: Nguyen, N.T. (ed.) Transactions on CCI V. LNCS, vol. 6910, pp. 182–212. Springer, Heidelberg (2011)Google Scholar
  22. 22.
    Pouliquen, B., Kimler, M., Steinberger, R., Ignat, C., Oellinger, T., Blackler, K., Fluart, F., Zaghouani, W., Widiger, A., Forslund, A.-C., Best, C.: Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation. In: Proceedings of LREC 2006, Genoa, Italy, pp. 24–26 (2006)Google Scholar
  23. 23.
    Snover, M., Li, X., Lin, W.-P., Chen, Z., Tamang, S., Ge, M., Lee, A., Li, Q., Li, H., Anzaroot, S., Ji, H.: Cross-lingual Slot Filling from Comparable Corpora. In: Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web, pp. 110–119 (2011)Google Scholar
  24. 24.
    Sudo, K., Sekine, S., Grishman, R.: Cross-lingual Information Extraction System Evaluation. In: Proceedings of COLING 2004 (2004)Google Scholar
  25. 25.
    Tanev, H., Piskorski, J., Atkinson, M.: Real-Time News Event Extraction for Global Crisis Monitoring. In: Proceedings of NLDB 2008 (2008)Google Scholar
  26. 26.
    Tanev, H., Zavarella, V., Linge, J., Kabadjov, M., Piskorski, J., Atkinson, M., Steinberger, R.: Exploiting Machine Learning Techniques to Build an Event Extraction System for Portuguese and Spanish. Linguamatica (NLP Journal for Iberian Languages) 2 (2009)Google Scholar
  27. 27.
    Thorsten, J.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, Springer, Heidelberg (1998)Google Scholar
  28. 28.
    Tyler, A.R. (ed.): Expert Systems Research Trends. Nova Science Publishers, New York (2007)Google Scholar
  29. 29.
    Yangarber, R., Jokipii, L., Rauramo, A., Huttunen, S.: Extracting Information about Outbreaks of Infectious Epidemics. In: Proceedings of the HLT-EMNLP 2005 (2005)Google Scholar
  30. 30.
    Yangarber, R.: Verification of Facts across Document Boundaries. In: Proceedings of International Workshop on Intelligent Information Access (2006)Google Scholar
  31. 31.
    Zhang, N.N.: Movement within a Spatial Phrase. In: Cuyckens, H., Radden, G. (eds.) Perspectives on Prepositions. Linguistische Arbeiten. Band, vol. 454, pp. 47–63. Max Niemeyer, Tübingen (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Martin Atkinson
    • 1
  • Mian Du
    • 3
  • Jakub Piskorski
    • 2
  • Hristo Tanev
    • 1
  • Roman Yangarber
    • 3
  • Vanni Zavarella
    • 1
  1. 1.JRCIspraItaly
  2. 2.FrontexWarsawPoland
  3. 3.Department of Computer ScienceUniversity of HelsinkiHelsinkiFinland

Personalised recommendations