Skip to main content

Identity and Granularity of Events in Text

  • Conference paper
  • First Online:
Computational Linguistics and Intelligent Text Processing (CICLing 2016)

Abstract

In this paper we describe a method to detect event descriptions in different news articles and to model the semantics of events and their components using RDF representations. We compare these descriptions to solve a cross-document event coreference task. Our component approach to event semantics defines identity and granularity of events at different levels. It performs close to state-of-the-art approaches on the cross-document event coreference task, while outperforming other works when assuming similar quality of event detection. We demonstrate how granularity and identity are interconnected and we discuss how semantic anomaly could be used to define differences between coreference, subevent and topical relations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    www.nltk.org/modules/nltk/stem/wordnet.html.

  2. 2.

    http://nltk.org/_modules/nltk/corpus/reader/wordnet.html.

  3. 3.

    www.newsreader-project.eu.

  4. 4.

    We used FrameNet frame elements to decide on relevance of a participant.

  5. 5.

    WordNet synsets are represented here as Inter Lingual Index (ili) records: www.globalwordnet.org/ili [33]. This enables us to compare events across different languages [32].

  6. 6.

    The reason that NWR-G is not 100% is because the NewsReader system could not process one of the evaluation files due to formatting problems.

  7. 7.

    We left out the document creation time as a baseline time-anchor because it may interfer with the task since the articles on each seminal event were published on different dates.

References

  1. Ahn, D.: The stages of event extraction. In: Proceedings of the Workshop on Annotating and Reasoning About Time and Events (2006)

    Google Scholar 

  2. Bagga, A., Baldwin, B.: Algorithms for scoring coreference chains. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC) (1998)

    Google Scholar 

  3. Bagga, A., Baldwin, B.: Cross-document event coreference: annotations, experiments, and observations. In: Proceedings of the ACL Workshop on Coreference and its Applications, p. 18 (1999)

    Google Scholar 

  4. Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley framenet project. In: Proceedings of the 17th International Conference on Computational Linguistics, vol. 1, pp. 86–90. Association for Computational Linguistics (1998)

    Google Scholar 

  5. Bejan, C.A., Harabagiu, S.: Unsupervised event coreference resolution with rich linguistic features. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden (2010)

    Google Scholar 

  6. Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly Media Inc., Sebastopol (2009). http://nltk.org/book

  7. Björkelund, A., Bohnet, B., Hafdell, L., Nugues, P.: A high-performance syntactic and semantic dependency parser. In: Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations, COLING 2010, pp. 33–36. Association for Computational Linguistics, Stroudsburg, PA, USA (2010). http://dl.acm.org/citation.cfm?id=1944284.1944293

  8. Blei, D.M., Frazier, P.I.: Distance dependent Chinese restaurant processes. J. Mach. Learn. Res. 12, 2461–2488 (2011)

    MathSciNet  MATH  Google Scholar 

  9. Chen, B., Su, J., Pan, S.J., Tan, C.L.: A unified event coreference resolution by integrating multiple resolvers. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 2011

    Google Scholar 

  10. Chen, Z., Ji, H.: Event coreference resolution: feature impact and evaluation. In: Proceedings of Events in Emerging Text Types (eETTs) Workshop (2009)

    Google Scholar 

  11. Chen, Z., Ji, H.: Graph-based event coreference resolution. In: TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing, pp. 54–57 (2009)

    Google Scholar 

  12. Cybulska, A., Vossen, P.: Semantic relations between events and their time, locations and participants for event coreference resolution. In: Angelova, G., Bontcheva, K., Mitkov, R. (eds.) Proceedings of Recent Advances in Natural Language Processing (RANLP-2013), INCOMA Ltd., Hissar, Bulgaria, 7–14 September 2013. No. ISSN 1313–8502. http://aclweb.org/anthology//R/R13/R13-1021.pdf

  13. Cybulska, A., Vossen, P.: Guidelines for ECB+ annotation of events and their coreference. Technical report NWR-2014-1, VU University Amsterdam (2014)

    Google Scholar 

  14. Cybulska, A., Vossen, P.: Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution. In: Proceedings of the 9th Language Resources and Evaluation Conference (LREC2014), Reykjavik, Iceland, 26–31 May 2014

    Google Scholar 

  15. Cybulska, A., Vossen, P.: “Bag of events” approach to event coreference resolution. Supervised classification of event templates. In: Proceedings of the 16th Cicling 2015 (Co-located: 1st International Arabic Computational Linguistics Conference), Cairo, Egypt, 14–20 April 2015

    Google Scholar 

  16. Fellbaum, C. (ed.): WordNet. An Electronic Lexical Database. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  17. Fokkens, A., Soroa, A., Beloki, Z., Ockeloen, N., Rigau, G., van Hage, W.R., Vossen, P.: NAF and GAF: linking linguistic annotations. In: Proceedings 10th Joint ISO-ACL SIGSEM Workshop on Interoperable Semantic Annotation, Reykjavik, Iceland, p. 9 (2014)

    Google Scholar 

  18. van Hage, W.R., Malaisé, V., Segers, R., Hollink, L., Schreiber, G.: Design and use of the Simple Event Model (SEM). J. Web Sem. 9(2), 128–136 (2011)

    Article  Google Scholar 

  19. Humphreys, K., Gaizauskas, R., Azzam, S.: Event coreference for information extraction. In: ANARESOLUTION 1997 Proceedings of a Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts (1997)

    Google Scholar 

  20. Kingsbury, P., Palmer, M.: From treebank to propbank. In: LREC. Citeseer (2002)

    Google Scholar 

  21. LDC: ACE (Automatic Content Extraction) English Annotation Guidelines for Events ver. 5.4.3 2005.07.01. In: Linguistic Data Consortium (2005)

    Google Scholar 

  22. Leacock, C., Chodorow, M.: Combining local context with wordnet similarity for word sense identification (1998)

    Google Scholar 

  23. Lee, H., Recasens, M., Chang, A., Surdeanu, M., Jurafsky, D.: Joint entity and event coreference resolution across documents. In: Proceedings of the 2012 Conference on Empirical Methods in Natural Language Processing and Natural Language Learning, EMNLPCoNLL 2012 (2012)

    Google Scholar 

  24. Liu, Z., Araki, J., Hovy, E., Mitamura, T.: Supervised within-document event coreference using information propagation. In: Proceedings of the International Conference on Language Resources and Evaluation, LREC 2014 (2014)

    Google Scholar 

  25. Luo, X.: On coreference resolution performance metrics. In: Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, EMNLP-2005 (2005)

    Google Scholar 

  26. Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: Dbpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, pp. 1–8. ACM (2011)

    Google Scholar 

  27. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  28. Pradhan, S., Ramshaw, L., Marcus, M., Palmer, M., Weischedel, R., Xue, N.: Conll-2011 shared task: modeling unrestricted coreference in ontonotes. In: Proceedings of CoNLL 2011: Shared Task (2011)

    Google Scholar 

  29. Recasens, M., Hovy, E.: Blanc: implementing the rand index for coreference evaluation. Nat. Lang. Eng. 17(4), 485–510 (2011)

    Article  Google Scholar 

  30. UzZaman, N., Llorens, H., Derczynski, L., Verhagen, M., Allen, J., Pustejovsky, J.: Semeval-2013 task 1: Tempeval-3: evaluating time expressions, events, and temporal relations (2013)

    Google Scholar 

  31. Vilain, M., Burger, J., Aberdeen, J., Connolly, D., Hirschman, L.: A model theoretic coreference scoring scheme. In: Proceedings of MUC-6 (1995)

    Google Scholar 

  32. Vossen, P., Agerri, R., Aldabe, I., Cybulska, A., van Erp, M., Fokkens, A., Laparra, E., Minard, A.L., Aprosio, A.P., Rigau, G., Rospocher, M., Segers, R.: Newsreader: how semantic web helps natural language processing helps semantic web. Special Issue Knowledge Based Systems, Elsevier (to appear)

    Google Scholar 

  33. Vossen, P., Bond, F., McCrae, J.: Toward a truly multilingual global wordnet grid. In: Proceedings of the 8th Global Wordnet Conference (2016)

    Google Scholar 

  34. Yang, B., Cardie, C., Frazier, P.I.: A hierarchical distance-dependent Bayesian model for event coreference resolution. CoRR abs/1504.05929 (2015). http://arxiv.org/abs/1504.05929

Download references

Acknowledgments

The NewsReader project was co-funded by the European Union as project number: 316404, FP7 Work Programme Call FP7-ICT-2011-8 Objective Cooperation Research theme “Information and Communication Technologies”, challenge 4.4 - Area Intelligent Information Management.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Piek Vossen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vossen, P., Cybulska, A. (2018). Identity and Granularity of Events in Text. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2016. Lecture Notes in Computer Science(), vol 9624. Springer, Cham. https://doi.org/10.1007/978-3-319-75487-1_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-75487-1_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-75486-4

  • Online ISBN: 978-3-319-75487-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics