Abstract
In this paper we describe a method to detect event descriptions in different news articles and to model the semantics of events and their components using RDF representations. We compare these descriptions to solve a cross-document event coreference task. Our component approach to event semantics defines identity and granularity of events at different levels. It performs close to state-of-the-art approaches on the cross-document event coreference task, while outperforming other works when assuming similar quality of event detection. We demonstrate how granularity and identity are interconnected and we discuss how semantic anomaly could be used to define differences between coreference, subevent and topical relations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
We used FrameNet frame elements to decide on relevance of a participant.
- 5.
WordNet synsets are represented here as Inter Lingual Index (ili) records: www.globalwordnet.org/ili [33]. This enables us to compare events across different languages [32].
- 6.
The reason that NWR-G is not 100% is because the NewsReader system could not process one of the evaluation files due to formatting problems.
- 7.
We left out the document creation time as a baseline time-anchor because it may interfer with the task since the articles on each seminal event were published on different dates.
References
Ahn, D.: The stages of event extraction. In: Proceedings of the Workshop on Annotating and Reasoning About Time and Events (2006)
Bagga, A., Baldwin, B.: Algorithms for scoring coreference chains. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC) (1998)
Bagga, A., Baldwin, B.: Cross-document event coreference: annotations, experiments, and observations. In: Proceedings of the ACL Workshop on Coreference and its Applications, p. 18 (1999)
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley framenet project. In: Proceedings of the 17th International Conference on Computational Linguistics, vol. 1, pp. 86–90. Association for Computational Linguistics (1998)
Bejan, C.A., Harabagiu, S.: Unsupervised event coreference resolution with rich linguistic features. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden (2010)
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly Media Inc., Sebastopol (2009). http://nltk.org/book
Björkelund, A., Bohnet, B., Hafdell, L., Nugues, P.: A high-performance syntactic and semantic dependency parser. In: Proceedings of the 23rd International Conference on Computational Linguistics: Demonstrations, COLING 2010, pp. 33–36. Association for Computational Linguistics, Stroudsburg, PA, USA (2010). http://dl.acm.org/citation.cfm?id=1944284.1944293
Blei, D.M., Frazier, P.I.: Distance dependent Chinese restaurant processes. J. Mach. Learn. Res. 12, 2461–2488 (2011)
Chen, B., Su, J., Pan, S.J., Tan, C.L.: A unified event coreference resolution by integrating multiple resolvers. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 2011
Chen, Z., Ji, H.: Event coreference resolution: feature impact and evaluation. In: Proceedings of Events in Emerging Text Types (eETTs) Workshop (2009)
Chen, Z., Ji, H.: Graph-based event coreference resolution. In: TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing, pp. 54–57 (2009)
Cybulska, A., Vossen, P.: Semantic relations between events and their time, locations and participants for event coreference resolution. In: Angelova, G., Bontcheva, K., Mitkov, R. (eds.) Proceedings of Recent Advances in Natural Language Processing (RANLP-2013), INCOMA Ltd., Hissar, Bulgaria, 7–14 September 2013. No. ISSN 1313–8502. http://aclweb.org/anthology//R/R13/R13-1021.pdf
Cybulska, A., Vossen, P.: Guidelines for ECB+ annotation of events and their coreference. Technical report NWR-2014-1, VU University Amsterdam (2014)
Cybulska, A., Vossen, P.: Using a sledgehammer to crack a nut? Lexical diversity and event coreference resolution. In: Proceedings of the 9th Language Resources and Evaluation Conference (LREC2014), Reykjavik, Iceland, 26–31 May 2014
Cybulska, A., Vossen, P.: “Bag of events” approach to event coreference resolution. Supervised classification of event templates. In: Proceedings of the 16th Cicling 2015 (Co-located: 1st International Arabic Computational Linguistics Conference), Cairo, Egypt, 14–20 April 2015
Fellbaum, C. (ed.): WordNet. An Electronic Lexical Database. MIT Press, Cambridge (1998)
Fokkens, A., Soroa, A., Beloki, Z., Ockeloen, N., Rigau, G., van Hage, W.R., Vossen, P.: NAF and GAF: linking linguistic annotations. In: Proceedings 10th Joint ISO-ACL SIGSEM Workshop on Interoperable Semantic Annotation, Reykjavik, Iceland, p. 9 (2014)
van Hage, W.R., Malaisé, V., Segers, R., Hollink, L., Schreiber, G.: Design and use of the Simple Event Model (SEM). J. Web Sem. 9(2), 128–136 (2011)
Humphreys, K., Gaizauskas, R., Azzam, S.: Event coreference for information extraction. In: ANARESOLUTION 1997 Proceedings of a Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts (1997)
Kingsbury, P., Palmer, M.: From treebank to propbank. In: LREC. Citeseer (2002)
LDC: ACE (Automatic Content Extraction) English Annotation Guidelines for Events ver. 5.4.3 2005.07.01. In: Linguistic Data Consortium (2005)
Leacock, C., Chodorow, M.: Combining local context with wordnet similarity for word sense identification (1998)
Lee, H., Recasens, M., Chang, A., Surdeanu, M., Jurafsky, D.: Joint entity and event coreference resolution across documents. In: Proceedings of the 2012 Conference on Empirical Methods in Natural Language Processing and Natural Language Learning, EMNLPCoNLL 2012 (2012)
Liu, Z., Araki, J., Hovy, E., Mitamura, T.: Supervised within-document event coreference using information propagation. In: Proceedings of the International Conference on Language Resources and Evaluation, LREC 2014 (2014)
Luo, X.: On coreference resolution performance metrics. In: Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, EMNLP-2005 (2005)
Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: Dbpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, pp. 1–8. ACM (2011)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Pradhan, S., Ramshaw, L., Marcus, M., Palmer, M., Weischedel, R., Xue, N.: Conll-2011 shared task: modeling unrestricted coreference in ontonotes. In: Proceedings of CoNLL 2011: Shared Task (2011)
Recasens, M., Hovy, E.: Blanc: implementing the rand index for coreference evaluation. Nat. Lang. Eng. 17(4), 485–510 (2011)
UzZaman, N., Llorens, H., Derczynski, L., Verhagen, M., Allen, J., Pustejovsky, J.: Semeval-2013 task 1: Tempeval-3: evaluating time expressions, events, and temporal relations (2013)
Vilain, M., Burger, J., Aberdeen, J., Connolly, D., Hirschman, L.: A model theoretic coreference scoring scheme. In: Proceedings of MUC-6 (1995)
Vossen, P., Agerri, R., Aldabe, I., Cybulska, A., van Erp, M., Fokkens, A., Laparra, E., Minard, A.L., Aprosio, A.P., Rigau, G., Rospocher, M., Segers, R.: Newsreader: how semantic web helps natural language processing helps semantic web. Special Issue Knowledge Based Systems, Elsevier (to appear)
Vossen, P., Bond, F., McCrae, J.: Toward a truly multilingual global wordnet grid. In: Proceedings of the 8th Global Wordnet Conference (2016)
Yang, B., Cardie, C., Frazier, P.I.: A hierarchical distance-dependent Bayesian model for event coreference resolution. CoRR abs/1504.05929 (2015). http://arxiv.org/abs/1504.05929
Acknowledgments
The NewsReader project was co-funded by the European Union as project number: 316404, FP7 Work Programme Call FP7-ICT-2011-8 Objective Cooperation Research theme “Information and Communication Technologies”, challenge 4.4 - Area Intelligent Information Management.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Vossen, P., Cybulska, A. (2018). Identity and Granularity of Events in Text. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2016. Lecture Notes in Computer Science(), vol 9624. Springer, Cham. https://doi.org/10.1007/978-3-319-75487-1_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-75487-1_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-75486-4
Online ISBN: 978-3-319-75487-1
eBook Packages: Computer ScienceComputer Science (R0)