Abstract
This paper evaluates the impact of semantic features in coreference resolution for the Portuguese language. We show that the new proposed features obtained on the basis of currently available Portuguese semantic resources improve results in precision, recall and f-measure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Different from [17], we did not implement rules “location and numeric mismatches".
- 2.
References
Bick, E.: The parsing system “palavras”: automatic grammatical analysis of Portuguese in a constraint grammar framework. Ph.D. thesis, Aarhus University, Aarhus University Press, Denmark (2000)
Bruckschen, M., Muniz, F., Souza, J., Fuchs, J., Infante, K., Muniz, M., Gonçalves, P., Vieira, R., Aluísio, S.: Anotação linguística em xml do corpus pln-br. Série de relatórios do NILC (2008)
Cardoso, N.: Rembrandt - a named-entity recognition framework. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation - LREC, Istanbul, Turkey, pp. 1240–1243 (2012)
Carreras, X., Màrquez, L., Padró, L.: A simple named entity extractor using adaboost. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL, vol. 4, pp. 152–155. Association for Computational Linguistics (2003)
Collovini, S., Carbonel, T.I., Fuchs, J.T., Coelho, J.C., Rino, L., Vieira, R.: Summ-it: Um corpus anotado com informações discursivas visando a sumarização automática. In: Proceedings of V Workshop em Tecnologia da Informação e da Linguagem Humana, Rio de Janeiro, RJ, Brasil, pp. 1605–1614 (2007)
Coreixas, T.: Resolução de correferência e categorias de entidades nomeadas. Pontifícia Universidade Católica Do Rio Grande Do Sul, Dissertação de Mestrado (2010)
da Silva, F.J.V., Carvalho, A.M.B.R., Roman, N.T.: A comparative analysis of centering-based algorithms for pronoun resolution in Portuguese. In: Kuri-Morales, A., Simari, G.R. (eds.) IBERAMIA 2010. LNCS, vol. 6433, pp. 336–345. Springer, Heidelberg (2010)
de Souza, J.G.C., Gonçalves, P.N., Vieira, R.: Learning coreference resolution for Portuguese texts. In: Teixeira, A., de Lima, V.L.S., de Oliveira, L.C., Quaresma, P. (eds.) PROPOR 2008. LNCS (LNAI), vol. 5190, pp. 153–162. Springer, Heidelberg (2008)
do Amaral, D.O.F.: O reconhecimento de entidades nomeadas por meio de conditional random fields para a língua portuguesa. Dissertação de Mestrado, Pontifícia Universidade Católica do Rio Grande do Sul (2013)
Fonseca, E.B., Vieira, R., Vanin, A.: Dealing with imbalanced datasets for coreference resolution. In: Proceedings of The Twenty-Eighth International Flairs Conference - FLAIRS (2015)
Fonseca, E.B., Vieira, R., Vanin, A.A.: Coreference resolution in portuguese: detecting person, location and organization. J. Braz. Comput. Intell. Soc. 12, 86–97 (2014)
Freitas, C., Mota, C., Santos, D., Oliveira, H.G., Carvalho, P.: Second HAREM: advancing the state of the art of named entity recognition in Portuguese. In: Proceedings of the International Conference on Language Resources and Evaluation, LREC, Valletta, Malta (2010)
Gabbard, R., Freedman, M., Weischedel, R.: Coreference for learning to extract relations: yes, virginia, coreference matters. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers, vol. 2, pp. 288–293. Association for Computational Linguistics (2011)
Garcia, M., Gamallo, P.: An entity-centric coreference resolution system for person entities with rich linguistic information. In: Proceedings of 25th International Conference on Computational Linguistics, COLING, Dublin, Ireland, pp. 741–752 (2014)
Garcia, M., Gamallo, P.: Multilingual corpora with coreferential annotation of person entities. In: Proceedings of the 9th edn. of the Language Resources and Evaluation Conference - LREC, pp. 3229–3233 (2014)
Hou, Y., Markert, K., Strube, M.: A rule-based system for unrestricted bridging resolution: recognizing bridging anaphora and finding links to antecedents. In: Proceedings of Conference on Empirical Methods in Natural Language Processing - EMNLP, Doha, Qatar, pp. 2082–2093 (2014)
Lee, H., Chang, A., Peirsman, Y., Chambers, N., Surdeanu, M., Jurafsky, D.: Deterministic coreference resolution based on entity-centric, precision-ranked rules. Comput. Linguist. 39, 885–916 (2013). MIT Press
Maziero, E.G., Pardo, T.A., Di Felippo, A., Dias da Silva, B.C.: A base de dados lexical e a interface web do tep 2.0: thesaurus eletrônico para o português do brasil. In: Proceedings of the XIV Brazilian Symposium on Multimedia and the Web, pp. 390–392. ACM (2008)
Miller, G.A.: WordNet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)
Oliveira, H.G., Gomes, P.: ECO and Onto-PT: a flexible approach for creating a portuguese wordnet automatically. Lang. Resour. Eval. 48(2), 373–393 (2014)
Pradhan, S., Ramshaw, L., Marcus, M., Palmer, M., Weischedel, R., Xue, N.: Conll-2011 shared task: Modeling unrestricted coreference in ontonotes. In: Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, pp. 1–27. Association for Computational Linguistics (2011)
Rahman, A., Ng, V.: Coreference resolution with world knowledge. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, Oregon, USA, pp. 814–824 (2011)
Sarmento, L., Pinto, A.S., Cabral, L.M.: REPENTINO – A wide-scope gazetteer for entity recognition in Portuguese. In: Vieira, R., Quaresma, P., Nunes, M.G.V., Mamede, N.J., Oliveira, C., Dias, M.C. (eds.) PROPOR 2006. LNCS (LNAI), vol. 3960, pp. 31–40. Springer, Heidelberg (2006)
da Silva, J.F.: Resolução de correferência em múltiplos documentos utilizando aprendizado não supervisionado. Dissertação de Mestrado, Universidade de São Paulo (2011)
Soon, W.M., Ng, H.T., Lim, C.Y.: A machine learning approach to coreference resolution of noun phrases. Comput. Linguist. 27(4), 521–544 (2001)
Acknowledgments
The authors acknowledge the financial support of CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico), CAPES (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior) and FAPERGS (Fundação de Amparo à Pesquisa do Rio Grande do Sul).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Fonseca, E., Vieira, R., Vanin, A. (2016). Improving Coreference Resolution with Semantic Knowledge. In: Silva, J., Ribeiro, R., Quaresma, P., Adami, A., Branco, A. (eds) Computational Processing of the Portuguese Language. PROPOR 2016. Lecture Notes in Computer Science(), vol 9727. Springer, Cham. https://doi.org/10.1007/978-3-319-41552-9_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-41552-9_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41551-2
Online ISBN: 978-3-319-41552-9
eBook Packages: Computer ScienceComputer Science (R0)