LIARc: Labeling Implicit ARguments in Spanish Deverbal Nominalizations

  • Aina Peris
  • Mariona Taulé
  • Horacio Rodríguez
  • Manuel Bertran Ibarz
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7816)

Abstract

This paper deals with the automatic identification and annotation of the implicit arguments of deverbal nominalizations in Spanish. We present the first version of the LIAR system focusing on its classifier component. We have built a supervised Machine Learning feature based model that uses a subset of AnCora-Es as a training corpus. We have built four different models and the overall F-Measure is 89.9%, which means an increase F-Measure performance approximately 35 points over the baseline (55%). However, a detailed analysis of the feature performance is still needed. Future work will focus on using LIAR to automatically annotate the implicit arguments in the whole AnCora-Es.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Fillmore, C.J.: Pragmatically Controlled Zero Anaphora. Technical report, Department of Linguistics. University of California (1986)Google Scholar
  2. 2.
    Fillmore, C.J., Baker, C.F.: Frame semantics for text understanding. In: Proceedings of the Workshop on WordNet and Other Lexical Resources, NAACL. Association for Computational Linguistics, Pittsburgh (2001)Google Scholar
  3. 3.
    Gerber, M., Chai, J.Y.: Beyond NomBank: a study of implicit arguments for nominal predicates. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL 2010, pp. 1583–1592. Association for Computational Linguistics, Stroudsburg (2010)Google Scholar
  4. 4.
    Gerber, M., Chai, J.Y.: Semantic Role Labeling of Implicit Arguments for Nominal Predicates. Computational Linguistics 38, 755–798 (2012)CrossRefGoogle Scholar
  5. 5.
    Peris, A., Taulé, M.: Annotating the argument structure of deverbal nominalizations in Spanish. In: Language Resources and Evaluation (2011), doi:10.1007/s10579-011-9172-xGoogle Scholar
  6. 6.
    Taulé, M., Martí, M., Recasens, M.: AnCora: Multilevel Annotated Corpora for Catalan and Spanish. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), pp. 96–101. European Language Resources Association (ELRA), Marrakech (2008)Google Scholar
  7. 7.
    Palmer, M., Kingsbury, P., Gildea, D.: The Proposition Bank: An Annotated Corpus of Semantic Roles. Computational Linguistics 31, 76–105 (2005)CrossRefGoogle Scholar
  8. 8.
    Meyers, A.: Annotation Guidelines for NomBank Noun Argument Structure for PropBank. Technical report, University of New York (2007)Google Scholar
  9. 9.
    Kipper, K.: VerbNet: A broad-coverage, comprehensive verb lexicon. PhD thesis, Computer and Information Science Dept., University of Pennsylvania, PA (2005)Google Scholar
  10. 10.
    Palmer, M.S., Dahl, D.A., Schiffman, R.J., Hirschman, L., Linebarger, M., Dowding, J.: Recovering Implicit information. In: Proceedings of the 24th Annual Meeting of the Association for Computational Linguistics, pp. 10–19. Association for Computational Linguistics, New York (1986)CrossRefGoogle Scholar
  11. 11.
    Tetreaul, J.R.: Implicit Role Reference. In: International Symposium on Reference Resolution for Natural Language Processing, pp. 109–115 (2002)Google Scholar
  12. 12.
    Heeman, P., Allen, J.: Trains 93 dialogues. Technical report, University of Rochester (1995)Google Scholar
  13. 13.
    Ruppenhofer, J., Sporleder, C., Morante, R., Baker, C., Palmer, M.: SemEval-2010 Task 10: Linking Events and Their Participants in Discourse. In: Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 296–299. Association for Computational Linguistics, Uppsala (2010)Google Scholar
  14. 14.
    Chen, D., Schneider, N., Das, D., Smith, N.A.: SEMAFOR: Frame argument resolution with log-linear models. In: Proceedings of the 5th International Workshop on Semantic Evaluation, SemEval 2010, pp. 264–267. Association for Computational Linguistics, Stroudsburg (2010)Google Scholar
  15. 15.
    Tonelli, S., Delmonte, R.: VENSES++: Adapting a deep semantic processing system to the identification of null instantiations. In: Proceedings of the 5th International Workshop on Semantic Evaluation, SemEval 2010, pp. 296–299. Association for Computational Linguistics, Stroudsburg (2010)Google Scholar
  16. 16.
    Silberer, C., Frank, A.: Casting Implicit Role Linking as an Anaphora Resolution Task. In: *SEM 2012: The First Joint Conference on Lexical and Computational Semantics – Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation (SemEval 2012), pp. 1–10. Association for Computational Linguistics, Montréal (2012)Google Scholar
  17. 17.
    Laparra, E., Rigau, G.: Exploiting Explicit Annotations and Semantic Types for Implicit Argument Resolution. In: ICSC, pp. 75–78 (2012)Google Scholar
  18. 18.
    Ruppenhofer, J., Gorinski, P., Sporleder, C.: In search of missing arguments: A linguistic approach. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2011 Organising Committee, Hissar, Bulgaria, pp. 331–338 (2011)Google Scholar
  19. 19.
    Tonelli, S., Delmonte, R.: Desperately seeking implicit arguments in text. In: Proceedings of the ACL 2011 Workshop on Relational Models of Semantics, pp. 54–62. Association for Computational Linguistics, Stroudsburg (2011)Google Scholar
  20. 20.
    Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet Project. In: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, ACL 1998, vol. 1, pp. 86–90. Association for Computational Linguistics, Stroudsburg (1998)Google Scholar
  21. 21.
    Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of english: The penn treebank. Computational Linguistics 19, 313–330 (1993)Google Scholar
  22. 22.
    Peris, A., Taulé, M.: AnCora-Nom: A Spanish Lexicon of Deverbal Nominalizations. Procesamiento del Lenguaje Natural 46, 11–19 (2011)Google Scholar
  23. 23.
    Alvez, J., Atserias, J., Carrera, J., Climent, S., Oliver, A., Rigau, G.: Consistent annotation of eurowordnet with the top concept ontology. In: Proceedings of Fourth International WordNet Conference (GWC 2008). Association for Computational Linguistics (2008)Google Scholar
  24. 24.
    Aparicio, J., Taulé, M., Martí, M.: AnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), pp. 797–802. European Language Resources Association (ELRA), Marrakech (2008)Google Scholar
  25. 25.
    Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)MATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Aina Peris
    • 1
  • Mariona Taulé
    • 1
  • Horacio Rodríguez
    • 2
  • Manuel Bertran Ibarz
    • 1
  1. 1.CLiC, Centre de Llenguatge i ComputacióUniversity of BarcelonaBarcelonaSpain
  2. 2.Technical University of Catalonia TALP Research CenterBarcelonaSpain

Personalised recommendations