Extracting Human Spanish Nouns

  • Sofia N. Galicia-Haro
  • Alexander F. Gelbukh
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6231)

Abstract

In this article we present a simple method to extract Spanish nouns with the linguistic property of “human” animacy. We describe a non-supervised method based on lexical patterns and on a person name list enlarged from a collection of newspaper texts. Results were obtained from the Web filters and estimation methods are proposed to validate them.

Keywords

Animacy human mark Spanish nouns non supervised learning 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Aissen, J.: Differential Object Marking: Iconicity vs. Economy. Natural Language and Linguistic Theory 21(3), 435–483 (2003)CrossRefGoogle Scholar
  2. 2.
    Altmann, L.J.P., Kemper, S.: Effects of Age, Animacy, and Activation Order on Sentence Production. Language and Cognitive Processes 21(1), 322–354 (2006)CrossRefGoogle Scholar
  3. 3.
    Berenguer, C.R., Cruz Pastor Ferrán, M.: ¿Cuánto dura/tarda la clase de Español?: una reflexión sobre determinados usos verbales en Español. In: Lengua y cultura en la enseñanza del Español a extranjeros. Actas del VII Congreso de ASELE, pp. 397i–406i. Ediciones de la Universidad de Castilla la Mancha (1998)Google Scholar
  4. 4.
    Brants, T., Franz, A.: Web 1T 5-gram Version 1 Linguistic Data Consortium (2006)Google Scholar
  5. 5.
    Fleischman, M., Echihabi, A., Hovy, E.: Offline Strategies for Online Question Answering: Answering Questions before They are Asked. In: Proceedings of the ACL Conference, pp. 1–7 (2003)Google Scholar
  6. 6.
    Foundalis, H.E.: Evolution of Gender in Indo-European Languages. In: Proceedings of the 24th Annual Conference of the Cognitive Science Society, Fairfax, VA, pp. 304–309 (2002)Google Scholar
  7. 7.
    Galicia-Haro, S.N.: Using Electronic Texts for an Annotated Corpus Building. In: 4th Mexican International Conference on Computer Science, ENC, Mexico, pp. 26–33 (2003)Google Scholar
  8. 8.
    Heng, J., Lin, D.: Gender and Animacy Knowledge Discovery from Web-Scale N-Grams for Unsupervised Person Mention Detection. In: Proceedings of PACLIC (2009)Google Scholar
  9. 9.
    Lin, D.: Automatic Retrieval and Clustering of Similar Words. In: Proceedings of the 17th International Conference on Computational Linguistics, pp. 768–774 (1998)Google Scholar
  10. 10.
    Orăsan, C., Evans, R.: Learning to Identify Animate References. In: Proceedings of the Workshop on Computational Natural Language Learning, ACL (2001)Google Scholar
  11. 11.
    Orăsan, C., Evans, R.: NP Animacy Resolution for Anaphora Resolution. Journal of Artificial Intelligence Research 29, 79–103 (2007)MATHGoogle Scholar
  12. 12.
    Øvrelid, L.: Empirical Evaluations of Animacy Annotation. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pp. 630–638 (2009)Google Scholar
  13. 13.
    Paşca, M., Van Durme, B.: What You Seek Is What You Get: Extraction of Class Attributes from Query Logs. In: Proceedings of the International Joint Conference on Artificial Intelligence 2007, pp. 2832–2837 (2007)Google Scholar
  14. 14.
    von Heusinger, K., Kaiser, G.A.: Differential Object Marking and the Lexical Semantics of Verbs in Spanish. In: Kaiser, G.A., Leonetti, M. (eds.) Proceedings of the Workshop Definiteness, Specificity and Animacy in Ibero-Romance Languages, pp. 85–110 (2007)Google Scholar
  15. 15.
    von Heusinger, K., Kaiser, G.A.: The Interaction of Animacy, Definiteness and Specificity in Spanish. In: von Heusinger, K., Kaiser, G.A. (eds.) Proceedings of the Workshop: Semantic and Syntactic Aspects of Specificity, Romance Languages, pp. 41–65. Universität Konstanz, Konstanz (2003)Google Scholar
  16. 16.
    Yamamoto, M.: Animacy and Reference: A Cognitive Approach to Corpus Linguistics. Studies in Language Companion Series, vol. 46. John Benjamins, Amsterdam (1999)Google Scholar
  17. 17.
    Zaenen, A., Carletta, J., Garretson, G., Bresnan, J., Koontz-Garboden, A., Nikitina, T., O’Connor, M.C., Wasow, T.: Animacy Encoding in English: Why and How. In: Proceedings of the 2004 ACL Workshop on Discourse Annotation, pp. 118–125 (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Sofia N. Galicia-Haro
    • 1
  • Alexander F. Gelbukh
    • 2
  1. 1.Facultad de CienciasUniversidad Nacional Autónoma de MéxicoMexico
  2. 2.Centro de Investigación en ComputaciónInstituto Politécnico NacionalMexico

Personalised recommendations