Supervised Machine Learning for Predicting the Meaning of Verb-Noun Combinations in Spanish

  • Olga Kolesnikova
  • Alexander Gelbukh
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6438)

Abstract

The meaning of such verb-noun combinations as take care, undertake work, pay attention can be generalized as DO what is designated by the noun. Likewise, the meaning of make a decision, provide support, write a letter can be generalized as MAKE what is designated by the noun. These generalizations represent the meaning of certain groups of verb-noun combinations. We use supervised machine learning algorithms to predict the meanings DO, MAKE, BEGIN, and CONTINUE of previously unseen verb-noun pairs. We evaluate the performance of the applied algorithms on a training set using 10- fold cross-validation technique. The learnt models have also been evaluated on an independent test set and the predictions have been checked manually to determine the accuracy of the classifiers. The obtained results show that supervised machine learning methods achieve significant accuracy and can be used for semantic annotation of verb-noun combinations.

Keywords

lexical functions verb-noun combinations meaning representation by means of hypernyms supervised machine learning 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Civit, M., Martí, M.A.: Building Cast3LB: A Spanish Treebank. Research on Language and Computation 2(4), 549–574 (2004)CrossRefGoogle Scholar
  2. 2.
    Diccionario de la Lengua Española. Real Academia Española, Madrid (2001)Google Scholar
  3. 3.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11(1) (2009)Google Scholar
  4. 4.
    Kilgarriff, A., Rychly, P., Smrz, P., Tugwell, D.: The Sketch Engine. In: Proceedings of EURALEX 2004, pp. 105–116 (2004)Google Scholar
  5. 5.
    Longman Dictionary of Contemporary English, 3rd edn. Longman Group Ltd., Essex (1995)Google Scholar
  6. 6.
    Mel’čuk, I.A.: A Theory of the Meaning-Text Type Linguistic Models. Nauka Publishers, Moscow (1974) (in Russian)Google Scholar
  7. 7.
    Mel’čuk, I.A.: Lexical Functions: A Tool for the Description of Lexical Relations in a Lexicon. In: Wanner, L. (ed.) Lexical Functions in Lexicography and Natural Language Processing, pp. 37–102. Benjamins Academic Publishers, Amsterdam (1996)Google Scholar
  8. 8.
    Nastase, V., Szpakowicz, S.: Exploring noun-modifier semantic relations. In: 5th International Workshop on Computational Semantics (IWCS-5), Tilburg, Netherlands, pp. 285–301 (2003)Google Scholar
  9. 9.
    Nastase, V., Sayyad-Shiarabad, J., Sokolova, M., Szpakowicz, S.: Learning noun-modifier semantic relations with corpus-based and wordnet-based features. In: Proceedings of the Twenty-First National Conference on Artificial Intelligence and the Eighteenth Innovative Applications of Artificial Intelligence Conference, AAAI Press, Menlo Park (2006)Google Scholar
  10. 10.
    Sidorov, G.: Lemmatization in automatized system for compilation of personal style dictionaries of literature writers. In: Word of Dostoyevsky, pp. 266–300. Russian Academy of Sciences, Moscow (1996)Google Scholar
  11. 11.
    Spanish Web Corpus, http://trac.sketchengine.co.uk/wiki/Corpora/SpanishWebCorpus/ (last viewed June 02, 2010)
  12. 12.
  13. 13.
    The University of Waikato Computer Science Department Machine Learning Group, WEKA download, http://www.cs.waikato.ac.nz/~ml/weka/index_downloading.html (last viewed June 02, 2010)
  14. 14.
    The University of Waikato Computer Science Department Machine Learning Group, Attribute-Relation File Form, http://www.cs.waikato.ac.nz/~ml/weka/arff.html (last viewed June 02, 2010)
  15. 15.
    Vossen, P. (ed.): EuroWordNet: A Multilingual Database with Lexical Semantic Networks. Kluwer Academic Publishers, Dordrecht (1998)MATHGoogle Scholar
  16. 16.
    Wanner, L.: Towards automatic fine-grained classification of verb-noun collocations. Natural Language Engineering 10(2), 95–143 (2004)CrossRefGoogle Scholar
  17. 17.
    Wanner, L., Bohnet, B., Giereth, M.: What is beyond Collocations? Insights from Machine Learning Experiments. In: EURALEX (2006) Google Scholar
  18. 18.
    Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)MATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Olga Kolesnikova
    • 1
  • Alexander Gelbukh
    • 1
  1. 1.Center for Computing ResearchNational Polytechnic InstituteMexico CityMexico

Personalised recommendations