Skip to main content

The EM-Based Wordnet Synsets Annotation of NP/PP Heads

  • Conference paper
  • 944 Accesses

Part of the Lecture Notes in Computer Science book series (LNAI,volume 6562)

Abstract

This paper contains an application of the EM selection algorithm to semantic annotation of NP/PP heads by means of wordnet synsets. Firstly presented are the preparation of a corpus to be semantically annotated and the wordnet on which the annotation is based. Next, the process of semantic annotation is discussed. Finally, its results are evaluated and compared with the well known solution proposed by Resnik.

Keywords

  • corpus linguistics
  • word sense disambiguation
  • wordnet
  • Polish language

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-3-642-20095-3_39
  • Chapter length: 12 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   99.00
Price excludes VAT (USA)
  • ISBN: 978-3-642-20095-3
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   129.00
Price excludes VAT (USA)

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hajnicz, E.: Semantic annotation of verb arguments in shallow parsed Polish sentences by means of EM selection algorithm. In: Marciniak, M., Mykowiecka, A. (eds.) Aspects of Natural Language Processing. LNCS, vol. 5070, pp. 211–240. Springer, Heidelberg (2009)

    CrossRef  Google Scholar 

  2. Agirre, E., Edmonds, P. (eds.): Word Sense Disambiguation. Algorithms and Applications. Text, Speech and Language Technology, vol. 33. Springer, Dordrecht (2006)

    Google Scholar 

  3. Przepiórkowski, A.: The IPI PAN corpus. Preliminary version. Institute of Computer Science, Polish Academy of Sciences, Warsaw (2004)

    Google Scholar 

  4. Woliński, M.: Komputerowa weryfikacja gramatyki Świdzińskiego. PhD thesis, Institute of Computer Science, Polish Academy of Sciences, Warsaw (2004)

    Google Scholar 

  5. Woliński, M.: An efficient implementation of a large grammar of Polish. In: Vetulani, Z. (ed.) Proceedings of the 2nd Language & Technology Conference, Poznań, Poland, pp. 343–347 (2005)

    Google Scholar 

  6. Świdziński, M.: Gramatyka formalna języka polskiego. Rozprawy Uniwersytetu Warszawskiego. Wydawnictwa Uniwersytetu Warszawskiego, Warsaw (1992)

    Google Scholar 

  7. Świdziński, M.: Syntactic Dictionary of Polish Verbs. Uniwersytet Warszawski / Universiteit van Amsterdam (1994)

    Google Scholar 

  8. Dębowski, Ł.: Valence extraction using the EM selection and co-occurrence matrices. Language Resources & Evaluation 43, 301–327 (2009)

    CrossRef  Google Scholar 

  9. Piasecki, M., Szpakowicz, S., Broda, B.: A Wordnet from the Ground Up. Oficyna Wydawnicza Politechniki Wrocławskiej, Wrocław (2009)

    Google Scholar 

  10. Derwojedowa, M., Piasecki, M., Szpakowicz, S., Zawisławska, M., Broda, B.: Words, concepts and relations in the construction of Polish WordNet. In: Tanacs, A., Csendes, D., Vincze, V., Fellbaum, C., Vossen, P. (eds.) Proceedings of the Global WordNet Conference, Seged, Hungary (2008)

    Google Scholar 

  11. Derwojedowa, M., Szpakowicz, S., Zawisławska, M., Piasecki, M.: Lexical units as the centrepiece of a wordnet. In: Kłopotek, M.A., Przepiórkowski, A., Wierzchoń, S.T. (eds.) Proceedings of the Intelligent Information Systems XVI (IIS 2008). Challenging Problems in Science: Computer Science. Academic Publishing House Exit, Zakopane (2008)

    Google Scholar 

  12. Fellbaum, C. (ed.): WordNet — An Electronic Lexical Database. MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  13. Vossen, P. (ed.): EuroWordNet: a multilingual database with lexical semantic network. Kluwer Academic Publishers, Dordrecht (1998)

    MATH  Google Scholar 

  14. Vetulani, Z., Walkowska, J., Obrębski, T., Konieczka, P., Rzepecki, P., Marciniak, J.: PolNet — Polish WordNet project algorithm. In: Vetulani, Z. (ed.) Proceedings of the 3rd Language & Technology Conference, Poznań, Poland, pp. 172–176 (2007)

    Google Scholar 

  15. Resnik, P.: Selection and Information: A Class-Based Approach to Lexical Relationships. PhD thesis, University of Pennsylvania, Philadelphia, PA (1993)

    Google Scholar 

  16. Resnik, P.: Selectional preference and sense disambiguation. In: Proceedings of the ACL Workshop on Tagging Text with Lexical Semantics, Why, What and How?, Washington, DC, pp. 52–57 (1997)

    Google Scholar 

  17. McCarthy, D.: Lexical Acquisition at the Syntax-Semantics Interface: Diathesis Alternations, Subcategorization Frames and Selectional Preferences. PhD thesis, University of Sussex (2001)

    Google Scholar 

  18. Ribas, F.: On Acquiring Appropriate Selectional Restrictions from Corpora Using a Semantic Taxonomy. PhD thesis, University of Catalonia (1995)

    Google Scholar 

  19. Li, H., Abe, N.: Generalizing case frames using a thesaurus and the MDL principle. Computational Linguistics 24(2), 217–244 (1998)

    Google Scholar 

  20. Carroll, J., McCarthy, D.: Word sense disambiguation using automatically acquired verbal preferences. Computers and the Humanities. Senseval Special Issue 32(1-2), 109–114 (2000)

    CrossRef  Google Scholar 

  21. Hajnicz, E., Woliński, M.: How valence information influences parsing Polish with Świgra. In: Kłopotek, M.A., Przepiórkowski, A., Wierzchoń, S.T., Trojanowski, K. (eds.) Recent Advances in Intelligent Information Systems. Challenging Problems in Science: Computer Science, pp. 193–206. Academic Publishing House Exit, Warsaw (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hajnicz, E. (2011). The EM-Based Wordnet Synsets Annotation of NP/PP Heads. In: Vetulani, Z. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2009. Lecture Notes in Computer Science(), vol 6562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20095-3_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20095-3_39

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20094-6

  • Online ISBN: 978-3-642-20095-3

  • eBook Packages: Computer ScienceComputer Science (R0)