Advertisement

Parsing with Agreement

  • Adam Radziszewski
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5729)

Abstract

Shallow parsing has been proposed as a means of arriving at practically useful structures while avoiding the difficulties of full syntactic analysis. According to Abney’s principles, it is preferred to leave an ambiguity pending than to make a likely wrong decision. We show that continuous phrase chunking as well as shallow constituency parsing display evident drawbacks when faced with freer word order languages. Those drawbacks may lead to unnecessary data loss as a result of decisions forced by the formalism and therefore diminish practical value of shallow parsers for Slavic languages.

We present an alternate approach to shallow parsing of noun phrases for Slavic languages which follows the original Abney’s principles. The proposed approach to parsing is decomposed into several stages, some of which allow for marking discontinuous phrases.

Keywords

Noun Phrase Sentiment Analysis Prepositional Phrase Slavic Language Regular Grammar 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abney, S.: Part-of-speech tagging and partial parsing. In: Corpus-Based Methods in Language and Speech, pp. 118–136. Kluwer Academic Publishers, Dordrecht (1997)CrossRefGoogle Scholar
  2. 2.
    Abney, S.: Partial parsing via finite-state cascades. In: Natural Language Engineering, pp. 8–15 (1996)Google Scholar
  3. 3.
    Przepiórkowski, A.: Slavic information extraction and partial parsing. In: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, Prague, Czech Republic, Association for Computational Linguistics, pp. 1–10 (June 2007)Google Scholar
  4. 4.
    Derwojedowa, M.: Porza̧dek linearny składników zdania elementarnego w jȩzyku polskim. Dom Wydawniczy Elipsa, Warsaw (2000)Google Scholar
  5. 5.
    Nenadić, G., Vitas, D.: Formal model of noun phrases in serbo-croatian. BULAG (23), 297–311 (1998)Google Scholar
  6. 6.
    Nenadić, G., Vitas, D.: Using local grammars for agreement modeling in highly inflective languages. In: Sojka, P., Kopeček, I., Pala, K. (eds.) Proceedings of TSD 1998, Brno, Czech Republic, pp. 91–96. Springer, Heidelberg (1998)Google Scholar
  7. 7.
    Nenadić, G.: Local grammars and parsing coordination of nouns in serbo-croatian. In: Sojka, P., Kopeček, I., Pala, K., Kopeček, I. (eds.) TSD 2000. LNCS (LNAI), vol. 1902, pp. 57–62. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  8. 8.
    Nenadić, G., Vitas, D., Krstev, C.: Local grammars and compound verb lemmatization in serbo-croatian. Current Issues in Formal Slavic Linguistics, 469–477 (1999)Google Scholar
  9. 9.
    Kristina Vučković, M.T., Dovedan, Z.: Rule-based chunker for croatian. In: European Language Resources Association (ELRA) (ed.) Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), Marrakech, Morocco (May 2008)Google Scholar
  10. 10.
    Przepiórkowski, A.: A preliminary formalism for simultaneous rule-based tagging and partial parsing. In: Data Structures for Linguistic Resources and Applications: Proceedings of the Biennial GLDV Conference 2007, pp. 81–90. Gunter Narr Verlag, Tübingen (2007)Google Scholar
  11. 11.
    Przepiórkowski, A.: Powierzchniowe przetwarzanie jȩzyka polskiego. Akademicka Oficyna Wydawnicza EXIT, Warsaw (2008)Google Scholar
  12. 12.
    Przepiórkowski, A.: Towards a partial grammar of Polish for valence extraction. In: Proceedings of Grammar and Corpora 2007, Liblice, Czech Republic (2007)Google Scholar
  13. 13.
    Buczyński, A., Wawer, A.: Shallow parsing in sentiment analysis of product reviews. In: Proceedings of the Partial Parsing workshop at LREC 2008, pp. 14–18 (2008)Google Scholar
  14. 14.
    Ogrodniczuk, M.: Nowa edycja wzbogaconego korpusu słownika frekwencyjnego. Jȩzykoznawstwo w Polsce. Stan i perspektywy, 181–190 (2003)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Adam Radziszewski
    • 1
  1. 1.Institute of InformaticsWrocław University of TechnologyWrocławPoland

Personalised recommendations