Advertisement

An Ambiguity Aware Treebank Search Tool

  • Marcin Woliński
  • Andrzej Zaborowski
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7499)

Abstract

We present a search tool for constituency treebanks with some interesting new features. The tool has been designed for a treebank containing several alternative trees for any given sentence, with one tree marked as the correct one. The tool allows to compare the selected tree with other candidates.

The query language is modelled after TIGER Search, but we extend the use of the negation operator to be able to use a class of universally quantified conditions in queries.

The tool is built on top of an SQL engine, whose indexing facilities provide for efficient searches.

Keywords

treebanks query/search tools constituency trees syntactic ambiguity 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Billot, S., Lang, B.: The structure of shared forests in ambiguous parsing. In: Meeting of the Association for Computational Linguistics, pp. 143–151 (1989)Google Scholar
  2. 2.
    Janus, D., Przepiórkowski, A.: Poliqarp 1.0: Some technical aspects of a linguistic search engine for large corpora. In: Waliński, J., Kredens, K., Goźdź-Roszkowski, S. (eds.) The Proceedings of Practical Applications of Linguistic Corpora 2005, Peter Lang (2006)Google Scholar
  3. 3.
    König, E., Lezius, W., Voormann, H.: TIGERSearch 2.1 user’s manual. Tech. rep., IMS, Universität Stuttgart, Germany (2003)Google Scholar
  4. 4.
    Lezius, W.: TIGERSearch — ein Suchwerkzeug für Baumbanken. In: Busemann, S. (ed.) Proceedings der 6. Konferenz zur Verarbeitung natürlicher Sprache (KONVENS 2002), Saarbrücken (2002)Google Scholar
  5. 5.
    Marek, T., Lundborg, J., Volk, M.: Extending the TIGER query language with universal quantification. In: KONVENS 2008, 9. Konferenz zur Verarbeitung natürlicher Sprache. pp. 5–17 (2008)Google Scholar
  6. 6.
    Maryns, H., Kepser, S.: MonaSearch — a tool for querying linguistic treebanks. In: Van Eynde, F., Frank, A., De Smedt, K. (eds.) Treebanks and Linguistic Theories, pp. 29–40 (2009)Google Scholar
  7. 7.
    Rohde, D.L.T.: Tgrep2 user manual (2005), http://tedlab.mit.edu/~dr/Tgrep2/
  8. 8.
    Świdziński, M., Woliński, M.: Towards a Bank of Constituent Parse Trees for Polish. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS, vol. 6231, pp. 197–204. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  9. 9.
    Woliński, M.: Dendrarium — an open source tool for treebank building. In: Kłopotek, M.A., Marciniak, M., Mykowiecka, A., Penczek, W., Wierzchoń, S.T. (eds.) Intelligent Information Systems, Siedlce, Poland, pp. 193–204 (2010)Google Scholar
  10. 10.
    Woliński, M., Głowińska, K., Świdziński, M.: A preliminary version of Składnica — a treebank of Polish. In: Vetulani, Z. (ed.) Proceedings of the 5th Language & Technology Conference, Poznań, pp. 299–303 (2011)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Marcin Woliński
    • 1
  • Andrzej Zaborowski
    • 1
  1. 1.Institute of Computer SciencePolish Academy of SciencesWarsawPoland

Personalised recommendations