Partial Grammar Checking for Czech Using the SET Parser

Kovář, Vojtěch

doi:10.1007/978-3-319-10816-2_38

Vojtěch Kovář²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8655))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1512 Accesses
3 Citations

Abstract

Checking people’s writing for correctness is one of the prominent language technology applications. In the Czech language, punctuation errors and mistakes in subject-predicate agreement belong to the most severe and most frequent errors people make, as there are complex and non-intuitive rules for both of these phenomena. At the same time, they include numerous syntactic, semantic and pragmatic aspects which makes them very difficult to be formalized for automatic checking. In this paper, we present an automatic method for fixing errors in commas and subject-predicate agreement, using pattern-matching rule-based syntactic analysis provided by the SET parsing system. We explain the method and present first evaluation of the overall accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Holan, T., Kuboň, V., Plátek, M.: A prototype of a grammar checker for Czech. In: Proceedings of the 5th Conference on Applied Natural Language Processing, pp. 147–154. Association for Computational Linguistics (1997)
Google Scholar
Kovář, V., Horák, A., Jakubíček, M.: Syntactic analysis using finite patterns: A new parsing system for Czech. In: Vetulani, Z. (ed.) LTC 2009. LNCS, vol. 6562, pp. 161–171. Springer, Heidelberg (2011)
Chapter Google Scholar
Oliva, K., Petkevič, V.: Microsoft s.r.o.: Czech grammar checker (2005), http://office.microsoft.com/word
Lingea s.r.o.: Grammaticon (2003), http://www.lingea.cz/grammaticon.htm
Pala, K.: Pište dopisy konečně bez chyb – Česká gramatickÝ korektor pro Microsoft Office. Computer, 13–14 (2005)
Google Scholar
Behún, D.: Kontrola české gramatiky pro MS Office - konec korektorů v Čechách (2005), http://interval.cz/clanky/kontrola-ceske-gramatiky-pro-ms-office-konec-korektoru-v-cechach
Jakubíček, M., Horák, A.: Punctuation detection with full syntactic parsing. Research in Computing Science, Special issue: Natural Language Processing and its Applications 46, 335–343 (2010)
Google Scholar
Horák, A.: Computer Processing of Czech Syntax and Semantics. Librix.eu, Brno (2008)
Google Scholar
Martin, J.: Rapid application development. Macmillan (1991)
Google Scholar
Gabriel, R.P.: Lisp: Good news, bad news, how to win big. AI Expert 6, 30–39 (1991)
Google Scholar
Sedláček, R., Smrž, P.: A new Czech morphological analyser ajka. In: Matoušek, V., Mautner, P., Mouček, R., Tauser, K. (eds.) TSD 2001. LNCS (LNAI), vol. 2166, pp. 100–107. Springer, Heidelberg (2001)
Chapter Google Scholar
Pala, K., Rychlý, P., Smrž, P.: DESAM — annotated corpus for Czech. In: Jeffery, K. (ed.) SOFSEM 1997. LNCS, vol. 1338, pp. 523–530. Springer, Heidelberg (1997)
Google Scholar
Trifanová, B.: Analýza chyb v diktátech žáků po absolvování 1. stupně ZŠ. Bachelor thesis, Masaryk University (2014), http://is.muni.cz/th/382965/ff_b
Šmerk, P.: Unsupervised learning of rules for morphological disambiguation. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 211–216. Springer, Heidelberg (2004)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

NLP Centre, Faculty of Informatics, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Vojtěch Kovář

Authors

Vojtěch Kovář
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Botanicá 6a, 60200, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Department of Information Technologies, Masaryk University, 602 00, Brno, Czech Republic
Aleš Horák , Ivan Kopeček & Karel Pala , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kovář, V. (2014). Partial Grammar Checking for Czech Using the SET Parser. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2014. Lecture Notes in Computer Science(), vol 8655. Springer, Cham. https://doi.org/10.1007/978-3-319-10816-2_38

Download citation

DOI: https://doi.org/10.1007/978-3-319-10816-2_38
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10815-5
Online ISBN: 978-3-319-10816-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics