Decision Tree-Based Evaluation of Genitive Classification – An Empirical Study on CMC and Text Corpora
- Cite this paper as:
- Hansen S., Schneider R. (2013) Decision Tree-Based Evaluation of Genitive Classification – An Empirical Study on CMC and Text Corpora. In: Gurevych I., Biemann C., Zesch T. (eds) Language Processing and Knowledge in the Web. Lecture Notes in Computer Science, vol 8105. Springer, Berlin, Heidelberg
Contemporary studies on the characteristics of natural language benefit enormously from the increasing amount of linguistic corpora. Aside from text and speech corpora, corpora of computer-mediated communication (CMC) position themselves between orality and literacy, and beyond that provide insight into the impact of “new”, mainly internet-based media on language behaviour. In this paper, we present an empirical attempt to work with annotated CMC corpora for the explanation of linguistic phenomena. In concrete terms, we implement machine learning algorithms to produce decision trees that reveal rules and tendencies about the use of genitive markers in German.
KeywordsCorpus Linguistics Computer-Mediated Communication Machine Learning Decision Trees Grammar Genitive Classification
Unable to display preview. Download preview PDF.