Syntactic Dependency-Based N-grams: More Evidence of Usefulness in Classification
- Cite this paper as:
- Sidorov G., Velasquez F., Stamatatos E., Gelbukh A., Chanona-Hernández L. (2013) Syntactic Dependency-Based N-grams: More Evidence of Usefulness in Classification. In: Gelbukh A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7816. Springer, Berlin, Heidelberg
The paper introduces and discusses a concept of syntactic n-grams (sn-grams) that can be applied instead of traditional n-grams in many NLP tasks. Sn-grams are constructed by following paths in syntactic trees, so sn-grams allow bringing syntactic knowledge into machine learning methods. Still, previous parsing is necessary for their construction. We applied sn-grams in the task of authorship attribution for corpora of three and seven authors with very promising results.
KeywordsSyntactic n-grams sn-grams syntactic paths authorship attribution task SVM classifier
Unable to display preview. Download preview PDF.