Advertisement

Automatic Analysis of Authorship Using Syntactic n-grams

  • Grigori Sidorov
Chapter
Part of the SpringerBriefs in Computer Science book series (BRIEFSCOMPUTER)

Abstract

We have conducted various experiments [93] in order to test the usefulness of the concept of syntactic n-grams. Essentially, we consider the task of authorship attribution, i.e., there are texts for which the authors are known and a text for which we have to determine the author (among the considered authors only). In our case, we use a corpus composed of texts written by three different authors.

Bibliography

  1. 93.
    Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hernández, L.: Syntactic Dependency-based N-grams as Classification Features. LNAI, 7630, pp. 1–11 (2012)Google Scholar
  2. 4.
    Argamon, S., Juola, P.: Overview of the international authorship identification competition at PAN-2011. In: Proc. of 5th Int. Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse (2011)Google Scholar
  3. 52.
    Juola, P.: Authorship Attribution. Foundations and Trends in Information Retrieval. 1(3):233–334 (2006)CrossRefGoogle Scholar
  4. 57.
    Koppel, M., Schler, J., Argamon, S.: Authorship attribution in the wild. Language Resources and Evaluation 45(1):83–94 (2011)CrossRefGoogle Scholar
  5. 102.
    Stamatatos, E.: A survey of modern authorship attribution methods. Journal of the American Society for information Science and Technology 60(3): 538–556 (2009)CrossRefGoogle Scholar
  6. 45.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update; SIGKDD Explorations, 11(1), pp. 10–18 (2009)CrossRefGoogle Scholar
  7. 94.
    Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hernández, L.: Syntactic Dependency-Based N-grams: More Evidence of Usefulness in Classification. LNCS, 7816 (Proc. of CICLing), pp. 13–24 (2013)Google Scholar
  8. 95.
    Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hernández, L.: Syntactic N-grams as Machine Learning Features for Natural Language Processing. Expert Systems with Applications, 41(3): 853–860 (2014)CrossRefGoogle Scholar

Copyright information

© The Author(s), under exclusive licence to Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Grigori Sidorov
    • 1
  1. 1.Instituto Politécnico NacionalCentro de Investigación en ComputaciónMexico CityMexico

Personalised recommendations