PPChecker: Plagiarism Pattern Checker in Document Copy Detection

  • NamOh Kang
  • Alexander Gelbukh
  • SangYong Han
Conference paper

DOI: 10.1007/11846406_83

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4188)
Cite this paper as:
Kang N., Gelbukh A., Han S. (2006) PPChecker: Plagiarism Pattern Checker in Document Copy Detection. In: Sojka P., Kopeček I., Pala K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science, vol 4188. Springer, Berlin, Heidelberg

Abstract

Nowadays, most of documents are produced in digital format, in which they can be easily accessed and copied. Document copy detection is a very important tool for protecting the author’s copyright. We present PPChecker, a document copy detection system based on plagiarism pattern checking. PPChecker calculates the amount of data copied from the original document to the query document, based on linguistically-motivated plagiarism patterns. Experiments performed on CISI document collection show that PPChecker produces better decision information for document copy detection than existing systems.

Keywords

Document Copy Detection Plagiarism Pattern 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • NamOh Kang
    • 1
  • Alexander Gelbukh
    • 2
  • SangYong Han
    • 1
  1. 1.School of Computer Science & EngineeringChung-Ang UniversitySouth Korea
  2. 2.National Polytechnic InstituteMexico

Personalised recommendations