PPChecker: Plagiarism Pattern Checker in Document Copy Detection

  • NamOh Kang
  • Alexander Gelbukh
  • SangYong Han
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4188)


Nowadays, most of documents are produced in digital format, in which they can be easily accessed and copied. Document copy detection is a very important tool for protecting the author’s copyright. We present PPChecker, a document copy detection system based on plagiarism pattern checking. PPChecker calculates the amount of data copied from the original document to the query document, based on linguistically-motivated plagiarism patterns. Experiments performed on CISI document collection show that PPChecker produces better decision information for document copy detection than existing systems.


Document Copy Detection Plagiarism Pattern 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Shivakumar, N., Garcia-Monlina, H.: SCAM: A Copy Detection Mechanisms for Digital Documents. In: Proceedings of International Conference on Theory and Practice of Digital Libraries, Austin, Texas (June 1995)Google Scholar
  2. 2.
    Brin, S., Davis, J., Garcia-Molina, H.: Copy Detection Mechanisms for Digital Documents. In: Proceedings of ACM SIGMOD Annual Conference, San Jose, CA (May 1995)Google Scholar
  3. 3.
    Si, A., Leong, H., Lau, R.: CHECK: A Document Plagiarism Detection System. In: Proceedings of ACM Symposium for Applied Computing, February 1997, pp. 70–77 (1997)Google Scholar
  4. 4.
    Jun-Peng, B., Jun-Yi, S., Xiao-Dong, L., Hai-Yan, L., Xiao-Di, Z.: Document Copy Detection Based On Kernel Method. In: 2003 International Conference on Natural Language Processing and Knowledge Engineering Proceedings (2003)Google Scholar
  5. 5.
    Monostori, K., Zaslavsky, A., Schmidt, H.: Document Overlap Detection System for Distributed Digital Libraries. In: Proc. of the 5th ACM conference on DL, pp. 226–227 (2000)Google Scholar
  6. 6.
    Bloomfield, L.: The Plagiarism Resource Site Charlottesville, Virginia,
  7. 7.
    Fullam, K., Park, J.: Improvements for Scalable and Accurate Plagiarism Detection in Digital Documents (2002)Google Scholar
  8. 8.
    Shivakumar, N., Garcia-Molina, H.: Building a Scalable and Accurate Copy Detection Mechanism. In: 1st ACM Int. Conference on Digital Libraries (DL 1996), March 1996, pp. 160–168 (1996)Google Scholar
  9. 9.
    Finkel, R., Zaslavsky, A., Monostori, K., Schmidt, H.: Signature Extraction for Overlap Detection in Documents. In: Proceedings of Australasian Computer Science Conference (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • NamOh Kang
    • 1
  • Alexander Gelbukh
    • 2
  • SangYong Han
    • 1
  1. 1.School of Computer Science & EngineeringChung-Ang UniversitySouth Korea
  2. 2.National Polytechnic InstituteMexico

Personalised recommendations