Skip to main content

A Test Collection to Evaluate Plagiarism by Missing or Incorrect References

  • Conference paper

Part of the Lecture Notes in Computer Science book series (LNISA,volume 7488)

Abstract

In recent years, several methods and tools been developed together with test collections to aid in plagiarism detection. However, both methods and collections have focused on content analysis, overlooking citation analysis. In this paper, we aim at filling this gap and present a test collection with cases of plagiarism by missing and incorrect references. The collection contains automatically generated academic papers in which passages from other documents have been inserted. Such passages were either: adequately referenced (i.e., not plagiarized), not referenced, or incorrectly referenced. Annotation files identifying each passage enable the evaluation of plagiarism detection systems.

Keywords

  • Citation Analysis
  • Source Document
  • Test Collection
  • Bibliographic Reference
  • Academic Paper

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   72.00
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Brin, S., Davis, J., Garcia-Molina, H.: Copy detection mechanisms for digital documents. In: SIGMOD, pp. 398–409 (1995)

    Google Scholar 

  2. PAN: Uncovering plagiarism, authorship, and social software misuse, http://pan.webis.de/ (accessed April 04, 2012)

  3. CL!TR: Cross-language !ndian text reuse, http://users.dsic.upv.es/grupos/nle/fire-workshop-clitr.html (accessed June 21, 2012)

  4. Corezola Pereira, R., Moreira, V.P., Galante, R.: A New Approach for Cross-Language Plagiarism Analysis. In: Agosti, M., Ferro, N., Peters, C., de Rijke, M., Smeaton, A. (eds.) CLEF 2010. LNCS, vol. 6360, pp. 15–26. Springer, Heidelberg (2010)

    CrossRef  Google Scholar 

  5. SCIgen: An automatic CS paper generator, http://pdos.csail.mit.edu/scigen/ (accessed April 04, 2012)

  6. DBLP, http://www.informatik.uni-trier.de/~ley/db/ (accessed April 04, 2012)

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

de L. Pertile, S., Moreira, V.P. (2012). A Test Collection to Evaluate Plagiarism by Missing or Incorrect References. In: Catarci, T., Forner, P., Hiemstra, D., Peñas, A., Santucci, G. (eds) Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics. CLEF 2012. Lecture Notes in Computer Science, vol 7488. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33247-0_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33247-0_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33246-3

  • Online ISBN: 978-3-642-33247-0

  • eBook Packages: Computer ScienceComputer Science (R0)