Skip to main content

Comparing Text-Matching Software Systems Using the Document Set in the Latvian Language

Abstract

There are many internationally developed text-matching software systems that help successfully identify potentially plagiarized content in English texts using both their internal databases and web resources. However, many other languages are not so widely spread but they are used daily to communicate, conduct research and acquire education. Each language has its peculiarities, so, in the context of finding content similarities, it is necessary to determine what systems are more suitable for a document set written in a specific language. The research focuses on testing the existent text-matching software systems on a set of documents prepared in the Latvian language. The corpus includes documents containing verbatim plagiarism, paraphrasing, translation plagiarism and original text to test both false positive and false negative cases. In total, 16 different text-matching software systems are compared on the plagiarism coverage using the prepared document corpus. The research presented is a part of an international initiative “Testing of Support Tools for Plagiarism Detection (TeSToP)” established under the European Network for Academic Integrity.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

References

Download references

Acknowledgements

The authors of the paper express their gratitude to Gints Jēkabsons, who tested the VDPKS system developed in Latvia.

Comparing Text-Matching Software Systems Using the Document Set in the Latvian Language.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Laima Kamzola.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Kamzola, L., Anohina-Naumeca, A. Comparing Text-Matching Software Systems Using the Document Set in the Latvian Language. J Acad Ethics 18, 129–141 (2020). https://doi.org/10.1007/s10805-019-09355-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10805-019-09355-z

Keywords

  • Plagiarism detection
  • Academic integrity
  • Text-matching software
  • Plagiarism coverage