Analysing Author Self-citations in Computer Science Publications
In scientific papers, citations refer to relevant previous work in order to underline the current line of argumentation, compare to other work and/or avoid repetition in writing. Self-citations, e.g. authors citing own previous work might have the same motivation but have also gained negative attention w.r.t. unjustified improvement of scientific performance indicators. Previous studies on self-citations do not provide a detailed analysis in the domain of computer science. In this work, we analyse the prevalence of self-citations in the DBLP, a digital library for computer science. We find, that approx. 10% of all citations are self-citations, while the rates vary with year after publication and the position of the author in the list as well as with the gender of the lead author. Further, we find that C-ranked venues have the highest incoming self-citation rate, while the outgoing rate is stable across all ranks.
KeywordsCitations Self-citations Analysis DBLP CORE
We would like to thank Moritz Grünbauer for his preliminary analysis and help with constructing the queries for the graph database.
- 2.Alonso, S., Cabrerizo, F., Herrera-Viedma, E., Herrera, F.: h-index: a review focused in its variants, computation and standardization for different scientific fields. J. Inf. 3(4), 273–289 (2009)Google Scholar
- 8.Ghiasi, G., Larivère, V., Sugimoto, C.R.: Gender differences in synchronous and diachronous self-citations. In: Proceedings International Conference on Science and Technology Indicaors (2016)Google Scholar
- 12.King, M.M., Bergstrom, C.T., Correll, S.J., Jacquet, J., West, J.D.: Men set their own cites high: gender and self-citation across fields and over time. Socius 3 (2017)Google Scholar
- 16.Lee, D., On, B.W., Kang, J., Park, S.: Effective and scalable solutions for mixed and split citation problems in digital libraries. In: Proceedings of the International Workshop on Information Quality in Information Systems, pp. 69–76. ACM, New York (2005)Google Scholar
- 19.Müller, M.C., Reitz, F., Roy, N.: Data sets for author name disambiguation: an empirical analysis and a new resource. Scientometrics (2017). https://doi.org/10.1007/s11192-017-2363-5
- 22.Thijs, B., Glänzel, W.: The influence of author self-citations on bibliometric meso-indicators. The case of European universities. Scientometrics 66, 71–80 (2006)Google Scholar