Abstract
In bibliometric and scientometric research, the quantitative assessment of scientific impact has boomed over the past few decades. Citations, being playing a major role in enhancing the impact of researchers, have become a very significant part of a plethora of new techniques for measuring scientific impact. Self-citations, though can be used genuinely to credit someone’s own work, can play a significant role in artificial manipulation of scientific impact. In this research, we study the impact of self-citations on enhancing the scientific impact of an author using a dataset retrieved from AMiner ranging from 1936 to 2014 from the computer science domain. We investigated the relations among trends of self-citation and their influence on scientific impact. We also studied its influence on ranking metrics including author impact factor and H-Index. By analyzing self-citations over time, we discover five basic self-citation trends, which are early, middle, later, multi and none. Distinctly different patterns were observed in self-citations trends. The results show that self-citations, if totally removed from total received citations, negatively influence the AIF and H-Index values and hence can be used to artificially boost the scientific impact. We used regression-based prediction models to predict the influence of self-citations on future H-Index. Classifiers including Logistic Regression, Naïve Bayes and K-NN were used with an accuracy of 93%, 73% and 60% respectively.
Similar content being viewed by others
References
Aksnes, D. W. (2003). A macro study of self-citation. Scientometrics,56, 235–246.
Amjad, T., & Ali, A. (2019). Uncovering diffusion trends in computer science and physics publications. Library Hi Tech,37, 794–810.
Amjad, T., & Daud, A. (2017). Indexing of authors according to their domain of expertise. Malaysian Journal of Library and Information Science,22, 69–82.
Amjad, T., Daud, A., Akram, A., & Muhammed, F. (2016). Impact of mutual influence while ranking authors in a co-authorship network. Kuwait Journal of Science,43, 101–109.
Amjad, T., Daud, A., & Aljohani, N. R. (2018). Ranking authors in academic social networks: A survey. Library Hi Tech,36, 97–128.
Amjad, T., Daud, A., Che, D., & Akram, A. (2015). MuICE: Mutual influence and citation exclusivity author rank. Information Processing and Management,52, 374–386.
Bu, Y., Ding, Y., Xu, J., Liang, X., Gao, G., & Zhao, Y. (2018). Understanding success through the diversity of collaborators and the milestone of career. Journal of the Association for Information Science and Technology,69, 87–97.
Chen, P., Xie, H., Maslov, S., & Redner, S. (2007). Finding scientific gems with Google’s PageRank algorithm. Journal of Informetrics,1, 8–15.
Debackere, K., & Thijs, B. (2013). A concise review on the role of author self-citations in information science, bibliometrics and science policy. In Proceedings of the annual conference of CAIS/Actes Du Congrès Annuel de l’ACSI.
Ding, Y. (2011). Topic-based PageRank on author cocitation networks. Journal of the American Society for Information Science and Technology,62, 449–466.
Dunaiski, M., Geldenhuys, J., & Visser, W. (2018a). Author ranking evaluation at scale. Journal of Informetrics,12, 679–702.
Dunaiski, M., Geldenhuys, J., & Visser, W. (2018b). How to evaluate rankings of academic entities using test data. Journal of Informetrics,12, 631–655.
Ferrara, E., & Romero, A. E. (2013). Scientific impact evaluation and the effect of self-citations: Mitigating the bias by discounting the h-index. Journal of the American Society for Information Science and Technology,64, 2332–2339.
Fowler, J., & Aksnes, D. (2007). Does self-citation pay? Scientometrics,72, 427–437.
Friedman, N., Geiger, D., & Goldszmidt, M. (1997). Bayesian network classifiers. Machine Learning,29, 131–163.
Gami, A. S., Montori, V. M., Wilczynski, N. L., & Haynes, R. B. (2004). Author self-citation in the diabetes literature. Canadian Medical Association Journal,170, 1925–1927.
Glänzel, W., Debackere, K., Thijs, B., & Schubert, A. (2006). A concise review on the role of author self-citations in information science, bibliometrics and science policy. Scientometrics,67, 263–277.
Glänzel, W., & Thijs, B. (2004). Does co-authorship inflate the share of self-citations? Scientometrics,61, 395–404.
González-Sala, F., Osca-Lluch, J., & Haba-Osca, J. (2019). Are journal and author self-citations a visibility strategy? Scientometrics,119, 1345–1364.
Grégoire, G. (2014). Multiple linear regression. European Astronomical Society Publication Series,66, 45–72.
Hendrix, D. (2009). Institutional self-citation rates: A three year study of universities in the United States. Scientometrics,81, 321–331.
Hirsch, J. E. (2005). An index to quantify an individual’s scientific research output. Proceedings of the National academy of Sciences USA,102, 16569–16572.
Hosmer, D. W., & Lemeshow, S. (1998). Applied logistic regression. New York: Wiley.
Huang, M.-H., & Cathy Lin, W.-Y. (2012). The influence of journal self-citations on journal impact factor and immediacy index. Online Information Review,36, 639–654.
Keller, J. M., Gray, M. R., & Givens, J. A. (1985). A fuzzy k-nearest neighbor algorithm. IEEE Transactions on Systems, Man, and Cybernetics,4, 580–585.
Kim, M., Newth, D., & Christen, P. (2014). Uncovering diffusion in academic publications using model-driven and model-free approaches. In 2014 IEEE fourth international conference on big data and cloud computing (BdCloud) (pp. 564–571). IEEE.
Pan, R. K., & Fortunato, S. (2014). Author impact factor: Tracking the dynamics of individual scientific impact. Scientific Reports,4, 4880.
Pandita, R., & Singh, S. (2015). Impact of self-citations on impact factor: A study across disciplines, countries and continents. Journal of Information Science Theory and Practice,3, 42–57.
Pandita, R., & Singh, S. (2017). Self-citations, a trend prevalent across subject disciplines at the global level: An overview. Collection Building,36, 115–126.
Reza Davarpanah, M., & Amel, F. (2009). Author self-citation pattern in science. Library Review,58, 301–309.
Shah, T. A., Gul, S., & Gaur, R. C. (2015). Authors self-citation behaviour in the field of Library and Information Science. Aslib Journal of Information Management,67, 458–468.
Tang, J., Zhang, J., Yao, L., Li, J., Zhang, L., & Su, Z. (2008). Arnetminer: Extraction and mining of academic social networks. ACM (pp. 990–998).
Wolfgang, G., Bart, T., & Balázs, S. (2004). A bibliometric approach to the role of author self-citations in scientific communication. Scientometrics,59, 63–77.
Yan, R., Huang, C., Tang, J., Zhang, Y., & Li, X. (2012). To better stand on the shoulder of giants. In Proceedings of the 12th ACM/IEEE-CS joint conference on digital libraries (pp. 51–60). ACM.
Zhao, F., Zhang, Y., Lu, J., & Shai, O. (2019). Measuring academic influence using heterogeneous author-citation networks. Scientometrics,118, 1119–1140.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Amjad, T., Rehmat, Y., Daud, A. et al. Scientific impact of an author and role of self-citations. Scientometrics 122, 915–932 (2020). https://doi.org/10.1007/s11192-019-03334-2
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-019-03334-2