Abstract
Citation score has become a very important metric to assess the quality of a publication in the current global ranking scenario. In this context, the study of citation networks gains importance as it helps in understanding the citation process as well as in analyzing citation trends in the research world. Citation networks are modeled as directed acyclic graphs in which publications of the authors are considered as nodes and citations between the papers form the links. In this paper, we propose an additive Time-Stamp based Network Evolution(TNE) model for citation networks, extending Price’s preferential attachment model by including the recency effect on the citation process without neglecting the impact of classical papers. We propose a more meaningful definition of clustering coefficient for citation networks in terms of ’citation triangles’. Further, the network simulated by the TNE model with best-fit parameters is compared with the real-world(DBLP) citation network. The results of various significance tests show that the simulated network matches very well with the DBLP citation network in terms of several network properties.
Similar content being viewed by others
References
Barabási, A.-L., & Albert, R. (1999). Emergence of scaling in random networks. Science, 286(5439), 509–512.
Barabâsi AL, Jeong H, Néda Z, Ravasz E, Schubert A, Vicsek T. (2002). Evolution of the social network of scientific collaborations. Physica A: Statistical Mechanics and its Applications, 311(3–4), 590–614.
Behfar, S. K., Turkina, E., Cohendet, P., & Burger-Helmchen, T. (2016). Directed networks’ different link formation mechanisms causing degree distribution distinction. Physica A: Statistical Mechanics and its Applications, 462, 479–491.
Chakraborty, T., Kumar, S., Goyal, P., Ganguly, N., & Mukherjee, A. (2015). On the categorization of scientific citation profiles in computer science. Communications of the ACM, 58(9), 82–90.
Clough, J. R., Gollings, J., Loach, T. V., & Evans, T. S. (2015). Transitive reduction of citation networks. Journal of Complex Networks, 3(2), 189–203.
Dorogovtsev, S. N., & Mendes, J. F. F. (2000). Evolution of networks with aging of sites. Physical Review E, 62(2), 1842.
Fagiolo, G. (2007). Clustering in complex directed networks. Physical Review E, 76(2), 026107.
Goldberg, S. R., Anthony, H., & Evans, T. S. (2015). Modelling citation networks. Scientometrics, 105(3), 1577–1604.
Hajra, K. B., & Sen, P. (2006). Modelling aging characteristics in citation networks. Physica A: Statistical Mechanics and its Applications, 368(2), 575–582.
Holme, P., & Kim, B. J. (2002). Growing scale-free networks with tunable clustering. Physical Review E, 65(2), 026107.
Hu, F., et al. (2021). The aging effect in evolving scientific citation networks. Scientometrics, 126(5), 4297–4309.
Jeong, H., Néda, Z., & Barabási, A.-L. (2003). Measuring preferential attachment in evolving networks. EPL (Europhysics Letters), 61(4), 567.
Karrer, B., & Newman, M. E. (2009). Random acyclic networks. Physical Review Letters, 102(12), 128701.
Keathley-Herring, H., et al. (2016). Assessing the maturity of a research area: Bibliometric review and proposed framework. Scientometrics, 109, 927–951.
Leskovec, J., Kleinberg, J., & Faloutsos, C. (2005). ACM (ed.) Graphs over time: densification laws, shrinking diameters and possible explanations. (ed. ACM) Proceedings of the eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data mining, (pp. 177–187), ACM.
Mingers, J., & Leydesdorff, L. (2015). A review of theory and practice in scientometrics. European Journal of Operational Research, 246(1), 1–19.
Narbaev, T. (2022). A meta-analysis of the public-private partnership literature reviews: exploring the identity of the field. International Journal of Strategic Property Management, 26(4), 318–331.
Narbaev, T., & Amirbekova, D. (2021). Research productivity in emerging economies: Empirical evidence from kazakhstan. Publications, 9(4), 51.
Newman, M. E. (2001). Clustering and preferential attachment in growing networks. Physical Review E, 64(2), 025102.
Pi, X., Tang, L., & Chen, X. (2021). A directed weighted scale-free network model with an adaptive evolution mechanism. Physica A: Statistical Mechanics and its Applications, 572, 125897.
Price, D. J. D. S. (1965). Networks of scientific papers: The pattern of bibliographic references indicates the nature of the scientific research front. Science, 149(3683), 510–515.
Price, D. D. S. (1976). A general theory of bibliometric and other cumulative advantage processes. Journal of the American Society for Information Science, 27(5), 292–306.
Ren, F. X., Shen, H. W., & Cheng, X. Q. (2012). Modeling the clustering in citation networks. Physica A: Statistical Mechanics and its Applications, 391(12), 3533–3539.
Ren, Z.-M. (2019). Age preference of metrics for identifying significant nodes in growing citation networks. Physica A: Statistical Mechanics and its Applications, 513, 325–332.
Rossetto, D. E., Bernardes, R. C., Borini, F. M., & Gattaz, C. C. (2018). Structure and evolution of innovation research in the last 60 years: Review and future trends in the field of business through the citations and co-citations analysis. Scientometrics, 115(3), 1329–1363.
Scholz, F. W., & Stephens, M. A. (1987). K-sample anderson-darling tests. Journal of the American Statistical Association, 82(399), 918–924.
Tang, J. et al. (2008). ACM (ed.) Arnetminer: extraction and mining of academic social networks. (ed.ACM) Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data mining, (pp. 990–998), ACM.
Vasiliauskaite, V. & Evans, T. S. (2018). Diversity from the topology of citation networks. ArXiv Preprint arXiv:1802.06015.
Wang, M., Yu, G., & Yu, D. (2009). Effect of the age of papers on the preferential attachment in citation networks. Physica A: Statistical Mechanics and its Applications, 388(19), 4273–4276.
Wu, Z. X., & Holme, P. (2009). Modeling scientific-citation patterns and other triangle-rich acyclic networks. Physical Review E, 80(3), 037101.
Zhu, H., Wang, X., & Zhu, J.-Y. (2003). Effect of aging on network structure. Physical Review E, 68(5), 056121.
Acknowledgements
We are grateful to the anonymous reviewers for their valuable comments and suggestions which helped in improving the quality of this manuscript. The first author acknowledges the financial assistance received from the University Grants Commission (UGC), Government of India in the form of a Junior Research Fellowship (JRF).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript. The authors have no financial or proprietary interests in any material discussed in this article.
Appendix Time stamp effect: analytical proof
Appendix Time stamp effect: analytical proof
If the time stamp effect is removed from our model, it will be reduced to the Price’s preferential attachment model. This can be analytically shown as follows:
The master equation of our additive TNE model is given as follows:
Where \(\pi _d(v_i) = \frac{k_i}{\sum _j k_j}\) is preferential attachment based on in-degree and \(\pi _t(v_i) = \frac{t_i}{t+1}\) is preferential attachment based on time factor.
The resulting master equation after the removal of the timestamp from the TNE model is given below.
where \(\pi _d(v_i) = \frac{k_i}{\sum _j k_j}\)
Integrating both sides in the range \((t_i, t)\) where \(t_i\) is the time at which paper i gets the initial citation. Therefore \(k_i(t_i) = b\) where b is a constant.
Since average \(k_i = m\)
Highly connected nodes are the result of the phenomena i.e, older nodes with smaller \(t_i\) values increase their in-degree at expense of recent nodes with larger \(t_i\). This helps us calculate \(p(k_i(t) < k)\) The probability of selecting a node at time t whose degree is less than k.
Calculating in-degree distribution requires randomly selecting a node at time t with probability \(\frac{1}{N_0 + t}\).
In-degree distribution of nodes having degree k at time t is given by:
Hence
while \(t\rightarrow \infty\)
Since \(0<\mu <1\) and \(\frac{1}{2\mu }+1 >=1\) which means when the time stamp effect is removed from our TNE model, it reduces to Price’s classical PA model.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kammari, M., S, D.B. Time-stamp based network evolution model for citation networks. Scientometrics 128, 3723–3741 (2023). https://doi.org/10.1007/s11192-023-04704-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-023-04704-7