Skip to main content
Log in

Time-stamp based network evolution model for citation networks

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

Citation score has become a very important metric to assess the quality of a publication in the current global ranking scenario. In this context, the study of citation networks gains importance as it helps in understanding the citation process as well as in analyzing citation trends in the research world. Citation networks are modeled as directed acyclic graphs in which publications of the authors are considered as nodes and citations between the papers form the links. In this paper, we propose an additive Time-Stamp based Network Evolution(TNE) model for citation networks, extending Price’s preferential attachment model by including the recency effect on the citation process without neglecting the impact of classical papers. We propose a more meaningful definition of clustering coefficient for citation networks in terms of ’citation triangles’. Further, the network simulated by the TNE model with best-fit parameters is compared with the real-world(DBLP) citation network. The results of various significance tests show that the simulated network matches very well with the DBLP citation network in terms of several network properties.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  • Barabási, A.-L., & Albert, R. (1999). Emergence of scaling in random networks. Science, 286(5439), 509–512.

    Article  MathSciNet  MATH  Google Scholar 

  • Barabâsi AL, Jeong H, Néda Z, Ravasz E, Schubert A, Vicsek T. (2002). Evolution of the social network of scientific collaborations. Physica A: Statistical Mechanics and its Applications, 311(3–4), 590–614.

    Article  MathSciNet  MATH  Google Scholar 

  • Behfar, S. K., Turkina, E., Cohendet, P., & Burger-Helmchen, T. (2016). Directed networks’ different link formation mechanisms causing degree distribution distinction. Physica A: Statistical Mechanics and its Applications, 462, 479–491.

    Article  Google Scholar 

  • Chakraborty, T., Kumar, S., Goyal, P., Ganguly, N., & Mukherjee, A. (2015). On the categorization of scientific citation profiles in computer science. Communications of the ACM, 58(9), 82–90.

    Article  Google Scholar 

  • Clough, J. R., Gollings, J., Loach, T. V., & Evans, T. S. (2015). Transitive reduction of citation networks. Journal of Complex Networks, 3(2), 189–203.

    Article  MathSciNet  Google Scholar 

  • Dorogovtsev, S. N., & Mendes, J. F. F. (2000). Evolution of networks with aging of sites. Physical Review E, 62(2), 1842.

    Article  Google Scholar 

  • Fagiolo, G. (2007). Clustering in complex directed networks. Physical Review E, 76(2), 026107.

    Article  MathSciNet  Google Scholar 

  • Goldberg, S. R., Anthony, H., & Evans, T. S. (2015). Modelling citation networks. Scientometrics, 105(3), 1577–1604.

    Article  Google Scholar 

  • Hajra, K. B., & Sen, P. (2006). Modelling aging characteristics in citation networks. Physica A: Statistical Mechanics and its Applications, 368(2), 575–582.

    Article  Google Scholar 

  • Holme, P., & Kim, B. J. (2002). Growing scale-free networks with tunable clustering. Physical Review E, 65(2), 026107.

    Article  Google Scholar 

  • Hu, F., et al. (2021). The aging effect in evolving scientific citation networks. Scientometrics, 126(5), 4297–4309.

    Article  Google Scholar 

  • Jeong, H., Néda, Z., & Barabási, A.-L. (2003). Measuring preferential attachment in evolving networks. EPL (Europhysics Letters), 61(4), 567.

    Article  Google Scholar 

  • Karrer, B., & Newman, M. E. (2009). Random acyclic networks. Physical Review Letters, 102(12), 128701.

    Article  Google Scholar 

  • Keathley-Herring, H., et al. (2016). Assessing the maturity of a research area: Bibliometric review and proposed framework. Scientometrics, 109, 927–951.

    Article  Google Scholar 

  • Leskovec, J., Kleinberg, J., & Faloutsos, C. (2005). ACM (ed.) Graphs over time: densification laws, shrinking diameters and possible explanations. (ed. ACM) Proceedings of the eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data mining, (pp. 177–187), ACM.

  • Mingers, J., & Leydesdorff, L. (2015). A review of theory and practice in scientometrics. European Journal of Operational Research, 246(1), 1–19.

    Article  MATH  Google Scholar 

  • Narbaev, T. (2022). A meta-analysis of the public-private partnership literature reviews: exploring the identity of the field. International Journal of Strategic Property Management, 26(4), 318–331.

    Article  Google Scholar 

  • Narbaev, T., & Amirbekova, D. (2021). Research productivity in emerging economies: Empirical evidence from kazakhstan. Publications, 9(4), 51.

    Article  Google Scholar 

  • Newman, M. E. (2001). Clustering and preferential attachment in growing networks. Physical Review E, 64(2), 025102.

    Article  Google Scholar 

  • Pi, X., Tang, L., & Chen, X. (2021). A directed weighted scale-free network model with an adaptive evolution mechanism. Physica A: Statistical Mechanics and its Applications, 572, 125897.

    Article  MathSciNet  MATH  Google Scholar 

  • Price, D. J. D. S. (1965). Networks of scientific papers: The pattern of bibliographic references indicates the nature of the scientific research front. Science, 149(3683), 510–515.

    Article  Google Scholar 

  • Price, D. D. S. (1976). A general theory of bibliometric and other cumulative advantage processes. Journal of the American Society for Information Science, 27(5), 292–306.

    Article  Google Scholar 

  • Ren, F. X., Shen, H. W., & Cheng, X. Q. (2012). Modeling the clustering in citation networks. Physica A: Statistical Mechanics and its Applications, 391(12), 3533–3539.

    Article  Google Scholar 

  • Ren, Z.-M. (2019). Age preference of metrics for identifying significant nodes in growing citation networks. Physica A: Statistical Mechanics and its Applications, 513, 325–332.

    Article  Google Scholar 

  • Rossetto, D. E., Bernardes, R. C., Borini, F. M., & Gattaz, C. C. (2018). Structure and evolution of innovation research in the last 60 years: Review and future trends in the field of business through the citations and co-citations analysis. Scientometrics, 115(3), 1329–1363.

    Article  Google Scholar 

  • Scholz, F. W., & Stephens, M. A. (1987). K-sample anderson-darling tests. Journal of the American Statistical Association, 82(399), 918–924.

    MathSciNet  Google Scholar 

  • Tang, J. et al. (2008). ACM (ed.) Arnetminer: extraction and mining of academic social networks. (ed.ACM) Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data mining, (pp. 990–998), ACM.

  • Vasiliauskaite, V. & Evans, T. S. (2018). Diversity from the topology of citation networks. ArXiv Preprint arXiv:1802.06015.

  • Wang, M., Yu, G., & Yu, D. (2009). Effect of the age of papers on the preferential attachment in citation networks. Physica A: Statistical Mechanics and its Applications, 388(19), 4273–4276.

    Article  Google Scholar 

  • Wu, Z. X., & Holme, P. (2009). Modeling scientific-citation patterns and other triangle-rich acyclic networks. Physical Review E, 80(3), 037101.

    Article  Google Scholar 

  • Zhu, H., Wang, X., & Zhu, J.-Y. (2003). Effect of aging on network structure. Physical Review E, 68(5), 056121.

    Article  Google Scholar 

Download references

Acknowledgements

We are grateful to the anonymous reviewers for their valuable comments and suggestions which helped in improving the quality of this manuscript. The first author acknowledges the financial assistance received from the University Grants Commission (UGC), Government of India in the form of a Junior Research Fellowship (JRF).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Monachary Kammari.

Ethics declarations

Conflict of interest

All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or non-financial interest in the subject matter or materials discussed in this manuscript. The authors have no financial or proprietary interests in any material discussed in this article.

Appendix Time stamp effect: analytical proof

Appendix Time stamp effect: analytical proof

If the time stamp effect is removed from our model, it will be reduced to the Price’s preferential attachment model. This can be analytically shown as follows:

The master equation of our additive TNE model is given as follows:

$$\begin{aligned} \frac{\partial k_i}{\partial t} = \mu \left[ m.\pi _d(v_i) + m.\pi _t(v_i) + m.\frac{1}{N_0 + t}\right] \end{aligned}$$

Where \(\pi _d(v_i) = \frac{k_i}{\sum _j k_j}\) is preferential attachment based on in-degree and \(\pi _t(v_i) = \frac{t_i}{t+1}\) is preferential attachment based on time factor.

The resulting master equation after the removal of the timestamp from the TNE model is given below.

$$\begin{aligned} \frac{\partial k_i}{\partial t}= & {} \mu \left[ m.\pi _d(v_i) + m.\frac{1}{N_0 + t}\right] \\ \frac{\partial k_i}{\partial t}= & {} \mu \left[ m.\frac{k_i}{\sum _j k_j} + m.\frac{1}{N_0 + t} \right] \end{aligned}$$

where \(\pi _d(v_i) = \frac{k_i}{\sum _j k_j}\)

$$\begin{aligned} \frac{\partial k_i}{\partial t}= & {} \mu \left[ m.\frac{k_i}{m_0+m.t} + m.\frac{1}{N_0 + t}\right] \\ \frac{\partial k_i}{\partial t} -\mu m.\frac{1}{N_0 + t}= & {} \mu m.\frac{k_i}{m_0+m.t}\\ \frac{1}{k_i}\frac{\partial k_i}{\partial t}= & {} \frac{\mu m}{m_0+m.t} + \mu m.\frac{1}{k_i(N_0 + t)}\\ \frac{1}{k_i}\partial k_i= & {} \frac{\mu m}{m_0+m.t}.\partial t + \mu m.\frac{1}{k_i(N_0 + t)}.\partial t \end{aligned}$$

Integrating both sides in the range \((t_i, t)\) where \(t_i\) is the time at which paper i gets the initial citation. Therefore \(k_i(t_i) = b\) where b is a constant.

$$\begin{aligned} \int _{t_i} ^t \frac{1}{k_i}.\partial k_i= & {} \int _{t_i} ^t \frac{\mu m}{m_0+m.t}.\partial t + \int _{t_i} ^t\mu m.\frac{1}{k_i(N_0 + t)}.\partial t\\ \log {\frac{k_i(t)}{k_i(t_i)}}= & {} \frac{\mu m}{m}\log {\left( \frac{t}{t_i}\right) }+\frac{\mu m}{k_i}\log {\left( \frac{N_0+t}{N_0+t_i}\right) }\\ {\frac{k_i(t)}{b}}= & {} {\left( \frac{t}{t_i}\right) }^\frac{\mu m}{m}+{\left( \frac{N_0+t}{N_0+t_i}\right) }^\frac{\mu m}{k_i}\\ \frac{k_i(t)}{b}= & {} {\left( \frac{t}{t_i}\right) }^{\mu +\frac{\mu m }{k_i}} \end{aligned}$$

Since average \(k_i = m\)

$$\begin{aligned} {\frac{k_i(t)}{b}}= & {} {\left( \frac{t}{t_i}\right) }^{2\mu }\\ k_i(t)= & {} b. {\left( \frac{t}{t_i}\right) }^{2\mu } \end{aligned}$$

Highly connected nodes are the result of the phenomena i.e, older nodes with smaller \(t_i\) values increase their in-degree at expense of recent nodes with larger \(t_i\). This helps us calculate \(p(k_i(t) < k)\) The probability of selecting a node at time t whose degree is less than k.

$$\begin{aligned} p(k_i(t)< k)&= p\left( b. \left( \frac{t}{t_i}\right) ^{2\mu }<k\right) \\&= p\left( t_i>\left( t.b^\frac{-1}{2\mu }.{k}^\frac{-1}{2\mu })\right) \right) \\&= 1- p\left( t_i<\left( t.b^\frac{-1}{w\mu }.{k}^\frac{-1}{2\mu }\right) \right) \end{aligned}$$

Calculating in-degree distribution requires randomly selecting a node at time t with probability \(\frac{1}{N_0 + t}\).

$$\begin{aligned} p(k_i(t) < k) = 1- \frac{t b^\frac{-1}{2\mu }}{N_0 +t}.{k}^\frac{-1}{2\mu } \end{aligned}$$

In-degree distribution of nodes having degree k at time t is given by:

$$\begin{aligned} P(k,t) = \frac{\partial }{\partial k}(p(k_i(t) < k)) \end{aligned}$$

Hence

$$\begin{aligned} P(k,t)= & {} \frac{\partial }{\partial k}\left[ 1- \frac{t b^\frac{-1}{2\mu }}{N_0 +t}.{k}^\frac{-1}{2\mu }\right] \\ P(k,t)= & {} \frac{t b^\frac{-1}{2\mu }}{2\mu (N_0+t)} k ^{-\left( \frac{1}{2\mu }+1\right) } \end{aligned}$$

while \(t\rightarrow \infty\)

$$\begin{aligned} P(k)= & {} \frac{b^\frac{-1}{2\mu }}{2\mu } k ^{-\left( \frac{1}{2\mu }+1\right) }\\ P(k)\sim & {} k^{-\left( \frac{1}{2\mu }+1\right) } \end{aligned}$$

Since \(0<\mu <1\) and \(\frac{1}{2\mu }+1 >=1\) which means when the time stamp effect is removed from our TNE model, it reduces to Price’s classical PA model.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kammari, M., S, D.B. Time-stamp based network evolution model for citation networks. Scientometrics 128, 3723–3741 (2023). https://doi.org/10.1007/s11192-023-04704-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-023-04704-7

Keywords

Navigation