Advertisement

Citation Networks

  • Filippo RadicchiEmail author
  • Santo Fortunato
  • Alessandro Vespignani
Part of the Understanding Complex Systems book series (UCS)

Abstract

Bibliographic databases contain a huge amount of information on the dissemination of scientific knowledge and the relationships between papers, authors, and scientific work. Large-scale citation networks can be generated from these databases in order to provide a systems-level perspective on the processes at the root of the spreading of ideas, theories, and results in science. Citation networks are therefore one of the main proxies for our understanding of knowledge dynamics as well as invaluable systems for the quantitative analysis of the impact of specific scientific contributions, the emergence of technical and scientific areas, and the ranking of journals, institutions, and scientists. This chapter reviews recent developments made in the study of citation networks, ranging from empirical analyses of real systems and mathematical models of them, to the study of dynamic processes taking place in them and their potential applications. Furthermore, studying citation datasets with the tools of network theory opens new avenues towards a quantitative understanding of the dynamics of popularity with respect to papers, journals, and scientists, possibly leading to novel measures of impact and ranking.

Keywords

Citation Count Community Detection Preferential Attachment Citation Network Bibliographic Database 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. Adler R, Ewing J, Taylor P (2009) Citation statistics. Statist Sci 24(1):1–14 (DOI: 10.1214/10.1214/09-STS285), also available as arXiv preprint http://arxiv.org/abs/0910.3529 Google Scholar
  2. Albert R, Barabási AL (2002) Statistical mechanics of complex networks. Rev Mod Phys 74(1):47–97 (DOI:  10.1103/RevModPhys.74.47), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0106096 Google Scholar
  3. Anastasiadis AD, de Albuquerque MP, de Albuquerque MP, Mussi, DB (2009) Tsallis q-exponential describes the distribution of scientific citations – a new characterization of the impact. Scientometrics 83:205–218 (DOI:  10.1007/s11192-009-0023-0), also available as arXiv preprint http://arxiv.org/abs/0812.4296 Google Scholar
  4. Barabási AL, Albert R (1999) Emergence of scaling in random networks. Science 286(5439):509–512 (DOI:  10.1126/science.286.5439.509), also available as arXiv preprint http://arxiv.org/abs/cond-mat/9910332 Google Scholar
  5. Barrat A, Barthélemy M, Vespignani A (2008) Dynamical processes on complex networks. Cambridge University Press, Cambridge (DOI:  10.1017/CBO9780511791383)
  6. Bergstrom C (2007) Eigenfactor: Measuring the value and prestige of scholarly journals. College Res Libraries News 68(5(285)):314–316. http://crln.acrl.org/content/68/5/314.full.pdf+html
  7. Bergstrom CT, West JD, Wiseman MA (2008) The EigenfactorTM metrics. J Neurosci 28(45):11433–11434 (DOI:  10.1523/JNEUROSCI.0003-08.2008)Google Scholar
  8. Boccaletti S, Latora V, Moreno Y, Chavez M, Hwang DU (2006) Complex networks: Structure and dynamics. Phys Rep 424(4–5):176–308 (DOI:  10.1016/j.physrep.2005.10.00)Google Scholar
  9. Bommarito II MJ, Katz DM (2009) Properties of the United States code citation network. ArXiv preprint http://arxiv.org/abs/0911.1751
  10. Bommarito II MJ, Katz DM, Zelner JL (2010a) On the stability of community detection algorithms on longitudinal citation data. Procedia Soc Behav Sci 4:26–37 (DOI: 10.1016/j.sbspro.2010.07.480), also available as arXiv preprint http://arxiv.org/abs/0908.0449arXiv:0908.0449. The article is part of: Hirschi C, Ingold K, Serdült U (guest eds) Applications of Social Network Analysis: 6th Conference on Applications of Social Network Analysis. Procedia Soc Behav Sci 4:1–102.
  11. Bommarito II MJ, Katz DM, Zelner JL, Fowler JH (2010b) Distance measures for dynamic citation networks. Physica A 389(19):4201–4208 (DOI:  10.1016/j.physa.2010.06.003), also available as arXiv preprint http://arxiv.org/abs/0909.1819 Google Scholar
  12. Bornmann L, Daniel HD (2008) What do citation counts measure? A review of studies on citing behavior. J Doc 64(1):45–80 (DOI:  10.1108/00220410810844150)Google Scholar
  13. Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems 30(1–7):107–117 (DOI:  10.1016/S0169-7552(98)00110-X) (The article is part of: Proceedings of the seventh international World Wide Web conference. Computer Networks and ISDN Systems 30(1–7):1–778)
  14. Caldarelli G (2007) Scale-free networks: Complex webs in nature and technology. Oxford Finance Series. Oxford University Press, OxfordGoogle Scholar
  15. Castellano C, Radicchi F (2009) On the fairness of using relative indicators for comparing citation performance in different disciplines. Archivum Immunologiae et Therapiae Experimentalis 57(2):85–90 (DOI:  10.1007/s00005-009-0014-0)
  16. Chen P, Xie H, Maslov S, Redner S (2007) Finding scientific gems with Google. J Informetr 1(1):8–15 (DOI:  10.1016/j.joi.2006.06.001), also available as arXiv preprint http://arxiv.org/abs/physics/0604130 Google Scholar
  17. Chen P, Redner S (2010) Community structure of the Physical Review citation network. J Informetr 4(3):278–290 (DOI:  10.1016/j.joi.2010.01.001), also available as arXiv preprint http://arxiv.org/abs/0911.0694 Google Scholar
  18. Dorogovtsev SN, Mendes JFF (2000a) Evolution of networks with aging of sites. Phys Rev E 62(2):1842–1845 (DOI:  10.1103/PhysRevE.62.1842), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0001419 Google Scholar
  19. Dorogovtsev SN, Mendes JFF, Samukhin AN (2000b) Structure of growing networks with preferential linking. Phys Rev Lett 85(21):4633–4636 (DOI:  10.1103/PhysRevLett.85.4633), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0004434 Google Scholar
  20. Dorogovtsev SN, Mendes JFF (2001) Scaling properties of scale-free evolving networks: Continuous approach. Phys Rev E 63(5):056125, 19 pp (DOI:  10.1103/PhysRevE.63.056125), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0012009 Google Scholar
  21. Dorogovtsev SN, Mendes JFF (2002) Evolution of networks.Adv Phys 51(4):1079–1187 (DOI: 10.1080/00018730110112519), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0106144 Google Scholar
  22. Fortunato S, Boguña M, Flammini A, Menczer F (2008) Approximating PageRank from in-degree. In: Aiello W, Broder A, Janssen J, Milios E (eds) Algorithms and models for the web-graph: Fourth international workshop, WAW 2006, Banff, Canada, November 30–December 1, 2006: Revised papers. Lecture Notes in Computer Science, vol 4936. Springer, Berlin, pp 59–71 (DOI:  10.1007/978-3-540-78808-9_6), also available as arXiv preprint http://arxiv.org/abs/cs/0511016
  23. Fortunato S (2010) Community detection in graphs. Phys Rep 486(3–5):75–174 (DOI: 10.1016/j.physrep.2009.11.002), also available as arXiv preprint http://arxiv.org/abs/0906.0612 Google Scholar
  24. Garfield E (1955) Citation indexes to science: A new dimension in documentation through association of ideas. Science 122(3159):108–111 (DOI:  10.1126/science.122.3159.108). Reprinted in: Garfield E (1984) Essays of an information scientist: Volume 6. ISI Press, Philadelphia, PA, pp. 468–471, also available online at the URL: http://www.garfield.library.upenn.edu/essays/v6p468y1983.pdf
  25. González-Pereira B, Guerrero-Bote VP, Moya-Anegón F (2009) The SJR indicator: A new indicator of journals’ scientific prestige. ArXiv preprint http://arxiv.org/abs/0912.4141
  26. Hajra KB, Sen P (2004a) Phase transitions in an aging network. Phys Rev E 70(5):056103, 4 pp (DOI:  10.1103/PhysRevE.70.056103), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0406332 Google Scholar
  27. Hajra KB, Sen P (2004b) Aging in citation networks. Physica A 346(1–2):44–48 (DOI: 10.1016/j.physa.2004.08.048), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0409017. The article is part of: Manna SS, Sen P (guest eds) Statphys – Kolkata V: Proceedings of the International Conference on Statistical Physics: ‘Complex networks: Structure, function and processes’ Kolkata, India, 27 June–01 July 2004. Physica A 346(1–2)Google Scholar
  28. Hajra KB, Sen P (2005) Modeling aging characteristics in citation networks. Physica A 368(2):575–582 (DOI:  10.1016/j.physa.2005.12.044), also available as arXiv preprint http://arxiv.org/abs/physics/0508035 Google Scholar
  29. Hirsch JE (2005) An index to quantify an individual’s scientific research output. Proc Natl Acad Sci USA 102(46):16569–16572 (DOI:  10.1073/pnas.0507655102, stable JSTOR URL: http://www.jstor.org/stable/4152261), also available as arXiv preprint http://arxiv.org/abs/physics/0508025 Google Scholar
  30. Jain AK, Murty MN, Flynn PJ (199) Data clustering: a review. ACM Comput Surveys 31(3):264–323 (DOI:  10.1145/331499.331504)Google Scholar
  31. Jeong H, Neda Z, Barabási AL (2003) Measuring preferential attachment in evolving networks. Europhys Lett 61(4):567–572 (DOI:  10.1209/epl/i2003-00166-9), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0104131 Google Scholar
  32. Karrer, B, Newman MEJ (2009a) Random acyclic networks. Phys Rev Lett 102(12):128701, 4 pp (DOI:  10.1103/PhysRevLett.102.128701), also available as arXiv preprint http://arxiv.org/abs/0902.4013 Google Scholar
  33. Karrer B, Newman MEJ (2009b) Random graph models for directed acyclic networks. Phys Rev E 80(4):046110, 12 pp (DOI:  10.1103/PhysRevE.80.046110), also available as arXiv preprint http://arxiv.org/abs/0907.4346 Google Scholar
  34. Krapivsky PL, Redner S, Leyvraz F (2000) Connectivity of growing random networks. Phys Rev Lett 85(21):4629–4632 (DOI:  10.1103/PhysRevLett.85.4629), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0005139 Google Scholar
  35. Kryssanov VV, Kuleshov EL, Rinaldo FJ, Ogawa H (2007) We cite as we communicate: A communication model for the citation process. ArXiv preprint http://arxiv.org/abs/cs/0703115
  36. Laherrére J, Sornette D (1998) Stretched exponential distributions in nature and economy: “Fat tails” with characteristic scales. Eur Phys J B 2(4):525–539 (DOI:  10.1007/s100510050276), also available as arXiv preprint http://arxiv.org/abs/cond-mat/9801293
  37. Lehmann S, Lautrup B, Jackson AD (2003) Citation networks in high energy physics. Phys Rev E 68(2):026113, 8 pp (DOI:  10.1103/PhysRevE.68.026113), also available as arXiv preprint http://arxiv.org/abs/physics/0211010 Google Scholar
  38. Leicht E A, Clarkson G, Shedden K, Newman MEJ (2007) Large-scale structure of time evolving citation networks. Eur Phys J B 59(1):75–83 (DOI:  10.1140/epjb/e2007-00271-7), also available as arXiv preprint http://arxiv.org/abs/0706.0015 Google Scholar
  39. Leicht EA, Newman MEJ (2008) Community structure in directed networks. Phys Rev Lett 100(11):118703, 4 pp (DOI:  10.1103/PhysRevLett.100.118703), also available as arXiv preprint http://arxiv.org/abs/0709.4500 Google Scholar
  40. Lotka AJ (1926) The frequency distribution of scientific productivity. J Wash Acad Sci 16(12):317–323Google Scholar
  41. Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U (2002) Network motifs: Simple building blocks of complex networks. Science 298(5594):824–827 (DOI:  10.1126/science.298.5594.824)Google Scholar
  42. Molloy M, Reed B (1998) The size of the largest component of a random graph on a fixed degree sequence. Combinatorics Probability Comput 7(3):295–306 (DOI:  10.1017/S0963548398003526)Google Scholar
  43. Newman MEJ (2003) The structure and function of complex networks. SIAM Rev 45(2):167–256 (DOI:  10.1137/S003614450342480), also available as arXiv preprint http://arxiv.org/abs/cond-mat/ 0303516Google Scholar
  44. Newman MEJ (2009) The first-mover advantage in scientific publication. Europhys Lett 86(6):68001 (DOI:  10.1209/0295-5075/86/68001), also available as arXiv preprint http://arxiv.org/abs/0809.0522 Google Scholar
  45. Pastor-Satorras R, Vespignani A (2004) Evolution and structure of the Internet: A statistical physics approach. Cambridge University Press, Cambridge (DOI:  10.1017/CBO9780511610905)
  46. Price DJ de Solla (1965) Networks of scientific papers. Science 149(3683):510–515 (DOI: 10.1126/science.149.3683.510). Reprinted in: Price DJ de Solla (1986) Little science, big science and beyond. Columbia University Press, New York, NY, pp 103–118
  47. Price DJ de Solla (1976) A general theory of bibliometric and other cumulative advantage processes. J Am Soc Inf Sci 27(5):292–306 (DOI:  10.1002/asi.4630270505), also available online at the URL: http://www.asis.org/Publications/JASIS/Best_Jasist/1976pricejasistarticle.pdf
  48. Radicchi F, Fortunato S, Castellano C (2008) Universality of citation distributions: Towards an objective measure of scientific impact. Proc Natl Acad Sci USA 105(45):17268–17272 (DOI: 10.1073/pnas.0806977105, stable JSTOR URL: http://www.jstor.org/stable/25465266, also available as arXiv preprint http://arxiv.org/abs/0806.0974
  49. Radicchi F, Fortunato S, Markines B, Vespignani A (2009) Diffusion of scientific credits and the ranking of scientists. Phys Rev E 80(5):056103, 10 pp (DOI:  10.1103/PhysRevE.80.056103), also available as arXiv preprint http://arxiv.org/abs/0907.1050 Google Scholar
  50. Redner S (1998) How popular is your paper? An empirical study of the citation distribution. Eur Phys J B 4(2):131–134 (DOI:  10.1007/s100510050359), also available as arXiv preprint http://arxiv.org/abs/cond-mat/9804163
  51. Redner S (2005) Citation statistics from 110 years of Physical Review. Phys Today 58(6):49–54 (DOI:  10.1063/1.1996475), also available as arXiv preprint http://arxiv.org/abs/physics/0506056 Google Scholar
  52. Rosvall M, Bergstrom CT (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci USA 105(4):1118–1123 (DOI:  10.1073/pnas.0706851105, stable JSTOR URL: http://www.jstor.org/stable/25451237), also available as arXiv preprint http://arxiv.org/abs/0707.0609
  53. Scott J (200) Social network analysis: A handbook, 2nd edn. Sage Publications, LondonGoogle Scholar
  54. Seglen PO (1999) The skewness of science. J Am Soc Inf Sci 43(9):628–638 (DOI: 10.1002/(SICI)1097-4571(199210)43:9%A1628::AID-ASI5%BF3.0.CO;2-0)Google Scholar
  55. Shockley W (1957) On the statistics of individual variations of productivity in research laboratories. Proc IRE 45(3):279–290 (DOI:  10.1109/JRPROC.1957.278364)
  56. Simkin MV, Roychowdhury VP (2005) Copied citations create renowned papers? Annals Improbable Res 11(1):24–27, available online at the URL: http://improbable.com/airchives/paperair/volume11/v11i1/v11i1.html, also available as arXiv preprint http://arxiv.org/abs/cond-mat/0305150 Google Scholar
  57. Simon HA (1957) Models of man: Social and rational: Mathematical essays on rational human behaviour in a social setting. Wiley, New York, NY; Chapman & Hall, LondonGoogle Scholar
  58. Stringer MJ, Sales-Pardo M, Amaral LAN (2008) Effectiveness of journal ranking schemes as a tool for locating information. PLoS ONE 3(2):e1683, 8 pp (DOI:  10.1371/journal.pone.0001683)
  59. Stringer MJ, Sales-Pardo M, Amaral LAN (2010) Statistical validation of a global model for the distribution of the ultimate number of citations accrued by papers published in a scientific journal. J Am Soc Inf Sci Techol 61(7):1377–1385 (DOI:  10.1002/asi.21335)Google Scholar
  60. Tsallis C, de Albuquerque, MP (2000) Are citations of scientific papers a case of nonextensivity? Eur Phys J B 13(4):777–780 (DOI:  10.1007/s100510050097), also available as arXiv preprint http://arxiv.org/abs/cond-mat/9903433 Google Scholar
  61. van Raan AFJ (2001a) Two-step competition process leads to quasi power-law income distributions: Application to scientific publications and citation distributions. Physica A 298(3–4):530–536 (DOI:  10.1016/S0378-4371(01)00254-0)
  62. van Raan AFJ (2001b) Competition amongst scientists for publication status: Toward a model of scientific publication and citation distributions. Scientometrics 51(1):347–357 (DOI: 10.1023/A:1010501820393)Google Scholar
  63. Vazquez A (2001) Statistics of citation networks. ArXiv preprint http://arxiv.org/abs/cond-mat/0105031
  64. Walker D, Xie H, Yan KK, Maslov S (2007) Ranking scientific publications using a model of network traffic. J Stat Mech 2007(6):P06010, 10 pp (DOI:  10.1088/1742-5468/2007/06/P06010), also available as arXiv preprint http://arxiv.org/abs/physics/0612122
  65. Wallace ML, Lariviére V, Gingras Y (2009) Modeling a century of citation distributions. J Informetr 3(4):296–303 (DOI:  10.1016/j.joi.2009.03.010), also available as arXiv preprint http://arxiv.org/abs/0810.1426 Google Scholar
  66. Wang M, Yu G, Yu D (2008) Measuring the preferential linear attachment mechanism in citation networks. Physica A 387(18): 4692–4698 (DOI:  10.1016/j.physa.2008.03.017)Google Scholar
  67. Wasserman S, Faust K (1994) Social network analysis: Methods and applications. Structural Analysis in the Social Sciences, vol 8. Cambridge University Press, CambridgeGoogle Scholar
  68. Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’ networks. Nature 393(6684):440–442 (DOI:  10.1038/30918)Google Scholar
  69. Wu ZX, Holme P (2009) Modeling scientific-citation patterns and other triangle-rich acyclic networks. Phys Rev E 80(3):037101, 4 pp (DOI:  10.1103/PhysRevE.80.037101), also available as arXiv preprint http://arxiv.org/abs/0908.2615 Google Scholar
  70. Yule GU (1925) A mathematical theory of evolution, based on the conclusions of Dr. J. C. Wills, F.R.S.. Proc R Soc Lond B Biol Sci 213(403):21–87 (DOI:  10.1098/rstb.1925.0002, stable JSTOR URL: http://www.jstor.org/stable/92117)Google Scholar
  71. Zhu H, Wang X, Zhu JY (2003) Effect of aging on network structure. Phys Rev E 68(5):056121, 9 pp (DOI:  10.1103/PhysRevE.68.056121), also available as arXiv preprint http://arxiv.org/abs/cond-mat/0304609

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Filippo Radicchi
    • 1
    Email author
  • Santo Fortunato
    • 2
    • 3
  • Alessandro Vespignani
    • 4
  1. 1.Howard Hughes Medical Institute (HHMI)Northwestern UniversityEvanstonUSA
  2. 2.Department of Biomedical Engineering and Computational Science, School of ScienceAalto UniversityEspooFinland
  3. 3.Complex Networks and SystemsISI FoundationTorinoItaly
  4. 4.Center for Complex Networks and Systems Research (CNetS), School of Informatics ComputingIndiana UniversityBloomingtonUSA

Personalised recommendations