Skip to main content
Log in

A distributed hypergraph model for simulating the evolution of large coauthorship networks

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

The coauthorship in a paper set can be expressed by a hypergraph, namely a system with heterogeneously multinary relationship. The coauthorship network of that paper set is the simple graph extracted from that hypergraph. We designed a distributed hypergraph model to simulate the dynamics of large coauthorship networks in a full-scale manner. Its assembly mechanism of hyperedges is driven by Lotka’s law and a cooperative game that maximizes a benefit-cost ratio for coauthoring a paper. The model is built on a circle to express the game, expressing the cost by the distance between nodes. The benefit of coauthoring with a productive author or one with many coauthors is expressed by the cumulative degree or hyperdegree of nodes. The model successfully simulates the multimodal features emerged in the evolutions of coauthorship patterns, the clustering of nodes, the degree assortativity of linked nodes, degree and hyperdegree distributions. This model has the potential to be a null model for studying the complexity of the large social networks arising from other specific collaboration behaviors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

Notes

  1. https://www.dblp.org.

  2. http://www.webofknowledge.com.

References

  • Aldieri, L., Kotsemir, M., & Vinci, C. P. (2018). The impact of research collaboration on academic performance: An empirical analysis for some European countries. Socio-Economic Planning Sciences, 62, 13–30.

    Google Scholar 

  • Barabási, A. L., Jeong, H., Néda, Z., Ravasz, E., Schubert, A., & Vicsek, T. (2002). Evolution of the social network of scientific collaborations. Physica A: Statistical Mechanics and its Applications, 311(3–4), 590–614.

    MathSciNet  MATH  Google Scholar 

  • Börner, K., Maru, J. T., & Goldstone, R. L. (2004). The simultaneous evolution of author and paper networks. Proceedings of the National Academy of Sciences, 101(suppl 1), 5266–5273.

    Google Scholar 

  • Burt, R. S. (2004). Structural holes and good ideas. American Journal of Sociology, 110(2), 349–399.

    Google Scholar 

  • Catanzaro, M., Caldarelli, G., & Pietronero, L. (2004). Assortative model for social networks. Physical Review E, 70, 037101.

    Google Scholar 

  • Committee on the Science of Team Science. (2015). Enhancing the Effectiveness of Team Science. National Academies Press.

  • Corrêa, J. R. E. A., Silva, F. N., Costa, L. D. F., & Amancio, D. R. (2017). Patterns of authors contribution in scientific manuscripts. Journal of Informetrics, 11(2), 498–510.

    Google Scholar 

  • Ductor, L. (2015). Does co-authorship lead to higher academic productivity? Oxford Bulletin of Economics 530 and Statistics, 77(3), 385–407.

    Google Scholar 

  • Ferligoj, A., Kronegger, L., Mali, F., Snijders, T. A., & Doreian, P. (2015). Scientific collaboration dynamics in a national scientific system. Scientometrics, 104(3), 985–1012.

    Google Scholar 

  • Glänzel, W. (2014). Analysis of co-authorship patterns at the individual level. Transinformacao, 26, 229–238.

    Google Scholar 

  • Glänzel, W., & Schubert, A. (2004). Analysing scientific networks through co-authorship. In H. F. Moed, W. Glänzel, U. Schmoch (Eds),Handbook of Quanitative Science and Technology Research (pp. 257–276). Springer.

  • Glänzel, W., Schubert, A., & Czerwon, H. J. (1999). A bibliometric analysis of international scientific cooperation of the European Union (1985–1995). Scientometrics, 45(2), 185–202.

    Google Scholar 

  • Gomez, I., Fernández, M. T., & Sebastian, J. (1999). Analysis of the structure of international scientific cooperation networks through bibliometric indicators. Scientometrics, 44(3), 441–457.

    Google Scholar 

  • Guimerá, R., Uzzi, B., Spiro, J., & Amaral, L. A. N. (2005). Team assembly mechanisms determine collaboration network structure and team performance. Science, 308, 697–702.

    Google Scholar 

  • Hall, K. L., Stokols, D., Stipelman, B. A., Vogel, A. L., Feng, A., Masimore, B., et al. (2012). Assessing the value of team science: A study comparing center-and investigator-initiated grants. American Journal of Preventive Medicine, 42(2), 157–163.

    Google Scholar 

  • Hoekman, J., Frenken, K., & Tijssen, R. J. (2010). Research collaboration at a distance: Changing spatial patterns of scientific collaboration within Europe. Research Policy, 39(5), 662–673.

    Google Scholar 

  • Hunter, L., & Leahey, E. (2008). Collaborative research in sociology: Trends and contributing factors. The American Sociologist, 39, 290–306.

    Google Scholar 

  • Katz, J. S. (1994). Geographical proximity and scientific collaboration. Scientometrics, 31(1), 31–43.

    Google Scholar 

  • Khor, K. A., & Yu, L. G. (2016). Influence of international co-authorship on the research citation impact of young universities. Scientometrics, 107(3), 1095–1110.

    Google Scholar 

  • Kshemkalyani, A. D., & Singhal, M. (2008). Distributed computing: principles, algorithms, and systems. Cambridge University Press.

  • Leclerc, M., & Gagné, J. (1994). International scientific cooperation: The continentalization of science. Scientometrics, 31(3), 261–292.

    Google Scholar 

  • Lee, S., & Bozeman, B. (2005). The impact of research collaboration on scientific productivity. Social Studies of Science, 35, 673–702.

    Google Scholar 

  • Lehman, H. C. (2017).Age and achievement. (Vol. 4970). Princeton University Press.

  • Li, F., Miao, Y., & Yang, C. (2015). How do alumni faculty behave in research collaboration? An analysis of Chang Jiang Scholars in China. Research Policy, 44(2), 438–450.

    Google Scholar 

  • Lotka, A. J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 16(12), 317–323.

    Google Scholar 

  • Lu, C., Zhang, Y., Ahn, Y. Y., Ding, Y., Zhang, C., & Ma, D. (2019). Co-contributorship network and division of labor in individual scientific collaborations. Journal of the Association for Information Science and 565 Technology. https://doi.org/10.1002/asi.24321.

    Article  Google Scholar 

  • Mali, F., Kronegger, L., Doreian, P., & Ferligoj, A. (2012). Dynamic scientific coauthorship networks. In A. Scharnhorst, K. Börner, & P. V. D. Besselaar (Eds.), Models of science dynamics (pp. 195–232). Springer.

  • Milojević, S. (2014). Principles of scientific research team formation and evolution. Proceedings of the National Academy of Science, 111(11), 3984–3989.

    Google Scholar 

  • Moody, J. (2004). The strucutre of a social science collaboration network: Disciplinery cohesion form 1963 to 1999. American Sociological Review, 69(2), 213–238.

    Google Scholar 

  • Narin, F., Stevens, K., & Whitlow, E. S. (1991). Scientific co-operation in Europe and the citation of multinationally authored papers. Scientometrics, 21(3), 313–323.

    Google Scholar 

  • Newman, M. (2001). Scientific collaboration networks. II. shortest paths, weighted networks, and centrality. Physical Review E, 64, 016132.

    Google Scholar 

  • Newman, M. (2001). Scientific collaboration networks. I. network construction and fundamental results. Physical Review E, 64, 016131.

    Google Scholar 

  • Newman, M. (2001). The structure of scientific collaboration networks. Proceedings of the National Academy of Sciences, 98, 404–409.

    MathSciNet  MATH  Google Scholar 

  • Newman, M. E. (2001). Clustering and preferential attachment in growing networks. Physical Review E, 64(2), 025102.

    Google Scholar 

  • Newman, M. (2002). Assortative mixing in networks. Physical Review Letters, 89, 208701.

    Google Scholar 

  • Newman, M. (2004). Coauthorship networks and patterns of scientific collaboration. Proceedings of the National Academy of Sciences, 101, 5200–5205.

    Google Scholar 

  • Nowak, M. A. (2006). Five rules for the evolution of cooperation. Science, 314(5805), 1560–3.

    Google Scholar 

  • Pennisi, E. (2005). How did cooperative behavior evolve? Science, 309(5731), 93–93.

    Google Scholar 

  • Perc, C. (2010). Growth and structure of Slovenia’s scientific collaboration network. Journal of Informetrics, 4, 475–482.

    Google Scholar 

  • Perc, M. (2014). The Matthew effect in empirical data. Journal of the Royal Society Interface, 11, 20140378.

    Google Scholar 

  • Perc, M., & Szolnoki, A. (2008). Social diversity and promotion of cooperation in the spatial prisoner’s dilemma game. Physical Review E, 77(1), 011904.

    Google Scholar 

  • Perc, M., & Szolnoki, A. (2010). Coevolutionary games—a mini review. BioSystems, 99(2), 109–125.

    Google Scholar 

  • Ponomariov, B., & Boardman, C. (2016). What is co-authorship? Scientometrics, 109(3), 1939–1963.

    Google Scholar 

  • Price, D. J. S. (1963). Little science, big science. Columbia University Press.

  • Qi, M., Zeng, A., Li, M., Fan, Y., & Di, Z. (2017). Standing on the shoulders of giants: The effect of outstanding scientists on young collaborators’ careers. Scientometrics, 111(3), 1839–1850.

    Google Scholar 

  • Russell, J. M. (1995). The increasing role of international cooperation in science and technology research in Mexico. Scientometrics, 34(1), 45–61.

    Google Scholar 

  • Santos, F. C., & Pacheco, J. M. (2005). Scale-free networks provide a unifying framework for the emergence of cooperation. Physical Review Letter, 95(9), 098104.

    Google Scholar 

  • Simonton, D. K. (1984). Creative productivity and age: A mathematical model based on a two-step cognitive process. Developmental Review, 4(1), 77–111.

    Google Scholar 

  • Valderas, J. M. (2007). Why do team-authored papers get cited more? Science, 317(5844), 1496–1498.

    Google Scholar 

  • Van Rijnsoever, F. J., & Hessels, L. K. (2011). Factors associated with disciplinary and interdisciplinary research collaboration. Research Policy, 40(3), 463–472.

    Google Scholar 

  • Vogel, A. L., Stipelman, B. A., Hall, K. L., Nebeling, L., Stokols, D., & Spruijt-Metz, D. (2014). Pioneering the transdisciplinary team science approach: Lessons learned from National Cancer Institute grantees. J Tran Med Epid, 2(2).

  • Wagner, C. S., & Leydesdorff, L. (2005). Network structure, self-organization, and the growth of international collaboration in science. Research Policy, 34(10), 1608–1618.

    Google Scholar 

  • Wuchty, S., Jones, B. F., & Uzzi, B. (2007). The increasing dominance of teams in production of knowledge. Science, 316(5827), 1036–1039.

    Google Scholar 

  • Xie, Z., Li, J.P., & Li, M. (2018). Exploring cooperative game mechanisms of scientific coauthorship networks. Complexity. https://doi.org/10.1155/2018/9173186

    Article  Google Scholar 

  • Xie, Z. (2019). A cooperative game model for the multimodality of coauthorship networks. Scientometrics, 121(1), 503–519.

    Google Scholar 

  • Xie, Z. (2020). Predicting the number of coauthors for researchers: A learning model. Journal of Informetrics, 14(2), 101036.

    Google Scholar 

  • Xie, Z. (2021). A prediction method of publication productivity for researchers. IEEE Transactions on Computational Social Systems, 8(2), 423–433.

    Article  Google Scholar 

  • Xie, Z., Li, M., Li, J. P., Duan, X. J., & Ouyang, Z. Z. (2018). Feature analysis of multidisciplinary scientific collaboration patterns based on pnas. EPJ Data Science, 7, 5.

    Google Scholar 

  • Xie, Z., Ouyang, Z. Z., & Li, J. P. (2016). A geometric graph model for coauthorship networks. Journal of Informetrics, 10, 299–311.

    Google Scholar 

  • Xie, Z., Ouyang, Z. Z., Li, J. P., Dong, E. M., & Yi, D. Y. (2018). Modelling transition phenomena of scientific coauthorship networks. Journal of the Association for Information Science and Technology, 69(2), 305–317.

    Google Scholar 

  • Xie, Z., Xie, Z. L., Li, M., Li, J. P., & Yi, D. Y. (2017). Modeling the coevolution between citations and coauthorship of scientific papers. Scientometrics, 112, 483–507.

    Google Scholar 

  • Zeng, A., Shen, Z., Zhou, J., Wu, J., Fan, Y., Wang, Y., & Stanley, H. E. (2017). The science of science: From the perspective of complex systems. Physics Reports, 714, 1–73.

    MathSciNet  MATH  Google Scholar 

  • Zhang, C., Bu, Y., Ding, Y., & Xu, J. (2018). Understanding scientific collaboration: Homophily, transitivity, and preferential attachment. Journal of the Association for Information Science and Technology, 69(1), 72–86.

    Google Scholar 

Download references

Acknowledgements

The author thanks Professor Jinying Su in the National University of Defense Technology for her helpful comments and feedback. This work is supported by the National Natural Science Foundation of China (Grant No. 61773020) and National Education Science Foundation of China (Grant No. DIA180383).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zheng Xie.

Appendix

Appendix

We applied the model to simulate the coauthorship networks extracted from the papers of three accredited journals of informetrics, namely the Journal of the Association for Information Science and Technology, Journal of Informetrics, and Scientometrics. The dataset considered here, named “informetr” (Table 3), consists the papers published at the years from 1998 to 2019, where authors’ name is extracted from the term “AF” in the raw data downloaded from Web of ScienceFootnote 2. The papers at 1998–2021 are used to extract nodes’ historical degree and hyperdegree. The coauthorship network extracted from the papers at 2002–2019 will be simulated in full scale by our model. Notably, using the term “AF” suffers the splitting and merging errors on names. Simulating a dataset with errors helps to test the fault tolerance of our model.

Table 3 The information of the dataset informetr

Fig. 16 shows the fitting polynomial of the cumulative number of nodes and that of the annual number of hyperedges in synth-informetr. It indicates that the mathematical formulae underlying our model also hold for this dataset. The number of new nodes at each time step is proportional to the corresponding number of informetr, where the proportion \(\varepsilon =7\). Six processes are used to run the model. When \(\alpha =1.6\), \(\beta =0.0007\), and \(\gamma =0.8\), the model can generate hyperdegree and degree distributions with approximate shapes to those of informetr. When \(\epsilon =0.0019\), the difference between the size of the giant component of informetr 2002–2019 and that of synth-informetr is smaller than 0.1.

Fig. 16
figure 16

The increasing trend of synth-informetr. Panel a shows the cumulative number of nodes and its fitting polynomial \(y(t)=\sum ^2_{l=0}a_lt^l\), where \(a_0=-1.356\)e+03, \(a_1=1.423\)e+03, and \(a_2=1.642\)e+02. Panel b shows that the annual number of hyperedges and its fitting polynomial \(y(t)=\sum ^1_{l=0}b_lt^l\), where \(b_0=1.641\)e+2 and \(b_1=3.219\)e+01. Panel c shows the contributions of the leading terms to these polynomials

Table 4 shows certain statistical indexes of the coauthorship networks extracted from informetr and from synth-informetr. Figs. 17, 18, 19, 20, 21, 2223 and 24 show that the model can captures the coauthorship patterns in informetr, and can reproduce the evolutions of coauthorship patterns, the clustering of nodes, the degree assortativity, the degree distribution, and so on.

Table 4 Network indexes of informetr and synth-informetr
Fig. 17
figure 17

Coauthoring patterns from the perspective of multinary relationship. The panels show the fractions of the patterns, defined in the caption of Fig. 9, of informetr (red bars) and those of synth-informetr (blue bars)

Fig. 18
figure 18

Coauthoring patterns from the perspective of binary relationship. The panels show the fractions of the patterns, defined in the caption of Fig.10, of informetr (red bars) and those of synth-informetr (blue bars)

Fig. 19
figure 19

The proportion of the authors who publish one paper. The \(r_t\) is this proportion of informetr, and \(r_s\) is that of the synth-informetr. The panels show this proportion for the authors in informetr who have the same number of coauthors (red squares) and that for synth-informetr (blue circles)

Fig. 20
figure 20

The correlation between the number of papers and the number of coauthors. The panels show the average number of coauthors of the authors in informetr who have the same number of papers at [2002, y] (red circles) and that in synth-informetr (blue squares). The Spearman correlation coefficient, \(r_t\), for informetr and that, \(r_s\), for synth-informetr are significantly larger than 0, p-value \(<0.05\)

Fig. 21
figure 21

The local clustering coefficient. The panels show the average of this value for the authors in informetr who have degree k at [2002, y] (red circles) and that for synth-informetr (blue circles)

Fig. 22
figure 22

The average degree of coauthors. The panels show the average of this value for the authors in informetr who have degree k at [2002, y] (red circles) and that for synth-informetr (blue squares)

Fig. 23
figure 23

The distribution of the number of an author’s papers. The panels show this distribution for informetr (red circles) and that for synth-informetr (blue squares)

Fig. 24
figure 24

The distribution of the number of an author’s coauthors. The panels show the distribution for informetr (red circles) and that for synth-informetr (blue squares)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xie, Z. A distributed hypergraph model for simulating the evolution of large coauthorship networks. Scientometrics 126, 4609–4638 (2021). https://doi.org/10.1007/s11192-021-03991-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-021-03991-2

Keywords

Navigation