Abstract
The coauthorship in a paper set can be expressed by a hypergraph, namely a system with heterogeneously multinary relationship. The coauthorship network of that paper set is the simple graph extracted from that hypergraph. We designed a distributed hypergraph model to simulate the dynamics of large coauthorship networks in a full-scale manner. Its assembly mechanism of hyperedges is driven by Lotka’s law and a cooperative game that maximizes a benefit-cost ratio for coauthoring a paper. The model is built on a circle to express the game, expressing the cost by the distance between nodes. The benefit of coauthoring with a productive author or one with many coauthors is expressed by the cumulative degree or hyperdegree of nodes. The model successfully simulates the multimodal features emerged in the evolutions of coauthorship patterns, the clustering of nodes, the degree assortativity of linked nodes, degree and hyperdegree distributions. This model has the potential to be a null model for studying the complexity of the large social networks arising from other specific collaboration behaviors.
Similar content being viewed by others
References
Aldieri, L., Kotsemir, M., & Vinci, C. P. (2018). The impact of research collaboration on academic performance: An empirical analysis for some European countries. Socio-Economic Planning Sciences, 62, 13–30.
Barabási, A. L., Jeong, H., Néda, Z., Ravasz, E., Schubert, A., & Vicsek, T. (2002). Evolution of the social network of scientific collaborations. Physica A: Statistical Mechanics and its Applications, 311(3–4), 590–614.
Börner, K., Maru, J. T., & Goldstone, R. L. (2004). The simultaneous evolution of author and paper networks. Proceedings of the National Academy of Sciences, 101(suppl 1), 5266–5273.
Burt, R. S. (2004). Structural holes and good ideas. American Journal of Sociology, 110(2), 349–399.
Catanzaro, M., Caldarelli, G., & Pietronero, L. (2004). Assortative model for social networks. Physical Review E, 70, 037101.
Committee on the Science of Team Science. (2015). Enhancing the Effectiveness of Team Science. National Academies Press.
Corrêa, J. R. E. A., Silva, F. N., Costa, L. D. F., & Amancio, D. R. (2017). Patterns of authors contribution in scientific manuscripts. Journal of Informetrics, 11(2), 498–510.
Ductor, L. (2015). Does co-authorship lead to higher academic productivity? Oxford Bulletin of Economics 530 and Statistics, 77(3), 385–407.
Ferligoj, A., Kronegger, L., Mali, F., Snijders, T. A., & Doreian, P. (2015). Scientific collaboration dynamics in a national scientific system. Scientometrics, 104(3), 985–1012.
Glänzel, W. (2014). Analysis of co-authorship patterns at the individual level. Transinformacao, 26, 229–238.
Glänzel, W., & Schubert, A. (2004). Analysing scientific networks through co-authorship. In H. F. Moed, W. Glänzel, U. Schmoch (Eds),Handbook of Quanitative Science and Technology Research (pp. 257–276). Springer.
Glänzel, W., Schubert, A., & Czerwon, H. J. (1999). A bibliometric analysis of international scientific cooperation of the European Union (1985–1995). Scientometrics, 45(2), 185–202.
Gomez, I., Fernández, M. T., & Sebastian, J. (1999). Analysis of the structure of international scientific cooperation networks through bibliometric indicators. Scientometrics, 44(3), 441–457.
Guimerá, R., Uzzi, B., Spiro, J., & Amaral, L. A. N. (2005). Team assembly mechanisms determine collaboration network structure and team performance. Science, 308, 697–702.
Hall, K. L., Stokols, D., Stipelman, B. A., Vogel, A. L., Feng, A., Masimore, B., et al. (2012). Assessing the value of team science: A study comparing center-and investigator-initiated grants. American Journal of Preventive Medicine, 42(2), 157–163.
Hoekman, J., Frenken, K., & Tijssen, R. J. (2010). Research collaboration at a distance: Changing spatial patterns of scientific collaboration within Europe. Research Policy, 39(5), 662–673.
Hunter, L., & Leahey, E. (2008). Collaborative research in sociology: Trends and contributing factors. The American Sociologist, 39, 290–306.
Katz, J. S. (1994). Geographical proximity and scientific collaboration. Scientometrics, 31(1), 31–43.
Khor, K. A., & Yu, L. G. (2016). Influence of international co-authorship on the research citation impact of young universities. Scientometrics, 107(3), 1095–1110.
Kshemkalyani, A. D., & Singhal, M. (2008). Distributed computing: principles, algorithms, and systems. Cambridge University Press.
Leclerc, M., & Gagné, J. (1994). International scientific cooperation: The continentalization of science. Scientometrics, 31(3), 261–292.
Lee, S., & Bozeman, B. (2005). The impact of research collaboration on scientific productivity. Social Studies of Science, 35, 673–702.
Lehman, H. C. (2017).Age and achievement. (Vol. 4970). Princeton University Press.
Li, F., Miao, Y., & Yang, C. (2015). How do alumni faculty behave in research collaboration? An analysis of Chang Jiang Scholars in China. Research Policy, 44(2), 438–450.
Lotka, A. J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 16(12), 317–323.
Lu, C., Zhang, Y., Ahn, Y. Y., Ding, Y., Zhang, C., & Ma, D. (2019). Co-contributorship network and division of labor in individual scientific collaborations. Journal of the Association for Information Science and 565 Technology. https://doi.org/10.1002/asi.24321.
Mali, F., Kronegger, L., Doreian, P., & Ferligoj, A. (2012). Dynamic scientific coauthorship networks. In A. Scharnhorst, K. Börner, & P. V. D. Besselaar (Eds.), Models of science dynamics (pp. 195–232). Springer.
Milojević, S. (2014). Principles of scientific research team formation and evolution. Proceedings of the National Academy of Science, 111(11), 3984–3989.
Moody, J. (2004). The strucutre of a social science collaboration network: Disciplinery cohesion form 1963 to 1999. American Sociological Review, 69(2), 213–238.
Narin, F., Stevens, K., & Whitlow, E. S. (1991). Scientific co-operation in Europe and the citation of multinationally authored papers. Scientometrics, 21(3), 313–323.
Newman, M. (2001). Scientific collaboration networks. II. shortest paths, weighted networks, and centrality. Physical Review E, 64, 016132.
Newman, M. (2001). Scientific collaboration networks. I. network construction and fundamental results. Physical Review E, 64, 016131.
Newman, M. (2001). The structure of scientific collaboration networks. Proceedings of the National Academy of Sciences, 98, 404–409.
Newman, M. E. (2001). Clustering and preferential attachment in growing networks. Physical Review E, 64(2), 025102.
Newman, M. (2002). Assortative mixing in networks. Physical Review Letters, 89, 208701.
Newman, M. (2004). Coauthorship networks and patterns of scientific collaboration. Proceedings of the National Academy of Sciences, 101, 5200–5205.
Nowak, M. A. (2006). Five rules for the evolution of cooperation. Science, 314(5805), 1560–3.
Pennisi, E. (2005). How did cooperative behavior evolve? Science, 309(5731), 93–93.
Perc, C. (2010). Growth and structure of Slovenia’s scientific collaboration network. Journal of Informetrics, 4, 475–482.
Perc, M. (2014). The Matthew effect in empirical data. Journal of the Royal Society Interface, 11, 20140378.
Perc, M., & Szolnoki, A. (2008). Social diversity and promotion of cooperation in the spatial prisoner’s dilemma game. Physical Review E, 77(1), 011904.
Perc, M., & Szolnoki, A. (2010). Coevolutionary games—a mini review. BioSystems, 99(2), 109–125.
Ponomariov, B., & Boardman, C. (2016). What is co-authorship? Scientometrics, 109(3), 1939–1963.
Price, D. J. S. (1963). Little science, big science. Columbia University Press.
Qi, M., Zeng, A., Li, M., Fan, Y., & Di, Z. (2017). Standing on the shoulders of giants: The effect of outstanding scientists on young collaborators’ careers. Scientometrics, 111(3), 1839–1850.
Russell, J. M. (1995). The increasing role of international cooperation in science and technology research in Mexico. Scientometrics, 34(1), 45–61.
Santos, F. C., & Pacheco, J. M. (2005). Scale-free networks provide a unifying framework for the emergence of cooperation. Physical Review Letter, 95(9), 098104.
Simonton, D. K. (1984). Creative productivity and age: A mathematical model based on a two-step cognitive process. Developmental Review, 4(1), 77–111.
Valderas, J. M. (2007). Why do team-authored papers get cited more? Science, 317(5844), 1496–1498.
Van Rijnsoever, F. J., & Hessels, L. K. (2011). Factors associated with disciplinary and interdisciplinary research collaboration. Research Policy, 40(3), 463–472.
Vogel, A. L., Stipelman, B. A., Hall, K. L., Nebeling, L., Stokols, D., & Spruijt-Metz, D. (2014). Pioneering the transdisciplinary team science approach: Lessons learned from National Cancer Institute grantees. J Tran Med Epid, 2(2).
Wagner, C. S., & Leydesdorff, L. (2005). Network structure, self-organization, and the growth of international collaboration in science. Research Policy, 34(10), 1608–1618.
Wuchty, S., Jones, B. F., & Uzzi, B. (2007). The increasing dominance of teams in production of knowledge. Science, 316(5827), 1036–1039.
Xie, Z., Li, J.P., & Li, M. (2018). Exploring cooperative game mechanisms of scientific coauthorship networks. Complexity. https://doi.org/10.1155/2018/9173186
Xie, Z. (2019). A cooperative game model for the multimodality of coauthorship networks. Scientometrics, 121(1), 503–519.
Xie, Z. (2020). Predicting the number of coauthors for researchers: A learning model. Journal of Informetrics, 14(2), 101036.
Xie, Z. (2021). A prediction method of publication productivity for researchers. IEEE Transactions on Computational Social Systems, 8(2), 423–433.
Xie, Z., Li, M., Li, J. P., Duan, X. J., & Ouyang, Z. Z. (2018). Feature analysis of multidisciplinary scientific collaboration patterns based on pnas. EPJ Data Science, 7, 5.
Xie, Z., Ouyang, Z. Z., & Li, J. P. (2016). A geometric graph model for coauthorship networks. Journal of Informetrics, 10, 299–311.
Xie, Z., Ouyang, Z. Z., Li, J. P., Dong, E. M., & Yi, D. Y. (2018). Modelling transition phenomena of scientific coauthorship networks. Journal of the Association for Information Science and Technology, 69(2), 305–317.
Xie, Z., Xie, Z. L., Li, M., Li, J. P., & Yi, D. Y. (2017). Modeling the coevolution between citations and coauthorship of scientific papers. Scientometrics, 112, 483–507.
Zeng, A., Shen, Z., Zhou, J., Wu, J., Fan, Y., Wang, Y., & Stanley, H. E. (2017). The science of science: From the perspective of complex systems. Physics Reports, 714, 1–73.
Zhang, C., Bu, Y., Ding, Y., & Xu, J. (2018). Understanding scientific collaboration: Homophily, transitivity, and preferential attachment. Journal of the Association for Information Science and Technology, 69(1), 72–86.
Acknowledgements
The author thanks Professor Jinying Su in the National University of Defense Technology for her helpful comments and feedback. This work is supported by the National Natural Science Foundation of China (Grant No. 61773020) and National Education Science Foundation of China (Grant No. DIA180383).
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
We applied the model to simulate the coauthorship networks extracted from the papers of three accredited journals of informetrics, namely the Journal of the Association for Information Science and Technology, Journal of Informetrics, and Scientometrics. The dataset considered here, named “informetr” (Table 3), consists the papers published at the years from 1998 to 2019, where authors’ name is extracted from the term “AF” in the raw data downloaded from Web of ScienceFootnote 2. The papers at 1998–2021 are used to extract nodes’ historical degree and hyperdegree. The coauthorship network extracted from the papers at 2002–2019 will be simulated in full scale by our model. Notably, using the term “AF” suffers the splitting and merging errors on names. Simulating a dataset with errors helps to test the fault tolerance of our model.
Fig. 16 shows the fitting polynomial of the cumulative number of nodes and that of the annual number of hyperedges in synth-informetr. It indicates that the mathematical formulae underlying our model also hold for this dataset. The number of new nodes at each time step is proportional to the corresponding number of informetr, where the proportion \(\varepsilon =7\). Six processes are used to run the model. When \(\alpha =1.6\), \(\beta =0.0007\), and \(\gamma =0.8\), the model can generate hyperdegree and degree distributions with approximate shapes to those of informetr. When \(\epsilon =0.0019\), the difference between the size of the giant component of informetr 2002–2019 and that of synth-informetr is smaller than 0.1.
Table 4 shows certain statistical indexes of the coauthorship networks extracted from informetr and from synth-informetr. Figs. 17, 18, 19, 20, 21, 2223 and 24 show that the model can captures the coauthorship patterns in informetr, and can reproduce the evolutions of coauthorship patterns, the clustering of nodes, the degree assortativity, the degree distribution, and so on.
Rights and permissions
About this article
Cite this article
Xie, Z. A distributed hypergraph model for simulating the evolution of large coauthorship networks. Scientometrics 126, 4609–4638 (2021). https://doi.org/10.1007/s11192-021-03991-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-021-03991-2