A distributed hypergraph model for simulating the evolution of large coauthorship networks

Xie, Zheng

doi:10.1007/s11192-021-03991-2

A distributed hypergraph model for simulating the evolution of large coauthorship networks

Published: 26 April 2021

Volume 126, pages 4609–4638, (2021)
Cite this article

Scientometrics Aims and scope Submit manuscript

Zheng Xie ORCID: orcid.org/0000-0003-0391-8725¹

634 Accesses
5 Citations
Explore all metrics

Abstract

The coauthorship in a paper set can be expressed by a hypergraph, namely a system with heterogeneously multinary relationship. The coauthorship network of that paper set is the simple graph extracted from that hypergraph. We designed a distributed hypergraph model to simulate the dynamics of large coauthorship networks in a full-scale manner. Its assembly mechanism of hyperedges is driven by Lotka’s law and a cooperative game that maximizes a benefit-cost ratio for coauthoring a paper. The model is built on a circle to express the game, expressing the cost by the distance between nodes. The benefit of coauthoring with a productive author or one with many coauthors is expressed by the cumulative degree or hyperdegree of nodes. The model successfully simulates the multimodal features emerged in the evolutions of coauthorship patterns, the clustering of nodes, the degree assortativity of linked nodes, degree and hyperdegree distributions. This model has the potential to be a null model for studying the complexity of the large social networks arising from other specific collaboration behaviors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A cooperative game model for the multimodality of coauthorship networks

Article 13 July 2019

A hypergraph model for representing scientific output

Article 22 September 2018

Growth patterns and models of real-world hypergraphs

Article 23 August 2022

Notes

References

Aldieri, L., Kotsemir, M., & Vinci, C. P. (2018). The impact of research collaboration on academic performance: An empirical analysis for some European countries. Socio-Economic Planning Sciences, 62, 13–30.
Google Scholar
Barabási, A. L., Jeong, H., Néda, Z., Ravasz, E., Schubert, A., & Vicsek, T. (2002). Evolution of the social network of scientific collaborations. Physica A: Statistical Mechanics and its Applications, 311(3–4), 590–614.
MathSciNet MATH Google Scholar
Börner, K., Maru, J. T., & Goldstone, R. L. (2004). The simultaneous evolution of author and paper networks. Proceedings of the National Academy of Sciences, 101(suppl 1), 5266–5273.
Google Scholar
Burt, R. S. (2004). Structural holes and good ideas. American Journal of Sociology, 110(2), 349–399.
Google Scholar
Catanzaro, M., Caldarelli, G., & Pietronero, L. (2004). Assortative model for social networks. Physical Review E, 70, 037101.
Google Scholar
Committee on the Science of Team Science. (2015). Enhancing the Effectiveness of Team Science. National Academies Press.
Corrêa, J. R. E. A., Silva, F. N., Costa, L. D. F., & Amancio, D. R. (2017). Patterns of authors contribution in scientific manuscripts. Journal of Informetrics, 11(2), 498–510.
Google Scholar
Ductor, L. (2015). Does co-authorship lead to higher academic productivity? Oxford Bulletin of Economics 530 and Statistics, 77(3), 385–407.
Google Scholar
Ferligoj, A., Kronegger, L., Mali, F., Snijders, T. A., & Doreian, P. (2015). Scientific collaboration dynamics in a national scientific system. Scientometrics, 104(3), 985–1012.
Google Scholar
Glänzel, W. (2014). Analysis of co-authorship patterns at the individual level. Transinformacao, 26, 229–238.
Google Scholar
Glänzel, W., & Schubert, A. (2004). Analysing scientific networks through co-authorship. In H. F. Moed, W. Glänzel, U. Schmoch (Eds),Handbook of Quanitative Science and Technology Research (pp. 257–276). Springer.
Glänzel, W., Schubert, A., & Czerwon, H. J. (1999). A bibliometric analysis of international scientific cooperation of the European Union (1985–1995). Scientometrics, 45(2), 185–202.
Google Scholar
Gomez, I., Fernández, M. T., & Sebastian, J. (1999). Analysis of the structure of international scientific cooperation networks through bibliometric indicators. Scientometrics, 44(3), 441–457.
Google Scholar
Guimerá, R., Uzzi, B., Spiro, J., & Amaral, L. A. N. (2005). Team assembly mechanisms determine collaboration network structure and team performance. Science, 308, 697–702.
Google Scholar
Hall, K. L., Stokols, D., Stipelman, B. A., Vogel, A. L., Feng, A., Masimore, B., et al. (2012). Assessing the value of team science: A study comparing center-and investigator-initiated grants. American Journal of Preventive Medicine, 42(2), 157–163.
Google Scholar
Hoekman, J., Frenken, K., & Tijssen, R. J. (2010). Research collaboration at a distance: Changing spatial patterns of scientific collaboration within Europe. Research Policy, 39(5), 662–673.
Google Scholar
Hunter, L., & Leahey, E. (2008). Collaborative research in sociology: Trends and contributing factors. The American Sociologist, 39, 290–306.
Google Scholar
Katz, J. S. (1994). Geographical proximity and scientific collaboration. Scientometrics, 31(1), 31–43.
Google Scholar
Khor, K. A., & Yu, L. G. (2016). Influence of international co-authorship on the research citation impact of young universities. Scientometrics, 107(3), 1095–1110.
Google Scholar
Kshemkalyani, A. D., & Singhal, M. (2008). Distributed computing: principles, algorithms, and systems. Cambridge University Press.
Leclerc, M., & Gagné, J. (1994). International scientific cooperation: The continentalization of science. Scientometrics, 31(3), 261–292.
Google Scholar
Lee, S., & Bozeman, B. (2005). The impact of research collaboration on scientific productivity. Social Studies of Science, 35, 673–702.
Google Scholar
Lehman, H. C. (2017).Age and achievement. (Vol. 4970). Princeton University Press.
Li, F., Miao, Y., & Yang, C. (2015). How do alumni faculty behave in research collaboration? An analysis of Chang Jiang Scholars in China. Research Policy, 44(2), 438–450.
Google Scholar
Lotka, A. J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 16(12), 317–323.
Google Scholar
Lu, C., Zhang, Y., Ahn, Y. Y., Ding, Y., Zhang, C., & Ma, D. (2019). Co-contributorship network and division of labor in individual scientific collaborations. Journal of the Association for Information Science and 565 Technology. https://doi.org/10.1002/asi.24321.
Article Google Scholar
Mali, F., Kronegger, L., Doreian, P., & Ferligoj, A. (2012). Dynamic scientific coauthorship networks. In A. Scharnhorst, K. Börner, & P. V. D. Besselaar (Eds.), Models of science dynamics (pp. 195–232). Springer.
Milojević, S. (2014). Principles of scientific research team formation and evolution. Proceedings of the National Academy of Science, 111(11), 3984–3989.
Google Scholar
Moody, J. (2004). The strucutre of a social science collaboration network: Disciplinery cohesion form 1963 to 1999. American Sociological Review, 69(2), 213–238.
Google Scholar
Narin, F., Stevens, K., & Whitlow, E. S. (1991). Scientific co-operation in Europe and the citation of multinationally authored papers. Scientometrics, 21(3), 313–323.
Google Scholar
Newman, M. (2001). Scientific collaboration networks. II. shortest paths, weighted networks, and centrality. Physical Review E, 64, 016132.
Google Scholar
Newman, M. (2001). Scientific collaboration networks. I. network construction and fundamental results. Physical Review E, 64, 016131.
Google Scholar
Newman, M. (2001). The structure of scientific collaboration networks. Proceedings of the National Academy of Sciences, 98, 404–409.
MathSciNet MATH Google Scholar
Newman, M. E. (2001). Clustering and preferential attachment in growing networks. Physical Review E, 64(2), 025102.
Google Scholar
Newman, M. (2002). Assortative mixing in networks. Physical Review Letters, 89, 208701.
Google Scholar
Newman, M. (2004). Coauthorship networks and patterns of scientific collaboration. Proceedings of the National Academy of Sciences, 101, 5200–5205.
Google Scholar
Nowak, M. A. (2006). Five rules for the evolution of cooperation. Science, 314(5805), 1560–3.
Google Scholar
Pennisi, E. (2005). How did cooperative behavior evolve? Science, 309(5731), 93–93.
Google Scholar
Perc, C. (2010). Growth and structure of Slovenia’s scientific collaboration network. Journal of Informetrics, 4, 475–482.
Google Scholar
Perc, M. (2014). The Matthew effect in empirical data. Journal of the Royal Society Interface, 11, 20140378.
Google Scholar
Perc, M., & Szolnoki, A. (2008). Social diversity and promotion of cooperation in the spatial prisoner’s dilemma game. Physical Review E, 77(1), 011904.
Google Scholar
Perc, M., & Szolnoki, A. (2010). Coevolutionary games—a mini review. BioSystems, 99(2), 109–125.
Google Scholar
Ponomariov, B., & Boardman, C. (2016). What is co-authorship? Scientometrics, 109(3), 1939–1963.
Google Scholar
Price, D. J. S. (1963). Little science, big science. Columbia University Press.
Qi, M., Zeng, A., Li, M., Fan, Y., & Di, Z. (2017). Standing on the shoulders of giants: The effect of outstanding scientists on young collaborators’ careers. Scientometrics, 111(3), 1839–1850.
Google Scholar
Russell, J. M. (1995). The increasing role of international cooperation in science and technology research in Mexico. Scientometrics, 34(1), 45–61.
Google Scholar
Santos, F. C., & Pacheco, J. M. (2005). Scale-free networks provide a unifying framework for the emergence of cooperation. Physical Review Letter, 95(9), 098104.
Google Scholar
Simonton, D. K. (1984). Creative productivity and age: A mathematical model based on a two-step cognitive process. Developmental Review, 4(1), 77–111.
Google Scholar
Valderas, J. M. (2007). Why do team-authored papers get cited more? Science, 317(5844), 1496–1498.
Google Scholar
Van Rijnsoever, F. J., & Hessels, L. K. (2011). Factors associated with disciplinary and interdisciplinary research collaboration. Research Policy, 40(3), 463–472.
Google Scholar
Vogel, A. L., Stipelman, B. A., Hall, K. L., Nebeling, L., Stokols, D., & Spruijt-Metz, D. (2014). Pioneering the transdisciplinary team science approach: Lessons learned from National Cancer Institute grantees. J Tran Med Epid, 2(2).
Wagner, C. S., & Leydesdorff, L. (2005). Network structure, self-organization, and the growth of international collaboration in science. Research Policy, 34(10), 1608–1618.
Google Scholar
Wuchty, S., Jones, B. F., & Uzzi, B. (2007). The increasing dominance of teams in production of knowledge. Science, 316(5827), 1036–1039.
Google Scholar
Xie, Z., Li, J.P., & Li, M. (2018). Exploring cooperative game mechanisms of scientific coauthorship networks. Complexity. https://doi.org/10.1155/2018/9173186
Article Google Scholar
Xie, Z. (2019). A cooperative game model for the multimodality of coauthorship networks. Scientometrics, 121(1), 503–519.
Google Scholar
Xie, Z. (2020). Predicting the number of coauthors for researchers: A learning model. Journal of Informetrics, 14(2), 101036.
Google Scholar
Xie, Z. (2021). A prediction method of publication productivity for researchers. IEEE Transactions on Computational Social Systems, 8(2), 423–433.
Article Google Scholar
Xie, Z., Li, M., Li, J. P., Duan, X. J., & Ouyang, Z. Z. (2018). Feature analysis of multidisciplinary scientific collaboration patterns based on pnas. EPJ Data Science, 7, 5.
Google Scholar
Xie, Z., Ouyang, Z. Z., & Li, J. P. (2016). A geometric graph model for coauthorship networks. Journal of Informetrics, 10, 299–311.
Google Scholar
Xie, Z., Ouyang, Z. Z., Li, J. P., Dong, E. M., & Yi, D. Y. (2018). Modelling transition phenomena of scientific coauthorship networks. Journal of the Association for Information Science and Technology, 69(2), 305–317.
Google Scholar
Xie, Z., Xie, Z. L., Li, M., Li, J. P., & Yi, D. Y. (2017). Modeling the coevolution between citations and coauthorship of scientific papers. Scientometrics, 112, 483–507.
Google Scholar
Zeng, A., Shen, Z., Zhou, J., Wu, J., Fan, Y., Wang, Y., & Stanley, H. E. (2017). The science of science: From the perspective of complex systems. Physics Reports, 714, 1–73.
MathSciNet MATH Google Scholar
Zhang, C., Bu, Y., Ding, Y., & Xu, J. (2018). Understanding scientific collaboration: Homophily, transitivity, and preferential attachment. Journal of the Association for Information Science and Technology, 69(1), 72–86.
Google Scholar

Download references

Acknowledgements

The author thanks Professor Jinying Su in the National University of Defense Technology for her helpful comments and feedback. This work is supported by the National Natural Science Foundation of China (Grant No. 61773020) and National Education Science Foundation of China (Grant No. DIA180383).

Author information

Authors and Affiliations

College of Liberal Arts and Sciences, National University of Defense Technology, Changsha, China
Zheng Xie

Authors

Zheng Xie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zheng Xie.

Appendix

We applied the model to simulate the coauthorship networks extracted from the papers of three accredited journals of informetrics, namely the Journal of the Association for Information Science and Technology, Journal of Informetrics, and Scientometrics. The dataset considered here, named “informetr” (Table 3), consists the papers published at the years from 1998 to 2019, where authors’ name is extracted from the term “AF” in the raw data downloaded from Web of Science^{Footnote 2}. The papers at 1998–2021 are used to extract nodes’ historical degree and hyperdegree. The coauthorship network extracted from the papers at 2002–2019 will be simulated in full scale by our model. Notably, using the term “AF” suffers the splitting and merging errors on names. Simulating a dataset with errors helps to test the fault tolerance of our model.

Table 3 The information of the dataset informetr

Full size table

Fig. 16 shows the fitting polynomial of the cumulative number of nodes and that of the annual number of hyperedges in synth-informetr. It indicates that the mathematical formulae underlying our model also hold for this dataset. The number of new nodes at each time step is proportional to the corresponding number of informetr, where the proportion \(\varepsilon =7\). Six processes are used to run the model. When \(\alpha =1.6\), \(\beta =0.0007\), and \(\gamma =0.8\), the model can generate hyperdegree and degree distributions with approximate shapes to those of informetr. When \(\epsilon =0.0019\), the difference between the size of the giant component of informetr 2002–2019 and that of synth-informetr is smaller than 0.1.

Table 4 shows certain statistical indexes of the coauthorship networks extracted from informetr and from synth-informetr. Figs. 17, 18, 19, 20, 21, 22 23 and 24 show that the model can captures the coauthorship patterns in informetr, and can reproduce the evolutions of coauthorship patterns, the clustering of nodes, the degree assortativity, the degree distribution, and so on.

Table 4 Network indexes of informetr and synth-informetr

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xie, Z. A distributed hypergraph model for simulating the evolution of large coauthorship networks. Scientometrics 126, 4609–4638 (2021). https://doi.org/10.1007/s11192-021-03991-2

Download citation

Received: 08 April 2020
Accepted: 03 April 2021
Published: 26 April 2021
Issue Date: June 2021
DOI: https://doi.org/10.1007/s11192-021-03991-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A distributed hypergraph model for simulating the evolution of large coauthorship networks

Abstract

Access this article

Similar content being viewed by others

A cooperative game model for the multimodality of coauthorship networks

A hypergraph model for representing scientific output

Growth patterns and models of real-world hypergraphs

Notes

References

Acknowledgements