Skip to main content
Log in

On the robustness of centrality measures against link weight quantization in social networks

  • Published:
Computational and Mathematical Organization Theory Aims and scope Submit manuscript

Abstract

In social network analysis, individuals are represented as nodes in a graph, social ties among them are represented as links, and the strength of the social ties can be expressed as link weights. However, in social network analyses where the strength of a social tie is expressed as a link weight, the link weight may be quantized to take only a few discrete values. In this paper, expressing a continuous value of social tie strength as a few discrete value is referred to as link weight quantization, and we study the effects of link weight quantization on centrality measures through simulations and experiments utilizing network generation models that generate synthetic social networks and real social network datasets. Our results show that (1) the effects of link weight quantization on the centrality measures are not significant when determining the most important node in a graph, (2) conversely, a 5–8 quantization level is needed to determine other important nodes, and (3) graphs with a highly skewed degree distribution or with a high correlation between node degree and link weights are robust against link weight quantization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  • Barrat A, Barthélemy M, Vespignani A (2004) Modeling the evolution of weighted networks. Phys Rev E 70(6):66–149

    Article  Google Scholar 

  • Batallas D, Yassine A (2006) Information leaders in product development organizational networks: social network analysis of the design structure matrix. IEEE Trans Eng Manag 53(4):570–582

    Article  Google Scholar 

  • Berry JW, Hendrickson B, LaViolette RA, Phillips CA (2011) Tolerating the community detection resolution limit with edge weighting. Phys Rev E 83(5):056,119

    Article  Google Scholar 

  • Bonacich P (1972) Factoring and weighting approaches to status scores and clique identification. J Math Sociol 2(1):113–120

    Article  Google Scholar 

  • Bonacich P (1987) Power and centrality: a family of measures. Am J Sociol 92(5):1170–1182

    Article  Google Scholar 

  • Borgatti SP (2006) Identifying sets of key players in a social network. Comput Math Organ Theory 12(1):21–34

    Article  Google Scholar 

  • Borgatti SP, Carley KM, Krackhardt D (2006) On the robustness of centrality measures under conditions of imperfect data. Soc Netw 28(2):124–136

    Article  Google Scholar 

  • Borgatti SP, Mehra A, Brass DJ, Labianca G (2009) Network analysis in the social sciences. Science 323(5916):892–895

    Article  Google Scholar 

  • Cataldo M, Ehrlich K (2012) The impact of communication structure on new product development outcomes. In: Proceedings of the SIGCHI conference on human factors in computing systems (CHI ’12), pp 3081–3090

  • Cattani G, Ferriani S (2008) A core/periphery perspective on individual creative performance: social networks and cinematic achievements in the Hollywood film industry. Organ Sci 19(6):824–844

    Article  Google Scholar 

  • Clauset A, Newman M, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):066,111

    Article  Google Scholar 

  • Colizza V, Pastor-Satorras R, Vespignani A (2007) Reaction-diffusion processes and metapopulation models in heterogeneous networks. Nat Phys 3(4):276–282

    Article  Google Scholar 

  • Costenbader E, Valente T (2003) The stability of centrality measures when networks are sampled. Soc Netw 25(4):283–307

    Article  Google Scholar 

  • Creswick N, Westbrook J (2010) Social network analysis of medication advice-seeking interactions among staff in an Australian hospital. Int J Med Inform 79(6):116–125

    Article  Google Scholar 

  • Cross R, Parker A (2004) The hidden power of social networks: understanding how work really gets done in organizations. Harvard Business Press, Boston

    Google Scholar 

  • De Meo P, Ferrara E, Fiumara G, Provetti A (2014) Mixing local and global information for community detection in large networks. J Comput Syst Sci 80(1):72–87

    Article  Google Scholar 

  • Ehrlich K, Cataldo M (2012) All-for-one and one-for-all?: a multi-level analysis of communication patterns and individual performance in geographically distributed software development. In: Proceedings of the ACM 2012 conference on computer supported cooperative work (CSCW ’12), pp 945–954

  • Frantz TL, Cataldo M, Carley K (2009) Robustness of centrality measures under uncertainty: examining the role of network topology. Comput Math Organ Theory 15:303–328

    Article  Google Scholar 

  • Freeman L (1979) Centrality in social networks conceptual clarification. Soc Netw 1(3):215–239

    Article  Google Scholar 

  • Ghoshal G, Barabási A (2011) Ranking stability and super-stable nodes in complex networks. Nat Commun 2(394):1–7

    Google Scholar 

  • Gómez V, Kaltenbrunner A, López V (2008) Statistical analysis of the social network and discussion threads in slashdot. In: Proceeding of the 17th international conference on world wide web (WWW ’08), pp 645–654

  • Granovetter MS (1973) The strength of weak ties. Am J Sociol 78(6):1360–1380

    Article  Google Scholar 

  • Kamei Y, Matsumoto S, Maeshima H, Onishi Y, Ohira M, Matsumoto K (2008) Analysis of coordination between developers and users in the apache community. Open Sour Dev Communities Qual 275:81–92

    Article  Google Scholar 

  • Kim P, Jeong H (2007) Reliability of rank order in sampled networks. Eur Phys J B-Condens Matter Complex Syst 55(1):109–114

    Article  Google Scholar 

  • Kumpula J, Onnela J, Saramäki J, Kertész J, Kaski K (2009) Model of community emergence in weighted social networks. Comput Phys Commun 180(4):517–522

    Article  Google Scholar 

  • Lee SH, Kim PJ, Jeong H (2006) Statistical properties of sampled networks. Phys Rev 73(1):016,102

    Article  Google Scholar 

  • Leskovec J, Krevl A (2014) SNAP Datasets: Stanford large network dataset collection. http://snap.stanford.edu/data

  • Li C, Chen G (2006) Modelling of weighted evolving networks with community structures. Phys A: Stat Mech Appl 370(2):869–876

    Article  Google Scholar 

  • Meo PD, Ferrara E, Fiumara G, Provetti A (2012) Enhancing community detection using a network weighting strategy. Inf Sci 222(10):648–668

    Google Scholar 

  • Opsahl T, Agneessens F, Skvoretz J (2010) Node centrality in weighted networks: generalizing degree and shortest paths. Soc Netw 32(3):245–251

    Article  Google Scholar 

  • Opsahl T (2013) Triadic closure in two-mode networks: redefining the global and local clustering coefficients. Soc Netw 35(2):159–167

    Article  Google Scholar 

  • Opsahl T, Panzarasa P (2009) Clustering in weighted networks. Soc Netw 31(2):155–163

    Article  Google Scholar 

  • Shetty J, Adibi J (2004) The Enron email dataset database schema and brief statistical report. Information Sciences Institute, University of Southern California, Tech. rep

  • Sun PG, Gao L, Yang Y (2013) Maximizing modularity intensity for community partition and evolution. Inf Sci 236:83–92

    Article  Google Scholar 

  • Sun PG (2014) Weighting links based on edge centrality for community detection. Phys A: Stat Mech Appl 394:346–357

    Article  Google Scholar 

  • Tsugawa S, Ohsaki H, Imase M (2012) Inferring leadership of online development community using topological structure of its social network. J Infosocionomics Soc 7(1):17–27

    Google Scholar 

  • Uzzi B, Spiro J (2005) Collaboration and creativity: the small world problem. Am J Sociol 111(2):447–504

    Article  Google Scholar 

  • Valente T, Watkins S, Jato M, Straten AVD, Tsitsol L (1997) Social network associations with contraceptive use among Cameroonian women in voluntary associations. Soc Sci Med 45(5):677–687

    Article  Google Scholar 

  • Watts DJ (2003) Small worlds: the dynamics of networks between order and randomness. Princeton Univ Press, Princeton

    Google Scholar 

  • Watts DJ (2007) A twenty-first century science. Nature 445(7127):489

    Article  Google Scholar 

Download references

Acknowledgments

The authors would like to thank Dr. Makoto Imase for valuable discussions, and anonymous reviewers for their insightful comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sho Tsugawa.

Appendix: quantization errors of linear and logarithmic quantization

Appendix: quantization errors of linear and logarithmic quantization

We derive the quantization errors of linear and logarithmic quantization. Without loss of generality, we assume link weights are distributed over [1, m]. As a representative skewed distribution, we assume link weights follow a truncated Pareto distribution with probability density function defined as following equation.

$$\begin{aligned} p(x) = {\left\{ \begin{array}{ll} \frac{\alpha x^{-\alpha -1}}{1-m^{-\alpha }} &{} (1 \le x \le m)\\ 0 &{} \text {otherwise} \end{array}\right. } \end{aligned}$$
(4)

Let n be the quantization level. Then, the expected value of the error of linear quantization is given by

$$\sum _{k=1}^n \int _{1+\frac{(m-1)}{n}(k-1)}^{1+\frac{(m-1)}{n}k}p(x) (1+\frac{m-1}{n}k-x) dx,$$
(5)

and the expected value of the error of logarithmic quantization is given by

$$\sum _{k=1}^n \int _{m^{\frac{k-1}{n}}}^{m^{\frac{k}{n}}}p(x) (m^{\frac{k}{n}}-x) dx.$$
(6)

The relation between quantization level n and quantization error is shown in Fig. 16. The relation between the parameter \(\alpha\) and quantization errors is shown in Fig. 17.

Fig. 16
figure 16

Relation between quantization level n and expected value of quantization error (parameter \(\alpha =3\), maximum link weight \(m=10\))

Fig. 17
figure 17

Relation between parameter \(\alpha\) and expected value of quantization error (quantization level \(n=10\), maximum link weight \(m=10\))

These figures show that when the link weight distribution is the truncated Pareto distribution, which is a skewed distribution, quantization error of logarithmic quantization is smaller than that of linear quantization.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tsugawa, S., Matsumoto, Y. & Ohsaki, H. On the robustness of centrality measures against link weight quantization in social networks. Comput Math Organ Theory 21, 318–339 (2015). https://doi.org/10.1007/s10588-015-9188-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10588-015-9188-7

Keywords

Navigation