Skip to main content
Log in

Power-law distributions of attributes in community detection

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Community detection has drawn significant attention as new media generates big data every day. To provide statistical testing procedures for community detection in social networks, a scanning method has been developed based on the likelihood of Poisson random graph. However, the scan statistics did not consider detecting communities of the attributes with power-law distribution. Power-law distribution, generally followed by network attributes, is conspicuous in many scientific situations. This paper aims at extending the scanning method to analyze a social network in which attributes follow power-law distribution. Besides the theoretical construction, simulation studies are performed to verify the feasibility of the proposed method, and an authorship network is used to demonstrate the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Balasubramanyan R, Cohen WW (2011) Block-lda: Jointly modeling entity-annotated text and entity-entity links. SDM, SIAM 11:450–461

  • Beebe NH (2002) Nelson hf beebe bibliographies page. http://www.math.utah.edu/~beebe/bibliographies.html

  • Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  • Burroughs SM, Tebbens SF (2001) Upper-truncated power laws in natural systems. Pure Appl Geophys 158(4):741–757

    Article  Google Scholar 

  • Clauset A, Shalizi CR, Newman ME (2009) Power-law distributions in empirical data. SIAM Rev 51(4):661–703

    Article  MATH  MathSciNet  Google Scholar 

  • Erdős P, Rényi A (1959) On random graphs. Publ Math Debr 6:290–297

    Google Scholar 

  • Fortunato S (2010) Community detection in graphs. Phys Rep 486(3):75–174

    Article  MathSciNet  Google Scholar 

  • Fortunato S, Barthélemy M (2007) Resolution limit in community detection. In: Proceedings of the National Academy of Sciences 104:36–41

  • Gillespie CS (2013) Fitting heavy tailed distributions: the poweRlaw package. R package version (20):2

  • Goldstein ML, Morris SA, Yen GG (2004) Problems with fitting to the power-law distribution. Eur Phys J B Condens Syst 41(2):255–258

    Article  Google Scholar 

  • Guimera R, Sales-Pardo M, Amaral LAN (2004) Modularity from fluctuations in random graphs and complex networks. Phys Rev E 70(2):025–101

    Article  Google Scholar 

  • Handcock MS, Raftery AE, Tantrum JM (2007) Model-based clustering for social networks. J Roy Stat Soc A Stat 170(2):301–354

    Article  MathSciNet  Google Scholar 

  • Heard NA, Weston DJ, Platanioti K, Hand DJ (2010) Bayesian anomaly detection methods for social networks. Ann Appl Stat 4(2):645–662

    Article  MATH  MathSciNet  Google Scholar 

  • Kossinets G, Watts DJ (2006) Empirical analysis of an evolving social network. Science 311(5757):88–90

    Article  MATH  MathSciNet  Google Scholar 

  • Kulldorff M (1997) A spatial scan statistic. Commun Stat Theor M 26(6):1481–1496

    Article  MATH  MathSciNet  Google Scholar 

  • Kulldorff M, Huang L, Pickle L, Duczmal L (2006) An elliptic spatial scan statistic. Stat Med 25(22):3929–3943

    Article  MathSciNet  Google Scholar 

  • Liu Y, Niculescu-Mizil A, Gryc W (2009) Topic-link lda: joint models of topic and author community. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp 665–672

  • Louch H (2000) Personal network integration: transitivity and homophily in strong-tie relations. Soc Netw 22(1):45–64

    Article  Google Scholar 

  • McPherson M, Smith-Lovin L, Cook JM (2001) Birds of a feather: Homophily in social networks. Annu Rev Sociol pp 415–444

  • Naus JI (1966) Some probabilities, expectations and variances for the size of largest clusters and smallest intervals. J Am Stat Assoc 61(316):1191–1199

    Article  MATH  MathSciNet  Google Scholar 

  • Newman ME (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066–133

    Article  Google Scholar 

  • Newman ME, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026–113

    Article  Google Scholar 

  • Tango T, Takahashi K (2005) A flexibly shaped spatial scan statistic for detecting clusters. Int J Health Geogr 4(1):11

    Article  Google Scholar 

  • Wang B, Phillips JM, Schreiber R, Wilkinson DM, Mishra N, Tarjan R (2008) Spatial scan statistics for graph clustering. In: SDM, pp 727–738

  • Yang J, McAuley J, Leskovec J (2013) Community detection in networks with node attributes. In: Data Mining (ICDM), 2013 IEEE 13th International Conference on, pp 1151–1156

  • Zhou Y, Cheng H, Yu JX (2009) Graph clustering based on structural/attribute similarities. In: Proceedings of the VLDB Endowment 2:718–729

Download references

Acknowledgments

This work was supported by (a) Career Development Award of Academia Sinica (Taiwan) grant number 103-CDA-M04 and National Science Council (Taiwan) grant number 102-2628-M-001-002-MY3, for Phoa, (b) Thematic Research Program of Academia Sinica (Taiwan) grant number AS-103-TP-C03 for Phoa and Wang.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tai-Chi Wang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, TC., Phoa, F.K.H. & Hsu, TC. Power-law distributions of attributes in community detection. Soc. Netw. Anal. Min. 5, 45 (2015). https://doi.org/10.1007/s13278-015-0283-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-015-0283-z

Keywords

Navigation