Data Mining and Knowledge Discovery

, Volume 29, Issue 5, pp 1152–1177 | Cite as

Dynamic inference of social roles in information cascades

  • Sarvenaz Choobdar
  • Pedro Ribeiro
  • Srinivasan Parthasarathy
  • Fernando Silva
Article

Abstract

Nodes in complex networks inherently represent different kinds of functional or organizational roles. In the dynamic process of an information cascade, users play different roles in spreading the information: some act as seeds to initiate the process, some limit the propagation and others are in-between. Understanding the roles of users is crucial in modeling the cascades. Previous research mainly focuses on modeling users behavior based upon the dynamic exchange of information with neighbors. We argue however that the structural patterns in the neighborhood of nodes may already contain enough information to infer users’ roles, independently from the information flow in itself. To approach this possibility, we examine how network characteristics of users affect their actions in the cascade. We also advocate that temporal information is very important. With this in mind, we propose an unsupervised methodology based on ensemble clustering to classify users into their social roles in a network, using not only their current topological positions, but also considering their history over time. Our experiments on two social networks, Flickr and Digg, show that topological metrics indeed possess discriminatory power and that different structural patterns correspond to different parts in the process. We observe that user commitment in the neighborhood affects considerably the influence score of users. In addition, we discover that the cohesion of neighborhood is important in the blocking behavior of users. With this we can construct topological fingerprints that can help us in identifying social roles, based solely on structural social ties, and independently from nodes activity and how information flows.

Keywords

Structural role mining Information cascade Social role Ensemble clustering Complex networks 

References

  1. Adamic L, Adar E (2005) How to search a social network. Soc Netw 27(3):187–203CrossRefGoogle Scholar
  2. Agarwal N, Liu H, Tang L, Yu PS (2008) Identifying the influential bloggers in a community. In: Proceedings of ACM International Conference on Web Search and Data MiningGoogle Scholar
  3. Akaike H (1998) Information theory and an extension of the maximum likelihood principle. Selected Papers of Hirotugu Akaike. Springer, New YorkGoogle Scholar
  4. Bakshy E, Hofman JM, Mason WA, Watts DJ (2011) Everyone’s an influencer: quantifying influence on twitter. In: Proceedings of ACM International Conference on Web Search and Data MiningGoogle Scholar
  5. Bonacich P (2007) Some unique properties of eigenvector centrality. Soc Netw 29(4):555–564CrossRefGoogle Scholar
  6. Cha M, Benevenuto F, Ahn Y-Y, Gummadi KP (2012) Delayed information cascades in flickr: measurement, analysis, and modeling. Comput Netw 56(3):1066–1076CrossRefGoogle Scholar
  7. Cha M, Haddadi H, Benevenuto F, Gummadi PK (2010) Measuring user influence in twitter: the million follower fallacy. In: Proceedings of AAAI International Conference on Weblogs and Social Media, vol. 10Google Scholar
  8. Cha M, Mislove A, Gummadi KP (2009) A measurement-driven analysis of information propagation in the Flickr social network. In: Proceedings of ACM International Conference on World Wide WebGoogle Scholar
  9. Choobdar S, Ribeiro P, Silva F (2012) Event detection in evolving networks. In: Proceedings of IEEE International Conference on Computational Aspects of Social Networks (CASoN), São Carlos, BrazilGoogle Scholar
  10. Choobdar S, Silva F, Ribeiro P (2011) Network node label acquisition and tracking. In: Proceedings of Portuguese Conference on Artificial Intelligence, Progress in Artificial IntelligenceGoogle Scholar
  11. Cormode G, Shkapenyuk V, Srivastava D, Xu B (2009) Forward decay: a practical time decay model for streaming systems. In: Proceedings of IEEE International Conference on Data EngineeringGoogle Scholar
  12. Costa L, Rodrigues F, Hilgetag C, Kaiser M (2009) Beyond the average: detecting global singular nodes from local features in complex networks. Europhys Lett 87(1):18008CrossRefGoogle Scholar
  13. Danilevsky M, Wang C, Desai N, Han J (2013) Entity role discovery in hierarchical topical communities. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data MiningGoogle Scholar
  14. Easley D, Kleinberg J (2010) Networks, crowds, and markets, vol 6(1). Cambridge University Press, New YorkCrossRefMATHGoogle Scholar
  15. Easley D, Kleinberg J (2012) Networks, crowds, and markets: reasoning about a highly connected world. Cambridge University Press, CambridgeGoogle Scholar
  16. Fred A, Jain A (2002) Data clustering using evidence accumulation. In: Proceedings of International Conference on Pattern Recognition, vol. 4. Quebec City, CanadaGoogle Scholar
  17. Gallagher B, Eliassi-Rad T (2010) Leveraging label-independent features for classification in sparsely labeled networks: an empirical study. In: Proceedings of International Conference on Advances in Social Network Mining and Analysis. Berlin, GermanyGoogle Scholar
  18. Ghosh R, Lerman K (2010) Predicting influential users in online social networks. In: Proceedings of KDD Workshop on Social Network Analysis (SNA-KDD), July 2010Google Scholar
  19. Ghosh R, Lerman K (2012) Rethinking centrality: the role of dynamical processes in social network analysis. arXiv preprint arXiv:1209.4616
  20. Gionis A, Mannila H, Tsaparas P (2005) Clustering aggregation. In: Proceedings of IEEE International Conference on Data EngineeringGoogle Scholar
  21. Goyal A, Bonchi F, Lakshmanan LV (2010) Learning influence probabilities in social networks. In: Proceedings of ACM International Conference on Web Search and Data MiningGoogle Scholar
  22. Granovetter M (1973) The strength of weak ties. Am J Sociol 78(6):1360–1380CrossRefGoogle Scholar
  23. Granovetter M (1985) Economic action and social structure: the problem of embeddedness. Am J Sociol 91:481–510CrossRefGoogle Scholar
  24. Guo S, Wang M, Leskovec J (2011) The role of social networks in online shopping: information passing, price of trust, and consumer choice. In: Proceedings of the 12th ACM Conference on Electronic CommerceGoogle Scholar
  25. Henderson K, Gallagher B, Eliassi-Rad T, Tong H, Basu S, Akoglu L, Koutra D, Faloutsos C, Li L, Matsubara Y, et al. (2012) Rolx: structural role extraction & mining in large graphs. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Beijing, ChinaGoogle Scholar
  26. Iribarren JL, Moro E (2009) Impact of human activity patterns on the dynamics of information diffusion. Phys Rev Lett 103(3):038702CrossRefGoogle Scholar
  27. Johnson SC (1967) Hierarchical clustering schemes. Psychometrika 32(3):241–254CrossRefGoogle Scholar
  28. Karypis G, Aggarwal R, Kumar V, Shekhar S (1997) Multilevel hypergraph partitioning: application in vlsi domain. In: Proceedings of the 34th Annual Design Automation Conference, ACMGoogle Scholar
  29. Kempe D, Kleinberg J, Tardos É (2003) Maximizing the spread of influence through a social network. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data MiningGoogle Scholar
  30. Kossinets G, Watts DJ (2006) Empirical analysis of an evolving social network. Science 311(5757):88–90MathSciNetCrossRefMATHGoogle Scholar
  31. Kwak H, Lee C, Park H, Moon S (2010) What is twitter, a social network or a news media? In: Proceedings of ACM International Conference on World Wide WebGoogle Scholar
  32. Lancichinetti A, Fortunato S (2012) Consensus clustering in complex networks. Sci Rep 2:336CrossRefGoogle Scholar
  33. Lee C, Kwak H, Park H, Moon S (2010) Finding influentials based on the temporal order of information adoption in twitter. In Proceedings of ACM International Conference on World Wide WebGoogle Scholar
  34. Lerman K, Ghosh R, Surachawala T (2012) Social contagion: an empirical study of information spread on digg and twitter follower graphs. arXiv preprint arXiv:1202.3162
  35. Myers SA, Zhu C, Leskovec J (2012) Information diffusion and external influence in networks. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data MiningGoogle Scholar
  36. Rissanen J (1978) Modeling by shortest data description. Automatica 14(5):465–471CrossRefMATHGoogle Scholar
  37. Romero DM, Galuba W, Asur S, Huberman BA (2011) Influence and passivity in social media. In: Proceedings of the ECML/PKDDGoogle Scholar
  38. Rossi R, Gallagher B, Neville J, Henderson K (2012) Role-dynamics: fast mining of large dynamic networks. In: Proceedings of ACM International Conference on World Wide Web. Lyon, FranceGoogle Scholar
  39. Saito K, Nakano R, Kimura M (2008) Prediction of information diffusion probabilities for independent cascade model. Knowledge-based intelligent information and engineering systems. Springer, New YorkGoogle Scholar
  40. Strehl A, Ghosh J (2003) Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3:583–617MathSciNetMATHGoogle Scholar
  41. Tang J, Sun J, Wang C, Yang Z (2009) Social influence analysis in large-scale networks. In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data MiningGoogle Scholar
  42. Taskar B, Abbeel P, Koller D (2002) Discriminative probabilistic models for relational data. In: Proceedings of Conference on Uncertainty in Artificial Intelligence. Alberta, CanadaGoogle Scholar
  43. Topchy A, Law M, Jain A, Fred A (2004) Analysis of consensus partition in cluster ensemble. In: Proceedings of IEEE International Conference on Data Mining. Brighton, UKGoogle Scholar
  44. Ver Steeg G, Ghosh R, Lerman K (2011) What stops social epidemics? In: Proceedings of AAAI International Conference on Weblogs and Social MediaGoogle Scholar
  45. Von Luxburg U (2007) A tutorial on spectral clustering. Stat Comput 17(4):395–416MathSciNetCrossRefGoogle Scholar
  46. Wang H, Fan W, Yu PS, Han J (2003) Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data MiningGoogle Scholar
  47. Wang T, Srivatsa M, Agrawal D, Liu L (2012) Microscopic social influence. In: Proceedings of SIAM International Conference on Data MiningGoogle Scholar
  48. Watts DJ, Strogatz SH (1998) Collective dynamics of ’small-world’ networks. Nature 393(6684):440–442CrossRefGoogle Scholar
  49. Zhao Y, Wang G, Yu PS, Liu S, Zhang S (2013) Inferring social roles and statuses in social networks. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data MiningGoogle Scholar
  50. Zhou Y, Liu L (2013) Social influence based clustering of heterogeneous information networks. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data MiningGoogle Scholar

Copyright information

© The Author(s) 2015

Authors and Affiliations

  • Sarvenaz Choobdar
    • 1
  • Pedro Ribeiro
    • 1
  • Srinivasan Parthasarathy
    • 2
  • Fernando Silva
    • 1
  1. 1.CRACS and INESC-TEC, Faculdade de Ciencias, Universidade do PortoPortoPortugal
  2. 2.The Ohio State UniversityColumbusUSA

Personalised recommendations