Science China Information Sciences

, 61:092101 | Cite as

Inferring diffusion networks with life stage heterogeneity

Research Paper
  • 68 Downloads

Abstract

A network inference problem focuses on discovering the structure of a diffusion network from observed cascades. This problem is significantly more challenging in several settings in which this type of an inference is desirable or necessary because of heterogeneity in the diffusion process. The heterogeneity of the diffusion process in different life stages results in the inaccuracy of a common assumption of constant influence strength. In this study, a Life Stage Heuristic (LSH) method is proposed to model life stage heterogeneity by decoupling the popularity level of an item under propagation from a true strength of social ties to improve inference accuracy. The proposed LSH is incorporated into almost all existing state-of-the-art network inference algorithms to improve estimation accuracy with only minimal changes in the implementation and maintaining the same running time. Additionally, NetRate, NetInf, and ConNIe are used as three examples to demonstrate the power of the proposed method. Furthermore, clustering of cascades prior to the LSH is proposed to eliminate noise, and the optimized method is termed as Clustered Life Stage Heuristic (CLSH). Extensive experiments on synthetic and real world datasets indicate that both LSH and CLSH methods significantly improve the accuracy of network inference.

Keywords

network inference life stage heterogeneity social influence information diffusion clustering cascade 

Notes

Acknowledgements

This work was supported by National Natural Science Foundation of China (Grant No. 61572041), Beijing Natural Science Foundation (Grant No. 4152023), and National High Technology Research and Development Program of China (863 Program) (Grant No. 2014AA015103).

References

  1. 1.
    Domingos P, Richardson M. Mining the network value of customers. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, 2001. 57–66Google Scholar
  2. 2.
    Kempe D, Kleinberg J, Tardos E. Maximizing the spread of influence through a social network. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, 2003. 137–146MATHGoogle Scholar
  3. 3.
    Leskovec J, Adamic L A, Huberman B A. The dynamics of viral marketing. In: Proceedings of ACM Transactions on the Web, New York, 2007. 1–5Google Scholar
  4. 4.
    Adar E, Zhang L, Adamic L, et al. Implicit structure and the dynamics of blogspace. In: Proceedings of Workshop on the Weblogging Ecosystem, New York, 2004. 13: 16989–16995Google Scholar
  5. 5.
    Gruhl D, Guha R, Liben-Nowell D, et al. Information diffusion through blogspace. In: Proceedings of the International Conference on World Wide Web, New York, 2004. 491–501Google Scholar
  6. 6.
    Leskovec J, Kleinberg J, Faloutsos C. Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, 2005. 177–187Google Scholar
  7. 7.
    Dong X L, Berti-Equille L, Srivastava D. Truth discovery and copying detection in a dynamic world. Proc VLDB Endow, 2009, 2: 562–573CrossRefGoogle Scholar
  8. 8.
    Daneshmand H, Rodriguez M G, Song L, et al. Estimating diffusion network structures: Recovery conditions, sample complexity and soft-thresholding algorithm. In: Proceedings of International Conference on Machine Learning, Beijing, 2014. 793–801Google Scholar
  9. 9.
    Du N, Song L, Woo H, et al. Uncover topic-sensitive information diffusion networks. In: Proceedings of International Conference on Artificial Intelligence and Statistics, Scottsdale, 2013. 229–237Google Scholar
  10. 10.
    Du N, Song L, Yuan S, et al. Learning networks of heterogeneous influence. In: Proceedings of Neural Information Processing Systems Conference, Lake Tahoe, 2012. 2780–2788Google Scholar
  11. 11.
    Rodriguez M G, Balduzzi D, Schölkopf B. Uncovering the temporal dynamics of diffusion networks. In: Proceedings of International Conference on Machine Learning, Bellevue, 2011. 561–568Google Scholar
  12. 12.
    Rodriguez M G, Leskovec J, Krause A. Inferring networks of diffusion and influence. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, 2012. 5: 21Google Scholar
  13. 13.
    Wang S, Hu X, Yu P S, et al. Mmrate: inferring multi-aspect diffusion networks with multi-pattern cascades. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, 2014. 1246–1255Google Scholar
  14. 14.
    Yang S H, Zha H. Mixture of mutually exciting processes for viral diffusion. In: Proceedings of International Conference on Machine Learning, Atlanta, 2013. 28: 1–9Google Scholar
  15. 15.
    Zhou K, Zha H, Song L. Learning social infectivity in sparse low-rank network using multi-dimensional hawkes processes. In: Proceedings of International Conference on Machine Learning, Atlanta, 2013. 31: 641–649Google Scholar
  16. 16.
    Dou P, Du S Z, Song G J. Inferring diffusion network on incomplete cascade data. In: Proceedings of International Conference on Web-Age Information Management, Nanchang, 2016. 325–337Google Scholar
  17. 17.
    Pasumarthi R K, Karthik S, Choure A, et al. Online network inference under dynamic cascade updates: A node-centric approach. In: Proceedings of International Conference on Communication Systems and Networks, Bangalore, 2014. 1–6Google Scholar
  18. 18.
    Rodriguez M G, Scholkopf B. Submodular inference of diffusion networks from multiple trees. In: Proceedings of International Conference on Machine Learning, Edinburgh, 2012. 1019–1028Google Scholar
  19. 19.
    Myers S A, Leskovec J. On the convexity of latent social network inference. In: Proceedings of Neural Information Processing Systems Conference, Whistler, 2010. 1741–1749Google Scholar
  20. 20.
    Yang J, Leskovec J. Patterns of temporal variation in online media. In: Proceedings of International Conference on Web Search and Data Mining, New York, 2011. 177–186Google Scholar
  21. 21.
    Matsubara Y, Sakurai Y, Prakash B A, et al. Rise and fall patterns of information diffusion: Model and implications. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing, 2012. 6–14Google Scholar
  22. 22.
    Evans J D. Straightforward Statistics for the Behavioral Sciences. California: Brooks/Cole Publishing, 1996Google Scholar
  23. 23.
    Leskovec J, Chakrabarti D, Kleinberg J, et al. Kronecker graphs: an approach to modeling networks. J Mach Learn Res, 2010, 11: 985–1042MATHMathSciNetGoogle Scholar
  24. 24.
    Leskovec J, Backstrom L, Kleinberg J. Meme-tracking and the dynamics of the news cycle. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, 2009. 497–506Google Scholar

Copyright information

© Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature 2017

Authors and Affiliations

  1. 1.Key Laboratory of Machine Perception (MOE)Peking UniversityBeijingChina
  2. 2.Computer Science DepartmentUniversity of Southern CaliforniaLos AngelesUSA

Personalised recommendations