On the role of conductance, geography and topology in predicting hashtag virality

  • Siddharth Bora
  • Harvineet Singh
  • Anirban Sen
  • Amitabha Bagchi
  • Parag Singla
Original Article
  • 294 Downloads

Abstract

We focus on three aspects of the early spread of a hashtag in order to predict whether it will go viral: the network properties of the subset of users tweeting the hashtag, its geographical properties, and, most importantly, its conductance-related properties. One of our significant contributions is to discover the critical role played by the conductance-based features for the successful prediction of virality. More specifically, we show that the second derivative of the conductance gives an early indication of whether the hashtag is going to go viral or not. We present a detailed experimental evaluation of the effect of our various categories of features on the virality prediction task. When compared to the baselines and the state-of-the-art techniques proposed in the literature our feature set is able to achieve significantly better accuracy on a large dataset of 7.7 million users and all their tweets over a period of month, as well as on existing datasets.

Keywords

Trend prediction Information diffusion Virality  Classification Twitter social network 

References

  1. Agarwal P (2013) Prediction of trends in online social network. Master’s thesis, Indian Institute of Technology, DelhiGoogle Scholar
  2. Aral S, Walker D (2011) Creating social contagion through viral product design: a randomized trial of peer influence in networks. Manag Sci 57(9):1623–1639CrossRefGoogle Scholar
  3. Ardon S, Bagchi A, Mahanti A, Ruhela A, Seth A, Tripathy RM, Triukose S (2013) Spatio-temporal and events based analysis of topic popularity in Twitter. In: Proceedings of 22nd ACM international conference on information and knowledge management (CIKM 2013). ACM, pp 219–228Google Scholar
  4. Berger J, Milkman KL (2012) What makes online content viral? J Mark Res 49(2):192–205CrossRefGoogle Scholar
  5. Bright P (2011) How the London riots showed us two sides of social networking. http://arstechnica.com/. Accessed 11 Aug 2011
  6. Cheng J, Adamic LA, Dow PA, Kleinberg JM, Leskovec J (2014) Can cascades be predicted? In: Proceedings of 23rd international World Wide Web conference (WWW ’14), pp 925–936Google Scholar
  7. Chierichetti F, Lattanzi S, Panconesi A (2010) Almost tight bounds for rumour spreading with conductance. In: Proceedings of 42nd ACM symposium on theory of computing (STOC ’10), pp 399–408Google Scholar
  8. Davis J, Goadrich M (2006) The relationship between precision-recall and roc curves. In: Proceedings of the 23rd international conference on machine learning. ACM, pp 233–240Google Scholar
  9. Drummond C, Holte RC (2003) C4.5, class imbalance, and cost sensitivity: why under-sampling beats over-sampling. In: ICML workshop on learning from imbalanced datasets, pp 1–8Google Scholar
  10. Ghosh R, Lerman K (2011) A framework for quantitative analysis of cascades on networks. In: Proceedings of 4th ACM international conference on Web search and data mining (WSDM ’11), pp 665–674Google Scholar
  11. Gleich DF, Seshadhri C (2012) Vertex neighborhoods, low conductance cuts, and good seeds for local community methods. In: Proceedings of 18th ACM SIGKDD international conference on knowledge discovery and data mining (KDD ’12), pp 597–605Google Scholar
  12. Guerini M, Strapparava C, Özbal G (2011) Exploring text virality in social networks. In: Proceedings of international AAAI conference on weblogs and social media (ICWSM 2011)Google Scholar
  13. Guerini M, Staiano J, Albanese D (2013) Exploring image virality in Google Plus. In: Proceedings of ASE/IEEE international conference on social computing (SocialCom 2013), pp 671–678Google Scholar
  14. Guruswami V (2000) Rapidly mixing Markov chains: a comparison of techniques. http://www.cs.cmu.edu/~venkatg/pubs/pubs.html. Accessed 20 Dec 2014
  15. Hansen LK, Arvidsson A, Nielsen FÅ, Colleoni E, Etter M (2011) Good friends, bad news-affect and virality in twitter. In: Future information technology. Springer, pp 34–43Google Scholar
  16. Harenberg S, Bello G, Gjeltema L, Ranshous S, Harlalka J, Seay R, Padmanabhan K, Samatova N (2014) Community detection in large-scale networks: a survey and empirical evaluation. WIREs Comput Stat 6:426–439CrossRefGoogle Scholar
  17. Jenders M, Kasneci G, Naumann F (2013) Analyzing and predicting viral tweets. In: WWW (companion volume), pp 657–664Google Scholar
  18. Jerrum MR, Sinclair AJ (1989) Approximating the permanent. SIAM J Comput 18:1149–1178MATHMathSciNetCrossRefGoogle Scholar
  19. Kitsak M, Gallos LK, Havlin S, Liljeros F, Muchnik L, Stanley HE, Makse HA (2010) Identification of influential spreaders in complex networks. Nat Phys 6(11):888–893CrossRefGoogle Scholar
  20. Kulshrestha J, Kooti F, Nikravesh A, Gummadi KP (2012) Geographic dissection of the Twitter network. In: Proceedings of ICWSM 2012Google Scholar
  21. Kwak H, Lee C, Park H, Moon S (2010) What is Twitter, a social network or a news media? In: Proceedings of 19th international conference on World Wide Web (WWW ’10), pp 591–600Google Scholar
  22. Lerman K, Hogg T (2010) Using a model of social dynamics to predict popularity of news. In: Proceedings of 19th international conference on World Wide Web (WWW ’10). ACM, pp 621–630Google Scholar
  23. Leskovec J, Backstrom L, Kleinberg J (2009) Meme-tracking and the dynamics of the news cycle. In: Proceedings of KDD ’09. ACM, pp 497–506Google Scholar
  24. Leskovec J, Lang KJ, Mahoney MW (2010) Empirical comparison of algorithms for network community detection. In: Proceedings of 19th international conference on World Wide Web (WWW ’10), pp 631–640Google Scholar
  25. Ma Z, Sun A, Cong G (2013) On predicting the popularity of newly emerging hashtags in twitter. J Assoc Inf Sci Technol 64(7):1399–1410CrossRefGoogle Scholar
  26. McGee J, Caverlee J, Cheng Z (2013) Location prediction in social media based on tie strength. In: Proceedings of 22nd ACM international conference on information and knowledge management (CIKM 2013), pp 459–468Google Scholar
  27. Myers SA, Zhu C, Leskovec J (2012) Information diffusion and external influence in networks. In: Proceedings of KDD ’12, pp 33–41Google Scholar
  28. Oh O, Agrawal M, Rao HR (2013) Rumor and communication in Asia in the internet age, chap 8. Taylor and Francis, LondonGoogle Scholar
  29. Rajyalakshmi S, Bagchi A, Das S, Tripathy RM (2012) Topic diffusion and emergence of virality in social networks. arXiv:1202.2215. Accessed 12 Feb 2015
  30. Romero DM, Meeder B, Kleinberg J (2011) Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In: Proceedings of 20th international conference on World Wide Web (WWW ’11), pp 695–704Google Scholar
  31. Suh B, Hong L, Pirolli P, Chi EH (2010) Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. In: IEEE/ASE SocialCom 2010. IEEE, pp 177–184Google Scholar
  32. Szabo G, Huberman BA (2010) Predicting the popularity of online content. Commun ACM 53(8):80–88CrossRefGoogle Scholar
  33. Totti LC, Costa FA, de Avila SEF, Valle E, Jr WM, Almeida V (2014) The impact of visual attributes on online image diffusion. In: Proceedings of ACM Web science conference (WebSci ’14), pp 42–51Google Scholar
  34. Weng L, Flammini A, Vespignani A, Menczer F (2012) Competitions among topics in a world with limited attention. Sci Rep 2, Article number 335Google Scholar
  35. Weng L, Menczer F, Ahn YY (2013) Virality prediction and community structure in social networks. Sci Rep 3, Article number 2522Google Scholar
  36. Weng L, Menczer F, Ahn YY (2014) Predicting successful memes using network and community structure. In: 8th international AAAI conference on weblogs and social media (ICWSM 2014)Google Scholar
  37. Wu F, Huberman BA (2007) Novelty and collective attention. Proc Natl Acad Sci USA 104(45):17599–17601CrossRefGoogle Scholar
  38. Zaman T, Fox EB, Bradlow ET (2014) A Bayesian approach for predicting the popularity of tweets. Ann Appl Stat 8(3):1583–1611MATHMathSciNetCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Wien 2015

Authors and Affiliations

  • Siddharth Bora
    • 1
  • Harvineet Singh
    • 1
  • Anirban Sen
    • 1
  • Amitabha Bagchi
    • 1
  • Parag Singla
    • 1
  1. 1.Indian Institute of TechnologyNew DelhiIndia

Personalised recommendations