Abstract
Popularity prediction in microblogging network aims to predict the future popularity of a tweet based on the observation in the early stages. Existing studies have investigated many features for prediction. However, features from the users who have potential to retweet a tweet have not been fully explored for this problem. Also, the impact of tweet’s post time on its early-stage popularity has been neglected. To address these issues, we study two prediction tasks in this paper, i.e. predicting the popularity of a tweet based on the observation in 1 Hour after being posted (PP1H) or the observation of its first k retweets (PPkR), and investigate a wide spectrum of features to identify effective features for each prediction task. We extract structural features including retweet network features and border network features from the underlying user network, and temporal features from the observed retweets. To mitigate the impact of tweet’s post time on its early-stage popularity, we introduce the notation of tweet time and use it to measure the temporal features. We treat both prediction tasks as classification problems and apply five standard classifiers (i.e. naive bayes, k-nearest-neighbor, support vector machine, logistic regression and bagging decision trees) for prediction. Experiments on Sina Weibo show that for PP1H task, bagging decision trees with all feature yield the best performance and border network features outperform other groups of features. For PPkR task, we find that satisfied prediction performance can be obtained based on only the temporal features of first 10 retweets. Furhter, by introducing tweet time, we can significantly improve the prediction performance of temporal features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Figueiredo, F., Benevenuto, F., Almeida, J.M.: The tube over time: characterizing popularity growth of youtube videos. In: WSDM, pp. 745–754. ACM (2011)
Lakkaraju, H., Ajmera, J.: Attention prediction on social media brand pages. In: CIKM, pp. 2157–2160. ACM (2011)
Tsur, O., Rappoport, A.: What’s in a hashtag?: content based prediction of the spread of ideas in microblogging communities. In: WSDM, pp. 643–652. ACM (2012)
Ma, Z., Sun, A., Cong, G.: Will this# hashtag be popular tomorrow? In: SIGIR, pp. 1173–1174. ACM (2012)
Crane, R., Sornette, D.: Robust dynamic classes revealed by measuring the response function of a social system. PNAS 105(41), 15649–15653 (2008)
Bakshy, E., Hofman, J.M., Mason, W.A., Watts, D.J.: Everyone’s an influencer: quantifying influence on twitter. In: WSDM, pp. 65–74. ACM (2011)
Asur, S., Huberman, B.A.: Predicting the future with social media. In: WI-IAT, vol. 1, pp. 492–499. IEEE (2010)
Hong, L., Dan, O., Davison, B.D.: Predicting popular messages in twitter. In: WWW Companion, pp. 57–58 (2011)
Bao, P., Shen, H.-W., Huang, J., Cheng, X.-Q.: Popularity prediction in microblogging network: a case study on sina weibo. In: WWW Companion, pp. 177–178 (2013)
Can, E.F., Oktay, H., Manmatha, R.: Predicting retweet count using visual cues. In: CIKM, pp. 1481–1484. ACM (2013)
Tsagkias, M., Weerkamp, W., De Rijke, M.: Predicting the volume of comments on online news stories. In: CIKM, pp. 1765–1768. ACM (2009)
Lerman, K., Hogg, T.: Using a model of social dynamics to predict popularity of news. In: WWW, pp. 621–630. ACM (2010)
Szabo, G., Huberman, B.A.: Predicting the popularity of online content. Commun. ACM 53(8), 80–88 (2010)
Bishop, C.M., Nasrabadi, N.M.: Pattern recognition and machine learning, vol. 1. Springer, New York (2006)
Romero, D.M., Meeder, B., Kleinberg, J.: Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In: WWW, pp. 695–704 (2011)
Garlaschelli, D., Loffredo, M.I.: Patterns of link reciprocity in directed networks. Physical Review Letters 93(26), 268701 (2004)
Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393(6684), 440–442 (1998)
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web (1999)
Chang, C.-C., Lin, C.-J.: Libsvm: a library for support vector machines. ACM TIST 2(3), 27 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Gao, S., Ma, J., Chen, Z. (2014). Popularity Prediction in Microblogging Network. In: Chen, L., Jia, Y., Sellis, T., Liu, G. (eds) Web Technologies and Applications. APWeb 2014. Lecture Notes in Computer Science, vol 8709. Springer, Cham. https://doi.org/10.1007/978-3-319-11116-2_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-11116-2_33
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11115-5
Online ISBN: 978-3-319-11116-2
eBook Packages: Computer ScienceComputer Science (R0)