Abstract
In Chap. 3, we identify some influential factors that have an positive impact on individual retweeting behavior, such as topical relevance, information richness, soical tie strength, etc. One may wonder whether these factors only play an important role in theroy or, are these factors still important when predicting individual retweeting behavior? Furthermoer, to the best of our knowledge, virtually no scholarly effort has been undertaken to figure out the relative importance of those factors when predicting individual retweeting decision. Instead, a large number of features are indiscriminately introduced into the prediction model without examining the relevance of these features. The existence of redundant features not only increases data collection cost, but also tends to generate an overfitted model which predicts poorly on future observations not used in model training, known as the curse of dimensionality. Thus, it is necessary to rank the priority of these factors and find out the dominating ones. To tackle the above problems, we first pick out a specific user to illustrate the feature (also called factor in the monograph) selection process. The results confirm that only a small subset of predictors have an influential impact on individual retweeting behavior. And then, based on a large sample, we commit ourselves to find out factors that are not only important in theory in terms of explaining individual retweeting behavior, but also important in practice in terms of predicting individual retweeting behavior. Finally, we obtain a subset of dominating factors which not only save the cost of collecting trivial features but also improve the prediction performance to some extent, under certain classification algorithms such as support vector classification (SVC) or logistic.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
0.00–0.19: very weak; 0.20–0.39: weak; 0.40–0.59: moderate; 0.60–0.79: strong; 0.80–1.00: very strong.
References
Zhang, J., Tang, J., Li, J., Liu, Y., Xing, C.: Who influenced you? Predicting retweet via social influence locality. ACM Trans. Knowl. Disc. Data (TKDD) 9(3), 25 (2014)
Xu, Z., Yang, Q.: Analyzing user retweet behavior on twitter. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), pp. 46–50. IEEE Computer Society (2012)
Tang, X., Miao, Q., Quan, Y., Tang, J., Deng, K.: Predicting individual retweet behavior by user similarity: a multi-task learning approach. Knowl.-Based Syst. 89, 681–688 (2015)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inform. Sci. 41(6), 391 (1990)
Hong, L., Davison, B.D.: Empirical study of topic modeling in twitter. In: Proceedings of the First Workshop on Social Media Analytics, pp. 80–88. ACM (2010)
Xu, Z., Lu, R., Xiang, L., Yang, Q.: Discovering user interest on twitter with a modified author-topic model. In: 2011 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), vol. 1, pp. 422–429. IEEE (2011)
Feng, W., Wang, J.: Retweet or not?: personalized tweet re-ranking. In: Proceedings of the sixth ACM international conference on Web search and data mining, pp. 577–586. ACM (2013)
Macskassy, S.A., Michelson, M.: Why do people retweet? Anti-homophily wins the day! In: Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, pp. 209–216 (2011)
Stieglitz, S., Dang-Xuan, L.: Emotions and information diffusion in social media–sentiment of microblogs and sharing behavior. J. Manag. Inform. Syst. 29(4), 217–248 (2013)
James, G., Witten, D., Hastie, T., Tibshirani, R.: An Introduction to Statistical Learning. Springer (2013)
Weng, J., Lim, E.P., Jiang, J., He, Q.: Twitterrank: finding topic-sensitive influential twitterers. In: Proceedings of the third ACM International Conference on Web Search and Data Mining, pp. 261–270. ACM (2010)
Leavitt, A., Burchard, E., Fisher, D., Gilbert, S.: The influentials: new approaches for analyzing influence on twitter. Web Ecol. Project 4(2), 1–18 (2009)
Dash, M., Liu, H.: Feature selection for classification. Intell. Data Anal. 1(1–4), 131–156 (1997)
Song, L., Smola, A., Gretton, A., Borgwardt, K.M., Bedo, J.: Supervised feature selection via dependence estimation. In: Proceedings of the 24th International Conference on Machine Learning, pp. 823–830. ACM (2007)
Weston, J., Elisseeff, A., Schölkopf, B., Tipping, M.: Use of the zero-norm with linear models and kernel methods. J. Mach. Learn. Res. 3, 1439–1461 (2003)
Mitra, P., Murthy, C., Pal, S.K.: Unsupervised feature selection using feature similarity. IEEE Trans. Pattern Anal. Mach. Intell. 24(3), 301–312 (2002)
Dy, J.G., Brodley, C.E.: Feature selection for unsupervised learning. J. Mach. Learn. Res. 5, 845–889 (2004)
Xu, Z., King, I., Lyu, M.R.T., Jin, R.: Discriminative semi-supervised feature selection via manifold regularization. IEEE Trans. Neural Netw. 21(7), 1033–1047 (2010)
Zhao, Z., Liu, H.: Semi-supervised feature selection via spectral analysis. In: Proceedings of the 2007 SIAM International Conference on Data Mining, pp. 641–646. SIAM (2007)
Cohen, J.: Statistical Power Analysis for the Behavioral Sciences. Academic Press (2013)
Evans, J.D.: Straightforward Statistics for the Behavioral Sciences. Brooks/Cole (1996)
Deutsch, M., Gerard, H.B.: A study of normative and informational social influences upon individual judgment. J. Abnormal Soc. Psychol. 51(3), 629 (1955)
Peng, H., Long, F., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1226–1238 (2005)
Ivakhnenko, A., Ivakhnenko, G.: The review of problems solvable by algorithms of the group method of data handling (GMDH). Pattern Recogn. Image Anal. C/C Of Raspoznavaniye Obrazov I Analiz Izobrazhenii 5, 527–535 (1995)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2020 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Shi, J., Lai, K.K., Chen, G. (2020). Dominating Factors Affecting Individual Retweeting Behavior. In: Individual Retweeting Behavior on Social Networking Sites. Springer, Singapore. https://doi.org/10.1007/978-981-15-7376-7_4
Download citation
DOI: https://doi.org/10.1007/978-981-15-7376-7_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7375-0
Online ISBN: 978-981-15-7376-7
eBook Packages: Literature, Cultural and Media StudiesLiterature, Cultural and Media Studies (R0)