Abstract
Twitter is a social platform that helps share ideas quickly and concisely. Although the network offers equal rights to post short texts, the attention these messages attract frequently depends on a user’s status in the real world. Thus the tweets of real life high-profile opinion makers will a priori have a higher probability of spurring the interest of society than the messages from the so-called grassroots. The paper elaborates on the developed classifier that detects automatically such opinion makers on Twitter. The approach exploits the Mixed Effect Random Forests method combined with the features engineered from the Twitter data. The accuracy and the sensitivity of the proposed technique outperform the results of the other machine learning classifiers on the out-of-sample data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, 321–357.
Dhamal, S., Prabuchandran, K. J., & Narahari, Y. (2016). Information diffusion in social networks in two phases. IEEE Transactions on Network Science and Engineering, 3(4), 197–210.
Ferrara, E., Varol, O., Menczer, F., & Flammini, A. (2016). Detection of promoted social media campaigns. In tenth international AAAI conference on web and social media.
Ghosh, R., & Lerman, K. (2010). Predicting influential users in online social networks. arXiv preprint arXiv:1005.4882.
Ghosh, R., Surachawala, T., & Lerman, K. (2011). Entropy-based classification of ’retweeting’ activity on twitter. arXiv preprint arXiv:1106.0346.
Hajjem, A., Bellavance, F., & Larocque, D. (2014). Mixed-effects random forest for clustered data. Journal of Statistical Computation and Simulation, 84(6), 1313–1328.
Huang, Wenhao, Guojie Song, Man Li, Weisong Hu, & Kunqing Xie. (2013). Adaptive weight optimization for classification of imbalanced data. In International Conference on Intelligent Science and Big Data Engineering, pp. 546–553. Berlin, Heidelber: Springer.
Hutto, C. J., & Gilbert, E. (2014). Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Eighth international AAAI conference on weblogs and social media.
Lahuerta-Otero, E., & Cordero-Gutiérrez, R. (2016). Looking for the perfect tweet. The use of data mining techniques to find influencers on Twitter, Computers in Human Behavior, 64, 575–583.
Lematre, G., Nogueira, F., & Aridas, C. K. (2017). Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. The Journal of Machine Learning Research, 18(1), 559–563.
Ludu, P. S. (2015). Inferring latent attributes of an Indian twitter user using celebrities and class influencers. Proceedings of the 1st ACM Workshop on Social Media World Sensors, 9–15.
Nebot, V., Rangel, F., Berlanga, R., & Rosso, P. (2018). Identifying and classifying influencers in twitter only with textual information. International Conference on Applications of Natural Language to Information Systems, 28–39.
Puigbo, J. Y., Sánchez-Hernández, G., Casabayó, M., & Agell, N. (2014). Influencer detection approaches in social networks: A current state-of-the-art. CCIA, 261–264.
Smedt, T. D., & Daelemans, W. (2012). Pattern for python. Journal of Machine Learning Research, 2063–2067.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Galeshchuk, S., Qiu, J. (2021). Identification of Opinion Makers on Twitter. In: Mariani, P., Zenga, M. (eds) Data Science and Social Research II. DSSR 2019. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-030-51222-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-51222-4_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-51221-7
Online ISBN: 978-3-030-51222-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)