Skip to main content

Identification of Opinion Makers on Twitter

  • Conference paper
  • First Online:
Data Science and Social Research II (DSSR 2019)

Included in the following conference series:

  • 914 Accesses

Abstract

Twitter is a social platform that helps share ideas quickly and concisely. Although the network offers equal rights to post short texts, the attention these messages attract frequently depends on a user’s status in the real world. Thus the tweets of real life high-profile opinion makers will a priori have a higher probability of spurring the interest of society than the messages from the so-called grassroots. The paper elaborates on the developed classifier that detects automatically such opinion makers on Twitter. The approach exploits the Mixed Effect Random Forests method combined with the features engineered from the Twitter data. The accuracy and the sensitivity of the proposed technique outperform the results of the other machine learning classifiers on the out-of-sample data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://nlp.uned.es/replab2014/.

  2. 2.

    https://www.clips.uantwerpen.be/pattern.

  3. 3.

    https://pyvideo.org/pydata-la-2018/attacking-clustered-data-with-a-mixed-effects-random-forests-model-in-python.html.

References

  • Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16, 321–357.

    Article  Google Scholar 

  • Dhamal, S., Prabuchandran, K. J., & Narahari, Y. (2016). Information diffusion in social networks in two phases. IEEE Transactions on Network Science and Engineering, 3(4), 197–210.

    Article  MathSciNet  Google Scholar 

  • Ferrara, E., Varol, O., Menczer, F., & Flammini, A. (2016). Detection of promoted social media campaigns. In tenth international AAAI conference on web and social media.

    Google Scholar 

  • Ghosh, R., & Lerman, K. (2010). Predicting influential users in online social networks. arXiv preprint arXiv:1005.4882.

  • Ghosh, R., Surachawala, T., & Lerman, K. (2011). Entropy-based classification of ’retweeting’ activity on twitter. arXiv preprint arXiv:1106.0346.

  • Hajjem, A., Bellavance, F., & Larocque, D. (2014). Mixed-effects random forest for clustered data. Journal of Statistical Computation and Simulation, 84(6), 1313–1328.

    Article  MathSciNet  Google Scholar 

  • Huang, Wenhao, Guojie Song, Man Li, Weisong Hu, & Kunqing Xie. (2013). Adaptive weight optimization for classification of imbalanced data. In International Conference on Intelligent Science and Big Data Engineering, pp. 546–553. Berlin, Heidelber: Springer.

    Google Scholar 

  • Hutto, C. J., & Gilbert, E. (2014). Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Eighth international AAAI conference on weblogs and social media.

    Google Scholar 

  • Lahuerta-Otero, E., & Cordero-Gutiérrez, R. (2016). Looking for the perfect tweet. The use of data mining techniques to find influencers on Twitter, Computers in Human Behavior, 64, 575–583.

    Google Scholar 

  • Lematre, G., Nogueira, F., & Aridas, C. K. (2017). Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. The Journal of Machine Learning Research, 18(1), 559–563.

    Google Scholar 

  • Ludu, P. S. (2015). Inferring latent attributes of an Indian twitter user using celebrities and class influencers. Proceedings of the 1st ACM Workshop on Social Media World Sensors, 9–15.

    Google Scholar 

  • Nebot, V., Rangel, F., Berlanga, R., & Rosso, P. (2018). Identifying and classifying influencers in twitter only with textual information. International Conference on Applications of Natural Language to Information Systems, 28–39.

    Google Scholar 

  • Puigbo, J. Y., Sánchez-Hernández, G., Casabayó, M., & Agell, N. (2014). Influencer detection approaches in social networks: A current state-of-the-art. CCIA, 261–264.

    Google Scholar 

  • Smedt, T. D., & Daelemans, W. (2012). Pattern for python. Journal of Machine Learning Research, 2063–2067.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Svitlana Galeshchuk .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Galeshchuk, S., Qiu, J. (2021). Identification of Opinion Makers on Twitter. In: Mariani, P., Zenga, M. (eds) Data Science and Social Research II. DSSR 2019. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-030-51222-4_15

Download citation

Publish with us

Policies and ethics