Skip to main content
Log in

Improving association discovery through multiview analysis of social networks

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

The rise of social networks has brought about a transformative impact on communication and the dissemination of information. However, this paradigm shift has also introduced many challenges in discerning valuable conversation threads amidst fake news, malicious accounts, background noise, and trolling. In this study, we address these challenges by focusing on propagating fake news labels. We evaluate the efficacy of community-based modeling in effectively addressing these challenges within the context of social network discussions using the state-of-the-art benchmark. Through a comprehensive analysis of millions of users engaged in discussions on a specific topic, we unveil compelling evidence demonstrating that community-based modeling techniques yield precision, recall, and accuracy levels comparable to those achieved by lexical classifiers. Remarkably, these promising results are achieved even without considering the textual content of tweets beyond the information conveyed by hashtags. Moreover, we explore the effectiveness of fusion techniques in tweet classification and underscore the superiority of a combined community and lexical approach, which consistently delivers the most robust outcomes and exhibits the highest performance measures. We illustrate this capability with specific network graphs constructed based on Twitter interactions related to the COVID-19 pandemic, showcasing the practicality and relevance of our proposed methodology. To demonstrate the excellent performance achieved with the fusion of modalities, we show an improvement of the combined lexical and community method that achieves up to 60% both for precision and recall measures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  • Ahmed W, Vidal-Alaball J, Downing J, Seguí FL (2020) Covid-19 and the 5g conspiracy theory: a social network analysis of twitter data. J Med Internet Res 22(5):19458

    Article  Google Scholar 

  • Aynaud T (2020) python-louvain 0.14: Louvain algorithm for community detection. https://github.com/taynaud/python-louvain

  • Baldi P, Brunak S, Chauvin Y, Andersen CA, Nielsen H (2000) Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics 16(5):412–424

    Article  CAS  PubMed  Google Scholar 

  • Bansal S (2020) A mutli-task mutlimodal framework for tweet classification based on cnn (grand challenge). In: 2020 IEEE sixth international conference on multimedia big data (BigMM), pp 456–460

  • Bhatia T, Manaskasemsak B, Rungsawang A (2023) Detecting fake news sources on twitter using deep neural network. In: 2023 11th international conference on information and education technology (ICIET), pp 508–512. https://doi.org/10.1109/ICIET56899.2023.10111446

  • Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008(10):10008. https://doi.org/10.1088/1742-5468/2008/10/p10008

    Article  Google Scholar 

  • Bovet A, Makse HA (2019) Influence of fake news in Twitter during the 2016 us presidential election. Nat Commun 10(1):1–14

    Article  ADS  Google Scholar 

  • Bruijn JA, Moel H, Weerts AH, Ruiter MC, Basar E, Eilander D, Aerts JCJH (2020) Improving the classification of flood tweets with contextual hydrological information in a multimodal neural network. Comput Geosci 140:104485

    Article  Google Scholar 

  • Campello RJGB, Moulavi D, Sander J (2013) Density-based clustering based on hierarchical density estimates. In: Pei J, Tseng VS, Cao L, Motoda H, Xu G (eds) Adv Knowl Discov Data Min. Springer, Berlin, Heidelberg, pp 160–172

    Chapter  Google Scholar 

  • Chicco D, Jurman G (2020) The advantages of the Matthews correlation coefficient (MCC) over f1 score and accuracy in binary classification evaluation. BMC Genom 21(1):1–13

    Article  Google Scholar 

  • Dai E, Aggarwal C, Wang S (2021) Nrgnn : learning a label noise resistant graph neural network on sparsely and noisily labeled graphs. In: Proceedings of the 27th ACM SIGKDD conference on knowledge discovery and data mining, pp 227–236

  • Dai E, Wang S (2021) Say no to the discrimination: learning fair graph neural networks with limited sensitive attribute information. In: Proceedings of the 14th ACM international conference on web search and data mining, pp 680–688. Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3437963.3441752

  • Gangireddy SCR, PD, Long C, Chakraborty T (2020) Unsupervised fake news detection: a graph-based approach. In: Proceedings of the 31st ACM conference on hypertext and social media. HT ’20, pp 75–83. Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3372923.3404783

  • Gao W, Li L, Zhu X, Wang Y (2020) Detecting disaster-related tweets via multimodal adversarial neural network. IEEE MultiMed 27(4):28–37

    Article  Google Scholar 

  • Gao D, Li K, Wang R, Shan S, Chen X (2020) Multi-modal graph neural network for joint reasoning on vision and scene text. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 12743–12753

  • Geeng C, Yee S, Roesner F (2020) Fake news on Facebook and witter: investigating how people (don’t) investigate. In: Proceedings of the 2020 CHI conference on human factors in computing systems. CHI ’20, pp 1–14. Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3313831.3376784

  • Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. In: Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R (Eds.) Advances in neural information processing systems, vol 30. Curran Associates, Inc

  • Indra I, et al. (2016) Using logistic regression method to classify tweets into the selected topics. In: International conference on advanced computer science and information systems (ICACSIS), pp 385–390. IEEE, NY

  • Kumar S et al (2020) An anatomical comparison of fake news and trusted-news sharing patterns on Twitter. Comput Math Organ Theory 27:109–133

    Article  Google Scholar 

  • Lim WL, Ho CC, Ting C (2020) Sentiment analysis by fusing text and location features of geo-tagged tweets. IEEE Access 8:181014–181027

    Article  Google Scholar 

  • Liu Z, Nguyen T-K, ang Y (2021) Tail-gnn : tail-node graph neural networks. In: Proceedings of the 27th ACM SIGKDD conference on knowledge discovery and data mining, pp 1109–1119

  • Magill A, Tomasso M (2020) Fake news twitter data analysis. https://github.com/DataLab12/fakenews

  • Monti F et al. (2019) Fake news detection on social media using geometric deep learning

  • Nguyen DQ, Vu T, Nguyen AT (2020) BERTweet: a pre-trained language model for English tweets. In: Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations, pp 9–14

  • Nguyen V-H, Sugiyama K, Nakov P, Kan M-Y (2020) Fang: leveraging social context for fake news detection using graph representation. In: Proceedings of the 29th ACM international conference on information and knowledge management. CIKM ’20, pp 1165–1174. Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3340531.3412046

  • Nogueira L (2020) Social network analysis at scale: graph-based analysis of Twitter trends and communities. Master’s thesis, Texas State University (Dec 2020). https://digital.library.txstate.edu/handle/10877/12933

  • Nogueira L pytwanalysis package. https://pypi.org/project/pytwanalysis/

  • Nogueira L, Tešić J (2021) pytwanalysis: twitter data management and analysis at scale. In: International conference on social network analysis management and security (SNAMS2021). https://emergingtechnet.org/SNAMS2021/

  • Osmundsen M, Bor A, Vahlstrup PB, Benchmann A, Petersen MB (2021) Partisan polarization is the primary psychological motivation behind political fake news sharing on Twitter. Am Political Sci Rev 115(3):999–1015. https://doi.org/10.1017/S0003055421000290

    Article  Google Scholar 

  • Pogorelov K, Schroeder DT, Burchard L, Moe J, Brenner S, Filkukova P, Langguth J (2020) Fake news: coronavirus and 5g conspiracy task at MediaEval 2020. In: Working Notes proceedings of the MediaEval 2020 workshop. MediaEval. http://ceur-ws.org/Vol-2882/

  • Schroeder DT, Pogorelov K, Langguth J (2019) Fact: a framework for analysis and capture of Twitter graphs. In: 2019 sixth international conference on social networks analysis, management and security (SNAMS), pp 134–141. https://doi.org/10.1109/SNAMS.2019.8931870

  • Sha H, Hasan MA, Mohler G, Brantingham PJ (2020) Dynamic topic modeling of the Tcovid-19 Twitter narrative among us governors and cabinet executives. arXiv preprint arXiv:2004.11692

  • Su T (2022) Automatic fake news detection on Twitter. PhD thesis, University of Glasgow

  • Suman C, Naman A, Saha S, Bhattacharyya P (2021) A multimodal author profiling system for tweets. IEEE Trans Comput Soc Syst 8(6):1407–1416

    Article  Google Scholar 

  • Vosoughi S et al (2018) The spread of true and false news online. Science 359(6380):1146–1151

    Article  ADS  CAS  PubMed  Google Scholar 

  • Wang J, Wang Y, Yang Z, Yang L, Guo Y (2021) Bi-gcn: binary graph convolutional network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 1561–1570

  • Yang X, Deng C, Dang Z, Wei K, Yan J (2021) Selfsagcn: self-supervised semantic alignment for graph convolution network. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 16775–16784

  • Zhou X, Zafarani R (2019) Fake news detection: an interdisciplinary research. In: WWW proceedings, p 1292. ACM, NY

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jelena Tešić.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shebaro, M., de Moura, L.N. & Tešić, J. Improving association discovery through multiview analysis of social networks. Soc. Netw. Anal. Min. 14, 42 (2024). https://doi.org/10.1007/s13278-023-01197-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-023-01197-3

Keywords

Navigation