Abstract
Easy propagation and access to information on the web has the potential to become a serious issue when it comes to disinformation. The term “fake news” describes the intentional propagation of news with the intention to mislead and harm the public and has gained more attention recently. This paper proposes a style-based Machine Learning (ML) approach, which relies on the textual information from news, such as manually extracted lexical features e.g. part of speech counts, and evaluates the performance of several ML algorithms. We identified a subset of the best performing linguistic features, using information-based metrics, which tend to agree with the literature. We also, combined Named Entity Recognition (NER) functionality with the Frequent Pattern (FP) Growth association rule algorithm to gain a deeper perspective of the named entities used in the two classes. Both methods reinforce the claim that fake and real news have limited differences in content, setting limitations to style-based methods. Results showed that convolutional neural networks resulted in the best accuracy, outperforming the rest of the algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alzanin, S.M., Azmi, A.M.: Detecting rumors in social media: a survey. Procedia Comp. Sci. 142, 294–300 (2018)
Ghafari, S.M., Tjortjis, C.: A survey on association rules mining using heuristics. WIREs Data Min. Knowl. Disc. 9(4), e1307 (2019)
Golbeck, J., et al.: Fake news vs satire: a dataset and analysis. In Proceedings of 10th ACM Conference on Web Science, pp. 17–21 (2018)
Gravanis, G., Vakali, A., Diamantaras, K., Karadais, P.: Behind the cues: a benchmarking study for fake news detection. Expert Syst. Appl. 128, 201–213 (2019)
Horne, B. D., Adali, S., Sikdar, S.: Identifying the social signals that drive online discussions: a case study of reddit communities. In 26th IEEE International Conference on Computer Communication and Networks (ICCCN), pp. 1–9 (2017)
Khan, J.Y., Khondaker, M., Islam, T., Iqbal, A., Afroz, S.: A benchmark study on machine learning methods for fake news detection. arXiv preprint arXiv:1905.04749 (2019)
Kiesel, J., et al.: Semeval-2019 task 4: hyperpartisan news detection. In: Proceedings of 13th International Workshop on Semantic Evaluation, pp. 829–839 (2019)
Koukaras, P., Tjortjis, C., Rousidis, D.: Social media types: introducing a data driven taxonomy. Computing 102(1), 295–340 (2020). https://doi.org/10.1007/s00607-019-00739-y
Liu, Y., Xu, S.: Detecting rumors through modeling information propagation networks in a social media environment. IEEE Trans. Comput. Soc. Syst. 3(2), 46–62 (2016)
Orso, D., Federici, N., Copetti, R., Vetrugno, L., Bove, T.: Infodemic and the spread of fake news in the COVID-19-era. Eur. J. Emerg. Med. (2020)
Pérez-Rosas, V., Kleinberg, B., Lefevre, A., Mihalcea, R.: Automatic detection of fake news. arXiv preprint arXiv:1708.07104 (2017)
Petty, R.E., Cacioppo, J.T.: The elaboration likelihood model of persuasion. In: Communication and Persuasion, pp. 1–24. Springer, New York (1986). https://doi.org/10.1007/978-1-4612-4964-1_1
Potthast, M., Kiesel, J., Reinartz, K., Bevendorff, J., Stein, B.: A stylometric inquiry into hyperpartisan and fake news. arXiv preprint arXiv:1702.05638 (2017)
Reis, J.C., Correia, A., Murai, F., Veloso, A., Benevenuto, F.: Supervised learning for fake news detection. IEEE Intell. Syst. 34(2), 76–81 (2019)
Rousidis, D., Koukaras, P., Tjortjis, C.: Social media prediction a literature review. Multimedia Tools Appl. 79(9–10), 6279–6311 (2020)
Rubin, V. L., Conroy, N., Chen, Y., Cornwell, S.: Fake news or truth? Using satirical cues to detect potentially misleading news. In: Proceedings of 2nd Workshop Computational Approaches to Deception Detection, pp. 7–17 (2016)
Ruchansky, N., Seo, S., Liu, Y.: CSI: a hybrid deep model for fake news detection. In: Proceedings of 2017 ACM Conference on Information and Knowledge Management, pp. 797–806 (2017)
Shahsavari, S., Holur, P., Tangherlini, T. R., Roychowdhury, V.: Conspiracy in the time of corona: automatic detection of covid-19 conspiracy theories in social media and the news. arXiv preprint arXiv:2004.13783 (2020)
Sharma, K., Qian, F., Jiang, H., Ruchansky, N., Zhang, M., Liu, Y.: Combating fake news: a survey on identification and mitigation techniques. ACM Trans. Intell. Syst. Technol. (TIST) 10(3), 1–42 (2019)
Shu, K., Mahudeswaran, D., Wang, S., Lee, D., Liu, H.: FakeNewsNet: a data repository with news content, social context and dynamic information for studying fake news on social media arXiv:1809.01286 (2018)
Tsiara, E., Tjortjis, C.: Using Twitter to predict chart position for songs. In: Proceedings of 16th IFIP International Conference on Artificial Intelligence Applications and Innovations, pp. 62–72 (2020)
Tversky, A., Kahneman, D.: Judgment under uncertainty: heuristics and biases. Science 185(4157), 1124–1131 (1974)
Wang, L.X., Ramachandran, A., Chaintreau, A.: Measuring click and share dynamics on social media: a reproducible and validated approach. In: 10th International AAAI Conference on Web and Social Media (2016)
Wu, L., Li, J., Hu, X., Liu, H.: Gleaning wisdom from the past: early detection of emerging rumors in social media. In: Proceedings 2017 SIAM International Conference on Data Mining, pp. 99–107 (2017)
Zhang, Y., Wallace, B.: A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. arXiv preprint arXiv:1510.03820 (2015)
Zhou, L., Twitchell, D.P., Qin, T., Burgoon, J.K., Nunamaker, J.F.: An exploratory study into deception detection in text-based computer-mediated communication. In: Proceedings of 36th IEEE International Conference on System Sciences, p. 10 (2003)
Zhou, X., Zafarani, R., Shu, K., Liu, H.: Fake news: fundamental theories, detection strategies and challenges. In: Proceedings of 12th ACM International Conference on Web Search and Data Mining, pp. 836–837 (2019)
Zubiaga, A., Aker, A., Bontcheva, K., Liakata, M., Procter, R.: Detection and resolution of rumours in social media: a survey. ACM Comput. Surv. 51(2), 1–36 (2018)
Acknowledgments
The authors would like to thank the Hellenic Artificial Intelligence Society (EETN) for covering part of their expenses to participate in AIAI 2021.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 IFIP International Federation for Information Processing
About this paper
Cite this paper
Kasseropoulos, D.P., Tjortjis, C. (2021). An Approach Utilizing Linguistic Features for Fake News Detection. In: Maglogiannis, I., Macintyre, J., Iliadis, L. (eds) Artificial Intelligence Applications and Innovations. AIAI 2021. IFIP Advances in Information and Communication Technology, vol 627. Springer, Cham. https://doi.org/10.1007/978-3-030-79150-6_51
Download citation
DOI: https://doi.org/10.1007/978-3-030-79150-6_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-79149-0
Online ISBN: 978-3-030-79150-6
eBook Packages: Computer ScienceComputer Science (R0)