Abstract
Fake news is a growing challenge for social networks and media. Detection of fake news always has been a problem for many years, but the evolution of social networks and increasing speed of news dissemination in recent years has been considered again. There are several approaches to solving this problem, one of which is to detect fake news based on its text style using deep neural networks. In recent years, transfer learning with transformers is one of the most used forms of deep neural networks for natural language processing. BERT is one of the most promising transformers that outperforms other models in many NLP benchmarks. In this article, we introduce MWPBert, which uses two parallel BERT networks to perform veracity detection on full-text news articles. One of the BERT networks encodes news headline, and another encodes news bodies. Since the input length of the BERT network is limited and constant and the news body is usually a long text, we cannot feed the whole text into the BERT. Therefore, using the MaxWorth algorithm, we selected the part of the news text that is more valuable for fact-checking, and fed it into the BERT network. Finally, we encode the output of the two BERT networks to an output network to classify the news. The experiment results showed that the proposed model outperformed previous models regarding accuracy and other performance measures.
Similar content being viewed by others
Data Availability
The datasets analysed during the current study are available in this repository: https://github.com/KaiDMML/FakeNewsNet
Code Availability
The code are available from the corresponding author on reasonable request.
References
Abouelenien M, Pérez-Rosas V, Zhao B et al. (2017) Gender-based multimodal deception detection. In: Proceedings of the ACM Symposium on Applied Computing, vol Part F1280. Association for Computing Machinery, New York, New York, USA, pp 137–144. https://doi.org/10.1145/3019612.3019644, http://dl.acm.org/citation.cfm?doid=3019612.3019644
Albawi S, Mohammed TA, Al-Zawi S (2018) Understanding of a convolutional neural network. In: Proceedings of 2017 International Conference on Engineering and Technology, ICET 2017, Ieee, pp 1–6. https://doi.org/10.1109/ICEngTechnol.2017.8308186
Bahdanau D, Cho KH, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. In: 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings. arXiv:1409.0473
Bordes A, Usunier N, Garcia-Durán A et al. (2013) Translating embeddings for modeling multi-relational data. In: Advances in Neural Information Processing Systems
Bronstein MM, Bruna J, Lecun Y et al. (2017) Geometric Deep Learning: Going beyond Euclidean data. https://doi.org/10.1109/MSP.2017.2693418, arXiv:1611.08097
Dagan I, Dolan B, Magnini B et al. (2009) Recognizing textual entailment: Rational, evaluation and approaches. https://doi.org/10.1017/S1351324909990209
Devlin J, Chang MW, Lee K et al. (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference 1(Mlm):4171–4186. arXiv:1810.04805
Gravanis G, Vakali A, Diamantaras K et al (2019) Behind the cues: A benchmarking study for fake news detection. Exp Syst Appl 128:201–213. https://doi.org/10.1016/j.eswa.2019.03.036
Hassan N, Arslan F, Li C et al (2017) Toward automated fact-checking: Detecting check-worthy factual claims by claimbuster. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Part F1296:1803–1812. https://doi.org/10.1145/3097983.3098131
Kaliyar RK, Goswami A, Narang P (2021) FakeBERT: Fake news detection in social media with a BERT-based deep learning approach. Multimed Tools Appl 80(8):11,765–11,788. https://doi.org/10.1007/s11042-020-10183-2
Karimi H, Tang J (2019) Learning hierarchical discourse-level structure for fake news detection. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference 1:3432–3442. https://doi.org/10.18653/v1/n19-1347, arXiv:1903.07389
Kowsari K, Meimandi KJ, Heidarysafa M et al. (2019) Text classification algorithms: A survey. Inf (Switzerland) 10(4):150. https://doi.org/10.3390/info10040150, arXiv:1904.08067
Kwon S, Cha M, Jung K et al. (2013) Prominent features of rumor propagation in online social media. In: Proceedings - IEEE International Conference on Data Mining, ICDM, pp 1103–1108. https://doi.org/10.1109/ICDM.2013.61
Li L, Qin B, Ren W et al (2017) Document representation and feature combination for deceptive spam review detection. Neurocomput 254:33–41. https://doi.org/10.1016/j.neucom.2016.10.080
Long Y, Lu Q, Xiang R et al. (2017) Fake News Detection Through Multi-Perspective Speaker Profiles. Proceedings of the Eighth International Joint Conference on Natural Language Processing Volume 2:(8):252–256. http://www.aclweb.org/anthology/I17-2043
Luong MT, Pham H, Manning CD (2015) Effective approaches to attention-based neural machine translation. In: Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing, pp 1412–1421, https://doi.org/10.18653/v1/d15-1166. arXiv:1508.04025
Meel P, Vishwakarma DK (2020) Fake news, rumor, information pollution in social media and web: A contemporary survey of state-of-the-arts, challenges and opportunities. https://doi.org/10.1016/j.eswa.2019.112986
Mikolov T, Sutskever I, Chen K et al. (2013) Distributed representations ofwords and phrases and their compositionality. In: Advances in Neural Information Processing Systems
Monti F, Frasca F, Eynard D et al. (2019) Fake News Detection on Social Media using Geometric Deep Learning. Arxiv 1902-06673v1 pp 1–15. arXiv:1902.06673
Ott M, Choi Y, Cardie C et al. (2011) Finding deceptive opinion spam by any stretch of the imagination. In: ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp 309–319. arXiv:1107.4557
Palani B, Elango S, Vignesh Viswanathan K (2022) CB-Fake: A multimodal deep learning framework for automatic fake news detection using capsule neural network and BERT. Multimed Tools Appl 81(4):5587–5620. https://doi.org/10.1007/s11042-021-11782-3. https://link.springer.com/article/10.1007/s11042-021-11782-3
Pan JZ, Pavlova S, Li C et al. (2018) Content based fake news detection using knowledge graphs. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol 11136 LNCS. Springer, pp 669–683. https://doi.org/10.1007/978-3-030-00671-6_39, http://link.springer.com/10.1007/978-3-030-00671-6_39
Panigrahi S, Nanda A, Swarnkar T (2021) A Survey on Transfer Learning. Smart Innov Syst Technol 194(10):781–789. https://doi.org/10.1007/978-981-15-5971-6_83
Pennington J, Socher R, Manning CD (2014) GloVe: Global vectors for word representation. In: EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, pp 1532–1543. https://doi.org/10.3115/v1/d14-1162
Pérez-Rosas V, Kleinberg B, Lefevre A et al. (2018) Automatic detection of fake news. https://www.aclweb.org/anthology/C18-1287/. arXiv:1708.07104
Peters ME, Neumann M, Zettlemoyer L et al. (2018) Dissecting contextual word embeddings: Architecture and representation. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, pp 1499–1509, https://doi.org/10.18653/v1/d18-1179. arXiv:1808.08949
Pierri F, Ceri S (2019) False news on social media: A data-driven survey. SIGMOD Record 48(2):18–32. https://doi.org/10.1145/3377330.3377334, arXiv:1902.07539
Rajpurkar P, Zhang J, Lopyrev K et al. (2016) SQuad: 100,000+ questions for machine comprehension of text. In: EMNLP 2016 - Conference on Empirical Methods in Natural Language Processing, Proceedings, pp 2383–2392. https://doi.org/10.18653/v1/d16-1264, arXiv:1606.05250
Roy A, Basak K, Ekbal A et al. (2019) A Deep Ensemble Framework for Fake News Detection and Multi-Class Classification of Short Political Statements. In: Proceedings of the 16th International Conference on Natural Language Processing
Sadeghi F, Bidgoly AJ, Amirkhani H (2022) Fake news detection on social media using a natural language inference approach. Multimed Tools Appl 81(23):33,801–33,821. https://doi.org/10.1007/s11042-022-12428-8, https://link.springer.com/article/10.1007/s11042-022-12428-8
Shu K, Mahudeswaran D, Liu H (2019) FakeNewsTracker: a tool for fake news collection, detection, and visualization. Comput Math Org Theory 25(1):60–71. https://doi.org/10.1007/s10588-018-09280-3
Shu K, Mahudeswaran D, Wang S et al. (2020) FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media. Big Data 8(3):171–188. https://doi.org/10.1089/big.2020.0062, arXiv:1809.01286
Singhania S, Fernandez N, Rao S (2017) 3HAN: A Deep Neural Network for Fake News Detection. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 10635 LNCS:572–581. https://doi.org/10.1007/978-3-319-70096-0_59
Thorne J, Vlachos A (2018) Automated fact checking: Task formulations, methods and future directions. COLING 2018 - 27th International Conference on Computational Linguistics, Proceedings pp 3346–3359. arXiv:1806.07687
Thota A, Tilak P, Ahluwalia S et al. (2018) Fake news detection: a deep learning approach. SMU Data Sci Rev 1(3):10. https://scholar.smu.edu/datasciencereview/vol1/iss3/10
Vaswani A, Shazeer N, Parmar N, et al (2017) Attention is all you need. In: Advances in Neural Information Processing Systems, pp 5999–6009. arXiv:1706.03762
Vosoughi S, Roy D, Aral S (2018) The spread of true and false news online. Sci 359(6380):1146–1151. https://doi.org/10.1126/science.aap9559
Wang B, Feng Y, cai Xiong X et al. (2022) Multi-modal transformer using two-level visual features for fake news detection. Appl Intell pp 1–15. https://doi.org/10.1007/s10489-022-04055-5, https://link.springer.com/article/10.1007/s10489-022-04055-5
Wang WY (2017) "Liar, liar pants on fire": A new benchmark dataset for fake news detection. In: ACL 2017–55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), pp 422–426. https://doi.org/10.18653/v1/P17-2067, arXiv:1705.00648
Williams A, Nangia N, Bowman SR (2018) A broad-coverage challenge corpus for sentence understanding through inference. In: NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, pp 1112–1122. https://doi.org/10.18653/v1/n18-1101, arXiv:1704.05426
Wolf T, Debut L, Sanh V et al. (2020) Transformers: State-of-the-Art Natural Language Processing. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp 38–45. https://doi.org/10.18653/v1/2020.emnlp-demos.6
Wu K, Yang S, Zhu KQ (2015) False rumors detection on Sina Weibo by propagation structures. In: Proceedings - International Conference on Data Engineering, pp 651–662. https://doi.org/10.1109/ICDE.2015.7113322
Zhou X, Zafarani R (2020) A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities. ACM Comput Surv 53(5). https://doi.org/10.1145/3395046, arXiv:1812.00315
Zhou X, Cao J, Jin Z et al. (2015) Real-time news certification system on sina weibo. In: WWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web, pp 983–988. https://doi.org/10.1145/2740908.2742571
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Farokhian, M., Rafe, V. & Veisi, H. Fake news detection using dual BERT deep neural networks. Multimed Tools Appl 83, 43831–43848 (2024). https://doi.org/10.1007/s11042-023-17115-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17115-w