Skip to main content
Log in

Sport-fanaticism lexicons for sentiment analysis in Arabic social text

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Sport-fanaticism is one of the social problems. Studying this problem in social network sites such as Twitter becomes important where social sites provide a mean for people to communicate and share emotions. Hence, a huge amount of data is posted on social media every day where text mining and sentiment analysis are essential to automatically analyze such data to extract the desired information and knowledge. In this paper, two main contributions are introduced. The first contribution is that we generated twelve large-scale fanatic-lexicons that can help in building fanatic-classification to automatically classify Arabic social text (e.g., tweets) into fanatic-text or non-fanatic text. The generated fanatic-lexicons can help in building anti-fanatic tools and automatically detecting and measuring sport-fanaticism in Arabic social text. As far as we know, the generated fanatic-lexicons are the first large-scale fanatic-lexicons. The generated resources are publicly available for research purpose. The second contribution is that we proposed a new method to automatically generate sentiment lexicons which is called Term Frequency-Inverse Context Frequency (TFICF). The performance of the proposed-TFICF method is analyzed and compared with one of the common methods in this path which is called Pointwise Mutual Information (PMI). Our proposed-TFICF method showed better performance where the highest accuracy of TFICF is 89%, and the highest accuracy of PMI is 82%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  • Aldayel HK, Azmi AM (2016) Arabic tweets sentiment analysis–a hybrid scheme. J Inf Sci 42(6):782–797

    Article  Google Scholar 

  • Al-Moslmi T, Albared M, Al-Shabi A, Omar N, Abdullah S (2017) Arabic senti-lexicon: constructing publicly available language resources for arabic sentiment analysis. J Inf Sci. https://doi.org/10.1177/0165551516683908

    Article  Google Scholar 

  • Alqmase MM (2019) Building Sentiment Resources, In: Sentiment analysis for sports fanaticism in Arabic social media text (https://eprints.kfupm.edu.sa/id/eprint/141005), King Fahd University of Petroleum & Minerals

  • Alqmase MM (2021) “Anti-fanatic Repository,” Anti-Fanatic, [Online]. https://github.com/qumasi/Anti-fanatic-resources. Accessed 22 May 2022

  • Alqmase M, Al-Muhtasab H, Rabaan H (2021) Sports-fanaticism formalism for sentiment analysis in Arabic Text. Soc Netw Anal Min (SNAM). https://doi.org/10.1007/s13278-021-00757-9

    Article  Google Scholar 

  • Alshahrani HA, Fong AC (2018) Arabic domain-oriented sentiment lexicon construction using latent Dirichlet Allocation, In: 2018 IEEE International Conference on Electro/Information Technology (EIT)

  • Al-Twairesh N, Al-Khalifa H, Al-Salman A (2014) Subjectivity and sentiment analysis of Arabic: trends and challenges, In: IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA), Doha, Qatar

  • Al-Twairesh N, Al-Khalifa H, AlSalman A (2016) Arasenti: large-scale twitter-specific arabic sentiment lexicons, In: Proceedings of the 54th annual meeting of the association for computational linguistics

  • Church KW, Hanks P (1990) Word association norms, mutual information, and lexicography. Comput Linguist 16(1):22–29

    Google Scholar 

  • Darwish K, Mubarak H (2016) "Farasa: A new fast and accurate Arabic word segmenter," In: Proceedings of the Tenth International Conference on Language Resources and Evaluation

  • E--Beltagy SR (2017) WeightedNileULex: a scored Arabic sentiment lexicon for improved sentiment analysis. In: Gayar NE, Suen CY (eds) Language processing, pattern recognition and intelligent systems. Special issue on computational linguistics, speech and image processing for Arabic language. World Scientific Publishing Co

    Google Scholar 

  • El-Beltagy SR (2016) NileULex: a phrase and word level sentiment lexicon for Egyptian and Modern Standard Arabic, In: LREC

  • El-Beltagy SR, Ali A (2013a) "Open issues in the sentiment analysis of Arabic social media: A case study," In: 2013a 9th International Conference on Innovations in Information Technology (IIT)

  • El-Beltagy S, Ali A (2013b) “unWeighted opinion mining Lexicon (Egyptian Arabic)

  • El-Beltagy S, Ali A (2013c) “unWeighted Opinion Mining Lexicon (Egyptian Arabic),” [Online]. http://bit.ly/MGtMqU. Accessed 4 Apr 2019

  • El-Masri M, Altrabsheh N, Mansour H (2017) Successes and challenges of Arabic sentiment analysis research: a literature review. Soc Netw Anal Min Springer 1(7):54

    Article  Google Scholar 

  • ElSahar H, El-Beltagy SR (2014) A fully automated approach for arabic slang lexicon extraction from microblogs," In: International conference on intelligent text processing and computational linguistics

  • ElSahar H, El-Beltagy SR (2015) Building large arabic multi-domain resources for sentiment analysis, In: International Conference on Intelligent Text Processing and Computational Linguistics

  • Esuli A, Sebastiani F (2006) Sentiwordnet: a publicly available lexical resource for opinion mining, In: LREC, Citeseer, pp 417–422

  • Hu M, Liu B (2014) Mining and summarizing customer reviews, In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining

  • Kiritchenko S, Zhu X, Mohammad SM (2014) Sentiment analysis of short informal texts. J Artif Intell Res 50:723–762

    Article  Google Scholar 

  • Liu B (2012) Sentiment analysis and opinion mining, Morgan and Claypool Publishers

  • Mahyoub FH, Siddiqui MA, Dahab MY (2014) Building an Arabic sentiment lexicon using semi-supervised learning. J King Saud Univ Comput Inf Sci 26(4):417–424

    Google Scholar 

  • Mataoui M, Zelmati O, Boumechache M (2016) A proposed lexicon-based sentiment analysis approach for the vernacular Algerian Arabic. Res Comput Sci 110:55–70

    Article  Google Scholar 

  • Mohammad S, Turney P (2010) Emotions evoked by common words and phrases: Using mechanical turk to create an emotion lexicon, In: Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text

  • Mohammad SM, Kiritchenko S, Zhu X (2013) Nrc-canada: Building the state-of-the-art in sentiment analysis of tweets, arXiv preprint arXiv:1308.6242

  • Mohammad S, Salameh M, Kiritchenko S (2016) Sentiment Lexicons for Arabic Social Media, In: LREC

  • Mohammad H, Sulaiman MN (2015) A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manag Process 5(2):1

    Article  Google Scholar 

  • Mohammad SM, Turney PD (2013) Crowdsourcing a word–emotion association lexicon. Comput Intell 29(3):436–465

    Article  MathSciNet  Google Scholar 

  • Nielsen F (2011) A new ANEW: Evaluation of a word list for sentiment analysis in microblogs, arXiv preprint arXiv:1103.2903

  • Niwa Y, Nitta Y (1994) Co-occurrence vectors from corpora vs. distance vectors from dictionaries, In: Proceedings of the 15th conference on Computational linguistics

  • Pasha A, Al-Badrashiny M, Mohamed MT, El Kholy A, Eskander R, Habash N, Pooleery M, Rambow O, Roth R (2014) Madamira: a fast, comprehensive tool for morphological analysis and disambiguation of arabic," In: LREC

  • QCRI Arabic Language Technologies Tools & Demos (2016) “FARASA” developed by Arabic Language Technologies Group at Qatar Computing Research Institute (QCRI), website accessed (2022) [Online]. https://farasa.qcri.org/. Accessed 13 May 2022

  • Refaee E, Rieser V (2014) An arabic twitter corpus for subjectivity and sentiment analysis, In: LREC

  • Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis, In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing

  • Youssef M, El-Beltagy SR (2018) MoArLex: an Arabic sentiment lexicon built through automatic lexicon expansion. Procedia Comput Sci 142:94–103

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to acknowledge the support provided by King Fahd University of Petroleum & Minerals.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammed Alqmase.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alqmase, M., Al-Muhtaseb, H. Sport-fanaticism lexicons for sentiment analysis in Arabic social text. Soc. Netw. Anal. Min. 12, 56 (2022). https://doi.org/10.1007/s13278-022-00871-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-022-00871-2

Keywords

Navigation