Towards Improving Sentiment Analysis in Arabic

Siddiqui, Sanjeera; Monem, Azza Abdel; Shaalan, Khaled

doi:10.1007/978-3-319-48308-5_12

Sanjeera Siddiqui⁷,
Azza Abdel Monem⁸ &
Khaled Shaalan^7,9

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 533))

Included in the following conference series:

International Conference on Advanced Intelligent Systems and Informatics

2603 Accesses
3 Citations

Abstract

This paper presents an innovative approach that explores the role of lexicalization for Arabic sentiment analysis. Sentiment Analysis in Arabic is hindered due to lack of resources, language in use with sentiment lexicons, pre-processing of dataset as a must and major concern is repeatedly following same approaches. One of the key solution found to resolve these problems include applying the extension of lexicon to include more words not restricted to Modern Standard Arabic. Secondly, avoiding pre-processing of dataset. Third, and the most important one, is investigating the development of an Arabic Sentiment Analysis system using a novel rule-based approach. This approach uses heuristics rules in a manner that accurately classifies tweets as positive or negative. The manner in which a series of abstraction occurs resulting in an end-to-end mechanism with rule-based chaining approach. For each lexicon, this chain specifically follows a chaining of rules, with appropriate positioning and prioritization of rules. Expensive rules in terms of time and effort thus resulted in outstanding results. The results with end-to-end rule chaining approach achieved 93.9 % accuracy when tested on baseline dataset and 85.6 % accuracy on OCA, the second dataset. A further comparison with the baseline showed huge increase in accuracy by 23.85 %.

Download conference paper PDF

Evaluation and Enrichment of Arabic Sentiment Analysis

Combining Lexical Features and a Supervised Learning Approach for Arabic Sentiment Analysis

Sentiment Analysis in Arabic

Keywords

1 Introduction

The mission of Sentiment Analysis is to perceive the content with suppositions and mastermind them in a way complying with the extremity, which incorporates: negative, positive or nonpartisan. Organization’s are taken to huge prestige by living up to the opinions from different people [9, 17]. Subjectivity and Sentiment Analysis characterization are prepared in four measurements: (1) subjectivity arrangement, to estimate on Subjective or Objective, (2) Sentiment Analysis, to anticipate on the extremity that could be negative, positive, or impartial, (3) the level in view of record, sentence, word or expression order, and (4) the methodology that is tailed; it could be standard based, machine learning, or half breed [10].

Arabic is a Semitic dialect, which is distinctive as far as its history, structure, diglossic nature and unpredictability Farghaly and Shaalan [8]. Arabic is broadly talked by more than 300 million individuals. Arabic Natural Language Processing (NLP) is testing and Arabic Sentiment Analysis is not a special case. Arabic is exceptionally inflectional [4, 11] because of the fastens which incorporates relational words and pronouns. Arabic morphology is intricate because of things and verbs bringing about 10,000 root [6]. Arabic morphology has 120 examples. Beesley [6] highlighted the hugeness of 5000 roots for Arabic morphology. No capitalization makes Arabic named substance acknowledgment a troublesome mission [14]. Free order of Arabic Language brings in additional challenges with regards to Sentiment Analysis, as the words in the sentence can be swapped without changing the structure and the meaning. Arabic Sentiment Analysis has been a gigantic center for scientists [2].

The target of this study is to examine procedures that decide negative and positive extremity of the information content. One of the critical result would be to recognize the proposed end-to-end principle binding way to deal with other dictionary based and machine learning-construct approaches in light of the chosen dataset.

The rest of this paper is organized as follows. Related work is covered in Sect. 2, Data collection is covered in Sect. 3 followed by system implementation in Sect. 4. Section 5 covers evaluation and results, Sect. 6 covers error analysis and lastly Sect. 7 depicts conclusion.

2 Related Work

To take a shot at Sentiment Analysis, the key parameter is the dataset. Late endeavors by Farra et al. [13] outlined the significance of crowdsourcing as an extremely fruitful technique for commenting on dataset. Bolster Vector Machine classifier accomplished 72.6 % precision on twitter dataset of 1000 tweets [15].

The record level assessment investigation utilizing a joined methodology comprising of a dictionary and Machine Learning approach with K-Nearest Neighbors and Maximum Entropy on a blended area corpus involving training, legislative issues and games achieved an F-measure of 80.29 % [7]. Shoukry and Refea [15] achieved a precision of 72.6 % with the corpus based method. The vocabulary and estimation examination device with an exactness of 70.05 % on tweeter dataset also, 63.75 % on Yahoo Maktoob dataset [3].

With a mixed approach that is Lexical and Support Vector Machine classifier created 84.01 % exactness [1]. 79.90 % exactness was shown with Hybrid methodology which involved lexical, entropy and K-closest neighbor [5]. Shoukry and Rafea [16] independently sent two methodologies, one being Support Vector Machine accomplished a precision of 78.80 % and other one being Lexical with an exactness of 75.50 %.

3 Data Collection

The dataset utilized as a part of this paper is Twitter Tweets and film surveys. The dataset is taken from [3, 12]. These datasets are utilized as these are accessible, rich and enough to reached a conclusion, also electronic assets, for example, vocabulary is given.

Abdullah et al. [3]’s dataset contains 1000 positive tweets and 1000 negative tweets with length ranging from one word to sentences. 7189 words in positive tweets and 9769 words in negative tweets. The tweets were manually collected belong to Modern Standard Arabic and Jordanian Dialect, which covers Levantine language family. The months-long segregation procedure of the tweets was physically led by two human specialists (local speakers of Arabic).

OCA corpus, termed as Opinion Corpus for Arabic, was presented by [12]. This corpus contains total 500 opinions, of which there are 250 positive opinions and 250 negative opinions. The procedure followed by Rushdi et al. [12] included gathering surveys from a few Arabic online journal locales and site pages utilizing a straightforward bash script for slithering. At that point, they expelled HTML labels and unique characters, and spelling mix-ups were adjusted physically. Next, a preparing of each survey was done, which included tokenizing, evacuating Arabic stop words, and stemming and sifting those tokens whose length was under two characters. In their trials, they have utilized the Arabic stemmer of RapidMiner and the Arabic stop word list. At last, three distinctive N-gram plans were created (unigrams, bigrams, and trigrams) and cross validation was used to assess the corpus for which they have achieved 90.6 % accuracy.

The vocabulary utilized as a part of this paper contains the dictionary used by [3], which included opinions, named substances and a few haphazardly set words. In view of the circumspection of reiteration of the words found in negative or positive reviews and their arrangement, they were incorporated into both the rundown that is positive and negative lists.

4 Implementation of Arabic Sentiment Analysis

This paper is a significant extension of [13]. Siddiqui et al. [13] research introduced a system, which contains only two type of rules - “equal to” and “within the text rules”, so as to examine whether the tweet is either negative or positive. The rules include a 360-degree coverage with an improvised segment that is end to end rule chaining principle: (1) in the middle, we termed as “within the text”, (2) at the boundary we termed as either “ending with the text” or “beginning with the text”, and (3) full coverage, we termed as “equal to the text”. Figure 1 delineates the 360-degree rules coverage. The End-to-End mechanism with rule chaining approach introduced in this paper, includes the chaining of rules based on the positioning of the polarities in the tweets. The key underlying base ground factors which helped us formulate appropriate rules includes analysis of the tweets and the extension of positive and negative lexicons. The analysis of tweets resulted in identifying relations pertaining to words which were either disjoint, intersected or coexisted.

The words which were disjoint that is completely indicating either positive or negative polarity were included in their respective lexicons. The words which intersected that is the ones which were found to be common in both negative and positive reviews were included in positive as well as negative lexicons. The words which coexisted at the same place in the negative and positive reviews, that is the ones which appeared at the beginning or ending were placed in either positive or negative lexicon, based on the highest frequency of the word in the respective reviews.

Rules handling intersection with the end-to-end chaining mechanism: As an example consider the following positive tweet (the decision was suspended for protecting the motherland), the word (suspended) appeared in the beginning of this tweet and the same word was found in the negative tweets. Hence, this set of situations was handle with the very use of positioning and chaining of rules. The steps involved to achieve the positioning and chaining of rules includes:

Rules Formation: With the logical discretion of the word (suspended) being seen at the “beginning of” positive tweets and “within the text” for the negative tweets, the rules thus formed were “beginning with” for positive tweets and “within the text” for negative tweets. With the correct positioning and chaining of rules this problem was resolved. In the current example “beginning with” rule needs to be positioned and chained in an orderly fashioned with the rule “within the text” so as to satisfy both positive and negative reviews. So, the rule “beginning with” was chained with the rule “within the text” for the word (suspended) by positioning the rule “beginning with” first followed by “within the text” rule. Hence, the rules are chained and positioned for the words which are found to repeat themselves in both positive and negative tweets.

Rules handling coexistence: The lexicons which are not seen to repeat themselves in either positive or negative tweets where handled with the rule – “within the text”. For example, (Thieves) is set for search “within the text” (Thieves), is majorly found within the text in negative review rather than in positive ones. Hence based on the frequency of (Thieves) the rule is set. Example tweet: (I am not responsible for that thieves spend lavishly from the funds of the country and I beg.)

Rules handling disjoint: The cases wherein the words were not repeated in positive and negative cases and were found to have their significance at the end of the tweet, were handled using “ending with” rule. In the following positive tweet example – (Remember who continues to speak falsehood he is recorded with Allah as a great liar – but who persists in speaking the truth he is recorded with Allah as an honest man). The word (Honest) appeared at the end of this tweet. Likewise, (Honest) was found to appear at the end in majority of positive reviews. Hence, the “end with” rule is set to search for indicating the system that if the review ends with the word (Honest) then it should be considered as a positive tweet.

Derivation of rules: Table 1 depicts the skeletal examples of derived rules. Conditional search includes two key phases, one includes a condition which checks on the entered tweet mapping with the rules and the second phase is the color coded output which changes its font color to - “Green fill with Green fill text” for negative tweets and “Light red fill with light red fill text” for positive tweets.

Table 1. Derivation of lexicalized rule

Full size table

With reference to rule 1.A in Table 1 in the primary column, if the word showed up inside of the content in a positive tweet. With reference to manage 2.B or 3.B in this table, if the same word showed up toward the end or toward the starting in the negative tweet, then situating and anchoring of these two rules are finished. For this case the positive principle was situated and fastened underneath the negative guideline. The principle “starting with” or “finishing with” not at all like “inside of the content” search for that word in the first place or end which will tag the tweet as negative. In the event that the situation of guideline is turned around then “inside of the content” as situated and affixed before the tenets “starting” or “consummation” with, when a negative tweet is gone through this, the tweet will be labeled positive. As this is a chained approach for a specific word, this checks its importance at the ending or starting rule, on the off chance that it doesn’t have a place with that then it goes through within the text.

End-to-end mechanism with rule chaining approach.: After precisely executing the rules taking into account the fulfillment of disjoint word(s) which existed together or basic words in negative and positive surveys, the guideline anchoring was dealt with. End-to-end component with rule chaining approach fulfills a word which has a place with a positive and negative tweet regardless of its real extremity. Consequently, a negative word in a positive audit and a positive word in a negative tweet is fulfilled through apt chaining as examined beneath with the guide of case from negative and positive tweets. Example 1 Rules in use – Rule A and Rule B.

Rule A in Table 1: In the negative tweet (There is no need to protect our motherland from hands of hypocrites Jordanian), contains the word (hypocrites) within the text. Rule B in Table 1: In the positive tweet (O Allah! Place us not with the people who are hypocrites) contains the word (hypocrites) at the last. If Rule A is not chained with Rule B, then only one of the rule will be satisfied. To satisfy both the rules, that is to correctly identify negative and positive polarity, Rule A is chained with Rule B by positioning Rule A Below Rule B, so as to allow the search to first visit the end rule first, then the within the text rule.

Example 2 with explanation on how the system works: For example, the word (pessimism) was found to be part of both positive and negative reviews with a slight variation. In the positive reviews appears only in the middle whereas in the negative review (pessimism) was found to appear at the beginning. Hence the rules thus were created covering “within the text” (refer rule 1.A in Table 1) and “beginning with” (refer rule 2.B in Table 1), but the positioning was varied. By positioning the rule “beginning with” first and then the rule “within the text” helped in satisfying both positive and negative reviews. As the system checked for the word beginning with (pessimism) in the entered review and if found then the review was tagged as negative. Likewise, the system proceeded with the entered review containing the word (pessimism), if the word (pessimism) was not found at the beginning then the search proceeded further and identified it as positive.

5 Evaluation and Results

To quantify the change also the nature of being trusted and had faith in, assessment plays an essential part. Cross-Validation and accuracy information are regularly used to assess the outcomes in estimation investigation. The exactness measures – Precision, Recall and Accuracy, which are generally being used was conveyed to measure the execution of the instruments utilized as a part of both the analyses. Precision, Recall and Accuracy were utilized to look at the outcomes by [3, 12, 16]. The condition is as per the following:

Precision = TP/(TP + FP)
Recall = = TP/(TP + FN)
Accuracy = (TP + TN)/(TP + TN + FP + FN)
Where:
TP – True Positive, all the tweets which were characterized accurately as positive
TN – True Negative, all the tweets which were accurately named negative
FP – False Positive, all the tweets which were mistakenly named positive
FN – False Negative, all the tweets which were mistakenly delegated negative

Results: The results fuse the relationship of the impressive number of tests coordinated in this paper. To do the connection, the accuracy of the extensive number of tests are used. Siddiqui et al. [13] system and System 2 (introduced in this paper) were attempted on [3, 12] dataset. Table 2 obviously takes after the outperformance of rules made in Siddiqui et al. [13] with enormous accuracy for Abdullah et al. [3] dataset when contrasted with the results on [12] OCA dataset.

Table 2. Comparison of System1 [13] Versus System 2

Full size table

System 1 Vs System 2 Results Comparison: Clear importance in expansion in accuracy in System 2 is seen for Abdullah et al. [3] and OCA dataset. The examination of Siddiqui et al. [13]’s System 1 and our System 2 doubtlessly answers that the end to end rule chaining improved the performance of sentiment analysis. Siddiqui et al. [13]’s-system 1 bound to limits with two norms sort with no attaching and System 2 variation of System 1 with authoritative and reasonable arranging of the standards. System 2 ended up being remarkable for both the datasets. 93.9 % accuracy for [3] dataset wherein for OCA dataset 85.6 % accuracy was measured. Still the accuracy for our system2 when tested on Abdullah et al. [3] was high with 8.3 % more exactness than OCA dataset. Recall is high for both the datasets with 3.3 % for Abdullah et al. [3] and 22.6 % for Rushdi et al. [12] as very less number of tweets are mistakenly delegated negative.

6 Error Analysis

Case 1: In the first case, a negative review included a positive word within the text and the same positive word was mostly found within the text in positive reviews affecting positioning of rules. Table 3 illustrates an example. Hence, the rule can either satisfy a positive review or negative review. Chain was not feasible to form in this case.

Table 3. Unhandled exceptions – Case 1

Full size table

Case 2: Likewise, in the second case, a positive review included a negative word within the text and the same negative word was mostly found within the text in the negative reviews affecting positioning of rules. Rule again restricted to either positive or negative review. Table 4 illustrates an example.

Table 4. Unhandled exceptions – Case 1

Full size table

7 Conclusion

This paper beats the vocabulary building process through the fitting position of words too not barring the basic words found in both the tweets for the vocabularies. The outperformance of rule chaining approach that is System 2 brought about 23.85 % in results when contrasted with Abdullah et al. [3]’s vocabulary based methodology. The incorporation of normal words in light of the examination of tweets in the negative and positive vocabulary list upgraded the general result when contrasted with the gauge dataset. Last yet not the minimum, the situating of principles has a gigantic effect to the rule based methodology as proper situating brought about fulfilling words which were observed to be regular in both negative and positive dictionaries. By and by, the end-to-end rule chaining methodology was the most difficult and costly regarding time and exertion, yet adds to the headways in the cutting edge for Arabic Sentiment Analysis, through the organized set standards and through the right utilization of various principles including “contains content”, “equivalent to”, “starting with” and “finishing with”. In reality, displayed the recently created assessment investigation framework- System2, which beat in both arrangements of examinations when contrasted with [3]. System 2 with guidelines reached out to cover all territories was demonstrated to expand the exactness of OCA corpus by 39.8 % and 4.3 % accuracy for Abdullah et al. [3] when contrasted with System 1’s principles of Siddiqui et al. [13]. Thus, starting a wakeup require every one of the scientists to redirect their enthusiasm to lead based methodology. The unmistakable hugeness in results along these lines acquired through the tenets made makes the principle based methodology the most alluring methodology.

References

Aldayel, H.K., Azmi, A.M.: Arabic tweets sentiment analysis–a hybrid scheme. J. Inf. Sci. (2015). doi:10.1177/0165551515610513
Google Scholar
Muhammad, A.-M., Kübler, S., Diab, M.: Samar: a system for subjectivity and sentiment analysis of Arabic social media. In: Proceedings of the 3rd Workshop in Computational Approaches to Subjectivity and Sentiment Analysis, pp. 19–28. Association for Computational Linguistics (2012)
Google Scholar
Abdulla, N.A., Ahmed, N.A., Shehab, M.A., Al-Ayyoub, M., Al-Kabi, M.N., Al-rifai, S.: Towards improving the lexicon-based approach for Arabic sentiment analysis. Int. J. Inf. Technol. Web Eng. (IJITWE) 9(3), 55–71 (2014)
Article Google Scholar
Hammo, B., Abu-Salem, H., Lytinen, S., Evens, M.: QARAB: a question answering system to support the Arabic language. In: The Proceedings of Workshop on Computational Approaches to Semitic Languages, ACL 2002, Philadelphia, PA, pp. 55–65, July 2002
Google Scholar
Beesley, K.R.: Arabic finite-state morphological analysis and generation. In: Proceedings of the 16th Conference on Computational Linguistics, vol. 1, pp. 89–94. Association for Computational Linguistics, August 1996
Google Scholar
Darwish, K.: Building a shallow Arabic morphological analyzer in one day. In: Proceedings of the ACL-2002 Workshop on Computational Approaches to Semitic languages, pp. 1–8. Association for Computational Linguistics, July 2002
Google Scholar
Alaa, E.-H.: Arabic opinion mining using combined classification approach (2011)
Google Scholar
Farghaly, A., Shaalan, K.: Arabic natural language processing: challenges and solutions. ACM Trans. Asian Lang. Inf. Process. (TALIP) 8(4), 1–22 (2009). The Association for Computing Machinery (ACM)
Article Google Scholar
Feldman, R.: Techniques and applications for sentiment analysis. Commun. ACM 56(4), 82–89 (2013)
Article Google Scholar
Korayem, M., Crandall, D., Abdul-Mageed, M.: Subjectivity and sentiment analysis of arabic: a survey. In: Hassanien, A.E., Salem, A.-B.M., Ramadan, R., Kim, T.-h. (eds.) AMLTA 2012. CCIS, vol. 322, pp. 128–139. Springer, Heidelberg (2012)
Chapter Google Scholar
Syiam, M., Fayed, Z., Habib, M.: An intelligent system for Arabic text categorization. Int. J. Intell. Comput. Inf. 6(1) (2006)
Google Scholar
Rushdi-Saleh, M., Martín-Valdivia, M.T., Ureña-López, L.A., Perea-Ortega, J.M.: OCA: opinion corpus for Arabic. J. Am. Soc. Inf. Sci. Technol. 62(10), 2045–2054 (2011)
Article Google Scholar
Siddiqui, S., Monem, A.A., Shaalan, K.: Sentiment analysis in Arabic. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds.) Proceedings of the 21st International Conference on the Application of Natural Language to Information Systems (NLDB 2016). LNCS. Springer, Heidelberg (2016)
Google Scholar
Khaled, S.: A survey of Arabic named entity recognition and classification. Comput. Linguist. 40(2), 469–510 (2014)
Article Google Scholar
Shoukry, A., Rafea, A.: Sentence-level Arabic sentiment analysis. In: 2012 International Conference on Collaboration Technologies and Systems (CTS), pp. 546–550. IEEE (2012)
Google Scholar
Shoukry, A., Rafea, A.: Preprocessing Egyptian dialect tweets for sentiment mining. In: The 4th Workshop on Computational Approaches to Arabic Script-Based Languages, pp. 47–56 (2012)
Google Scholar
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 9–27 (2011)
Article Google Scholar

Download references

Author information

Authors and Affiliations

British University in Dubai, Block 11, 1st and 2nd Floor, Dubai International Academic City, Dubai, UAE
Sanjeera Siddiqui & Khaled Shaalan
Faculty of Computer and Information Sciences, Ain Shams University, Abbassia, Cairo, 11566, Egypt
Azza Abdel Monem
School of Informatics, University of Edinburgh, Edinburgh, UK
Khaled Shaalan

Authors

Sanjeera Siddiqui
View author publications
You can also search for this author in PubMed Google Scholar
Azza Abdel Monem
View author publications
You can also search for this author in PubMed Google Scholar
Khaled Shaalan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sanjeera Siddiqui .

Editor information

Editors and Affiliations

Faculty of Computers & Information, Cairo University, Giza, Egypt
Aboul Ella Hassanien
Dubai International Academic City, The British University, Dubai, United Arab Emirates
Khaled Shaalan
CS Dept. Faculty of Computers and Inform, Suez Canal University CS Dept. Faculty of Computers and Inform, Ismailia, Egypt
Tarek Gaber
Ahmed Orabi Square , Menouf, Egypt
Ahmad Taher Azar
Faculty of Computer & Information Scienc, Ain Shams University Faculty of Computer & Information Scienc, Cairo, Egypt
M. F. Tolba

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Siddiqui, S., Monem, A.A., Shaalan, K. (2017). Towards Improving Sentiment Analysis in Arabic. In: Hassanien, A., Shaalan, K., Gaber, T., Azar, A., Tolba, M. (eds) Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2016. AISI 2016. Advances in Intelligent Systems and Computing, vol 533. Springer, Cham. https://doi.org/10.1007/978-3-319-48308-5_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-48308-5_12
Published: 18 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48307-8
Online ISBN: 978-3-319-48308-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Towards Improving Sentiment Analysis in Arabic

Abstract

Similar content being viewed by others

Evaluation and Enrichment of Arabic Sentiment Analysis