Opinion mining for national security: techniques, domain applications, challenges and research opportunities

Razali, Noor Afiza Mat; Malizan, Nur Atiqah; Hasbullah, Nor Asiakin; Wook, Muslihah; Zainuddin, Norulzahrah Mohd; Ishak, Khairul Khalil; Ramli, Suzaimah; Sukardi, Sazali

doi:10.1186/s40537-021-00536-5

Opinion mining for national security: techniques, domain applications, challenges and research opportunities

Survey Paper
Open access
Published: 04 December 2021

Volume 8, article number 150, (2021)
Cite this article

Download PDF

You have full access to this open access article

Journal of Big Data Submit manuscript

Opinion mining for national security: techniques, domain applications, challenges and research opportunities

Download PDF

Noor Afiza Mat Razali ORCID: orcid.org/0000-0001-5149-3907¹,
Nur Atiqah Malizan¹,
Nor Asiakin Hasbullah¹,
Muslihah Wook¹,
Norulzahrah Mohd Zainuddin¹,
Khairul Khalil Ishak²,
Suzaimah Ramli¹ &
…
Sazali Sukardi³

8520 Accesses
10 Citations
1 Altmetric
Explore all metrics

Abstract

Background

Opinion mining, or sentiment analysis, is a field in Natural Language Processing (NLP). It extracts people’s thoughts, including assessments, attitudes, and emotions toward individuals, topics, and events. The task is technically challenging but incredibly useful. With the explosive growth of the digital platform in cyberspace, such as blogs and social networks, individuals and organisations are increasingly utilising public opinion for their decision-making. In recent years, significant research concerning mining people’s sentiments based on text in cyberspace using opinion mining has been explored. Researchers have applied numerous opinions mining techniques, including machine learning and lexicon-based approach to analyse and classify people’s sentiments based on a text and discuss the existing gap. Thus, it creates a research opportunity for other researchers to investigate and propose improved methods and new domain applications to fill the gap.

Methods

In this paper, a structured literature review has been done by considering 122 articles to examine all relevant research accomplished in the field of opinion mining application and the suggested Kansei approach to solve the challenges that occur in mining sentiments based on text in cyberspace. Five different platforms database were systematically searched between 2015 and 2021: ACM (Association for Computing Machinery), IEEE (Advancing Technology for Humanity), SCIENCE DIRECT, SpringerLink, and SCOPUS.

Results

This study analyses various techniques of opinion mining as well as the Kansei approach that will help to enhance techniques in mining people’s sentiment and emotion in cyberspace. Most of the study addressed methods including machine learning, lexicon-based approach, hybrid approach, and Kansei approach in mining the sentiment and emotion based on text. The possible societal impacts of the current opinion mining technique, including machine learning and the Kansei approach, along with major trends and challenges, are highlighted.

Conclusion

Various applications of opinion mining techniques in mining people’s sentiment and emotion according to the objective of the research, used method, dataset, summarized in this study. This study serves as a theoretical analysis of the opinion mining method complemented by the Kansei approach in classifying people’s sentiments based on text in cyberspace. Kansei approach can measure people’s impressions using artefacts based on senses including sight, feeling and cognition reported precise results for the assessment of human emotion. Therefore, this research suggests that the Kansei approach should be a complementary factor including in the development of a dictionary focusing on emotion in the national security domain. Also, this theoretical analysis will act as a reference to researchers regarding the Kansei approach as one of the techniques to improve hybrid approaches in opinion mining.

A systematic study on the role of SentiWordNet in opinion mining

Article 05 June 2021

360 degree view of cross-domain opinion classification: a survey

Article 06 August 2020

A Comprehensive Survey on Multilingual Opinion Mining

Introduction

Nowadays, cyberspace is consistently loaded with several applications and digital media where people with various backgrounds and expertise share their thoughts and opinions on numerous topics/events. Usually, the information shared by people is textual form-based [1]. Sharing can be made using any digital media application such as online news, blogs, and social media. Therefore, countless blogs, social media platforms, forums, news reports, e-commerce websites, and other online resources allow people to express opinions. Such information can be utilised to understand public and consumer opinions regarding product preferences, political movements, social events, marketing campaigns, company strategies, and monitoring reputations. People are unaware that the opinions they express have a negative impact on national security. A negative opinion can cause chaos and disputes among a community, which creates opposing views for people of other countries, thereby threatening a state’s national security [2].

To address this issue, communities of researchers and academicians have been rigorously working on sentiment analysis for the last decade and a half. Sentiment analysis (SA) is a computational assessment of the sentiments, opinions and emotions conveyed in texts and aimed at a certain entity [3]. Sentiment analysis (also called review mining, opinion mining, attitude analysis or appraisal extraction) is the task of detecting, extracting and classifying opinions, sentiments and attitudes concerning different topics, as expressed in textual input [4].

Opinion mining or sentiment analysis helps in achieving various goals such as observing public mood regarding political movements [5], customer satisfaction measurement [6], movie sales prediction [7], etc. However, the existing opinion mining method alone, which includes machine learning and lexicon-based approach, cannot effectively help in analysing and classifying people’s sentiments and emotions in cyberspace according to the national security domain because some opinion mining methods only focus on existing domains such as business and education. This paper suggests that the Kansei approach can be a complementary factor in mining and classifying people’s sentiment in other domains, such as the national security domain, by analysing suitable references for this approach.

The Kansei method can apply conventional techniques, such as consumer surveys and expert interviews, to understand people’s reactions towards a certain entity or event with the use of artefacts [8]. Kansei Engineering is one of the methods based on the Kansei approach, which has been employed in diverse research for emotional design. Kansei Engineering (KE) is capable of measuring people’s feelings and emotional states. These emotional and sensory outcomes are then translated into perceptual design elements of the product or artefact [9]. Typically, Kansei Words has proven to be excellent in describing affective needs and mapping relationships between Kansei words and design elements to achieve customers’ emotional satisfaction on product specifications. Nowadays, the Kansei approach can be used in different research areas such as education and information technology since the research method of KE had an influential effect on the relationship between the response of emotions and the attributes of any entity. Researchers are using this method in the information technology domain for analysing design elements for online websites. Therefore, this research explores the possible utilisation of KE in combination with other opinion mining methods to analyse emotions from the text.

This paper is structured as follows: Sect. “Introduction” provides a brief introduction on opinion mining and the Kansei approach and their functionality and application in mining people’s sentiments in cyberspace. Section “Method’ presents the method/research methodology employed in this paper with some explanation. Then, Sect. “Result” stated the result of the reviewed article, and Sect. “Discussion” explained and discussed the context of the result in depth. Section “Discussion” also discuss the finding by highlighting the functionalities of sentiment analysis/opinion mining and the Kansei approach as the new mechanism for mining people’s sentiment and emotions in the national security domain. Also, it presents the challenges of applying machine learning, the lexicon-based approach and the Kansei method for opinion mining based on text in cyberspace. Section “Future research directions of opinion mining for national security” discusses future research utilising the hybrid approach of machine learning, the lexicon-based approach and the Kansei approach for opinion mining in the national security domain. Section “Limitation” gives out the limitation of our research. Section “Conclusion” summarises the work, as well as the conclusion.

Method

To observe the related literature on opinion mining/sentiment analysis and the Kansei approach in mining sentiments based on text in cyberspace, we conducted a systematic literature review of the relevant literature. The following research questions are our focus area on this paper:

1.
How can opinion mining techniques and the Kansei approach enhance the methods of mining people’s sentiments and emotions in cyberspace?
2.
What are the most relevant sectors that benefit from opinion mining which includes the Kansei approach?
3.
What are the techniques used for opinion mining in various domain applications?
4.
What are the challenges and future scope of research for opinion mining techniques that include the Kansei approach?

To answer the research questions above, we conducted the SLR by following the reference guidelines for performing systematic literature reviews in software engineering published by Kitchenham and Charters in 2007. A search has been conducted on five platforms: the ACM (Association for Computing Machinery), IEEE (Advancing Technology for Humanity), SCIENCE DIRECT, SpringerLink, and SCOPUS. Figure 1 presents the research methodology employed to find related articles.

Several keywords were selected to be used in this research, such as: “opinion mining,” “sentiment analysis,” “polarity,” “emotion,” “Kansei,” and “opinion mining.” The Web of Science operators such as ‘OR’ and ‘AND had been used in combination with the selected keyword for searching the particular publication. Based on the search platform, this research runs the searching by the keywords, title, or abstract.

Then, the result from the search was filtered through the inclusion or exclusion criteria. The research must follow the inclusion criteria, such as the publication year of the papers must be between 2015 and 2021, and the publication must write in English. The publication must be the focus on the opinion mining techniques based on text in cyberspace. Variety type of discipline was placed on the paper such as computer science, business, psychology, and medicine. Publication in the type of books, posters, and literature review was disregarded.

As the selection result, an initial set total of 1556 research documents was identified. The identified document was reduced to 1475 documents from the preliminary keyword search on the selected platforms. Then, the duplicated document was removed and gave out remaining a total of 1324 documents. The remaining 1324 documents have been checked and read based on the inclusion or exclusion criteria. After that process, a total of 1428 was excluded. The final of 122 relevant papers was included in this research, which is based on the evaluation on reading the full text of the papers. The subsequent section of the literature review involved the analysis of the remaining 122 articles.

Result

In this paper, we study numerous subjects with 122 papers in total. We outline the descriptive statistics from the reviewed article, such as subject-wise analysis, year-wise analysis, and country-wise analysis. The chart in Fig. 2 shows the subject-wise classification; it reveals that Computer Science and Engineering are the major areas in which related research has been published. Social Sciences, Biomedical Science (Medicine), Health, Psychology, Business, Management, and Accounting and Decision Sciences have also observed an increase in the number of research publications on opinion mining/sentiment analysis and the Kansei approach for mining people’s sentiments in cyberspace.

Based on the year-wise analysis, the significant research in opinion mining for analysing sentiments in cyberspace began from 2015 onwards. We can observe a substantial growth in the number of publications from 2015 to 2018. In 2020, an exponential increase can be seen with more papers published than in 2018, indicating a growing trend in this research area, as shown in Fig. 3. If we take a closer look at the research, many studies also concentrate on mining sentiments in cyberspace. It indicates that opinion mining is also being explored at a considerably faster rate across multiple industries, partially due to its growing use in various applications.

Figure 4 illustrates the country-wise analysis; it presents the current trend regarding the location where India has the maximum amount of research published for opinion mining or sentiment analysis. However, United Stated (US) is also going forward and increasingly making contributions to the research. It shows that research on opinion mining has the potential to move further in enhancing the detection of people’s opinions in various domains. Asian nations and European nations such as Malaysia, Vietnam, South Korea, the United Kingdom (UK), and Italy also significantly contribute to this research area.

Discussion

Opinion mining overview

Sentiment analysis, also known as opinion mining, has been used to extract and interpret public sentiments and opinions for over a half-century by research communities, academics, government, and service industries. The role of opinion mining is both technically demanding and extremely realistic [10].

According to Liu [11], opinion mining/sentiment analysis is known as the computational study of people’s views, appraisals, attitudes and emotions toward individuals, people, problems, events, subjects, and their attributes. It is also the study of people’s opinions based on the sentiments, attitudes, or emotions expressed in a product [12].

‘A thought, opinion, or concept based on a feeling about a situation’ is the definition of the term “sentiment” according to the Cambridge dictionary [13]. Opinion mining involves the process of drawing opinions and categorising them according to their polarity, whether they are positive or negative or other emotions. They can be employed for different levels such as document-level sentiment analysis, sentence-level sentiment analysis, and feature or aspect-level sentiment analysis.

Opinion mining has been a research interest since the early twenty-first century. In 2003, Dave et al. [14] discussed opinion mining and proposed a model for document polarity classification (either recommended or not recommended) based on feedback analysis towards certain entities. From that research onwards, other researchers became interested in applying opinion mining in their text mining studies. It then became new extensive research in the following years. In 2004, Hu and Liu [15] had investigated the mining approach to summarise product reviews by identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative. In 2008, Abbasi et al. conducted research on sentiment analysis techniques and their applications [16, 17]. In 2009, Tang et al. [18] discussed document sentiment classification and opinion extraction and experimented with classifying web review opinions for consumer product analysis. In 2010, Chen and Zimbra [19] assessed the opinions of various business constituents regarding the company by employing an analysis framework that applied automatic topic and sentiment extraction methods to various online discussions. Based on the review of selected articles, this research found that between 2016 until today, opinion mining-related research is still an interesting subject area for researchers.

Classification in opinion mining

There are various classification techniques that exist for sentiment or opinion mining. In classification, content polarity has been identified as a suitable approach to analyse people’s opinions interpreted in text. Usually, three classes are used for classification: positive, negative and neutral. According to the literature, most researchers have classified their sentiments as positive, negative and neutral. Singh et al. [20] and Akila et al. [21] had concluded in their findings that positive, negative and neutral opinions toward their entities are adequate. The classification algorithms used for sentiment analysis depend on the method employed, such as the supervised or unsupervised method.

Techniques in performing opinion mining

To conduct opinion mining, researchers have recently applied various methods in the classification of opinions based on textual data. The supervised and unsupervised methods have been used as the classification algorithms. In the basic process of opinion mining, there are two well-known approaches. The unsupervised lexicon-based approach is one approach in which the process is guided by rules and heuristics derived from linguistic knowledge. Another approach is the supervised machine learning approach, where algorithms retrieve inherent information from existing labelled data in order to classify newer, unlabelled data [22].

Followed by the research question on “What are the techniques used for opinion mining in various domain applications.” Based on the papers reviewed, all had shown the use of either the machine learning techniques, lexicon-based approach, or a mixture of both methods when executing sentiment analysis. The results reveal that opinion mining or sentiment analysis has been conducted in 64 papers using machine learning techniques, while 23 of the reviewed papers applied the lexicon-based approach and 30 papers presented a hybrid approach by combining both methods. Figure 5 displays a chart that contains the number of review papers according to the type of opinion mining technique. The following chart displays the number of review papers according to the type of opinion mining technique. Other techniques were also discussed in these papers, such as the Kansei approach. Five related papers have employed the Kansei approach for mining people’s opinions and emotions.

Machine learning

The machine learning method is divided into three approaches: supervised learning, unsupervised learning and semi-supervised learning. Supervised learning uses labelled data that facilitate algorithms to learn and predict the sentiment of the text. Usually, to classify the opinion or sentiment of the text, textual data are not labelled, so the focus is on finding the pattern and gaining insight from that data. Based on the reviewed papers, most researchers had used machine learning techniques to analyse people’s opinions in the business domain. They extract people’s opinions from reviews left on e-commerce platforms. Businesses or products such as skin care, mobile phones, movie reviews, banking and train services have applied machine learning techniques for mining people’s opinions regarding their products and goods. Other than that, machine learning techniques are also used in the health and education domains. For the health domain, the machine learning method has been used to mine people’s opinions on health-related issues such as COVID-19 and medicine reviews. In the education sector, researchers have been more focused on the e-learning environment to analyse student reviews regarding e-learning. Government-related domains, such as politics and the economy, also apply machine learning techniques.

Under supervised learning, machine learning methods include the Naïve Bayes Classifier, Support Vector Machine, Decision Tree and Maximum Entropy. Based on the review articles, most methods employed by the researcher have been Naïve Bayes Classifier and Support Vector Machine. In the transportation domain, Mogaji and Erkan [23] identified the textual data on Twitter that will fall into which sentiments category (positive, negative, or neutral) according to consumer experiences of United Kingdom (UK) train transportation services by using the Naïve Bayes algorithm. Thus, the limitation highlighted by that research was that the automated process was prone to error. It needs the involvement of humans to watch out for that process and stated that human emotion does not fit into just three categories of positive, negative, or natural sentiment. It was different on Naïve Bayes Classifier implemented by Kaur and Kumar [24] to analyse public opinions on a crisis based on the social media platform. That research had enhanced the method by adding other features that is unigram, it helps in detecting sentiment that can provide useful information to the government in managing crisis situations, but researcher had to state on doing the approach comparison research by comparing this method with other approaches such as Support Vector Machine (SVM) in finding the appropriate sentiment classifier performance on natural disaster domain.

In 2017, Sabuj et al. [25] used SVM to mine opinions based on data from the web that resulted in satisfactory results when SVM was applied as a polarity classifier. Based on the accuracy comparison value, they found out that the SVM outperformed the Naïve Bayes. The SVM also was employed by Zhang et al. [26] to explore the negative sentiment tweets on Twitter. Even though that research contributes to identifying the negative features of the text on Twitter, it was observed that a more detailed classification of emotions such as positive was able to be identified by this sentiment analysis method. Ameur et al. [27] used the SVM classifier to determine the polarity of the "positive or negative" classification for comments on Facebook.

Researchers also use or combine more than one machine learning technique. Based on the reviewed article, the Naïve Bayes algorithm and Support Vector Machine method was most used together to extract opinions and sentiments from textual data from various datasets and social media. More than one method became the most used method in machine learning since the outcome of predicted data is accurate. According to research by Dhahi and Waleed [28] that employs Naïve Bayes and SVM as machine learning classifiers to extract sentiment from tweet datasets, they found that Naïve Bayes shows acceptable results. Still, it shows a different result from the research performed in [29], where SVM performed slightly better than NB by adding other features called as stemmed unigram that made the precision value of the SVM method higher than NB. Even though these are the two methods frequently used in mining opinions, other methods such as the maximum entropy and decision tree also have been employed to determine the positive and negative opinions based on a textual dataset but because of the lack of result accuracy. In 2019, Elhadad et al. [30] proposed an efficient approach in handling Tweets, in Arabic and English languages, with different processing techniques, such as Decision trees and Naïve Bayes. It was identified that the Decision Tree gets the least value on accuracy, and precision acts as a performance measure on those methods.

The supervised learning technique had limitations because machine learning applies the method of training and testing. As a result, researchers need to conduct the time-consuming training phase to get the result. Moreover, a training dataset and testing dataset are usually prepared by employing existing datasets due to requirements in the machine learning method that needs labelled data to train classifiers. It is necessary for datasets used in the experiment to be labelled with an opinion flag. For example, Twitter and movie review datasets are embedded with positive and negative reviews that resulted in the datasets made available with polarity labels (positive, negative, and neutral). Since the classification of sentiments within sentences usually uses machine learning algorithms, thus the input dataset is desired to be labelled.

Random forest, a semi-supervised learning technique, is another method that researchers have implemented in previous studies. In 2018, Khanvilkar and Vora [31] proposed the use of the random forest as the classification for sentiments on product reviews. The researchers have stated that the random forest machine learning algorithm will help improve sentiment analysis for product recommendations using multiclass classification. In 2020, Suganya and Vijayarani [32] used the deep learning method in opinion mining. They found that the time taken of execution of random forest was more than the CNN, one of the deep learning methods. Deep learning is a subfield of machine learning that employs deep neural networks. Recently, deep learning algorithms have been widely used in opinion mining. This section provides an overview of papers that have applied deep learning for opinion mining. Deep learning is one of the methods of semi-supervised learning. Imran et al. [33] used the deep learning method in the health domain. The deep long short-term memory (LSTM) was employed to detect the polarity and emotion on COVID-19 related tweets. That article successfully observed and detected the correlation between sentiments and emotions of people from within neighbouring countries amidst coronavirus (COVID-19) outbreak from their tweets but had some limitations on understanding the tweet context.

Other researchers have also used deep learning methods (such as CNN and LSTM) for analysing the emotional reactions to events of mass violence as well as to enhance the capability and accuracy of the opinion mining method based on a textual dataset by considered properties of users and events, generalized conclusions using several events [34]. The researcher observed that the CNN model was an appropriate method with meaningful and representative features for prediction. The deep learning method proved to be capable of classifying opinions into positive, negative, and other emotions. However, these supervised algorithms requiring a large dataset to predict the accurate result make this method time-consuming [35].

Datasets from social media platforms such as Twitter, Facebook and Tumblr are the textual datasets used by researchers. The text mostly consists of user comments, reviews or related research topic words on businesses, products, or events. Researchers have also used existing datasets in cyberspace websites such as IMDB and Amazon review datasets. Several researchers have also applied other dataset platforms such as text in the news, articles and emails. The following Figs. 6, 7 and 8 presents the distribution of articles according to application, technique and dataset platforms. The machine learning techniques used in opinion mining from the text are summarized in the Tables 1, 2, 3, 4, 5, 6 below.

Table 1 summarizes the Naïve Bayes/Bayesian techniques used in opinion mining based on text.

Table 1 Summary of Naïve Bayes/Bayesian techniques used in opinion mining from text

Full size table

Table 2 summarizes the Support Vector Machine (SVM) techniques used in opinion mining based on text.

Table 2 Summary of Support Vector Machine (SVM) techniques used in opinion mining from text

Full size table

Table 3 summarizes the Random Forest (RF) techniques used in opinion mining based on text.

Table 3 Summary of random forest (RF) techniques used in opinion mining from text

Full size table

Table 4 summarizes the Decision Tree (DT) techniques used in opinion mining based on text.

Table 4 Summary of decision tree (DT) techniques used in opinion mining from text

Full size table

Table 5 summarizes the Deep learning techniques used in opinion mining based on text.

Table 5 Summary of Deep learning techniques used in opinion mining from text

Full size table

Table 6 summarizes the Deep learning techniques used in opinion mining based on text.

Table 6 Summary of logistic regression used in opinion mining from text

Full size table

Lexicon-based approach

Another method for opinion mining or sentiment analysis would be the lexicon-based approach. The lexicon-based approach employs a dictionary that incorporates the polarity of the word inside it. If a word is found in a text, it is compared to a word in the dictionary, and the sentiment score is applied. The lexicon-based approach is used to determine sentiment, which is then computed by the overall polarity included in a text.

The lexicon-based approach can be classified under the unsupervised method. This method involves counting the positive and negative words related to the data. This method must also implement a lexicon, known as dictionaries. The dictionaries can be created manually or automatically from existing dictionaries. The difference between this method from machine learning is that it does not depend on or require any training data since it only employs the dictionary.

Through this research, 23 articles that use the lexicon-based approach for opinion mining or sentiment analysis were reviewed and implemented this approach to conduct emotion analysis to determine the sentiments and opinions of the textual dataset. Based on the reviewed articles, most research utilises the lexicon-based approach to extract opinions on business, products and e-commerce domains. Half of the reviewed articles had used a lexicon-based approach for analysing sentiments and emotion data on products and services such as cameras, mobile phones, laptops, tablets, TVs, video surveillance devices and movie reviews. Several types of research have also focused on education and health domains. Researchers employ this approach to analyse people’s opinions on a certain topic related to government issues such as political issues, election-related matters as well as environmental and energy resources.

For the lexicon-based approach, two techniques have been used by researchers: the dictionary-based approach and the corpus-based approach. The first technique, the dictionary-based approach, is employed to pinpoint the opinion words and their polarities.

Usually, to determine sentiments or opinions of the word, the dictionary-based approach is used where synonyms, antonyms and hierarchies in existing lexicons with sentiment information are found. In the existing lexicon, there are three numerical sentiment scores used: Obj(s), Pos(s) and Neg(s), which signify the Objective, Positive and Negative synset. This method is utilised to tag the polarity value with the sentiment dictionary, also known as the sentiment lexicon. Fernández-Gavilanes et al. [35] had employed the dictionary-based approach to detect opinions on online text such as tweets and reviews. The researcher stated the advantages of this method that can be applied to subject domains other than the domain it was designed for and fix some generic lexicon issues on not context-based by employing a context-based algorithm that helps create a dictionary/lexicon based on a particular context.

Abd et al. [80] further aimed to recognise the emotional segmentation of a movie reviewer based on the entertainment domain by using this approach to extract sentiments from a given text and classify them. Lexicon based approach helps them achieve a significant result by identifying the contextual polarity for a large subset of sentiment. It was suggested to apply this dictionary idea with machine learning to enhance the accuracy of the result. Also, the researcher had implemented existing dictionaries such as Wordnet and SentiWordNet.

The most used lexicon for the lexicon-based approach, according to the papers reviewed is SentiWordNet. SentiWordNet is the dictionary mostly employed for opinion mining. SentiWordNet is a lexical resource derived from WordNet which assigns numerical values to each synset, representing the scores of positivity, negativity or objectivities [81]. Each score has a value between 0 and 1, and the sum of positivity, negativity, or objectivity scores is 1. For example, Khan et al. [82] used the SentiWordNet to create their sentiment dictionary capable of enhancing the polarity classification in sentiment analysis based on movie review dataset and increasing the capability of SentiWordNet.

Even though SentiWordNet is the most frequently used because of the improvement of its usability in opinion mining. Other lexicons, such as MPQA, Wordnet, Vader, and Pattern lexicon was less selected by researchers because of their lack of capabilities in opinion classification. However, it is still able to be applied by researchers for opinion mining. For instance, Wordnet was used as an association list for the opinion classifier of user comments in online media platforms. It was observed that the dictionary enables the classification of irrelevant comments with a high score of precision value but less accuracy in finding relevant and positive comments [83]. Recently, Dey et al. [84] used the Vader lexicon, another type of dictionary, compared with other classification methods such as n-gram based SO-CAL approach and Senti-N-Gram lexicon based on those methods in determining the polarity of opinions in a movie review. The results show, the Vader lexicon got less score on accuracy between those two methods.

Other researchers also used an existing dictionary, called the NRC emotion lexicon, for classifying the opinion or polarity according to emotions. The NRC emotion lexicon is a list of words and their corresponding emotions. Eight emotions (fear, sadness, disgust, anger, trust, surprise, anticipation, and joy) and two sentiments (positive and negative) are included in this NRC emotion lexicon. In 2019, Swain and Seeja [85] employed this lexicon to develop a web-based application that may predict polarity and emotion based on data from Twitter. That lexicon helps classify people’s opinions such as emotions (joy, sadness, disgust, anticipation, trust, fear, surprise, anger, positive and negative) and helps government analyse peoples’ perception with sentiment analysis. However, the web application was only an experiment on the related Tweet on demonetization in India, not in other domains or issues.

As previously mentioned, the other method in the lexicon-based approach is the corpus-based approach. It works when a new sentiment word is recognised based on its mutual relationship. It exploits co-occurrence patterns of words found in unstructured textual documents. In the corpus-based approach, new sentiment words are recognised based on their relationship with other words. This approach can use an existing dictionary or generate a new lexicon based on the research domain to clarify the opinion or sentiment. Deng et al. [86] had developed a corpus according to the vital research topic regarding social media to be used to extract people’s opinions. The observation of result use for this approach is helpful in domain-specific sentiment classification that is implemented in existing sentiment lexicons. Still, the effectiveness of that method was dependent on the heuristic limitation, which is the frequently co-occurring words are likely to have similar sentiment orientation. The corpus-based approach can be used to analyse the diversity of online opinions that have a potential impact in commercial, industrial and academic environments. However, the extraction and processing of opinions are complex and difficult tasks.

The lexicon-based approach is dependent on lexical resources, and the overall success of the technique is highly dependent on the quality of the lexical resources. It is based on the polarity of a line of text, which may be determined by the polarity of the words that constitute that text. This approach is not meant to address all aspects of language, particularly slang, irony, and negation, because of the complex nature of natural language. Using sentimental language is insufficient. Some issues do exist, such as the fact that some words have varying meanings depending on the application, that some phrases including emotion words might not express any opinion or emotion. From there, this technique has a low recall and a low accuracy. However, the lexicon-based approach has its own advantages, including the following: it can simply count positive and negative words, it is adaptable to many languages and speeds up analysis, and it is fast in terms of processing because it does not require training for its data. The following table displays a summary of review papers on the lexicon-based approach used in opinion mining.

We found that the most applied dataset platform for the lexicon-based approach is the Twitter dataset. Next would be the movie review dataset. Researchers also frequently use other datasets from websites such as online shopping sites. Facebook platforms and blogs have been somewhat utilised depending on the specific research domain. The following Figs. 9, 10 and 11 presents the distribution of articles according to their application, technique and dataset platforms. Tables 7 and 8 below show the detail of articles that employ the Dictionary based approach and Corpus-based approach.

Table 7 Summary of the lexicon-based approach (dictionary based approach) used for opinion mining

Full size table

Table 8 Summary of the lexicon-based approach (Corpus based approach) used for opinion mining

Full size table

Hybrid approach

Researchers have implemented the hybrid approach in performing opinion mining. The hybrid approach has been implemented to cover up the incapability’s of machine learning and lexicon-based approach by combining two or more methods to achieve better accuracy in extracting and classifying people’s opinions. Based on the reviewed research papers, most researchers use the hybrid approach for opinion mining of products and businesses such as cameras, hairdryers, aircraft, IKEA products and the stock market. It has been further employed in the education and health sectors. Also, we found that the most used machine learning techniques in the hybrid approach are the Naïve Bayes Classifier and Support Vector Machine. Other methods such as the Fuzzy rule-based system, random forest, and deep learning have also been combined with the lexicon-based approach. The most used lexicon/dictionary in the hybrid approach is SentiWordnet, where 16 papers had implemented this lexicon. Other lexicons such as Wordnet, Pattern lexicon, VADER, and NRC Emotion lexicon were also used in this hybrid approach. Mahajan and Rana [103] had applied eight emotions from the NRC emotion lexicon to quantify public emotion. Several types of research have also used existing sentiment lexicon packages (such as “sentiment r”) and existing dictionaries (such as English sentiment dictionary and Dutch sentiment dictionary). Also, many articles used their own lexicon and combined it with the machine learning method.

Based on research in the business/tourism domain by Chen et al. [104], the hybrid approach was implemented to construct a tourism sentiment model to achieve text sentiment classification that accurately understood tourist emotions and benefits management and business operations domain. The first method was using the dictionary-based method, which is one of the lexicon-based approaches, to calculate the sentiment value of a single-sentence text. For the second method, the Naïve Bayes machine learning algorithm was used to construct the classifier. Researchers observe that only using a dictionary method has an unacceptable effect on corpus classification. When the NB classifier is used to classify the corpus, the effect will be fixed and improved. Keyvanpour et al. [105] had implemented the hybrid approach based on lexicon and machine learning to recognize people’s opinions on social networks. The polarity of opinions toward a target word was determined using a method based on the lexicon approach. The textual features of words, sentences, and opinions were analysed and classified using the deep learning method (Neural-fuzzy network). The result from that method had been compared with other supervised methods and found that this method’s speed is slightly slower than other methods because the meta-heuristic algorithm calculates the cost of each member of the population repeatedly using a cost function until determining optimum values for the parameters.

Different from the research by Hamad et al. [106] used more than one machine learning technique in their hybrid approach for the research that was based on product reviews in the social network. The flow of the approach is identical with the lexicon-based approach is usually the first phase employed lexicon dictionary to determine the sentiment polarity of the sentence, but the machine learning method is used to find and classify the accurate label of polarity and emotion of sentences was different. This research employs the ZeroR, NB, K-NN and Linear SVM as the machine learning method. This approach was compared with some approaches to measure the performance of K-NN, NB and SVM classifiers. It was observed that the K-NN, NB, SVM, and ZeroR have a reasonable accuracy rate. However, the K-NN has outperformed the NB, SVM, and ZeroR based on the achieved accuracy rates and trained model time. The K-NN has achieved the highest accuracy rates of 96.58% and 99.94% for the iPad and iPhone emotion data sets. Despite the result, the researcher highlights the challenge for this approach, such as control of implicit attributes of products, building a summary of opinions based on attributes of products, and dealing with negation opinion expressions. The following Tables 9 and 10 presents a summary of review papers on the hybrid approach used in opinion mining.

Table 9 Summary of hybrid approach (combination only one of machine learning method with lexicon-based approach)

Full size table

Table 10 Summary of hybrid approach (combination more than one of machine learning method with lexicon-based approach)

Full size table

The combination of the lexicon-based approach with machine learning is favourable to mine people’s opinions and emotions based on textual datasets according to specific research domains. Datasets from social media platforms such as Twitter and Facebook were seen as the most popular datasets used by researchers based on the reviewed papers. The IMDB movie review dataset comes next, followed by travel review datasets which have become well-known datasets to apply the hybrid approach. The following Figs. 12, 13 and 14 presents the distribution chart of articles according to application, technique and dataset platforms. The chart in Fig. 14 shows that NB is the most employed machine learning technique and SentiWordNet is one of the popular lexicon types used by the researcher. NB application in opinion predictions for various domains is due to its simplicity and fast processing time. The simple structure of this method makes it easy to implement and results in a high level of effectiveness. Meanwhile, SentiWordNet easy implementation in searching the opinions contributed to the frequent usage of the dictionary by the researchers. In addition, most of the researchers either use only one or more than one of the machine learning methods. For example, several researchers only employed NB or SVM and used a dictionary-based approach as the lexicon-based and the SentiWordNet and NRC emotion lexicon as the lexicon dictionary. Other than that, researchers combine more than one method of machine learning such as Naïve Bayes, Support Vector Machine, Decision Tree (J48) and the dictionary-based approach as their hybrid approach.

Kansei approach

Recently, in the opinion mining-related domain, the Kansei approach was a new method implemented by the researcher. The Kansei approach has been used to study emotions toward certain entities based on textual data, such as product reviews. After reviewing papers that utilised the Kansei approach, we found that most research had focused on using emotions as the mechanism for measuring people’s expressions toward certain entities. It makes the Kansei approach one of the possible opinion mining approaches that can help in enhancing and improving techniques to mine people’s opinions. Among the existing Kansei approaches frequently used are Kansei Engineering (Type 1) and Kansei evaluation model techniques.

This research has used the Kansei approach to study visual content and investigate the evoked emotions in extremist YouTube videos among younger viewers [133].The method help in finding the specific emotion regarding content on the online social platform, but it does not involve finding any score of emotion that can help enhance the accuracy of the emotion classification. Different from this, researchers use the Kansei approach to construct the Kansei evaluation model for analysing product design from product reviews on the web by applying NLP methods based on the business/product domain [134]. From those methods, it can calculate and recognize the related scores evaluated by subjective experiments. The method is useful for products design that is highly had relation to people feeling. However, this method only focused on finding the product design-based people’s opinions according to reviews on online platforms.

Opinion mining using Kansei has not been fully explored yet, but recently, several articles have used the combination of the Kansei methodology with the text mining technique. Based on business/services domain application, Hsiao et al. [135] had used Kansei Engineering and text mining to analyse opinions regarding hotel services from people’s comments online review. Kansei Engineering, which is one of the methods in the Kansei approach, also uses emotions as the mechanism for evaluating people’s perceptions toward certain entities to mine people’s opinions based on text datasets. The hybrid approach between Kansei Engineering and text mining was effective in extracting and analysing the relationship between the consumer’s emotion and service characteristics that can help to improve the development of services and product for the hotel domain. However, this method had not involved any degree of values on the extracted emotion, and there had the participation of polarity classification. Recently, we can see the development of new research that integrated the Kansei approach and machine learning in mining people’s opinions. Research by Li et al. [136] was different because it combined Kansei Engineering and machine learning techniques such as Support Vector Machine (SVM) to analyse reviews of online stores from online shopping web pages and had involvement of degree words polarity classification. It was found that the integrated method helped in solving the opinion mining gap that only focused on the polarity classification of the positivity and negativity of the review texts and effectively assisted designers and manufacturers in recognised customers’ emotions to products design through inputting the review texts to facilitate the process of product design. Research of Hsiao et al. and Li et al. have become relevant foundations for the implication of the Kansei approach on another domain. For instance, the combination of the Kansei approach and machine learning technique for opinion mining in the national security domain is a matter that can be further explored. Table 11 presents the list of reviewed articles regarding the Kansei approach.

Table 11 Summary of papers reviewed using the Kansei approach for mining people’s opinions

Full size table

Drawbacks of opinion mining

Opinions and emotions from textual datasets, such as sentences from reviews, text in online news and blogs and whatever people post on social media, can be extracted using opinion mining techniques. However, the results extracted from opinion mining are in the form of sentiments or opinions, which are either positive, negative or neutral. Specific emotions of opinions, such as anger, sadness, etc., in the domain of national security, have not been fully explored in the opinion mining realm. Several researchers have been extracting emotions based on text. However, challenges exist when extracting emotions from text since more than one technique is needed, and this can require significant time. It must also involve a certain library that functions to look up the right emotion of the word. Some issues also exist when it comes to finding the best technique and method in classifying and extracting people’s opinions and emotions. Each opinion mining technique has its own difficulties and deficiencies. Opinion mining techniques that use machine learning and the lexicon-based approach do not assign identified emotions to specific domains. It would be helpful to mine people’s opinions within text according to specific domains.

Based on all research discussed in this study, Kansei Engineering has proven to be a potential method for evaluating the emotions of a certain entity. Overall, there is a gap to be addressed: combining Kansei Engineering with the opinion mining hybrid approach (the combination of machine learning techniques and lexicon-based approach) to extract and mine existing emotions and opinions within text in cyberspace according to specific domains, such as national security. Moreover, Kansei Engineering involves several steps to assess emotions towards a specimen. In preparing the assessment, there is a need a human involvement to collect a set of evaluation words suitable for evaluating the specimens in interest, arrange the evaluation word space, and choose suitable evaluation words to be used for the assessment. The collection of words from this approach can be utilised to develop a dictionary that can act as a lexicon in mining people’s opinions. It is similar to the existed lexicon such as the NRC emotion lexicon that had the same method in constructing their dictionary. The creation of the list of a word in the NRC emotion lexicon was based on human involvement in finding the word and evaluating the related emotion.

Challenges for utilising machine learning, lexicon-based and Kansei approach in opinion mining

Researchers have been using opinion mining in business and product development sectors because it can help in mining people’s opinions regarding products. From these results, the product capability can be enhanced. Opinion mining is also used in government and health, and its application is still expanding. However, challenges exist in opinion mining applications such as the need for a dictionary that can be used in a different domain to produce a polarity score for a dataset. For example, Fischer and Steiger [72] have stated that regarding the health sector, limitations do exist on the use of dictionaries when conducting their research. Their problem was finding a specific dictionary for classifying medical literature. Other than that, when extracting emotions based on text, completing such a task is challenging due to the limitation of domain-specific emotion words. It depends on the existing library for scoring the opinions and emotions of words. Asghar et al. [138] realised that to extract the emotion based on the sentence, and there is a limitation on the ability to incorporate domain-specific words and automatic scoring of such words without performing a lookup operation in the existing library, such as SWN.

There is also a problem with the method used for mining people’s opinions and emotions. Although the Kansei approach has proven to be a method capable of determining people’s emotions regarding certain entities or artefacts, there have been several challenges that require further enhancements for this technique. Most researchers had adopted manual ways to combat this issue, such as making a questionnaire. Finding the right emotion by using this method requires significant time. For example, it has been stated that traditional SD questionnaires are widely used in the Kansei approach. This method is reliable but cumbersome because some research can take several years to complete, and hundreds of respondents must be involved [139]. This is challenging because Kansei is still a new approach and has limitations such as the lack of a systematic method for assigning scores to entities for emotion evaluation experiments in research. In 2018, Yamada et al. [134] implemented a text mining technique to perform Kansei evaluation for a product design. They found that the method is useful, and it is in automatic form. However, they had stated that some problems must be fixed such as the necessity to provide an appropriate score to entities used in the subjective evaluation experiment.

Future research directions of opinion mining for national security

Future works should be based on the theoretical findings of the opinion mining method and the systematic literature review accomplished in this research. In our analysis, the results show that opinion mining had been utilised in several popular domains such as business, stock market and entertainment. In the articles surveyed in this SLR, most of the research has reported successful experiments using various techniques to mine people’s opinions based on text in cyberspace. Domain-specific emotion words are the limitation when extracting emotions based on text because of the high dependency on the existing library to determine opinions and emotions of words. Kansei approach has the potential to address the gap. These findings encouraged us to explore elevated techniques for opinion mining-related work in the domain of national security.

National security overview

The end of World War II raised the term “national security” in American politics and held the attention of many throughout those years. The early development of national security had focused more on the military. Nowadays, the present concept covers a broad range of non-military aspects. To fit and adapt to the trending or current occurrences around the world, the concept of national security will continue to develop. National security is a category in political science [140]. It is a dynamic situation where the state and the society can be protected from threats of armed aggression, political dictatorship, and economic coercion. Two main concepts can define national security: to ensure the nation’s security and to secure the citizens [141].

When a country confronts direct and indirect threats, the government must mobilise its national security system [142]. National security refers to a country’s ability to be free from internally or externally threats to its core values. For example, social threats may include hostility from neighbouring nations, invasion of a terrorist group as well as global economic trends that have an impact on the country’s well-being. In distinct cases, dangers or threats may be considered a natural disaster or an outbreak of viral disease. Threats may affect the harmony and sovereignty of the country. Economic, political and social issues are of high interest and often debated in many nations since the elements of national security can be influenced by these issues. Military and non-military are the basic national security elements. Military security is the ability of a nation to secure the nation or intercept military violence from the outside. The non-military element is related to political security, food security, economic security, human security, energy and natural resources security, environmental security, border security, cybersecurity and health security [143]. Thus, an association between national security elements with citizens’ emotions must be studied so that efforts to maintain and strengthen these elements can be implemented [144].

Hybrid approach of machine learning, lexicon-based and Kansei approaches for opinion mining in national security domain

Opinion mining is an emerging field of data mining that can be utilised to extract information, such as people’s opinions and emotions, from a vast volume of reviews and text on social platforms regarding any product or topic. Based on the reviewed articles, several methods have been used for opinion mining, such as the machine learning technique, the lexicon-based approach, the hybrid approach and the Kansei approach.

There are many drawbacks and difficulties that have been stated in various research regarding opinion mining techniques, such as lack of specific emotions in opinion mining research and the efficiency of machine learning techniques and lexicon-based approaches. Therefore, this research suggested to employs the Kansei approach that can be combined with machine learning technique and lexicon-based approach as a hybrid approach. However, the liability of the Kansei approach is the use of emotions and the evaluation process in determining the right and specific result of people’s emotions towards an artefact. Even though this method was not annotated with the polarity score, it can be solved by combining the Kansei approach with the machine learning technique and lexicon-based approach for the dictionary establishment for the national security domain. The machine learning technique and lexicon-based approach will help to calculate the text polarity score and enhance the accuracy of the opinion result. Therefore, this research presents a new domain: using the hybrid approach for opinion mining in national security.

Based on the review of the selected papers in the previous chapter, machine learning, lexicon-based approach and the Kansei approach demonstrated their capability of extracting people’s emotions in opinion mining. However, lack of domain-specific emotion words is the limitation faced when extracting emotions based on text due to high dependency on the existing library for scoring the opinions and emotions of words. The existing libraries that included emotions are NRC Word-Emotion Association Lexicon (known as NRC Emotion lexicon or EmoLex) and NRC Emotion Intensity Lexicon (called as Affect Intensity Lexicon). NRC Word-Emotion Association Lexicon is the emotion lexicon constructed for the English language, and it can classify text into eight categories of emotions and sentiment such as anger, anticipation, disgust, fear, joy, sadness, surprise and trust, positive and negative that different from the NRC Emotion Intensity Lexicon. The lexicon is not able to classify text into positive or negative sentiment because it contains the list of English words and their associations with only eight basic emotions (anger, anticipation, disgust, fear, joy, sadness, surprise, trust).

Thus, the Kansei approach can be utilised to complement this gap for the development of a dictionary that incorporates domain-specific words in a specific domain such as national security in opinion mining. For future research, this study suggests adopting a hybrid approach by combining the machine learning method and the lexicon-based approach with the Kansei approach to mine people’s opinions and emotions for national security. The emotions can be used as the parameter to relate with the national security risk using various scenarios such as anger and fear toward certain bad political issues that can bring unwanted risks such as riot, coup, terrorism, and civil war.

Machine learning and lexicon-based approach can classify and predict people’s opinions, while the Kansei approach can be used as a method to clarify people’s emotions in the national security domain. This hybrid approach will enable researchers, businesses and governments to apply the method to observe sentiments and emotions simultaneously for national security observation purposes. The expected output from this combination would be the evaluation of people’s sentiments and emotions with the inclusion of the score value of polarity according to the national security element.

Benefits of performing opinion mining in national security

Various activities in cyberspace pose a risk to national security, such as cyber rumours, fake news websites and hate speech [145]. These types of threats in cyberspace can be significant risks to national security [146]. Individuals involved in such activities can indirectly become conspirators since every cyberspace user has a distinct persona, opinion, religion and emotion. They can willingly or unwillingly believe these false rumours and continue to endorse and share them with others. These types of human emotions and behaviours can affect cyberspace. Thus, emotion is deemed a crucial mechanism to detect threats towards national security. Since cyberspace has an emotionally rich nuance and space where people can express their emotions, sentiments and opinions, the connection between emotion and hate speech in cyberspace is undeniable [147]. Related research on emotion in the national security field had found that fear and anger affect politics, which is one element of national security [148]. The relation between emotion and national security elements can be seen in how humans react towards issues related to environmental security. A study did find that ‘hope’ is a reaction that people have towards climate change [149].

The implementation of opinion mining in the national security domain is crucially beneficial. The reason is that most information in the online system is displayed in textual form. A substantial amount of textual data can be generated since it is usual for an individual or persona in cyberspace to express emotions through words or text [150]. By utilising opinion mining in detecting threats in cyberspace, the state of national security can be strengthened.

Limitation

This research intends to incorporate all published literature, such as articles, press articles, and research papers, referring to the implementation and application of opinion mining techniques in cyberspace, including the utilisation of the Kansei approach. It uses a systematic literature search methodology to collect valuable information from a collection of available literature. It reveals current developments of opinion mining and the Kansei approach in mining people’s sentiment, paving the road forward for further research. The scope of this work is restricted to the technique of opinion mining and the Kansei approach in mining people’s sentiments based on text to implement in the national security domain. Since 2003, research in this field has been growing and continues at a steady pace of development.

Conclusion

Opinion mining has been a helpful mechanism in finding people’s sentiments and emotions based on text in cyberspace. Based on our research findings, in most of the reviewed papers in this research, various domains do exist that usually employ opinion mining, such as business/products, transportation, health, government, entertainment, and education. It shows the involvement of opinion mining capabilities in various domains. However, there are several drawbacks from the implication of opinion mining techniques that have been discussed in this research. Thus, this study can help as a reference for future research on finding and determining the suitable method for future new research domains such as national security that was suggested. Although mining people’s opinions and emotions for national security is relatively new research, it should be explored and investigated by researchers to enhance the literature within the national security field. This will further secure and strengthen a state’s national security from unwanted threats. This research suggests that the combination of the machine learning method, lexicon-based approach and the Kansei approach can be a possible mechanism for evaluating people’s emotions within the text. This includes the text’s opinion polarity and possible emotions flag that can influence people’s acceptance of information in cyberspace.

Availability of data and materials

All papers studied in this systematic review are available in SCOPUS, IEEE Xplore, ACM Digital Library, SPRINGERLINK and ScienceDirect. Please see the references below.

Abbreviations

ACM:: Association for Computing Machinery
IEEE:: Advancing Technology for Humanity
NLP:: Natural Language Processing
COVID-19:: Coronavirus disease 2019
KE:: Kansei Engineering
LSTM:: Long short-term memory
LR:: Logistic Regression
NB:: Naïve Bayes
SVM:: Support Vector Machines
DT:: Decision Tree
SGD:: Stochastic Gradient Descent
NNs:: Neural Network
RF:: Random Forest
LDA:: Latent Dirichlet Allocation
KNN:: K-Nearest Neighbour
ML-KNN:: Multilabel K-Nearest Neighbours
ME:: Maximum Entropy
CRF:: Conditional Random Fields
AdaBoost:: Adaptive Boosting
BFTree:: Best-First Decision Tree
OneR:: One Rule
CNN:: Convolutional Neural Network
ANN:: Artificial Neural Network
DBN:: Deep Belief Network
DNN:: Deep Neural Network
RNN:: Recurrent Neural Network
GRU:: Gated Recurrent Unit
BERT:: Bidirectional Encoder Representations from Transformers
BPNN:: Back-Propagation Neural Networks
SD:: Semantic differential
IMDB:: Internet Movie Database
PMI:: Point-wise mutual information
MPQA:: Multi-Perspective Question Answering
SWN:: SentiWordNet
Vader:: Valence Aware Dictionary and Sentiment Reasoner

References

Hannigan TR, et al. Topic modeling in management research: rendering new theory from textual data. Acad Manag Ann. 2019;13(2):586–632. https://doi.org/10.5465/annals.2017.0099.
Article Google Scholar
Stevens D, Vaughan-Williams N. Citizens and security threats: issues, perceptions and consequences beyond the national frame. Br J Polit Sci. 2014;46(1):149–75. https://doi.org/10.1017/S0007123414000143.
Article Google Scholar
Cambria E. Affective computing and sentiment analysis. IEEE Intell Syst. 2016;31:102–7. https://doi.org/10.1109/MIS.2016.31.
Article Google Scholar
Zhang L, Liu B. Sentiment analysis and opinion mining. In: Sammut C, Webb GI, editors. Encyclopedia of machine learning and data mining. Boston: Springer US; 2017. p. 1152–61.
Chapter Google Scholar
Preoctiuc-Pietro D, Liu Y, Hopkins D, Ungar L. Beyond binary labels: political ideology prediction of Twitter users. 2017. https://doi.org/10.18653/v1/p17-1068.
Xu F, Pan Z, Xia R. E-commerce product review sentiment classification based on a naïve Bayes continuous learning framework. Inf Process Manag. 2020;57(5): 102221. https://doi.org/10.1016/j.ipm.2020.102221.
Article Google Scholar
Rachiraju SC, Revanth M. Feature extraction and classification of movie reviews using advanced machine learning models. 2020. https://doi.org/10.1109/iciccs48265.2020.9120919.
Yang YP, Chen DK, Gu R, Gu YF, Yu SH. Consumers’ Kansei needs clustering method for product emotional design based on numerical design structure matrix and genetic algorithms. Comput Intell Neurosci. 2016;2016:1–11. https://doi.org/10.1155/2016/5083213.
Article Google Scholar
Nagamachi M. Kansei/affective engineering and history of Kansei/affective engineering in the world. In: Kansei/affective engineering. CRC Press, 2010, pp. 1–12.
Aggarwal CC, Zhai C, editors. Mining text data. Springer US, 2012.
Liu B. Opinion mining and sentiment analysis. In: Web data mining. Springer Berlin Heidelberg, 2011, pp. 459–526.
Isabelle G, Maharani W, Asror I. Analysis on opinion mining using combining lexicon-based method and multinomial Naïve Bayes. vol. 2, No. IcoIESE 2018, pp. 214–219, 2019. https://doi.org/10.2991/icoiese-18.2019.38.
Cambridge Dictionary, “Sentiment definition.” 2021, [Online]. Available: https://dictionary.cambridge.org/dictionary/english/sentiment.
Dave K, Lawrence S, Pennock D. Mining the peanut gallery: opinion extraction and semantic classification of product reviews. Min Peanut Gall Opin Extr Semant Classif Prod Rev 2003;775152. https://doi.org/10.1145/775152.775226.
Hu M, Liu B. Mining and Summarizing Customer Reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2004, pp. 168–177. https://doi.org/10.1145/1014052.1014073.
Abbasi A, Chen H, Salem A. Sentiment analysis in multiple languages: feature selection for opinion classification in Web forums. ACM Trans Inf Syst. 2008;26(3):1–34. https://doi.org/10.1145/1361684.1361685.
Article Google Scholar
Zhang C, Zuo W, Peng T, He F. Sentiment classification for Chinese reviews using machine learning methods based on string kernel. 2008. https://doi.org/10.1109/iccit.2008.51.
Tang H, Tan S, Cheng X. A survey on sentiment detection of reviews. Expert Syst Appl. 2009;36(7):10760–73. https://doi.org/10.1016/j.eswa.2009.02.063.
Article Google Scholar
Chen H, Zimbra D. AI and opinion mining. IEEE Intell Syst. 2010;25(3):74–80. https://doi.org/10.1109/MIS.2010.75.
Article Google Scholar
Singh N, Sharma N, Juneja A. Sentiment score analysis for opinion mining. In: Advances in intelligent systems and computing. Springer Singapore, 2018, pp. 363–374.
Akila R, Revathi S, Shreedevi G. Opinion mining on food services using topic modeling and machine learning algorithms. 2020. https://doi.org/10.1109/icaccs48705.2020.9074428.
Pang B, Lee L, Vaithyanathan S. “Thumbs up? Sentiment Classification using Machine Learning Techniques. In: Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing ({EMNLP} 2002), 2002, pp. 79–86. https://doi.org/10.3115/1118693.1118704.
Mogaji E, Erkan I. Insight into consumer experience on UK train transportation services. Travel Behav Soc. 2019;14:21–33. https://doi.org/10.1016/j.tbs.2018.09.004.
Article Google Scholar
Kaur HJ, Kumar R. Sentiment analysis from social media in crisis situations. 2015. https://doi.org/10.1109/ccaa.2015.7148383.
Sabuj MS, Afrin Z, Hasan KMA. Opinion Mining Using Support Vector Machine with Web Based Diverse Data. In: Lecture Notes in Computer Science. Springer International Publishing, 2017, pp. 673–678.
Zhang L, Dong W, Mu X. Analysing the features of negative sentiment tweets. Electron Libr. 2018;36(5):782–99. https://doi.org/10.1108/EL-05-2017-0120.
Article Google Scholar
Ameur H, Jamoussi S, Ben Hamadou A. Sentiment lexicon enrichment using emotional vector representation. 2017. https://doi.org/10.1109/aiccsa.2017.151.
Dhahi SH, Waleed J. Emotions polarity of tweets based on semantic similarity and user behavior features. 2020. https://doi.org/10.1109/it-ela50150.2020.9253088.
Banik N, Rahman MHH. Evaluation of Naïve Bayes and Support Vector Machines on Bangla Textual Movie Reviews. 2018. https://doi.org/10.1109/icbslp.2018.8554497.
Elhadad MK, Li KF, Gebali F. Sentiment Analysis of Arabic and English Tweets. In: Advances in Intelligent Systems and Computing, Springer International Publishing, 2019, pp. 334–348.
Khanvilkar G, Vora D. Sentiment analysis for product recommendation using random forest. Int J Eng Technol. 2018;7(3):87–9. https://doi.org/10.14419/ijet.v7i3.3.14492.
Article Google Scholar
Suganya E, Vijayarani S. Sentiment Analysis for Scraping of Product Reviews from Multiple Web Pages Using Machine Learning Algorithms. In: Advances in Intelligent Systems and Computing, Springer International Publishing, 2019, pp. 677–685.
Imran AS, Daudpota SM, Kastrati Z, Batra R. Cross-cultural polarity and emotion detection using sentiment analysis and deep learning on COVID-19 related tweets. IEEE Access. 2020;8(October):181074–90. https://doi.org/10.1109/ACCESS.2020.3027350.
Article Google Scholar
Harb JGD, Ebeling R, Becker K. A framework to analyze the emotional reactions to mass violent events on Twitter and influential factors. Inf Process Manag. 2020;57(6): 102372. https://doi.org/10.1016/j.ipm.2020.102372.
Article Google Scholar
Fernández-Gavilanes M, Álvarez-López T, Juncal-Martínez J, Costa-Montenegro E, Javier González-Castaño F. Unsupervised method for sentiment analysis in online texts. Expert Syst Appl. 2016;58:57–75. https://doi.org/10.1016/j.eswa.2016.03.031.
Article Google Scholar
Kunal S, Saha A, Varma A, Tiwari V. Textual dissection of live Twitter reviews using Naive Bayes. Procedia Comput Sci. 2018;132:307–13. https://doi.org/10.1016/j.procs.2018.05.182.
Article Google Scholar
Lee J, Benjamin S, Childs M. Unpacking the emotions behind TripAdvisor travel reviews: the case study of Gatlinburg, Tennessee. Int J Hosp Tour Adm. 2020;00(00):1–18. https://doi.org/10.1080/15256480.2020.1746219.
Article Google Scholar
Sathya V, Venkataramanan A, Tiwari A, DD PS. Ascertaining Public Opinion Through Sentiment Analysis. 2019. https://doi.org/10.1109/iccmc.2019.8819738.
Anand D, Naorem D. Semi-supervised aspect based sentiment analysis for movies using review filtering. Procedia Comput Sci. 2016;84:86–93. https://doi.org/10.1016/j.procs.2016.04.070.
Article Google Scholar
Chawla S, Dubey G, Rana A. Product opinion mining using sentiment analysis on smartphone reviews. 2017. https://doi.org/10.1109/icrito.2017.8342455.
Bhargava MG, Rao DR. Sentimental analysis on social media data using R programming. Int J Eng Technol. 2018;7(2):80–4. https://doi.org/10.14419/ijet.v7i2.31.13402.
Article Google Scholar
Pugsee P, Nussiri V, Kittirungruang W. Opinion Mining for Skin Care Products on Twitter. In: Communications in Computer and Information Science. Springer Singapore, 2018, pp. 261–271.
Rane PS, Khan RA. Ranked rule based approach for sentiment analysis. 2018. https://doi.org/10.1109/icrieece44171.2018.9008647.
Al-Saffar A, Awang S, Tao H, Omar N, Al-Saiagh W, Al-bared M. Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm. PLoS ONE. 2018;13(4):1–18. https://doi.org/10.1371/journal.pone.0194852.
Article Google Scholar
Shrestha H, Dhasarathan C, Munisamy S, Jayavel A. Natural language processing based sentimental analysis of Hindi (SAH) script an optimization approach. Int J Speech Technol. 2020;23(4):757–66. https://doi.org/10.1007/s10772-020-09730-x.
Article Google Scholar
Arif F, Dulhare UN. A machine learning based approach for opinion mining on social network data. In: Lecture Notes in Networks and Systems. Springer Singapore, 2017, pp. 135–147.
Tripathy A, Agrawal A, Rath SK. Classification of sentiment reviews using n-gram machine learning approach. Expert Syst Appl. 2016;57:117–26. https://doi.org/10.1016/j.eswa.2016.03.028.
Article Google Scholar
Elzayady H, Badran KM, Salama GI. Sentiment analysis on Twitter data using Apache Spark Framework. 2018. https://doi.org/10.1109/icces.2018.8639195.
Singh J, Singh G, Singh R, Singh P. Optimizing accuracy of sentiment analysis using deep learning based classification technique. Commun Comput Inf Sci. 2018;799:516–32. https://doi.org/10.1007/978-981-10-8527-7_43.
Article MATH Google Scholar
Bansal B, Srivastava S. Sentiment classification of online consumer reviews using word vector representations. Procedia Comput Sci. 2018;132:1147–53. https://doi.org/10.1016/j.procs.2018.05.029.
Article Google Scholar
Akkarapatty N, Raj N. A machine learning approach for classification of sentence polarity. 2016, pp. 316–321. https://doi.org/10.1109/SPIN.2016.7566711.
Hiremath BN, Patil MM. Enhancing optimized personalized therapy in clinical decision support system using natural language processing. J King Saud Univ Comput Inf Sci. 2020. https://doi.org/10.1016/j.jksuci.2020.03.006.
Article Google Scholar
Yueyang L, Wang YZ. Detecting opinion polarities using ensemble of classification algorithms. J Phys Conf Ser. 2019;1229(1):012065. https://doi.org/10.1088/1742-6596/1229/1/012065.
Article Google Scholar
Poornima A, Priya KS. A comparative sentiment analysis of sentence embedding using machine learning techniques. 2020. https://doi.org/10.1109/icaccs48705.2020.9074312.
Charoensuk J, Sornil O. A hierarchical emotion classification technique for Thai reviews. J ICT Res Appl. 2018;12(3):280–96. https://doi.org/10.5614/itbj.ict.res.appl.2018.12.3.6.
Article Google Scholar
Ruseti S, Sirbu MD, Calin MA, Dascalu M, Trausan-Matu S, Militaru G. Comprehensive exploration of game reviews extraction and opinion mining using nlp techniques. Adv Intell Syst Comput. 2020;1041(February):323–31. https://doi.org/10.1007/978-981-15-0637-6_27.
Article Google Scholar
Barrón Estrada ML, Zatarain Cabada R, Oramas Bustillos R, Graff M. Opinion mining and emotion recognition applied to learning environments. Expert Syst Appl. 2020;150:113265. https://doi.org/10.1016/j.eswa.2020.113265.
Article Google Scholar
Rathee N, Joshi N, Kaur J. Sentiment analysis using machine learning techniques on Python. 2018. https://doi.org/10.1109/iccons.2018.8663224.
Jayakrishnan R, Gopal GN, Santhikrishna MS. Multi-class emotion detection and annotation in malayalam novels. 2018. https://doi.org/10.1109/iccci.2018.8441492.
Babu MY, Vijaya Pal Reddy P, Shoba Bindu C. Aspect category detection using multi label multi class support vector machine with semantic and lexical features. J Adv Res Dyn Contr Syst. 2020;12(1):398–405. https://doi.org/10.5373/JARDCS/V12I1/20201920.
Article Google Scholar
Gan D, Shen J, Xu M, Stamenkovic Z. Adaptive learning emotion identification method of short texts for online medical knowledge sharing community. Comput Intell Neurosci. 2019;1–10:2019. https://doi.org/10.1155/2019/1604392.
Article Google Scholar
Varsha K, Monica R. Analyzing of premier institution using twitter data on real-time basis. 2017. https://doi.org/10.1109/icecds.2017.8389968.
Kumar HMK, Harish BS, Kumar SVA, Aradhya VNM. Classification of sentiments in short-text. 2018. https://doi.org/10.1145/3184066.3184074.
Krishna BV, Pandey AK, Kumar APS. Feature based opinion mining and sentiment analysis using fuzzy logic. In: Cognitive Science and Artificial Intelligence, Springer Singapore, 2017, pp. 79–89.
Zhan M, Tu R, Yu Q. Understanding Readers. 2018. https://doi.org/10.1145/3297156.3297270.
M. S. R. Hitesh, V. Vaibhav, Y. J. A. Kalki, S. H. Kamtam, and S. Kumari, “Real-Time Sentiment Analysis of 2019 Election Tweets using Word2vec and Random Forest Model,” Sep. 2019, doi: https://doi.org/10.1109/icct46177.2019.8969049.
Halim Z, Waqar M, Tahir M. A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email. Knowledge-Based Syst. 2020;208: 106443. https://doi.org/10.1016/j.knosys.2020.106443.
Article Google Scholar
Mahalakshmi S, Elango S. Cross domain sentiment analysis using different machine learning techniques. 2015, pp. 77–87.
Shrivastava A, Regunathan R, Pant A, Srujan CS. Document-level analysis of sentiments for various emotions using hybrid variant of recursive neural network. Adv Intell Syst Comput. 2019;828:641–9. https://doi.org/10.1007/978-981-13-1610-4_65.
Article Google Scholar
Zheng, Y. Opinion Mining from news articles. In: Advances in intelligent systems and computing. Springer Singapore, 2018, pp. 447–453.
Saleh SN, Lehmann CU, McDonald SA, Basit MA, Medford RJ. Understanding public perception of coronavirus disease 2019 (COVID-19) social distancing on Twitter. Infect Control Hosp Epidemiol. 2021;42(2):131–8. https://doi.org/10.1017/ice.2020.406.
Article Google Scholar
Fischer I, Steiger HJ. Toward automatic evaluation of medical abstracts: the current value of sentiment analysis and machine learning for classification of the importance of PubMed abstracts of randomized trials for stroke. J Stroke Cerebrovasc Dis. 2020;29(9): 105042. https://doi.org/10.1016/j.jstrokecerebrovasdis.2020.105042.
Article Google Scholar
Medford RJ, Saleh SN, Sumarsono A, Perl TM, Lehmann CU. An ‘Infodemic’: leveraging high-volume twitter data to understand early public sentiment for the Coronavirus disease 2019 outbreak. Open Forum Infect. Dis. 2020;7(7). https://doi.org/10.1093/ofid/ofaa258.
Zhang X, Li W, Ying H, Li F, Tang S, Lu S. Emotion detection in online social networks: a multilabel learning approach. IEEE Internet Things J. 2020;7(9):8133–43. https://doi.org/10.1109/JIOT.2020.3004376.
Article Google Scholar
Sankar H, Subramaniyaswamy V, Vijayakumar V, Arun Kumar S, Logesh R, Umamakeswari A. Intelligent sentiment analysis approach using edge computing-based deep learning technique. Softw Pract Exp. 2020;50(5):645–57. https://doi.org/10.1002/spe.2687.
Article Google Scholar
Gopalakrishnan V, Ramaswamy C. Patient opinion mining to analyze drugs satisfaction using supervised learning. J Appl Res Technol. 2017;15(4):311–9. https://doi.org/10.1016/j.jart.2017.02.005.
Article Google Scholar
Rao G, Huang W, Feng Z, Cong Q. LSTM with sentence representations for document-level sentiment classification. Neurocomputing. 2018;308:49–57. https://doi.org/10.1016/j.neucom.2018.04.045.
Article Google Scholar
Le NC, Lam NT, Nguyen SH, Nguyen DT. On Vietnamese sentiment analysis: a transfer learning method. 2020. https://doi.org/10.1109/rivf48685.2020.9140757.
Kalaivani P, Dinesh D. Machine learning approach to analyze classification result for twitter sentiment. 2020. https://doi.org/10.1109/icosec49089.2020.9215278.
Abd DH, Abbas AR, Sadiq AT. Analyzing sentiment system to specify polarity by lexicon-based. Bull Electr Eng Informatics. 2021;10(1):283–9. https://doi.org/10.11591/eei.v10i1.2471.
Article Google Scholar
Rao VA, Anuranjana K, Mamidi R. A Sentiwordnet strategy for curriculum learning in sentiment analysis. In: Natural language processing and information systems. 2020, pp. 170–178.
Khan FH, Qamar U, Bashir S. SentiMI: introducing point-wise mutual information with SentiWordNet to improve sentiment polarity detection. Appl Soft Comput J. 2016;39:140–53. https://doi.org/10.1016/j.asoc.2015.11.016.
Article Google Scholar
Kavitha KM, Shetty A, Abreo B, D’Souza A, Kondana A. Analysis and classification of user comments on YouTube videos. Procedia Comput Sci. 2020;177:593–8. https://doi.org/10.1016/j.procs.2020.10.084.
Article Google Scholar
Dey A, Jenamani M, Thakkar JJ. Senti-N-Gram: an n-gram lexicon for sentiment analysis. Expert Syst Appl. 2018;103:92–105. https://doi.org/10.1016/j.eswa.2018.03.004.
Article Google Scholar
Swain S, Seeja KR. TWEESENT: a Web application on sentiment analysis. Adv Intell Syst Comput. 2019;851:393–400. https://doi.org/10.1007/978-981-13-2414-7_36.
Article Google Scholar
Deng S, Sinha AP, Zhao H. Adapting sentiment lexicons to domain-specific social media texts. Decis Support Syst. 2017;94:65–76. https://doi.org/10.1016/j.dss.2016.11.001.
Article Google Scholar
Azizan A, Jamal NNSK, Abdullah MN, Mohamad M, Khairudin N. Lexicon-based sentiment analysis for movie review tweets. 2019. https://doi.org/10.1109/aidas47888.2019.8970722.
Khan FH, Qamar U, Bashir S. eSAP: A decision support framework for enhanced sentiment analysis and polarity classification. Inf Sci (Ny). 2016;367–368:862–73. https://doi.org/10.1016/j.ins.2016.07.028.
Article Google Scholar
Abdar M, et al. Energy choices in Alaska: mining people’s perception and attitudes from geotagged tweets. Renew Sustain Energy Rev. 2020;124: 109781. https://doi.org/10.1016/j.rser.2020.109781.
Article Google Scholar
Wook M, et al. Opinion mining technique for developing student feedback analysis system using lexicon-based approach (OMFeedback). Educ Inf Technol. 2020;25(4):2549–60. https://doi.org/10.1007/s10639-019-10073-7.
Article Google Scholar
Aslam F, Awan TM, Syed JH, Kashif A, Parveen M. Sentiments and emotions evoked by news headlines of coronavirus disease (COVID-19) outbreak. Humanit Soc Sci Commun. 2020;7(1):1–10. https://doi.org/10.1057/s41599-020-0523-3.
Article Google Scholar
Garcia MB. Sentiment analysis of tweets on coronavirus disease 2019 (COVID-19) pandemic from Metro Manila, Philippines. Cybern Inf Technol. 2020;20(4):141–55. https://doi.org/10.2478/cait-2020-0052.
Article Google Scholar
Song C, Guo J, Zhuang J. Analyzing passengers’ emotions following flight delays- a 2011–2019 case study on SKYTRAX comments. J Air Transp Manag. 2020;89:101903. https://doi.org/10.1016/j.jairtraman.2020.101903.
Article Google Scholar
Abdullah M, Hadzikadic M. Sentiment analysis of twitter data: emotions revealed regarding donald trump during the 2015–16 primary debates. 2017. https://doi.org/10.1109/ictai.2017.00120.
Hoffmann T. ‘Too many Americans are trapped in fear, violence and poverty’: A psychology-informed sentiment analysis of campaign speeches from the 2016 US presidential election. Linguist Vanguard. 2018;4(1):1–9. https://doi.org/10.1515/lingvan-2017-0008.
Article Google Scholar
Giachanou A, Gonzalo J, Crestani F. Propagating sentiment signals for estimating reputation polarity. Inf Process Manag. 2019;56(6): 102079. https://doi.org/10.1016/j.ipm.2019.102079.
Article Google Scholar
Bose R, Aithal PS, Roy S. Sentiment analysis on the basis of tweeter comments of application of drugs by customary language toolkit and textblob opinions of distinct countries. Int J Emerg Trends Eng Res. 2020;8(7):3684–96. https://doi.org/10.30534/ijeter/2020/129872020.
Article Google Scholar
Muhammad A, Wiratunga N, Lothian R. Contextual sentiment analysis for social media genres. Knowledge-Based Syst. 2016;108:92–101. https://doi.org/10.1016/j.knosys.2016.05.032.
Article Google Scholar
Rodrigues RG, das Dores RM, Camilo-Junior CG, Rosa TC. SentiHealth-Cancer: a sentiment analysis tool to help detecting mood of patients in online social networks. Int J Med Inform. 2016;85(1):80–95. https://doi.org/10.1016/j.ijmedinf.2015.09.007.
Article Google Scholar
Severyn A, Moschitti A, Uryupina O, Plank B, Filippova K. Multi-lingual opinion mining on YouTube. Inf Process Manag. 2016;52(1):46–60. https://doi.org/10.1016/j.ipm.2015.03.002.
Article Google Scholar
Asghar MZ, Khan A, Zahra SR, Ahmad S, Kundi FM. Aspect-based opinion mining framework using heuristic patterns. Clust Comput. 2019;22:7181–99. https://doi.org/10.1007/s10586-017-1096-9.
Article Google Scholar
Giatsoglou M, Vozalis MG, Diamantaras K, Vakali A, Sarigiannidis G, Chatzisavvas KC. Sentiment analysis leveraging emotions and word embeddings. Expert Syst Appl. 2017;69:214–24. https://doi.org/10.1016/j.eswa.2016.10.043.
Article Google Scholar
Mahajan P, Rana A. Sentiment classification-how to quantify public emotions using twitter. Int J Sociotechnology Knowl Dev. 2018;10(1):57–71. https://doi.org/10.4018/IJSKD.2018010104.
Article Google Scholar
Chen B, Fan L, Fu X. Sentiment Classification of Tourism Based on Rules and {LDA} Topic Model. 2019. https://doi.org/10.1109/eei48997.2019.00108.
Keyvanpour M, Karimi Zandian Z, Heidarypanah M. OMLML: a helpful opinion mining method based on lexicon and machine learning in social networks. Soc Netw Anal Min. 2020;10(1). https://doi.org/10.1007/s13278-019-0622-6.
Hamad RA, Alqahtani SM, Torres MT. Emotion and polarity prediction from Twitter. 2017. https://doi.org/10.1109/sai.2017.8252118.
Riaz S, Fatima M, Kamran M, Nisar MW. Opinion mining on large scale data using sentiment analysis and k-means clustering. Cluster Comput. 2019;22(s3):7149–64. https://doi.org/10.1007/s10586-017-1077-z.
Article Google Scholar
Chowdhury SMMH, Ghosh P, Abujar S, Arina Afrin M, Akhter HS. Sentiment analysis of tweet data: the study of sentimental state of human from tweet text”. Adv Intell Syst Comput. 2019;813:3–14. https://doi.org/10.1007/978-981-13-1498-8_1.
Article Google Scholar
Tran YH, Tran QN. Estimating public opinion in social media content using aspect-based opinion mining. Lecture Notes of the Institute for Computer Sciences. Social Informatics and Telecommunications Engineering: Springer International Publishing; 2018. p. 101–15.
Google Scholar
Meddeb I, Lavandier C, Kotzinos D. Using Twitter Streams for Opinion Mining: A Case Study on Airport Noise. In: Communications in Computer and Information Science, Springer International Publishing, 2020, pp. 145–160.
Madani Y, Erritali M, Bengourram J, Sailhan F. A hybrid multilingual fuzzy-based approach to the sentiment analysis problem using SentiWordNet. Int J Uncertainty Fuzz Knowl Based Syst. 2020;28(3):361–90. https://doi.org/10.1142/S0218488520500154.
Article Google Scholar
Gopi AP, Jyothi RNS, Narayana VL, Sandeep KS. Classification of tweets data based on polarity using improved RBF kernel of {SVM}. Int J Inf Technol. 2020. https://doi.org/10.1007/s41870-019-00409-4.
Article Google Scholar
Mandal L, Das R, Bhattacharya S, Basu PN. Intellimote: a hybrid classifier for classifying learners’ emotion in a distributed e-learning environment. Turkish J Electr Eng Comput Sci. 2017;25(3):2084–95. https://doi.org/10.3906/elk-1510-120.
Article Google Scholar
Van De Kauter M, Breesch D, Hoste V. Fine-grained analysis of explicit and implicit sentiment in financial news articles. Expert Syst Appl. 2015;42(11):4999–5010. https://doi.org/10.1016/j.eswa.2015.02.007.
Article Google Scholar
Sotirakou C, Germanakos P, Holzinger A, Mourlas C. Feedback Matters! Predicting the Appreciation of Online Articles A Data-Driven Approach. In: Lecture Notes in Computer Science. Springer International Publishing, 2018, pp. 147–159.
Ali F, Kwak D, Khan P, Islam SMR, Kim KH, Kwak KS. Fuzzy ontology-based sentiment analysis of transportation and city feature reviews for safe traveling. Transp Res Part C Emerg Technol. 2017;77:33–48. https://doi.org/10.1016/j.trc.2017.01.014.
Article Google Scholar
Noferesti S, Shamsfard M. Using Linked Data for polarity classification of patients’ experiences. J Biomed Inform. 2015;57:6–19. https://doi.org/10.1016/j.jbi.2015.06.017.
Article Google Scholar
Ahuja S, Dubey G. Clustering and sentiment analysis on Twitter data. 2017, https://doi.org/10.1109/tel-net.2017.8343568.
Song C, Wang XK, Fei Cheng P, Qiang Wang J, Li L. SACPC: a framework based on probabilistic linguistic terms for short text sentiment analysis. Knowl Based Syst. 2020;194:105572. https://doi.org/10.1016/j.knosys.2020.105572.
Article Google Scholar
Vashishtha S, Susan S. Fuzzy rule based unsupervised sentiment analysis from social media posts. Expert Syst Appl. 2019;138: 112834. https://doi.org/10.1016/j.eswa.2019.112834.
Article Google Scholar
Derakhshan A, Beigy H. Sentiment analysis on stock social media for stock price movement prediction. Eng Appl Artif Intell. 2019;85:569–78. https://doi.org/10.1016/j.engappai.2019.07.002.
Article Google Scholar
Valdivia A, et al. Inconsistencies on TripAdvisor reviews: a unified index between users and sentiment analysis methods. Neurocomputing. 2019;353:3–16. https://doi.org/10.1016/j.neucom.2018.09.096.
Article Google Scholar
Jiang H, Kwong CK, Okudan Kremer GE, Park WY. Dynamic modelling of customer preferences for product design using DENFIS and opinion mining. Adv Eng Informatics. 2019;42:100969. https://doi.org/10.1016/j.aei.2019.100969.
Article Google Scholar
Wankhede R, Thakare AN. Design approach for accuracy in movies reviews using sentiment analysis. 2017. https://doi.org/10.1109/iceca.2017.8203652.
Hiriyannaiah S, Siddesh GM, Srinivasa KG. Real-Time streaming data analysis using a three-Way classification method for sentimental analysis. Int J Inf Technol Web Eng. 2018;13(3):99–111. https://doi.org/10.4018/IJITWE.2018070107.
Article Google Scholar
Sahu TP, Ahuja S. Sentiment analysis of movie reviews: a study on feature selection and classification algorithms. 2016. https://doi.org/10.1109/microcom.2016.7522583.
Lima ACES, De Castro LN, Corchado JM. A polarity analysis framework for Twitter messages. Appl Math Comput. 2015;270:756–67. https://doi.org/10.1016/j.amc.2015.08.059.
Article MATH Google Scholar
Li Y, Fleyeh H. Twitter sentiment analysis of new {IKEA} stores using machine learning. 2018. https://doi.org/10.1109/comapp.2018.8460277.
Ahmed S, Danti A. Effective sentimental analysis and opinion mining of web reviews using rule based classifiers. Adv Intell Syst Comput. 2016;410:171–9. https://doi.org/10.1007/978-81-322-2734-2_18.
Article Google Scholar
Vashishtha S, Susan S. Sentiment Cognition from Words Shortlisted by Fuzzy Entropy. IEEE Trans Cogn Dev Syst. 2020;12(3):541–50. https://doi.org/10.1109/TCDS.2019.2937796.
Article Google Scholar
Vashishtha S, Susan S. Highlighting keyphrases using senti-scoring and fuzzy entropy for unsupervised sentiment analysis. Expert Syst Appl. 2021;169: 114323. https://doi.org/10.1016/j.eswa.2020.114323.
Article Google Scholar
Sarkar K, Bhowmick M. Sentiment polarity detection in Bengali tweets using multinomial Naïve Bayes and support vector machines. 2017. https://doi.org/10.1109/calcon.2017.8280690.
Rosli RM, Lokman AM, Aris SRS. Analysis of evoked emotions in extremist YouTube videos through Kansei evaluation. Adv Intell Syst Comput. 2018;739:740–7. https://doi.org/10.1007/978-981-10-8612-0_77.
Article Google Scholar
Yamada A, Hashimoto S, Nagata N. A text mining approach for automatic modeling of Kansei evaluation from review texts. Adv Intell Syst Comput. 2018;739:319–28. https://doi.org/10.1007/978-981-10-8612-0_34.
Article Google Scholar
Hsiao Y-H, Chen M-C, Lin M-K. Kansei Engineering with Online Review Mining for Hotel Service Development. 2017. https://doi.org/10.1109/iiai-aai.2017.12.
Li Z, Tian ZG, Wang JW, Wang WM. Extraction of affective responses from customer reviews: an opinion mining and machine learning approach. Int J Comput Integr Manuf. 2020;33(7):670–85. https://doi.org/10.1080/0951192X.2019.1571240.
Article Google Scholar
Hsiao YH, Chen M-C. Kansei Engineering with Online Content Mining for Cross-Border Logistics Service Design. 2016. https://doi.org/10.1109/iiai-aai.2016.12.
Asghar DM, Khan A, Bibi A, Kundi F, Ahmad H. Sentence-level emotion detection framework using rule-based classification. Cognit Comput. 2017;9:1–27. https://doi.org/10.1007/s12559-017-9503-3.
Article Google Scholar
Su Z, Yu S, Chu J, Zhai Q, Gong J, Fan H. A novel architecture: Using convolutional neural networks for Kansei attributes automatic evaluation and labeling. Adv Eng Informatics. 2020;44:101055.
Article Google Scholar
Somayeh G. An investigation of the components of political developments on the national security of the Islamic Republic of Iran from the perspective of policy experts. J Polit Sci Public Aff. 2017;5(2). https://doi.org/10.4172/2332-0761.1000244.
Fjäder C. The nation-state, national security and resilience in the age of globalisation. Resilience. 2014;2(2):114–29. https://doi.org/10.1080/21693293.2014.914771.
Article Google Scholar
Chandra S, Bhonsle R. National security: concept, measurement and management. Strateg Anal. 2015;39(4):337–59. https://doi.org/10.1080/09700161.2015.1047217.
Article Google Scholar
Muguruza CC. Human security as a policy framework: critics and challenges. Deusto J Hum Rights. 2017;4:15–35. https://doi.org/10.18543/aahdh-4-2007pp15-35.
Article Google Scholar
Teivans-Treinovskis J, Jefimovs N. State national security: aspect of recorded crime. J Secur Sustain Issues. 2012;2:41–8. https://doi.org/10.9770/jssi.2012.2.2(4).
Article Google Scholar
Hazel Kwon K, Raghav Rao H. Cyber-rumor sharing under a homeland security threat in the context of government Internet surveillance: the case of South-North Korea conflict. Gov Inf Q. 2017;34(2):307–16. https://doi.org/10.1016/j.giq.2017.04.002.
Article Google Scholar
Koujalagi DA, Thrupti NS, Kurbet K. Security threats in Indian cyberspace by social media and cyberhoaxes. Int J Trend Sci Res Dev. 2018;2(4):598–600. https://doi.org/10.31142/ijtsrd13040.
Article Google Scholar
Yassine M, Hajj H. A framework for emotion mining from text in online social networks. Proc. IEEE Int. Conf. Data Mining, ICDM, pp. 1136–1142, 2010. https://doi.org/10.1109/ICDMW.2010.75.
Coan TG, Merolla JL, Zechmeister EJ, Zizumbo-Colunga D. Emotional responses shape the substance of information seeking under conditions of threat. Polit Res Q. 2020. https://doi.org/10.1177/1065912920949320.
Article Google Scholar
Lorenzoni I, Nicholson-Cole S, Whitmarsh L. Barriers perceived to engaging with climate change among the UK public and their policy implications. Glob Environ Chang. 2007;17(3–4):445–59. https://doi.org/10.1016/j.gloenvcha.2007.01.004.
Article Google Scholar
Nasir AFA, et al. Text-based emotion prediction system using machine learning approach. IOP Conf Ser Mater Sci Eng. 2020;769:12022. https://doi.org/10.1088/1757-899x/769/1/012022.
Article Google Scholar

Download references

Acknowledgements

This research is fully supported by the National Defence University of Malaysia (UPNM) and the Ministry of Higher Education Malaysia (MOHE) under FRGS/1/2021/ICT07/UPNM/02/1. The authors fully acknowledge UPNM and MOHE for the approved fund, which made this research viable and effective.

Funding

National Defence University of Malaysia (UPNM) under Grant FRGS/1/2021/ICT07/UPNM/02/1.

Author information

Authors and Affiliations

National Defence University of Malaysia, Kuala Lumpur, Malaysia
Noor Afiza Mat Razali, Nur Atiqah Malizan, Nor Asiakin Hasbullah, Muslihah Wook, Norulzahrah Mohd Zainuddin & Suzaimah Ramli
Management and Science University, Selangor, Malaysia
Khairul Khalil Ishak
CyberSecurity Malaysia, Selangor, Malaysia
Sazali Sukardi

Authors

Noor Afiza Mat Razali
View author publications
You can also search for this author in PubMed Google Scholar
Nur Atiqah Malizan
View author publications
You can also search for this author in PubMed Google Scholar
Nor Asiakin Hasbullah
View author publications
You can also search for this author in PubMed Google Scholar
Muslihah Wook
View author publications
You can also search for this author in PubMed Google Scholar
Norulzahrah Mohd Zainuddin
View author publications
You can also search for this author in PubMed Google Scholar
Khairul Khalil Ishak
View author publications
You can also search for this author in PubMed Google Scholar
Suzaimah Ramli
View author publications
You can also search for this author in PubMed Google Scholar
Sazali Sukardi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

NAM conducted the systematic literature review and examined various techniques related to opinion mining and also took part in drafting the manuscript. NAMR wrote the first draft of the manuscript and introduced this topic to NAH, NMZ and MW. NAH, NMZ, MW has made significant contributions in discussing the structure of the review papers. KKI, SR and SS took part in reviewing the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Noor Afiza Mat Razali.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Razali, N.A.M., Malizan, N.A., Hasbullah, N.A. et al. Opinion mining for national security: techniques, domain applications, challenges and research opportunities. J Big Data 8, 150 (2021). https://doi.org/10.1186/s40537-021-00536-5

Download citation

Received: 26 June 2021
Accepted: 08 November 2021
Published: 04 December 2021
DOI: https://doi.org/10.1186/s40537-021-00536-5

Opinion mining for national security: techniques, domain applications, challenges and research opportunities

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

A systematic study on the role of SentiWordNet in opinion mining

360 degree view of cross-domain opinion classification: a survey

A Comprehensive Survey on Multilingual Opinion Mining

Introduction

Method

Result

Discussion

Opinion mining overview

Classification in opinion mining

Techniques in performing opinion mining

Machine learning

Lexicon-based approach

Hybrid approach

Kansei approach

Drawbacks of opinion mining

Challenges for utilising machine learning, lexicon-based and Kansei approach in opinion mining

Future research directions of opinion mining for national security

National security overview

Hybrid approach of machine learning, lexicon-based and Kansei approaches for opinion mining in national security domain

Benefits of performing opinion mining in national security

Limitation

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation