“Harnessing Customer Feedback for Product Recommendations: An Aspect-Level Sentiment Analysis Framework”

Yadav, Nimesh Bali

doi:10.1007/s44230-023-00018-2

“Harnessing Customer Feedback for Product Recommendations: An Aspect-Level Sentiment Analysis Framework”

Research Article
Open access
Published: 27 March 2023

Volume 3, pages 57–67, (2023)
Cite this article

Download PDF

You have full access to this open access article

Human-Centric Intelligent Systems Aims and scope Submit manuscript

“Harnessing Customer Feedback for Product Recommendations: An Aspect-Level Sentiment Analysis Framework”

Download PDF

Nimesh Bali Yadav ORCID: orcid.org/0000-0001-7807-1667¹

1991 Accesses
4 Citations
Explore all metrics

Abstract

This research paper presents a novel approach for recommending products to customers based on their cared aspects by performing sentiment analysis on customer feedback. The proposed approach utilizes the WordNet database to identify and extract aspects from customer reviews and feedback, and then applies sentiment analysis techniques to determine the sentiment associated with each aspect. The resulting sentiment scores are then used to generate personalized product recommendations that align with the customer’s preferences and priorities. Here we extract the comments from an e-commerce website that is Amazon, and we then choose the most cared aspects from those comments. The dataset is publicly available online which contains reviews of each product. The chosen most cared aspects are price, colour, battery, and screen. These cared aspects are keywords that shopping online and recommending, will help to categorize the comments based on price, colour, battery, and screen. After categorizing the comments, it will be defined as the set of explicit comments. After an explicit comment set is defined, sentiment analysis is performed to systematically identify the interest of the customer through comments. Here the comments are classified into the polarity of given texts in an explicit comment set into positive, negative, and neutral. Finally, scores were calculated for all brands which will help to recommend the product.

A difference of multimedia consumer’s rating and review through sentiment analysis

Article 24 March 2020

Framework for Customers’ Sentiment Analysis

Sentiment Analysis on Online Product Reviews

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Comments are very important for the customers and an e-commerce website because they help customers to know what all products or services are best [9]. The most challenging thing faced by e-commerce websites is to make sure that they are providing the best customer services to their customers, Plus making them feel more comfortable shopping online and recommending them based on the customer’s likes, dislikes, and comments will help the customers and even the shop sell more products [14]. Hence, there is a need for a recommendation system for the customers in an e-commerce website.

Sentiment analysis and opinion mining is the study of people’s opinions, emotions towards a particular entity. An entity can be individuals, events, or topics [1]. However, some research says that opinion mining and sentiment analysis have slight differences [2]. Opinion mining is used to extract and analyse people’s opinions and sentiment analysis is used to identify the sentiments that are expressed by the text then analyse it [15].

The product reviews are important for business holders as they can help them to make decisions according to the analysis result of customers' opinion of the products. Sentiment analysis can also be performed on various datasets like on stock markets or news articles [3, 4].

In this paper, the Amazon dataset is used to classify the brands into positive, negative, and neutral [16]. The dataset contains more than 65,000 reviews. The dataset is achieved from www.amazon.com. Classification can also be done in two classes called the binary task of classifying, and for classifying tasks into three or more we say it as a multi-class task of classifying [17]. Classification is done for each aspect into a three-way task of classifying that is positive, negative, and neutral. After classifying the sentiment, only positive comments are considered highly weighted among the total number of comments of particular aspects, for calculating the score [18]. The scores are calculated for every brand depending on the positive comments, which will help recommend the product based on the aspect [19].

The system has been evaluated on a dataset of customer feedback, and the results demonstrate that the proposed approach outperforms traditional product recommendation systems [20]. The approach is found to be effective in identifying the aspects that are most important to customers and in generating accurate and personalized recommendations [21].

Sentiment analysis can be broadly classified into three types: fine-grained sentiment analysis, emotion detection sentiment analysis, and aspect-based sentiment analysis [23].

In fine-grained sentiment analysis, we focus simply on polarities of the comment that are positive, negative, and neutral [22]. For more opinions we can dig customer’s experience polarities to very positive, positive, neutral, negative, very negative [24].

In emotion detection sentiment analysis, we detect emotions of texts. It helps us identify the happy, sad, angry emotions of the customers [25].

Lastly, Aspect-based sentiment analysis (ABSA) is a subfield of natural language processing (NLP) that focuses on analysing the sentiment associated with specific aspects or features of a product or service [26]. ABSA aims to identify the aspects or features that are mentioned in a customer's feedback or review, and then determine the sentiment associated with each aspect or feature [27].

The findings of an aspect-based sentiment analysis can provide valuable insights into the aspects or features of a product or service that customers care about most and the sentiment associated with each aspect or feature [28]. This information can be used to improve the overall quality of the product or service and to generate more accurate and useful recommendations to customers.

For example, in the context of recommending products to customers based on their cared aspects, an aspect-based sentiment analysis could be used to identify the aspects or features of a product that are most important to customers and the sentiment associated with each aspect [29]. This information could then be used to generate personalized product recommendations that align with the customer's preferences and priorities.

In Sect. 2, the basics of sentiment analysis are discussed in detail. In Sect. 3, the overview of the recent techniques that are used by some researchers for recommendation is discussed. In Sect. 4, the detailed description of the methodology for the paper is elaborated. In Sect. 5, the results of the paper have been discussed which elaborates the scoring method which helps recommendation of products.

2 Background

A basic task of sentiment analysis is classifying the text into the emotions of the customer. The emotions of the customer can be classified into positive, negative, and neutral. That is sentiment analysis classifies the given text’s polarities [30]. A positive polarity indicates that the comment has a positive impact on the particular product. For example, the comment contains the words like good, best, like, or love then it will suggest to us the positive emotion of the customer towards the product [31]. Likewise, for negative polarity, the comment harms a particular product, for example, the words like bad, worst, or hate suggest to us the negative emotion of the customer towards the product. Some comments have both emotions for that we use the neutral polarity. Therefore, we can say that sentiment analysis is a feedback process that digs into the comment to find out the emotions ‘happy’, or ‘sad’ of the customer for a particular product [32].

Sentiment analysis can be broadly classified into three types: fine-grained sentiment analysis, emotion detection sentiment analysis, and aspect-based sentiment analysis. In fine-grained sentiment analysis, we focus simply on polarities of the comment that are positive, negative, and neutral [33]. If we want to find more opinions we can dig more into and expand the polarities to very positive, positive, neutral, negative, very negative. In emotion detection sentiment analysis, we detect emotions of texts [34]. It helps us identify the happy, sad, angry emotions of the customers. It uses lexicons and machine learning to detect emotions. In aspect-based sentiment analysis, when a customer comments about the product, they mention specific aspects of the product, aspects can be a screen, battery, and many more. Aspect-based sentiment analysis is necessary because it helps to have a better understanding of the product. In this paper, aspect-based sentiment analysis is proposed to help us recommend the product [35].

3 Related Work

Sentiment analysis is a natural language processing task. It can be used at many levels of classification tasks. For the document-level classification task, it can be handled at a sentence level [6, 7] and it can be handled at the parsing level [5, 8, 9].

The recent paper uses parse level sentence classification [5] it uses rhetorical structure theory that can improve document-level sentiment analysis. Rhetorical structure theory (RST) is a parser that offers significant improvement at the parsing level [5].

Comments are posted by plenty of the customers on an e-commerce website, this helps other customers to gain insights into the product by other people’s experience. And it also helps the business side to know the viewpoint of customers. Some of the early and recent results on sentiment analysis of e-commerce data [10] used a fuzzy decision support model with sentiment analysis for item comparison in e-commerce. It uses probability multivalued neutrosophic linguistic numbers (PMVNLNs) to characterize online reviews. PMVNLN can reflect similarities and differences in positive (negative) information.

To classify the sentiments of movie reviews was discussed by [13] in the paper sentiment classification using the Machine learning technique. The authors of that paper compared various machine learning techniques for sentiment classification of movie reviews. They are Naive Bayes, maximum entropy classification, and support vector machines.

Some of the research has been done on Twitter data for sentiment analysis. One of the significant efforts of sentiment classification is done by [11]. In their work, they use polarity predictions as noisy labels to train a model from the three websites and they used around 1000 manually labelled tweets for tuning and more than 1000 manually labelled tweets for testing [11].

One of the research done by [12] is a cornerstone of sentiment analysis. It uses the new technique compared to [13]. The authors discussed applying sentiment analysis and machine learning methods to study the relationship between the online reviews for a movie and the movie’s box office revenue performance. They used document-level sentiment analysis that consists of Term Frequency and Inverse Document Frequency values as features along with Fuzzy Clustering which results in positive and negative sentiments [12].

4 Methodology

In this paper, Amazon dataset is used. The dataset contains two.csv file items and reviews of each item. Text analysis is performed on the comment set, later the aspect-based comments are extracted from those comment sets. Finally, sentiment analysis is performed to find the emotions of each comment based on a few particular aspects [36]. We have defined price, screen, battery, colour to be the most cared aspects for cell-phone-based products. Let us understand the methodology by going through the design flow of the paper (Fig. 1). Design Flow is a representation of the paper in a flowchart.

4.1 Dataset

Amazon Dataset is used for cell-phone-based products. It is a publicly available dataset. In this dataset, we have two.csv files, one file shows the items with a brand name, URL, review URL, image rating, total reviews, and price and the other file shows the reviews with brand name ratings and reviews. We are merging these two files so that all the items will be mapped with the respective reviews.

4.2 Cleansing of Reviews

We cannot deal with the text directly without cleaning it. Therefore, cleansing of the reviews is very crucial and it is an important step before text analysis [37]. In this we are cleaning the string that is we are cleaning each comment by lower-casing, trimming, removing a non-alphanumeric character, replacing non-letters with spaces. Lower-casing the text helps us to avoid any case-sensitive process and it additionally helps us to detect the stopwords in later steps [38]. Removing a non-alphanumeric character helps us to find the words without punctuations, commas, and quotes. One way we can avoid that is by replacing the non-alphanumeric characters with white spaces.

4.3 Text Analysis

There is a huge amount of unstructured data in the form of comments for the particular product. Text analysis is used to avoid challenges faced to analyse by reading each comment. It’s really hard to manually work on comments therefore we use the text analysis technique [39]. We can use text analysis to extract specific information about products like keywords, brand name, features, aspects. In this paper few methods like Punkt, wordnet, and stopwords for analysing text is used.

4.3.1 Punkt

Is a sentence tokenizer that helps us divide the text into a list of sentences, it uses an unsupervised algorithm. The NLTK data package has a pre-trained Punkt tokenizer for the English language [40].

4.3.2 Wordnet

Is also an NLTK corpus reader, it is used to find the meaning of the word, synonyms, and antonyms. Wordnet is a huge database that contains words of the English language defining Nouns, Adjectives, Adverbs, and Verbs [41]. These are classified into some groups of synonyms, which are called synsets.

4.3.3 Stopwords

Is a group of words in any language that are often used. For instance, in the English language the words like, “the”, “is” and “and”, can be easily called stop words. In NLP, stopwords are often used to eliminate words with less importance, allowing us to concentrate on the important words [42]. In this paper, we have defined the stopwords for the analysis procedure.

4.4 Cared Aspect Identification

Before the text analysis procedure, the aspects that are cared for in particular products by the user are defined. Aspects like ‘price’, ‘colour’, ‘battery’, ‘speed’ can be the most cared for aspects of buying the cell phone. After the text analysis procedure, we got the comments categorized based on those four aspects for the particular product type. In this step, the comments are categorized based on cared aspects. The entity extraction technique has been used for categorizing comments based on aspects [43].

4.5 Categorizing reviews into aspects

This paper uses entity extraction techniques to categorize the comments into aspects. This technique identifies the key elements from the text. These key elements are our entities. The entities that are defined as our aspects that are price, battery, colour, and screen [44]. According to this technique the comments of a particular brand or a product will be categorized into price, battery, colour, and screen. There are two types of entity extraction techniques that are predefined and custom [45]. Predefined models are ready to use, and custom models need to be trained. In this paper custom entity extraction technique is used. In this technique, the model is first built by defining keywords and then trained. This model is built to categorize aspects related to comments.

4.6 Sentiment Analysis

In this step, sentiment analysis is performed to find the polarities of each comment of the product. After getting aspect-based comments, sentiment analysis is performed and achieved the positive, negative, and neutral scores of each aspect of the particular product [46]. The scores are analysed after getting the polarities of each comment.

Rule-based sentiment analysis model is used to identify sentiments of the comments. This approach uses human crafted rules to classify the sentiments into positive, negative, and neutral. It basically checks and counts the positive, negative polarities from one comment and then classifies the comment into those polarities. For example, if one comment is in paragraph and it has more positives than negatives feedback, then using this approach it will return it as positive sentiments and vice-versa [47]. Therefore, we proposed a sent-token rule-based sentiment analysis approach, in which we perform sentence tokenization that is breaking the paragraphs and making one sentence as one token and then we are performing a rule based sentiment analysis approach to classify into positive, negative, and neutral [48].

4.7 Recommendation System Based on Weighted Score

After achieving sentiment scores of aspect-based comments, it has been represented so that it will help the e-commerce side predict which product should be highly recommended for another customer. This will help the business to sell the products to the customer. If for example, customer A cares more about price than battery then based on the aspect-based representation it will predict to show them the products which have positive polarity for price.

4.7.1 Calculating the Sentiment Score

Calculate the sentiment score of each review based on their aspect. In this step, we count the total number of positive, negative, and neutral comments. The values are represented in numerical format.

4.7.2 Weighting the Aspects for Every Brand

Aspects are weighted for each aspect of every brand. The weights are calculated by focussing on the positive comments from the total number of comments of a particular aspect.

$$Weight\left( {aspect_{i} } \right)\; = \;\left( {positive \, reviews/total \, reviews} \right)$$

4.7.3 Recommending Based on Weighted Score

4.7.3.1 Brand Weighted Score

In this step, it will predict the customer’s likes based on their comments so that we can recommend the products of specific brands to the customers based on the aspect which he/she is focusing on.

4.7.3.2 Product Weighted Score

In this step, it will calculate the sentiments of each product based on the aspects, and then it will predict which product is good for the customer based on the aspect which he or she likes.

The above-proposed idea is good for this dataset as it contains multiple reviews for every product. But if we have the dataset in which there is only one comment for one product it will give 100% recommendation if the review is positive [49]. Therefore we have proposed a preference-based recommendation system that will help to overcome this limitation.

4.8 Preference-Based Recommendation System

A preference-based recommendation system is used to overcome the limitation that we get from the above proposed method. This method will predict the consumer’s product choice based on previous reviews. The ranking mechanism is done based on the preferences concerning the customer’s degree of experience. The degrees can be proficiency (high degree), intermediate (medium degree), novice (starter degree). The below steps give a detailed idea of the flow.

1.
If the products have multiple comments, then,
a.
Calculate the sentiments of the comments positive, negative, and neutral.
b.
It will recommend, based on the product-weighted score as mentioned in method 1.
2.
If product A has only one comment by Customer A, then,
a.
Calculate the total number of comments by customer A. It should be greater than 5 or else the customer would be in novice degree.
b.
Find the sentiments of each comment given by the customer.
c.
Calculate the overall sentiments of the comments of every product (that customer A commented on).
d.
Compare the opinion given by customer A with the opinions, given by all other customers.
e.
For all the opinions that matched the value would be 1 and for all that did not match the value would be 0.
f.
Calculate the value of the customer that will help us know the degree of the customer’s experience.
g.
For proficiency, the value is above 80%, for an intermediate degree, the value is above 60%, and for a novice, it would be below 60%. It will recommend only from the customers which have proficiency and intermediate.

5 Results and Discussion

5.1 Data Description

The dataset that has been used is Amazon Dataset. It has reviews for multiple mobile brands like Motorola, Nokia, Samsung, HUAWEI, Sony, Apple, Google, OnePlus, Xiaomi, and ASUS. The dataset contains approximately 68,000 reviews. The dataset has multiple products for each brand type.

It is a publicly available dataset. In this dataset, we have two.csv files, one file shows the items with a brand name, URL, review URL, image rating, total reviews, and price and the other file shows the reviews with brand name ratings and reviews. We have merged these two files so that all the items will be mapped with the respective reviews.

Merging of two csv files is done because we have brand names in one csv file and we have reviews in the other csv file. We merge it to get the products mapped to the reviews. After merging these two csv files will just use the brand and the reviews of the particular items of the respective brand. Brand and Body shows that the brand is mapped to the reviews. For instance, consider the following table with brand and body as our brand of the product and reviews of each product respectively Table 1.

Table 1 Brand and their reviews

Full size table

5.2 Cleansing of Reviews

This step is very important, it cleans each comment by lower-casing, trimming, removing a non-alphanumeric character, replacing non-letters with spaces Table 2.

Table 2 A cleansed review

Full size table

The cleansed comment will avoid all wanted data from the comments. Table 3 shows the example of cleansed comments.

Table 3 Comments categorized based on price

Full size table

As it is shown in the above table the cleansed comment avoids all unwanted data by lowercasing, trimming, removing all non-alphanumeric characters, and replacing it with spaces. We need cleaning reviews for analysing the comments more accurately. It helps to focus on important terms rather than unwanted data.

5.3 Text Analysis and Cared Aspect Identification

Wordnet is an NLTK corpus reader, it is used to find the meaning of the word, synonyms, and antonyms. WordNet is a large lexical database of English that was developed at Princeton University. It is a computerized thesaurus that provides a semantic hierarchy of words and their inter-relationships. WordNet consists of synsets, which are sets of words that are semantically related and have similar meanings.

Synset is a set of synonyms that are grouped to express the same meaning. A synset, or “synonym set,” is a group of words that are considered to be synonyms or semantically related. Each synset in WordNet represents a distinct concept, and the words in the synset share a common meaning. Synsets are organized into a hierarchy, with broader concepts at the top and more specific concepts at the bottom. This hierarchical structure allows for efficient searching and retrieval of related words and concepts. Synsets are the building blocks of WordNet and are the primary means by which words are connected and related to one another in the database.

In this paper, we identify the aspects and after identifying the aspects it will group them. Figure 2 shows the algorithm for identifying the sentiment score of price aspect. Similarly, the same procedure was performed to identify the sentiment scores of other most cared aspects like screen, battery, and colour.

Synset is used for identifying the aspects from the comments. In this paper, we define our most cared aspect to be price, battery, screen, and colour. Synset was used to find these aspects. The function of Synset is a special kind of interface that is present in the Natural Language Tool Kit to look up words in wordnet. It is a group of synonyms of words which has the same meaning and the same concept. In simple words, in wordnet we have similar words that are grouped into a set called synset, A synset contains name, POS, and a number. The words in a synset are called Lemmas. The function ‘wordnet.synsets (“word”)’ will return an array containing all the synonyms related to that word passed to it as the argument. In this case, we will define the word to be price, battery, screen, and colour. Then it will find the synonyms of the word price, battery, screen, and colour.

After identifying the price, battery, screen, colour aspects, sentence tokenization is performed to display the comments based on the respective aspects. Sentence tokenization helps to form the tokens in the form of sentences. Every sentence becomes a token. After doing all these we get comments categorized based on the most cared aspect. For instance, the following Table 3 shows the comments based on the price aspect for the Samsung brand. The table only showcases four comments among a lot of comments of Samsung brand based on price aspect. Similarly, it has been done to find for all other brands and their most cared aspects.

5.4 Sentiment Analysis

In this step, sentiment analysis is performed to find the polarities of each comment. The polarities are positive, negative, and neutral. For example, consider the brand “Samsung” it has 2836 positive comments, 1105 negative comments, and 935 neutral comments for only the aspect ‘battery’. Similarly, sentiment analysis was done on battery, colour, and price aspects. Table 4 below shows the sentiment analysis of the Samsung brand on these four aspects.

Table 4 Sentiment Analysis based on Aspects for Samsung brand

Full size table

Similarly, we have performed sentiment analysis and calculated the sentiments of the comments, and categorized it as positive, negative, and neutral on various brands that has been mentioned earlier in this report, based on the most cared aspects.

5.5 Output: Aspect Based Representation

Below Fig. 3 is a graphical representation of the sentiments that has been achieved after performing sentiment analysis for the price aspect of the Samsung brand.

Figure 3 shows the counts of positive reviews around 1400, negative around 200, and neutral around 300. As we can see that there are more positive reviews for the price aspect it means that the customer is satisfied with the price.

The above Fig. 4 is a graphical representation of the sentiments that has been achieved after performing sentiment analysis for the screen aspect of the Samsung brand. It has more than 2500 positive comments and around 1000 negative comments, and 1000 neutral comments. Display sentiment graph shows the screen aspect for Samsung brand. According to the graph, it does not have a lot of positive reviews. It is just an average, that is the people on the business side will have to focus on this aspect.

The above Fig. 5 is a graphical representation of the sentiments that has been achieved after performing sentiment analysis for the battery aspect of the Samsung brand. It has more than 3000 positive comments and around 1000 negative and neutral comments. Since it has more positive comments that means the customer is satisfied by the battery aspect of the Samsung brand.

The above Fig. 6 is a graphical representation of the sentiments that has been achieved after performing sentiment analysis for the colour aspect of the Samsung brand. It has around 160 positive reviews and around 30 negative and neutral comments. Since it has more positive comments compared to negative and neutral ones, it is satisfactory at the customer’s side.

Similarly, we can see the graphical representation of each brand based on their aspects.

5.6 Recommending Based on Weighted Score

After analysing the sentiments, we have calculated the scores for each aspect of every brand as shown in Table 5 below. The scores have been calculated focussing on the positive comments from the total number of comments of a particular aspect. This will help us to predict the customer’s likes based on their comments so that we can recommend the products of specific brands to the customers based on the aspect which he/she is focusing on.

Table 5 Calculating Scores for each Aspect (brand level)

Full size table

For instance, if customer A cares about ‘price’ more than the other aspects of the product then we can recommend to the customer the products of the brand which has the highest score for the ‘price’ aspect. Here according to the comments, we have the ‘Xiaomi’ brand that has the highest score for the ‘price’ aspect. Therefore, we can recommend Xiaomi products to Customer A.

For instance, if the customer wants a particular brand and wants the best product to be suggested to them, then it will do product-based recommendation as shown in the above Table 6.

Table 6 Calculating Scores for each Aspect (product level)

Full size table

Here according to the comments, we have a OnePlus 7 pro to be recommended if the customer cares about the screen, battery, and colour. If the customer cares about the price, then it will recommend OnePlus 7.

5.7 Preference-Based Recommendation System

A preference-based recommendation system is used to overcome the limitation that we get from the above proposed method. This method will predict the consumer’s product choice based on previous reviews. The ranking mechanism is done based on the preferences concerning the customer’s degree of experience. The degrees can be proficiency (high degree), intermediate (medium degree), novice (starter degree).

We will check if there are multiple comments for a particular product or just one comment for a particular product. If there are multiple comments for the particular product then the above method to calculate the scores based on a product level is implemented.

Method 1 has limitations that have been overcome by improvising the algorithm. The limitation was if there is only one comment for a particular product.T algorithm will not accurately recommend the product. Therefore we proposed an improvised version of method 1 that is a preference-based recommendation system.

For instance, Janet commented on iPhone 12 pro. And there is only one comment for that product. She commented, “Battery performance is awesome”. Firstly, the algorithmic flow for this approach would be to check if the product ‘iPhone 12’ has multiple comments. Since it has only one comment it will go to step 2.

Calculate the total number of comments given by Janet for various products. For example, Janet gave around 10 comments for various products. Now it will check if the total number of comments given by Janet is greater than 5. Since it’s greater than 5 we will proceed to the next steps of the algorithm.

Perform sentiment analysis to find the opinions of Janet for every product she has commented on. For example, Janet commented on products like the OnePlus 6, iPhone 7, Samsung A50, Samsung S10, iPhone 11, Motorola G6, Motorola G4, OnePlus 7 pro, Samsung A70, and Motorola G5. And sentiments calculated from them are positive, negative, positive, positive, positive, negative, positive, negative, positive, and negative respectively.

Calculate the overall sentiments of the comments of every product (that customer A commented on). In this step, the sentiment analysis is performed and scores of each product have been calculated.

After that comparison is done between the opinion given by Janet and the opinions given by all other customers. Janet gave positive, negative, positive, positive, positive, negative, positive, negative, positive, and negative. As shown in Table 7 we will see whether positive reviews are greater than the other reviews, then we will assign the opinion as positive to the products. Finally, we will compare the opinion of Janet with the assigned opinions of the product. If it’s equal then we will assign 1 or else 0. So in this case our values will be (1, 0, 1, 1, 1, 0, 1, 0, 1, 0).

Table 7 Sentiment analysis of products

Full size table

Now, to calculate the customer’s value based on their experience the following formula has been used.

$$Cust\_Value_{j} \; = \;\left( {Sum \, of Cust\_op_{j} } \right)/\left( {Total \, reviews \, of \, Cust_{j} } \right)$$

where j is a particular customer.

$$Cust\_Value_{janet} \; = \;\left( {1 + 0 + 1 + 1 + 1 + 0 + 1 + 0 + 1 + 0} \right)/10 = 0.6$$

Now to determine the degree of experience of Janet we have to assign the cust_value to the degree. Cust_value of Janet is 0.6 which is 60% she will be in an intermediate degree. Since she belongs to an intermediate degree, we can recommend that particular product to the other customers.

6 Conclusion and Future Work

In this paper, the most cared aspects that has been considered are price, battery, colour, and screen for the cell-phone-based dataset. This dataset is an Amazon dataset, and it’s available to use. We have grouped the comments of each brand and product based on the most cared aspects. Then sentiment analysis is performed on those aspect-based comments and found whether the comments are positive, negative, and neutral. We then calculated the polarities for the same. Finally, for all brands, we have calculated the scores for each aspect. This helped to predict what products to be recommended to the customer based on their most cared aspects. The above-proposed idea is good for the dataset that has been used, as it contains multiple reviews for every product. But if we have the dataset in which there is only one comment for one product it will give 100% recommendation if the review is positive. Therefore, we have proposed a preference-based recommendation system that will help to overcome this limitation. We can further perform the same method on various datasets according to the customer’s cared aspects. Thus we see that our proposed method acts as a boon to product based companies to actually target customers with products based on their most cared aspects. This not only decreases the marketing cost at the end of the e-commerce company but also saves customers valuable time. For future work, firstly, we can define more aspects to be the most cared aspects by the customer and get product recommendation for them as well. Secondly, we can develop more sophisticated and accurate sentiment analysis algorithms for better accuracy. Lastly, we can also expand the scope of the recommendation system- The current system is focused on product recommendations based on customer feedback, but it can be expanded to include other aspects such as geographic location, purchasing history, and social media activity to provide more accurate and personalized recommendations.

Data availability

The datasets generated and/or analysed during the current study are available in the Data Folder repository, http://archive.ics.uci.edu/ml/datasets/Amazon+Commerce+reviews+set#.

References

Medhat W, Hassan A, Korashy H. Sentiment analysis algorithms and applications, a survey. Ain Shams Eng J. 2014;5(4):1093–113. https://doi.org/10.1016/j.asej.2014.04.011.
Article Google Scholar
Tsytsarau M, Palpanas T. Survey on mining subjective data on the web. Data Min Knowl Disc. 2012;24:478–514. https://doi.org/10.1007/s10618-011-0238-6.
Article MATH Google Scholar
Liang-Chih Yu, Jheng-Long Wu, Chang P-C, Chu H-S. Using a contextual entropy model to expand emotion words and their intensity for the sentiment classification of stock market news. Knowled-Based Syst. 2013;41:89–97. https://doi.org/10.1016/j.knosys.2013.01.001.
Article Google Scholar
Tao Xu, Peng Q, Cheng Y. Identifying the semantic orientation of terms using S-HAL for sentiment analysis. Knowled-Based Syst. 2012;35:279–89. https://doi.org/10.1016/j.knosys.2012.04.011.
Article Google Scholar
Bhatia P, Ji Y, Eisenstein J. Better Document-level Sentiment Analysis from RST Discourse Parsing. Published at Empirical Methods in Natural Language Processing (EMNLP 2015) (2015). https://doi.org/10.48550/arXiv.1509.01599
Turney P Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. ACL (2002)
Pang and Lee L A sentimental education: Sentiment analysis using subjectivity analysis using subjectivity summarization based on minimum cuts. ACL (2004)
Wilson T, Wiebe J, Hoffman P Recognizing contextual polarity in phrase level sentiment analysis. ACL (2005)
Apoorv Agarwal, Fadi Biadsy, Kathleen Mckeown Contextual phrase-level polarity analysis using lexical affect scoring and syntactic n-grams. Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009). 24–32 (2009)
P. Ji, H. -Y. Zhang and J. -Q. Wang “A Fuzzy Decision Support Model With Sentiment Analysis for Items Comparison in e-Commerce: The Case Study of http://PConline.com.” In: IEEE Transactions on Systems, Man, and Cybernetics: Systems. 1993–2004 (2019).
Luciano Barbosa, Junlan Feng Robust sentiment detection on twitter from biased and noisy data. Proceedings of the 23rd International Conference on Computational Linguistics: Posters. 36–44 (2010).
Nagamma P, Pruthvi HR, Nisha KK, Shwetha NH. An improved sentiment analysis of online movie reviews based on clustering for box-office prediction. Int Conf Comput Commun Automat. 2015. https://doi.org/10.1109/CCAA.2015.7148530.
Article Google Scholar
Pang Bo, Lee L, Vaithyanathan S. Thumbs up? Sentiment classification using machine learning techniques. EMNLP. 2002. https://doi.org/10.3115/1118693.1118704.
Article Google Scholar
Ben Kharrat F, Elkhleifi A, Faiz R “Recommendation system based contextual analysis of Facebook comment.” IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA). Agadir, Morocco. 1-6 (2016)
Decker R, Trusov M. Estimating aggregate consumer preferences from online product reviews. Int J Res Market. 2010;27(4):293–307. https://doi.org/10.1016/j.ijresmar.2010.09.001.
Article Google Scholar
Cheng L-C, Wang H-A. A fuzzy recommender system based on the integration of subjective preferences and objective information. Appl Soft Comput. 2014;18:290–301. https://doi.org/10.1016/j.asoc.2013.09.004.
Article Google Scholar
Zhang H, Li J, Ji Y, Ye Y “Content-based movie recommending using a Triple Wing Harmonium model.” 2015 IEEE 13th International Conference on Industrial Informatics (INDIN). Cambridge, UK. 2015. pp 1096–1101.
Hill W, Stead L, Rosenstein M, Furnas G “Recommending and Evaluating Choices in a Virtual Community of Use.” In: Proceedings of CHI. pp 95.
Linden G, Smith B, York J. Amazon.com, “Amazon.com Recommendations Item-to-Item Collaborative Filtering.” IEEE Int Comput. 2003;7(1):76–80.
Article Google Scholar
Luo Yi, Fan Miao, Zhou Xiaoxia, “The Design and Implementation of Feature-Grading Recommendation System for ECommerce,” Proceeding of the IEEE International Conference on Information and Automation Shenzhen, China June 2011.
Yifan Hu, Yehuda Koren, Chris Volinsky, “Collaborative Filtering for Implicit Feedback Datasets,” AT&T Labs—Research in 2012.
Miao Fan, Guoshi Wu, Jing Li, “Feature-Item Recommender System for E-Commerce,” 2011 International ConferenceonComputer Control and Automation.241
Adomavicius G, Tuzhilin A Toward the next generation of recommender systems. IEEE Transactions on Knowledge and Data Engineering, 2005.
Ben Schafer J, Konstan J, Riedl J Recommender systems in e-commerce. In: EC ’99: Proceedings of the 1st ACM conference on Electronic commerce, NY, ACM, pp. 158, (1999).
Brin S, Page L. The anatomy of a large-scale hypertextual web search engine. Comput Network ISDN Syst. 1998;30(1–7):107–17.
Article Google Scholar
Burke R. Hybrid recommender systems: survey and experiments. User Model User-Adapt Int. 2002;21(4):331–70.
Article MATH Google Scholar
Dixit VS, Bedi P, Mehta H. Generation of web recommendations using implicit user feedback and normalized mutual information. Int J Knowled Web Intell. 2013;4(2/3):113–41.
Article Google Scholar
Ekstrand, MD., Ludwig, M., Konstan, JA. and Riedl, JT, Rethinking the recommender research ecosystem: reproducibility, openness, and lenskit. In: Proceedings of the fifth ACM Conference on Recommender Systems, pp. 133–140, (2011)
Fellbaum C, Grabowski J, Landes S. Performance and confidence in a semantic annotation task. MIT Press Cambrid Massachuset Chapt. 1998;9:216–37.
Google Scholar
Funk S [online]. http://sifter.org/~simon/journal/20061211.html (Accessed Dec. 2022).
Baatarjav E-A, Phithakkitnukoon S, Et R. Dantu. Group recommendation system for facebook. pp. 211–219, (2008)
Schroder M, Baggia P, Burkhardt F, Oltramari A, Pelachaud C, Peter C, Zovato E, Emotion Markup Language (EmotionML) 1.0. W3C Working Draft, 2010. http://www.w3.org/TR/emotionml/.
M. Schroeder, H. Pirker, M. Lamolle, F. Burkhardt, C. Peter, E. Zovato. 2011 Representing emotions and related states in technological systems. In: Emotion-Oriented Systems. Springer. Heidelberg. 369-387.
Hill W, Terveen L (1996) 'Using frequency-of-mention in public conversations for social filtering. In: Proceedings of the 1996 ACM conference on Computer supported cooperative work. New York, USA. 106–112.
Goldberg D, Nichols D, Oki B, Douglas T (1992) Using collaborative filtering to weave an information tapestry. In: communications of the ACM. 61–70.
Goldberg K, Roeder T, Gupta D, Perkins C. Eigentaste: a constant time collaborative filtering algorithm. In Inf Retr. 2001;4(2):133–51.
Article MATH Google Scholar
Konstan J, Miller B, Maltz D, Herlocker J, Gordon L, Riedl J. Grouplens: applying collaborative filtering to usenet news. Commun ACM. 1997;40(3):77–87.
Article Google Scholar
Lang K (1995) Newsweeder: learning to filter netnews. In: Proceedings of the 12th Interna-tional Conference on Machine Learning. 331–339.
Pang B, Lee L Opinion Mining and Sentiment Analysis. Now Publishers Inc., 2008.
Lianga J, Rakesh K, Keith W-R. The fasttrack overlay: a measurement study. J Comput Networks. 2006;50(6):842–58.
Article Google Scholar
Paterek A (2007) Improving regularized singular value decomposition for collaborative filtering. In KDD Cup and Workshop.
Pazzani M, Billsus D. Learning and revising user profiles:t identification of interesting websites. J Mach Learn. 1997;27(3):313–31.
Article Google Scholar
Chouhan K, Yadav M, Rout RK, Sahoo KS, Jhanjhi N, et al. Sentiment analysis with tweets behaviour in twitter streaming api. Comput Syst Sci Eng. 2023;45(2):1113–28.
Article Google Scholar
Almuayqil SN, Humayun M, Jhanjhi NZ, Almufareh MF, Javed D. Framework for improved sentiment analysis via random minority oversampling for user tweet review classification. Electronics. 2022;11(19):3058.
Article Google Scholar
Yadav S, Shah D. News summarization using text mining. Int Res J Eng Technol. 2018;5(11):202–6.
Google Scholar
Yadav S, Shah K (2018b) ‘To analyze student’s learning experience in social media’. IC-CSOD-2018 Conference Proceedings.
Yadav S, Shah K (2018c) ‘To understand drug usage by mining social media data’. IC-CSOD-2018 Conference Proceedings.
Yadav S, Yadav S. Text mining of Voot application reviews on Google Play Store. Int Res J Eng Technol e-ISSN. 2018;5(1):56–72.
Google Scholar
Zhang M, et al. AsU-OSum: aspect-augmented unsupervised opinion summarization. Inf Proc Manage. 2023;60(1):103138.
Article Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This study did not receive any specific funding.

Author information

Authors and Affiliations

MPSTME, NMIMS University, Mumbai, India
Nimesh Bali Yadav

Authors

Nimesh Bali Yadav
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

NY: Data collection and interpretation; Scientific Writing including initial draft preparation and manuscript revision and editing; Table and Figure preparation; Literature review; Contributed to the interpretation of the results and critical revision of the manuscript for important intellectual content. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Nimesh Bali Yadav.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yadav, N.B. “Harnessing Customer Feedback for Product Recommendations: An Aspect-Level Sentiment Analysis Framework”. Hum-Cent Intell Syst 3, 57–67 (2023). https://doi.org/10.1007/s44230-023-00018-2

Download citation

Received: 20 November 2022
Accepted: 21 February 2023
Published: 27 March 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s44230-023-00018-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

“Harnessing Customer Feedback for Product Recommendations: An Aspect-Level Sentiment Analysis Framework”

Abstract

Similar content being viewed by others

A difference of multimedia consumer’s rating and review through sentiment analysis

Framework for Customers’ Sentiment Analysis

Sentiment Analysis on Online Product Reviews

Explore related subjects

1 Introduction

2 Background

3 Related Work

4 Methodology

4.1 Dataset

4.2 Cleansing of Reviews

4.3 Text Analysis

4.3.1 Punkt

4.3.2 Wordnet

4.3.3 Stopwords

4.4 Cared Aspect Identification

4.5 Categorizing reviews into aspects

4.6 Sentiment Analysis

4.7 Recommendation System Based on Weighted Score

4.7.1 Calculating the Sentiment Score

4.7.2 Weighting the Aspects for Every Brand

4.7.3 Recommending Based on Weighted Score

4.7.3.1 Brand Weighted Score

4.7.3.2 Product Weighted Score

4.8 Preference-Based Recommendation System

5 Results and Discussion

5.1 Data Description

5.2 Cleansing of Reviews

5.3 Text Analysis and Cared Aspect Identification

5.4 Sentiment Analysis

5.5 Output: Aspect Based Representation

5.6 Recommending Based on Weighted Score

5.7 Preference-Based Recommendation System

6 Conclusion and Future Work

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation