Ensemble application of bidirectional LSTM and GRU for aspect category detection with imbalanced data

Kumar, J. Ashok; Abirami, S.

doi:10.1007/s00521-021-06100-9

Ensemble application of bidirectional LSTM and GRU for aspect category detection with imbalanced data

Original Article
Published: 12 May 2021

Volume 33, pages 14603–14621, (2021)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

1133 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

E-commerce websites produce a large number of online reviews, posts, and comments about a product or service. These reviews are used to assist consumers in buying or recommending a product. However, consumers are expressing their views on a specific aspect category of a product. In particular, aspect category detection is one of the subtasks of aspect-based sentiment analysis, and it classifies a given text into a set of predefined aspects. Naturally, a class imbalance problem occurs in real-world applications. The class imbalance is studied over the last two decades using machine learning algorithms. However, there is very little empirical research in deep learning with the class imbalance problem. In this paper, we propose bidirectional LSTM and GRU networks to deal with imbalance aspect categories. The proposed method applies a data-level technique to reduce class imbalance. Specifically, we employ the stratified sampling technique to deal with imbalanced classes. Moreover, we create word vectors with the corpus-specific word embeddings and pre-trained word embeddings. This word representations fed into the proposed method and their merge modes such as addition, multiplication, average, and concatenation. The performance of this method is evaluated with a confusion matrix, precision, recall, F1-score with micro-average, macro-average, and weighted average. The experimental result analysis suggests that the proposed method outperforms with pre-trained word embeddings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey on sentiment analysis methods, applications, and challenges

Article 07 February 2022

Sentiment Analysis in the Age of Generative AI

Article Open access 05 March 2024

Sentiment Analysis in Social Media Data for Depression Detection Using Artificial Intelligence: A Review

Article 19 November 2021

References

Akhtar MS, Ekbal A, Cambria E (2020) How intense are you? Predicting intensities of emotions and sentiments using stacked ensemble. IEEE Comput Intell Mag 15(1):64–75
Article Google Scholar
Al-Smadi M, Talafha B, Al-Ayyoub M, Jararweh Y (2019) Using long short-term memory deep neural networks for aspect-based sentiment analysis of Arabic reviews. Int J Mach Learn Cybernet 10(8):2163–2175
Article Google Scholar
Basaldella M, Antolli E, Serra G, Tasso C (2018) Bidirectional lstm recurrent neural network for keyphrase extraction. In: Italian Research Conference on Digital Libraries, pp 180–187. Springer
Basiri ME, Nemati S, Abdar M, Cambria E, Acharya UR (2021) ABCDM: an attention-based bidirectional cnn–rnn deep model for sentiment analysis. Future Gener Comput Syst 115:279–294
Article Google Scholar
Bronchal L (2017) Classifying with logistic regression (0.9473)—Kaggle. https://www.kaggle.com/lbronchal/classifying-with-logistic-regression-0-9473
Cambria E, Wang H, White B (2014) Guest editorial: big social data analysis. Knowl Based Syst 69:1–2
Article Google Scholar
Chemchem A, Alin F, Krajecki M (2018) Deep learning and data mining classification through the intelligent agent reasoning. In: Proceedings of the 2018 6th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW), pp 13–20. IEEE
Cho K, van Merriënboer B, Bahdanau D, Bengio Y (2014) On the properties of neural machine translation: encoder–decoder approaches. In: Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, pp 103–111. Association for Computational Linguistics, Doha, Qatar. https://doi.org/10.3115/v1/W14-4012.https://www.aclweb.org/anthology/W14-4012
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. In: NIPS 2014 Workshop on Deep Learning, December 2014
Dua ED, Karra T (2017) UCI (University of California Irvine) Machine Learning Repository. Repository
Ferri C, Hernández-Orallo J, Modroiu R (2009) An experimental comparison of performance measures for classification. Pattern Recogn Lett 30(1):27–38
Article Google Scholar
Gasparetti F (2017) Modeling user interests from web browsing activities. Data Min Knowl Disc 31(2):502–547
Article MathSciNet Google Scholar
Ghadery E, Movahedi S, Sabet MJ, Faili H, Shakery A (2019) Licd: A language-independent approach for aspect category detection. In: European Conference on Information Retrieval, pp 575–589. Springer
Graves A (2013) Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850
Haixiang G, Yijing L, Shang J, Mingyun G, Yuanyue H, Bing G (2017) Learning from class-imbalanced data: Review of methods and applications. Expert Syst Appl 73:220–239
Article Google Scholar
Hinton G, Srivastava N, Swersky K (2012) Neural networks for machine learning lecture 6a overview of mini-batch gradient descent. Cited on 14(8)
Hossin M, Sulaiman M (2015) A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manag Process 5(2):1
Article Google Scholar
Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pp 168–177
Huang Z, Xu W, Yu K (2015) Bidirectional lstm-crf models for sequence tagging. arXiv preprint arXiv:1508.01991
Jayaraman AK, Abirami S (2020) Opinion-based co-occurrence network for identifying the most influential product features. J Eng Res 8(4)
Jian L, Li J, Shu K, Liu H (2016) Multi-label informed feature selection. IJCAI 16:1627–33
Google Scholar
Jin Z, Yang Y, Liu Y (2019) Stock closing price prediction based on sentiment analysis and LSTM. Neural Comput Appl 32:1–17
Google Scholar
Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. J Big Data 6(1):1–54
Article Google Scholar
Khan MU, Javed AR, Ihsan M, Tariq U (2020) A novel category detection of social media reviews in the restaurant industry. Multimedia Syst pp 1–14
Kiritchenko S, Zhu X, Cherry C, Mohammad S (2014) Nrc-Canada-2014: detecting aspects and sentiment in customer reviews. In: Proceedings of the 8th international workshop on semantic evaluation (SemEval 2014), pp 437–442
Li W, Liu P, Zhang Q, Liu W (2019) An improved approach for text sentiment classification based on a deep neural network via a sentiment attention mechanism. Future Internet 11(4):96
Article Google Scholar
Li Y, Wang S, Ma Y, Pan Q, Cambria E (2020) Popularity prediction on vacation rental websites. Neurocomputing 412:372–380
Article Google Scholar
Liao W, Zeng B, Yin X, Wei P (2020) An improved aspect-category sentiment analysis model for text sentiment analysis based on roberta. Appl Intell pp 1–12
Liu N, Shen B (2020) Aspect-based sentiment analysis with gated alternate neural network. Knowl Based Syst 188
Liu S, Ni’mah I, Menkovski V, Mocanu DC, Pechenizkiy M (2021) Efficient and effective training of sparse recurrent neural networks. Neural Comput Appl pp 1–12
Ma Y, Peng H, Cambria E (2018) Targeted aspect-based sentiment analysis via embedding commonsense knowledge into an attentive LSTM. AAAI, pp 5876–5883
Ma Y, Peng H, Khan T, Cambria E, Hussain A (2018) Sentic LSTM: a hybrid network for targeted aspect-based sentiment analysis. Cognit Comput 10(4):639–650
Article Google Scholar
Majumder N, Bhardwaj R, Poria S, Gelbukh A, Hussain A (2020) Improving aspect-level sentiment analysis with aspect extraction. Neural Comput Appl pp 1–11
Movahedi S, Ghadery E, Faili H, Shakery A (2019) Aspect category detection via topic-attention network. arXiv preprint arXiv:1901.01183
Nagy P (2018) Multi class classification with LSTM. https://www.kaggle.com/ngyptr/multi-class-classification-with-lstm
Park H, Song M, Shin KS (2020) Deep learning models and datasets for aspect term sentiment classification: implementing holistic recurrent attention on target-dependent memories. Knowl Based Syst 187:104825
Article Google Scholar
Pennington J, Socher R, Manning CD (2014) Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1532–1543
Pirzadeh H, Shanian S, Hamou-Lhadj A, Mehrabian A (2011) The concept of stratified sampling of execution traces. In: Proceedings of the 2011 IEEE 19th International Conference on Program Comprehension, pp 225–226. IEEE
Pontiki M, Galanis D, Papageorgiou H, Androutsopoulos I, Manandhar S, Al-Smadi M, Al-Ayyoub M, Zhao Y, Qin B, De Clercq O, et al (2016) Semeval-2016 task 5: aspect based sentiment analysis. In: International workshop on semantic evaluation, pp 19–30
Pontiki M, Galanis D, Pavlopoulos J, Papageorgiou H, Androutsopoulos I, Manandhar S (2014) SemEval-2014 task 4: aspect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pp 27–35. Association for Computational Linguistics, Dublin, Ireland. https://doi.org/10.3115/v1/S14-2004. https://www.aclweb.org/anthology/S14-2004
Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowl Based Syst 108:42–49
Article Google Scholar
Poria S, Cambria E, Ku LW, Gui C, Gelbukh A (2014) A rule-based approach to aspect extraction from product reviews. In: Proceedings of the second workshop on natural language processing for social media (SocialNLP), pp 28–37
Pushp PK, Srivastava MM (2017) Train once, test anywhere: zero-shot learning for text classification. arXiv preprint arXiv:1712.05972
Qiu G, Liu B, Bu J, Chen C (2011) Opinion word expansion and target extraction through double propagation. Comput Linguist 37(1):9–27
Article Google Scholar
Raj D, Sahu S, Anand A (2017) Learning local and global contexts using a convolutional recurrent network model for relation classification in biomedical text. In: Proceedings of the 21st conference on computational natural language learning (CoNLL 2017), pp 311–321
Sahu S, Anand A (2016) Recurrent neural network models for disease name recognition using domain invariant features. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 2216–2225. Association for Computational Linguistics, Berlin, Germany. doi: https://doi.org/10.18653/v1/P16-1209. https://www.aclweb.org/anthology/P16-1209
Schmidhuber J, Hochreiter S (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article Google Scholar
Schouten K, Van Der Weijde O, Frasincar F, Dekker R (2017) Supervised and unsupervised aspect category detection for sentiment analysis with co-occurrence data. IEEE Trans Cybern 48(4):1263–1275
Article Google Scholar
Toh Z, Su J (2016) Nlangp at semeval-2016 task 5: improving aspect based sentiment analysis using neural network features. In: Proceedings of the 10th international workshop on semantic evaluation (SemEval-2016), pp 282–288
Zhou X, Wan X, Xiao J (2015) Representation learning for aspect category detection in online reviews. In: Proceedings of the AAAI conference on artificial intelligence, vol. 29

Download references

Acknowledgements

We thank the University Grants Commission (UGC), Government of India for supporting this work under the UGC National Fellowship. Also, we thank editors and reviewers for their valuable comments and guidance to improve the earlier version of this paper.

Author information

Authors and Affiliations

Department of Information Science and Technology, Anna University, Chennai, 600025, India
J. Ashok Kumar & S. Abirami

Authors

J. Ashok Kumar
View author publications
You can also search for this author in PubMed Google Scholar
S. Abirami
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Ashok Kumar.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kumar, J.A., Abirami, S. Ensemble application of bidirectional LSTM and GRU for aspect category detection with imbalanced data. Neural Comput & Applic 33, 14603–14621 (2021). https://doi.org/10.1007/s00521-021-06100-9

Download citation

Received: 03 November 2019
Accepted: 01 May 2021
Published: 12 May 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s00521-021-06100-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Ensemble application of bidirectional LSTM and GRU for aspect category detection with imbalanced data

Abstract

Access this article

Similar content being viewed by others

A survey on sentiment analysis methods, applications, and challenges

Sentiment Analysis in the Age of Generative AI

Sentiment Analysis in Social Media Data for Depression Detection Using Artificial Intelligence: A Review

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Ensemble application of bidirectional LSTM and GRU for aspect category detection with imbalanced data

Abstract

Access this article

Similar content being viewed by others

A survey on sentiment analysis methods, applications, and challenges

Sentiment Analysis in the Age of Generative AI

Sentiment Analysis in Social Media Data for Depression Detection Using Artificial Intelligence: A Review

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation