Skip to main content
Log in

Sentence Annotation for Aspect-oriented Sentiment Analysis: A Lexicon based Approach with Marathi Movie Reviews

  • ORIGINAL CONTRIBUTION
  • Published:
Journal of The Institution of Engineers (India): Series B Aims and scope Submit manuscript

Abstract

Aspect oriented sentiment analysis is a research area in natural language processing concerned with analyzing opinions at aspect/feature level. In this paper we present the construction of sentiment annotation system designed for aspect oriented sentiment analysis of Marathi movie reviews. The objective of the work is to create annotated dataset for opinion summarization system. For opinion summarization systems, it is better to use aspect based approach and build summaries around specific aspects and provide in depth analysis of the review. Since manual annotation of large data is difficult and time consuming, we try to automate the task using lexicon based approach. With the help of aspect and sentiment lexicons, we demonstrate a rule based annotation scheme designed for movie reviews domain in Marathi language. The annotated dataset is evaluated for aspect identification and aspect-sentiment pairing. The system reported an average F1 of 0.76 and 0.71 respectively for both the tasks. The proposed model can easily be adopted for sentiment analysis in other domains in Marathi language.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. P. Karagoz, B. Kama, M. Ozturk, I.H. Toroslu, D. Canturk, A framework for aspect based sentiment analysis on turkish informal texts. J. Intell. Inform. Syst. 53(3), 431–451 (2019)

    Article  Google Scholar 

  2. J.S. Kessler, N. Nicolov, The JDPA sentiment corpus for the automotive domain. In handbook of linguistic annotation, (Springer, 2017) p 833–854

  3. W. Liao, Bi. Zeng, X. Yin, P. Wei, An improved aspect-category sentiment analysis model for text sentiment analysis based on roberta. Appl. Intell. 51(6), 3522–3533 (2021)

    Article  Google Scholar 

  4. N. Zainuddin, A. Selamat, R. Ibrahim, Hybrid sentiment classification on twitter aspect-based sentiment analysis. Appl. Intell. 48(5), 1218–1232 (2018)

    Google Scholar 

  5. D. Zhang, Z. Zhu, S. Kang, G. Zhang, P. Liu, Syntactic and semantic analysis network for aspect-level sentiment classification. Appl. Intell. 51(8), 6136–6147 (2021)

    Article  Google Scholar 

  6. A. Goud B. Garg, A novel framework for aspect based sentiment analysis using a hybrid bert (hybbert) model. Multimed. Tools Appl. 1–33 (2023)

  7. B. Daille, E. Dubreil, L. Monceaux, M. Vernier, Annotating opinion— evaluation of blogs: the blogoscopy corpus. Lang. Resour. Eval. 45(4), 409–437 (2011)

    Article  Google Scholar 

  8. J. Read, J. Carroll, Annotating expressions of appraisal in english. Lang. Resour. Eval. 46(3), 421–447 (2012)

    Article  Google Scholar 

  9. J. Ramirez, M. Garber, X. Wang, SoccEval: an annotation schema for rating soccer players. In Proceedings of ACL 2017, Student Research Workshop, p 89–94 (2017)

  10. J. Barnes, P. Lambert, T. Badia, MultiBooked: a corpus of basque and catalan hotel reviews annotated for aspect-level sentiment classification. arXiv preprint arXiv:1803.08614, (2018)

  11. D. Cirqueira, L. Vinícius, M. Pinheiro, A. Jacob Junior, F. Lobato, Á. Santana, Opinion label: A gamified crowdsourcing system for sentiment. In Anais Estendidos do XXIII Simpósio Brasileiro de Sistemas Multimídia e Web, p 209–213. SBC, (2017)

  12. T. Sutabri, M. Ardiansyah, Framework of sentiment annotation for document specification in indonesian language base on topic modeling and machine learning. In Cyber and IT Service Management (CITSM), 2017 5th International Conference on, 1–6 IEEE, (2017)

  13. K. Garg, D.K. Lobiyal, Hindi emotionnet: a scalable emotion lexicon for sentiment classification of Hindi text. ACM Trans. Asian Low-Resour. Lang. Inform. Process. (TALLIP) 19(4), 1–35 (2020)

    Article  Google Scholar 

  14. V. Gupta, N. Jain, S. Shubham, A. Madan, A. Chaudhary, Q. Xin, Toward integrated cnn-based sentiment analysis of tweets for scarce-resource language—hindi. Trans. Asian Low-Resour. Lang. Inform. Process. 20(5), 1–23 (2021)

    Article  Google Scholar 

  15. S. Rani, P. Kumar, Aspect-based sentiment analysis using dependency parsing. Trans. Asian Low-Resour. Lang. Inform. Process. 21(3), 1–19 (2021)

    Google Scholar 

  16. S.S. Mukku, R.M. Actsa: Annotated corpus for telugu sentiment analysis. In Proceedings of the first workshop on building linguistically generalizable NLP systems, p 54– 58 (2017)

  17. Manisha Satish Divate, Sentiment analysis of marathi news using lstm. Int. J. Inf. Technol. 13(5), 2069–2074 (2021)

    Google Scholar 

  18. R.S. Patil, S.R. Kolhe, Supervised classifiers with tf-idf features for sentiment analysis of marathi tweets. Soc. Net. Anal. Mining 12(1), 1–16 (2022)

    Google Scholar 

  19. M. Kabir, O.B. Mahfuz, S.R. Raiyan, H. Mahmud, Md K. Hasan. BanglaBook: a large-scale Bangla dataset for sentiment analysis from book reviews. In Findings of the association for computational linguistics: ACL 2023, Toronto, Canada, Association for computational linguistics (July 2023) p 1237–1247

  20. A. Kulkarni, M. Mandhane, M. Likhitkar, G. Kshirsagar, R. Joshi, L3CubeMahaSent: A Marathi tweet-based sentiment analysis dataset. In Proceedings of the eleventh workshop on computational approaches to subjectivity, sentiment and social media analysis. Association for computational linguistics, (April 2021) p 213–220

  21. N. Mhaske, A. Patil, Issues and challenges in analyzing opinions in marathi text. IJCSI Int. J. Comput. Sci. Issue 13(2), 19–25 (2016)

    Google Scholar 

  22. N.T. Mhaske, A.S. Patil, Resource creation for opinion mining: a case study with marathi movie reviews. Int. J. Inform. Technol. 13(4), 1521–1529 (2021)

    Google Scholar 

  23. B. Pang, L. Lee, S. Vaithyanathan, Thumbs up?: sentiment classification using machine learning techniques. In Proceedings of the ACL-02 conference on empirical methods in natural language processing-Vol 10, 2002 p 79–86

  24. H. Yu, V. Hatzivassiloglou, Towards answering opinion questions: separating facts from opinions and identifying the polarity of opinion sentences. In Proceedings of the 2003 conference on Empirical methods in natural language processing, association for computational linguistics, 2003 p 129–136

Download references

Funding

No funding was provided for the completion of this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to N. T. Mhaske.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mhaske, N.T., Patil, A.S. Sentence Annotation for Aspect-oriented Sentiment Analysis: A Lexicon based Approach with Marathi Movie Reviews. J. Inst. Eng. India Ser. B (2024). https://doi.org/10.1007/s40031-024-01072-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s40031-024-01072-5

Keywords

Navigation