Abstract
Extracting opinion targets (features) and opinion words is a main task in aspect-based level sentiment analysis. In this paper, we have proposed a hybrid approach for mining features and opinion words based on an upgraded “double propagation” algorithm by adding rules that explore the semantic relations between many parts of speech in sentences, some regular expression rules and ontologies. We employed HITS algorithm to prune features. In our experiments on multiple data sets, we showed that our approach gives perfectly acceptable results.
Similar content being viewed by others
References
Dave K, Lawrence S, Pennock M (2003) Mining the peanut gallery: opinion extraction and semantic classification of product reviews. In: Hencsey G, White B, Chen YFR, Kovács L, Lawrence S (Ed.) Proceedings of the 12th international world wide web conference, WWW 2003, Budapest, Hungary, May 20–24, 2003, ACM, pp 519–528
Nasukawa T, Yi J (2003) Sentiment analysis: capturing favorability using natural language processing. In: Gennari JH, Porter BW, Gil Y (Ed.) Proceedings of the 2nd international conference on knowledge capture (K-CAP 2003), Sanibel Island, FL, USA, October 23–25, 2003, ACM, pp 70–77
Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the conference on human language technology and empirical methods in natural language processing
Tuarob S, Tucker CS (2015) Quantifying product favorability and extracting notable product features using large scale social media data. J Comput Inf Sci Eng. doi:10.1115/1.4029562
Hai Z, Chang K, Kim J, Yang C (2013) Identifying features in opinion mining via intrinsic and extrinsic domain relevance. In: IEEE Transactions on Knowledge and Data Engineering, pp 623–634
Kobayashi N, Inui K, Matsumoto Y (2007) Extracting aspect-evaluation and aspect-of relations in opinion mining. In: Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning, pp 1065–1074
Qiu G, Liu B, Bu J, Chen C (2009) Expanding Domain Sentiment Lexicon through Double Propagation. In: Proceedings of the 21st international joint conference on artificial intelligence (IJCAI-09), Pasadena, California, USA, July 11–17
Qiu G, Liu B, Bu J, Chen C (2011) Opinion word expansion and target extraction through double propagation. Comput Linguist 37(1):9–27
Kleinberg J (1999) Authoritative sources in hyperlinked environment. J ACM 46(5):604–632
Tran TK, Phan TT (2015) Constructing sentiment ontology for Vietnamese reviews. In: Proceedings of the 17th international conference on information integration and web-based applications & services (iiWAS2015), Brussels, Belgium. December 11–13, pp 281–285. ISBN: 978-1-4503-3491
Baccianella S, Esuli A, Sebastiani F (2010) Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC’10
Cambria E, Olsher D, Rajagopal D (2014) SenticNet 3: a common and common-sense knowledge base for cognition-driven sentiment analysis. In: AAAI, pp 1515–1521
Vu T-T, Pham H-T (2011) C.T.L.Q.T.H.: a feature-based opinion mining model on product reviews in Vietnamese. In: Semantic methods for knowledge discovery and communication, Polish-Taiwanese Workshop. Springer, Berlin Heidelberg, pp 22–23
Nguyen HN, Van Le T, Le HS, Pham TV, Domain specific sentiment dictionary for opinion mining of vietnamese text. In: The 8th multi-disciplinary international workshop on artificial intelligence (MIWAI 2014), pp 136–148
Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining, pp 168–177
Liu B, Hsu W, Ma Y (1998) Integrating classification and association rule mining. KDD’ 98:1998
Popescu A, Etzioni O (2005) Extracting product features and opinions from reviews. In: Proceedings of the conference on human language technology and empirical methods in natural language processing, pp 339–346
Long C, Zhang J, Zhu X (2010) A review selection approach for accurate feature rating estimation. In: Proceedings of the 23rd international conference on computational linguistics: posters, pp 766–774
Cilibrasi RL, Vitanyi PMB (2007) The google similarity distance. IEEE Trans Knowl Data Eng 19(3):370–383
Li F, Huang M, Zhu X (2010) Sentiment analysis with global topics and local dependency. In: Fox M, Poole D (Ed.) Proceedings of the 24th AAAI conference on artificial intelligence, AAAI 2010, Atlanta, Georgia, USA, July 11–15, 2010. AAAI Press
Zhao WX, Jiang J, Yan H, Li X (2010) Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid. In Proceedings of the 2010 conference on empirical methods in natural language processing, EMNLP 2010, MIT Stata Center, Massachusetts, USA, 9–11 October 2010. The Association for Computer Linguistics, pp 56–65
Sauper C, Haghighi A, Barzilay R (2011) Content models with attitude. In: Proceedings of the 49th annual meeting of the association for computational linguistics
Le HS, Le TV, Pham TV (2015) Aspect analysis for opinion mining of Vietnamese text. In: 2015 international conference on advanced computing and applications (ACOMP)
Chen Z, Mukherjee A, Liu B, Hsu M, Castellanos M, Ghosh R (2013) Discovering coherent topics using general knowledge. In: Proceedings of the 22nd ACM international conference on conference on information & knowledge management. ACM, pp 209–218
Van ATT, Dau HX (2014) A crossed-domain sentiment analysis system for the discovery of current careers from social networks. In: Proceedings of the fifth symposium on information and communication technology (SoICT 14), New York, pp 226–231
Liu B, Hu M, Cheng J (2005) Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of the 14th international world wide web conference (WWW-2005), May 10–14, in Chiba, Japan
Zhang L, Liu B (2010) Extracting and ranking product features in opinion documents. In: Proceedings of the 23rd international conference on computational linguistics (COLING-2010), August 23–27, Beijing, China
Liu B (2010) Sentiment analysis and subjectivity. In: Indurkhya N, Damerau FJ (eds) Handbook of natural language processing, 2nd edn. Chapman & Hall, London
Tran TK, Phan TT (2015) An upgrading SentiVoice—a system for querying hotel service reviews via phone. In: Proceedings of the 19th international conference on Asian language processing (IALP 2015), Suzhou, China. October 24–25
Noy NF, McGuinness DL (2001) Ontology development 101: a guide to creating your first ontology. Stanford Medical Informatics Technical Report SMI-2001-0880, pp 1-25
Gruber TR (1993) A translation approach to portable ontologies. Knowl Acquisit 5(2):199–220
Duo Z, Juan-Zi L, Bin X (2005) Web service annotation using ontology mapping. In: SOSE 2005: proceedings of the IEEE international workshop, pp 243–250
Khan L, McLeod D, Hovy E (2004) Retrieval effectiveness of an ontology-based model for information selection. VLDB J 13:71–85
Huyen NTM, Roussanaly A, Vinh HT (2008) A hybrid approach toward segmentation of Vietnamese texts. Language and automata theory and applications. Springer, Berlin Heidelberg, pp 240–249
Nguyen DQ, Dai Quoc Nguyen DDP, Pham SB (2014). RDRPOSTagger: a ripple down rules-based part-of-speech tagger. In: Proceedings of the demonstrations at the 14th conference of the European Chapter of the association for computational linguistics, 2014, pp 17–20
Nguyen DQ, Nguyen DQ, Pham SB, Nguyen P-T, Le Nguyen M (2014) From treebank conversion to automatic dependency parsing for Vietnamese. In Proceedings of 19th international conference on application of natural language to information systems, NLDB’14, Springer LNCS, 2014, pp 196–207
Hong PL, Nguyen TMH, Roussanaly A (2012) Vietnamese parsing with an automatically extracted tree-adjoining grammar. In: Proceedings of the 9th IEEE RIVF international conference on computing & communication technologies, research, innovation, and vision for the future, IEEE, pp 1–6
Buchholz S, Marsi E (2006) CoNLL-X shared task on multilingual dependency parsing. In: Proceedings of the tenth conference on computational natural language learning, Association for Computational Linguistics
Girju CR (2002) Text mining for semantic relations. Ph.D. Thesis. The University of Texas at Dallas, 2002
Ha QT, Vu TT, Pham HT, Luu CT (2011) An upgrading feature-based opinion mining model on Vietnamese product reviews. In: Zhong N, Callaghan V, Ghorbani AA, Hu B. (eds) Active Media Technology. AMT 2011. Lecture Notes in Computer Science, vol 6890. Springer, Berlin, Heidelberg
Acknowledgement
This paper was supported by the research Project C2016-20-32 funded by Vietnam National University Ho Chi Minh City (VNU-HCM).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tran, T.K., Phan, T.T. Mining opinion targets and opinion words from online reviews. Int. j. inf. tecnol. 9, 239–249 (2017). https://doi.org/10.1007/s41870-017-0032-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-017-0032-9