Abstract
The advent of Web 2.0 has led to an increase in user-generated content on the Web. This has provided an extensive collection of free-style texts with opinion expressions that could influence the decisions and actions of their readers. Providers of such content exert a certain level of influence on the receivers and this is evident from blog sites having effect on their readers’ purchase decisions, political view points, financial planning, and others. By detecting the opinion expressed, we can identify the sentiments on the topics discussed and the influence exerted on the readers. In this paper, we introduce an automatic approach in deriving polarity pattern rules to detect sentiment polarity at the phrase level, and in addition consider the effects of the more complex relationships found between words in sentiment polarity classification. Recent sentiment analysis research has focused on the functional relations of words using typed dependency parsing, providing a refined analysis on the grammar and semantics of textual data. Heuristics are typically used to determine the typed dependency polarity patterns, which may not comprehensively identify all possible rules. We study the use of class sequential rules (CSRs) to automatically learn the typed dependency patterns, and benchmark the performance of CSR against a heuristic method. Preliminary results show CSR leads to further improvements in classification performance achieving over 80% F1 scores in the test cases. In addition, we observe more complex relationships between words that could influence phrase sentiment polarity, and further discuss on possible approaches to handle the effects of these complex relationships.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Adar E, Adamic L A. Tracking information epidemics in blogspace. In Proc. Int. Conf. Web Intelligence, Washington, DC, USA, Sept. 2005, pp.207–214.
Agarwal N, Liu H, Tang L, Yu P S. Identifying the influential bloggers in a community. In Proc. WSDM 2008, New York, USA, Feb. 2008, pp.207–218.
Tan L K W, Na J C, Theng Y L. Influence detection between blog posts through blog features, content analysis, and community identity. Online Information Review, 2011, 35(3): 425–442.
Abbasi A, Chen H, Salem A. Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. Trans. Inf. Syst., 2008, 26(3): Article No. 12.
Demartini G, Siersdorfer S. Dear search engine: What’s your opinion about…?: Sentiment analysis for semantic enrichment of web search results. In Proc. SEMSEARCH 2010, New York, USA, April 2010, Article No.4.
Devitt A, Ahmad K. Sentiment polarity identification in financial news: A cohesion-based approach. In Proc. ACL 2007, Prague, Czech Republic, June 2007, pp.984–991.
O’Hare N, Davy M, Bermingham A, Ferguson P, Sheridan P, Gurrin C, Smeaton A F. Topic-dependent sentiment analysis of financial blogs. In Proc. CIKM Workshop on TSA 2009, New York, USA, Nov. 2009, pp.9–16.
Ding X, Liu B, Yu P S. A holistic lexicon-based approach to opinion mining. In Proc. WSDM 2008, New York, USA, April 2008, pp.231–240.
Morinaga S, Yamanishi K, Tateishi K, Fukushima T. Mining product reputations on the Web. In Proc. SIGKDD 2002, New York, USA, July 2002, pp.341–349.
Riloff E, Wiebe J. Learning extraction patterns for subjective expressions. In Proc. EMNLP 2003, Stroudsburg, PA, USA, July 2003, pp.105–112.
Turney P D. Thumbs up or thumbs down?: Semantic orientation applied to unsupervised classification of reviews. In Proc. ACL 2002, Stroudsburg, PA, USA, July 2002, pp.417–424.
Pang B, Lee L. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proc. ACL 2004, Barcelona, Spain, July 2004, pp.271–278.
Thet T T, Na J C, Khoo C S G. Aspect-based sentiment analysis of movie reviews on discussion boards. Journal of Information Science, 2010, 36(6): 823–848.
Wilson T, Wiebe J, Hoffmann P. Recognizing contextual polarity in phrase-level sentiment analysis. In Proc. HLTEMNLP 2005, Vancouver, British Columbia, Canada, Oct. 2005, pp.347–354.
Wilson T, Wiebe J, Hwa R. Recognizing strong and weak opinion clauses. Computational Intelligence, 2006, 22(2): 73–99.
Nivre J. Dependency grammar and dependency parsing. Technical Report MSI report 05133, Växjö University, School of Mathematics and Systems Engineering, 2005.
Jakob N, Weber S H, Muller M C, Gurevych I. Beyond the stars: Exploiting free-text user reviews to improve the accuracy of movie recommendations. In Proc. CIKM Workshop on TSA 2009, Hong Kong, China, Nov. 2009, pp.57–64.
Shaikh M A M, Prendinger H, Ishizuka M. Sentiment assessment of text by analyzing linguistic features and contextual valence assignment. Appl. Artif. Intell., 2008, 22(6): 558–601.
Liu B. Web Data Mining: Exploring Hyperlinks, Contents and Usage Data (1st edition). Springer Berlin Heidelberg, New York, 2006, pp.37–54.
Osman D J, Yearwood J, Vamplew P. Weblogs for market research: Finding more relevant opinion documents using system fusion. Online Information Review, 2009, 33(5): 873–888.
Hu M, Liu B. Mining and summarizing customer reviews. In Proc. the 10th SIGKDD, Seattle, WA, USA, Aug. 2004, pp.168–177.
Kim S M, Hovy E. Determining the sentiment of opinions. In Proc. the 20th COLING, Geneva, Switzerland, 2004, pp.1367–1373.
Zhang C, Zeng D, Li J, Wang F Y, Zuo W. Sentiment analysis of Chinese Documents: From sentence to document level. Journal of the American Society for Information Science and Technology, 2009, 60(12): 2474–2487.
Na J C, Thet T T, Khoo C. Comparing sentiment expression in movie reviews from four online genres. Online Information Review, 2010, 34(2): 317–338.
Moilanen K, Pulman S. Sentiment composition. In Proc. RANLP 2007, Borovets, Bulgaria, Sept. 2007, pp.378–382.
Cohen J. A coe±cient of agreement for nominal scales. Educational and Psychological Measurement, 1960, 20(1): 37–46.
Joshi M, Penstein-Rose C. Generalizing dependency features for opinion mining. In Proc. ACL-IJCNLP 2009, Suntec, Singapore, Aug. 2009, pp.313–316.
Agrawal R, Srikant R. Fast algorithms for mining association rules in large databases. In Proc. VLDB 1994, Santiago de Chile, Chile, Sept. 1994, pp.487–499.
Wong K W, Zhou S, Yang Q, Yeung J M S. Mining customer value: From association rules to direct marketing. Data Mining and Knowledge Discovery, 2005, 11(1): 57–79.
Polanyi L, Zaenen A. Computing attitude and affect in text: Theory and applications. Computing Attitude and Affect in Text: Theory and Applications, 2006, 20: 1–10.
Quirk R, Greenbaum S, Leech G, Svartvik J. A Comprehensive Grammar of the English Language, Longman, 1985.
Tan L K W, Na J C, Theng Y L, Chang K Y. Sentence-level sentiment polarity classification using a linguistic approach. In Proc. ICADL 2011, Beijing, China, Oct. 2011, pp.77–87.
Author information
Authors and Affiliations
Corresponding author
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Tan, L.KW., Na, JC., Theng, YL. et al. Phrase-Level Sentiment Polarity Classification Using Rule-Based Typed Dependencies and Additional Complex Phrases Consideration. J. Comput. Sci. Technol. 27, 650–666 (2012). https://doi.org/10.1007/s11390-012-1251-y
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-012-1251-y