Towards a Robust Metric of Polarity

Nigam, Kamal; Hurst, Matthew

doi:10.1007/1-4020-4102-0_20

Kamal Nigam⁴ &
Matthew Hurst⁴

Part of the book series: The Information Retrieval Series ((INRE,volume 20))

1397 Accesses
6 Citations

Abstract

This chapter describes an automated system for detecting polar expressions about a specified topic. The two elementary components of this approach are a shallow NLP polar language extraction system and a machine learning based topic classifier. These components are composed together by making a simple but accurate collocation assumption: if a topical sentence contains polar language, the polarity is associated with the topic. We evaluate our system, components and assumption on a corpus of online consumer messages.

Based on these components, we discuss how to measure the overall sentiment about a particular topic as expressed in online messages authored by many different people. We propose to use the fundamentals of Bayesian statistics to form an aggregate authorial opinion metric. This metric would propagate uncertainties introduced by the polarity and topic modules to facilitate statistically valid comparisons of opinion across multiple topics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

10. References

Agrawal, R., Rajagopalan, S., Srikant, R., and Xu, Y. (2003) Mining newsgroups using networks arising from social behavior. In Proceedings of the 12th World Wide Web Conference.
Google Scholar
Banfield, A. (1982) Unspeakable Sentences. Boston: Routledge and Kegan Paul.
Google Scholar
Blum, A. (1997) Empirical support for Winnow and weighted-majority based algorithms: Results on a calendar scheduling domain. Machine Learning 26:5–23.
Article MathSciNet Google Scholar
Dagan, I., Karov, Y, and Roth, D. (1997) Mistake-driven learning in text categorization. In EMNLP’ 97, 2nd Conference on Empirical Methods in Natural Language Processing.
Google Scholar
Dave, K., Lawrence, S., and Pennock, D. M. (2003) Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of the 12th World Wide Web Conference.
Google Scholar
Engstrom, C. (2004) Topic Dependence in Sentiment Classification. Master’s thesis, Cambridge University.
Google Scholar
GoogleMovies. http://24.60.188.10:8080/demos/googlemovies/googlemovies.cgi.
Google Scholar
Hurst, M., and Nigam, K. (2004) Retrieving topical sentiment from online document collections. In Proceedings of the 11th Conference on Document Recognition and Retrieval.
Google Scholar
Joachims, T. (1998) Text categorization with support vector machines: Learning with many relevant features. In Machine Learning: ECML-98 Tenth European Conference on Machine Learning, 137–142.
Google Scholar
Littlestone, N. (1998). Learning quickly when irrelevant features abound: A new linear-threshold algorithm. Machine Learning 2:285–318.
Google Scholar
Nasukawa, T., and Yi, J. (2003) Sentiment analysis: Capturing favorability using natural language processing. In Proceedings of K-CAP’ 03.
Google Scholar
Pang, B., Lee, L., and Vaithyanathan, S. (2002) Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of EMNLP 2002.
Google Scholar
Wiebe, J., Wilson, T., and Bell, M. (2001) Identifying collocations for recognizing opinions. In Proceedings of ACL/EACL’ 01 Workshop on Collocation.
Google Scholar
Yang, Y. (1999) An evaluation of statistical approaches to text categorization. Information Retrieval 1(1/2): 67–88.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Intelliseek Applied Research Center, 5001 Baum Blvd, Suite 644, Pittsburgh, PA, 15213, USA
Kamal Nigam & Matthew Hurst

Authors

Kamal Nigam
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Hurst
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Clairvoyance Cooperation, Pittsburgh, PA, USA
James G. Shanahan & Yan Qu &
University of Pittsburgh, PA, USA
Janyce Wiebe

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Nigam, K., Hurst, M. (2006). Towards a Robust Metric of Polarity. In: Shanahan, J.G., Qu, Y., Wiebe, J. (eds) Computing Attitude and Affect in Text: Theory and Applications. The Information Retrieval Series, vol 20. Springer, Dordrecht. https://doi.org/10.1007/1-4020-4102-0_20

Download citation

DOI: https://doi.org/10.1007/1-4020-4102-0_20
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-4026-9
Online ISBN: 978-1-4020-4102-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics