Skip to main content

Topic-Sentiment Mining from Multiple Text Collections

  • Conference paper
  • First Online:
  • 1756 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10035))

Abstract

Topic-sentiment mining is a challenging task for many applications. This paper presents a topic-sentiment joint model in order to mine topics and their sentimental polarities from multiple text collections. Text collections are represented with a mixture of components and modeled via the hierarchical Dirichlet process which can determine the number of components automatically. Each component consists of topic words and its sentiments. The model can mine topics with different proportions and different sentimental polarities as well as one positive and one negative topic for each collection. Experiments on two text collections from Chinese news media and microblog show that our model can find meaningful topics and their different sentimental polarities. Experiments on Multi-Domain Sentiment Dataset show that our model is better than the JST-alike models on parameter settings for topic-sentiment mining.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    These collections may vary a lot in the content, but refer to similar topics.

  2. 2.

    http://www.cs.jhu.edu/~mdredze/datasets/sentiment/.

  3. 3.

    http://ir.dlut.edu.cn/EmotionOntologyDownload.aspx (in Chinese).

  4. 4.

    http://keenage.com/html/c_bulletin_2007.htm (in Chinese).

References

  1. Blei, D.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012)

    Article  MathSciNet  Google Scholar 

  2. Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  3. Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, vol. 7, pp. 440–447. ACL, Prague, CZ (2007)

    Google Scholar 

  4. Cano, E., He, Y., Liu, K., Zhao, J.: A weakly-supervised Bayesian model for violence detection from social media. In: 6th International Joint Conference on Natural Language Processing (IJCNLP). ACL, Nagoya, October 2013

    Google Scholar 

  5. Fang, Y., Si, L., Somasundaram, N., Yu, Z.: Mining contrastive opinions on political texts using cross-perspective topic model. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, pp. 63–72. ACM, Seattle (2012)

    Google Scholar 

  6. Friedman, H.H., Amoo, T.: Rating the rating scales. J. Mark. Manag. 9(3), 114–123 (1999)

    Google Scholar 

  7. Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 815–824. ACM, Hong Kong (2011)

    Google Scholar 

  8. Krippendorff, K.: Content Analysis: An Introduction to its Methodology. Sage Publications, Beverly Hills (1980)

    MATH  Google Scholar 

  9. Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 375–384. ACM, Hong Kong (2009)

    Google Scholar 

  10. Lin, C., He, Y., Everson, R., Ruger, S.: Weakly supervised joint sentiment-topic detection from text. IEEE Trans. Knowl. Data Eng. 24(6), 1134–1145 (2012)

    Article  Google Scholar 

  11. Liu, B.: Sentiment analysis and subjectivity. In: Indurkhya, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, 2nd edn, pp. 627–666. Chapman and Hall/CRC, Boca Raton (2010)

    Google Scholar 

  12. Mei, Q., Ling, X., Wondra, M., Su, H., Zhai, C.: Topic sentiment mixture: modeling facets and opinions in weblogs. In: Proceedings of the 16th International Conference on World Wide Web, pp. 171–180. ACM, Banff (2007)

    Google Scholar 

  13. Paul, M.J., Zhai, C., Girju, R.: Summarizing contrastive viewpoints in opinionated text. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 66–76. Association for Computational Linguistics, ACL, Massachusetts (2010)

    Google Scholar 

  14. Teh, Y.W., Jordan, M.I.: Hierarchical Bayesian Nonparametric Models with Applications. Cambridge University Press, Cambridge (2010)

    Book  Google Scholar 

  15. Teh, Y., Jordan, M., Beal, M., Blei, D.: Hierarchical dirichlet processes. J. Am. Stat. Assoc. 101(476), 1566–1581 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  16. Xu, L., Lin, H., Pan, Y., Ren, H., Chen, J.: Constructing the affective lexicon ontology. J. China Soc. Sci. Tech. Inf. 27(2), 180–185 (2008). (In Chinese)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qifeng Zhu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Zhu, Q., Li, F. (2016). Topic-Sentiment Mining from Multiple Text Collections. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2016 2016. Lecture Notes in Computer Science(), vol 10035. Springer, Cham. https://doi.org/10.1007/978-3-319-47674-2_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-47674-2_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-47673-5

  • Online ISBN: 978-3-319-47674-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics