Abstract
Topic-sentiment mining is a challenging task for many applications. This paper presents a topic-sentiment joint model in order to mine topics and their sentimental polarities from multiple text collections. Text collections are represented with a mixture of components and modeled via the hierarchical Dirichlet process which can determine the number of components automatically. Each component consists of topic words and its sentiments. The model can mine topics with different proportions and different sentimental polarities as well as one positive and one negative topic for each collection. Experiments on two text collections from Chinese news media and microblog show that our model can find meaningful topics and their different sentimental polarities. Experiments on Multi-Domain Sentiment Dataset show that our model is better than the JST-alike models on parameter settings for topic-sentiment mining.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
These collections may vary a lot in the content, but refer to similar topics.
- 2.
- 3.
http://ir.dlut.edu.cn/EmotionOntologyDownload.aspx (in Chinese).
- 4.
http://keenage.com/html/c_bulletin_2007.htm (in Chinese).
References
Blei, D.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012)
Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, vol. 7, pp. 440–447. ACL, Prague, CZ (2007)
Cano, E., He, Y., Liu, K., Zhao, J.: A weakly-supervised Bayesian model for violence detection from social media. In: 6th International Joint Conference on Natural Language Processing (IJCNLP). ACL, Nagoya, October 2013
Fang, Y., Si, L., Somasundaram, N., Yu, Z.: Mining contrastive opinions on political texts using cross-perspective topic model. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, pp. 63–72. ACM, Seattle (2012)
Friedman, H.H., Amoo, T.: Rating the rating scales. J. Mark. Manag. 9(3), 114–123 (1999)
Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 815–824. ACM, Hong Kong (2011)
Krippendorff, K.: Content Analysis: An Introduction to its Methodology. Sage Publications, Beverly Hills (1980)
Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 375–384. ACM, Hong Kong (2009)
Lin, C., He, Y., Everson, R., Ruger, S.: Weakly supervised joint sentiment-topic detection from text. IEEE Trans. Knowl. Data Eng. 24(6), 1134–1145 (2012)
Liu, B.: Sentiment analysis and subjectivity. In: Indurkhya, N., Damerau, F.J. (eds.) Handbook of Natural Language Processing, 2nd edn, pp. 627–666. Chapman and Hall/CRC, Boca Raton (2010)
Mei, Q., Ling, X., Wondra, M., Su, H., Zhai, C.: Topic sentiment mixture: modeling facets and opinions in weblogs. In: Proceedings of the 16th International Conference on World Wide Web, pp. 171–180. ACM, Banff (2007)
Paul, M.J., Zhai, C., Girju, R.: Summarizing contrastive viewpoints in opinionated text. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 66–76. Association for Computational Linguistics, ACL, Massachusetts (2010)
Teh, Y.W., Jordan, M.I.: Hierarchical Bayesian Nonparametric Models with Applications. Cambridge University Press, Cambridge (2010)
Teh, Y., Jordan, M., Beal, M., Blei, D.: Hierarchical dirichlet processes. J. Am. Stat. Assoc. 101(476), 1566–1581 (2006)
Xu, L., Lin, H., Pan, Y., Ren, H., Chen, J.: Constructing the affective lexicon ontology. J. China Soc. Sci. Tech. Inf. 27(2), 180–185 (2008). (In Chinese)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Zhu, Q., Li, F. (2016). Topic-Sentiment Mining from Multiple Text Collections. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2016 2016. Lecture Notes in Computer Science(), vol 10035. Springer, Cham. https://doi.org/10.1007/978-3-319-47674-2_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-47674-2_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47673-5
Online ISBN: 978-3-319-47674-2
eBook Packages: Computer ScienceComputer Science (R0)