Abstract
Nowadays plenty of user-generated posts, e.g., tweets and sina weibos, are published on social media and the posts imply the public’s opinions towards various topics. Joint sentiment/topic models are widely applied in detecting sentiment-aware topics on the lengthy documents. However, the characteristics of posts, i.e., short texts, on social media pose new challenges: (1) context sparsity problem of posts makes traditional sentiment-topic models inapplicable; (2) conventional sentiment-topic models are designed for flat documents without structure information, while publishing users, publishing timeslices and hashtags of posts provide rich structure information for these posts. In this paper, we firstly devise a method to mine potential hashtags, based on explicit hashtags, to further enrich structure information for posts, then we propose a novel Sentiment Topic Model for Posts (STMP) which aggregates posts with the structure information, i.e., timeslices, users and hashtags, to alleviate the context sparsity problem. Experiments on Sentiment140 and Twitter7 show STMP outperforms previous models both in sentiment classification and sentiment-aware topic extraction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)
Chen, Z., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., Ghosh, R.: Leveraging multi-domain prior knowledge in topic models. In: Proceedings of IJCAI, pp. 2071–2077. AAAI (2013)
Dermouche, M., Khouas, L., Velcin, J., Loudcher, S.: A joint model for topic-sentiment modeling from text. In: Proceedings of SAC, pp. 819–824. ACM (2015)
Dermouche, M., Velcin, J., Khouas, L., Loudcher, S.: A joint model for topic-sentiment evolution over time. In: Proceedings of ICDM, pp. 773–778. IEEE (2014)
Diao, Q., Jiang, J., Zhu, F., Lim, E.-P.: Finding bursty topics from microblogs. In: Proceedings of ACL, pp. 536–544. ACL (2012)
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, pp. 1–12 (2009)
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of SIGIR, pp. 50–57. ACM (1999)
Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: Proceedings of WSDM, pp. 815–824. ACM (2011)
Kiritchenko, S., Zhu, X., Cherry, C., Mohammad, S.: NRC-Canada-2014: detecting aspects and sentiment in customer reviews. In: SemEval, pp. 437–442. ACL (2014)
Lim, K.W., Buntine, W.: Twitter opinion topic model: extracting product opinions from tweets by leveraging hashtags and sentiment lexicon. In: Proceedings of CIKM, pp. 1319–1328. ACM (2014)
Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of CIKM, pp. 375–384. ACM (2009)
Lin, C., He, Y., Everson, R., Rüger, S.: Weakly supervised joint sentiment-topic detection from text. IEEE Trans. Knowl. Data Eng. 24(6), 1134–1145 (2012)
Lu, B., Ott, M., Cardie, C., Tsou, B.K.: Multi-aspect sentiment analysis with topic models. In: Proceedings of ICDMW, pp. 81–88. IEEE (2011)
Mei, Q., Ling, X., Wondra, M., Su, H., Zhai, C.: Topic sentiment mixture: modeling facets and opinions in weblogs. In: Proceedings of WWW, pp. 171–180. ACM (2007)
Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proceedings of EMNLP, pp. 262–272. ACL (2011)
Mukherjee, S., Basu, G., Joshi, S.: Joint author sentiment topic model. In: SDM, pp. 370–378. SIAM (2014)
Nguyen, T.H., Shirai, K.: Topic modeling based sentiment analysis on social media for stock market prediction. In: Proceedings of ACL, pp. 1354–1364. ACL (2015)
Pennington, v., Socher, R., Manning. C.D.: Glove: global vectors for word representation. In: Proceedings of EMNLP, pp. 1532–1543. ACL (2014)
Rong, X.: Word2Vec parameter learning explained. arXiv preprint arXiv:1411.2738 (2014)
Tsur, O., Rappoport, A.: What’s in a hashtag? Content based prediction of the spread of ideas in microblogging communities. In: Proceedings of WSDM, pp. 643–652. ACM (2012)
Turney, P.D., Littman, M.L.: Measuring praise and criticism: inference of semantic orientation from association. ACM Trans. Inf. Syst. 21(4), 315–346 (2003)
Wallach, H.M., Mimno, D.M., McCallum, A.: Rethinking LDA: why priors matter. In: NIPS, pp. 1973–1981 (2009)
Wang, Y., Liu, J., Huang, Y., Feng, X.: Using hashtag graph-based topic model to connect semantically-related words without co-occurrence in microblogs. IEEE Trans. Knowl. Data Eng. 28(7), 1919–1933 (2016)
Wang, Y., Liu, J., Qu, J., Huang, Y., Chen, J., Feng, X.: Hashtag graph based topic model for tweet mining. In: Proceedings of ICDM, pp. 1025–1030. IEEE (2014)
Xu, K., Qi, G., Huang, J., Wu, T.: A joint model for sentiment-aware topic detection on social media. In: Procedings of ECAI, pp. 338–346. IOS Press (2016)
Zhang, Q., Gong, Y., Sun, X., Huang, X.: Time-aware personalized hashtag recommendation on social media. In: Proceedings of COLING, pp. 203–212. ACL (2014)
Zhao, W.X., Jiang, J., Yan, H., Li, X.: Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid. In: Proceedings of EMNLP, pp. 56–65. ACL (2010)
Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., Li, X.: Comparing Twitter and traditional media using topic models. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20161-5_34
Zheng, C., Chengtao, L., Jian-Tao, S., Zhang, J.: Sentiment topic model with decomposed prior. In: Proceedings of SDM, pp. 767–775. SIAM (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Xu, K., Huang, J., Qi, G. (2017). A New Sentiment and Topic Model for Short Texts on Social Media. In: Wang, Z., Turhan, AY., Wang, K., Zhang, X. (eds) Semantic Technology. JIST 2017. Lecture Notes in Computer Science(), vol 10675. Springer, Cham. https://doi.org/10.1007/978-3-319-70682-5_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-70682-5_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70681-8
Online ISBN: 978-3-319-70682-5
eBook Packages: Computer ScienceComputer Science (R0)