A New Sentiment and Topic Model for Short Texts on Social Media

Xu, Kang; Huang, Junheng; Qi, Guilin

doi:10.1007/978-3-319-70682-5_12

Kang Xu¹⁷,
Junheng Huang¹⁷ &
Guilin Qi¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10675))

Included in the following conference series:

Joint International Semantic Technology Conference

1008 Accesses

Abstract

Nowadays plenty of user-generated posts, e.g., tweets and sina weibos, are published on social media and the posts imply the public’s opinions towards various topics. Joint sentiment/topic models are widely applied in detecting sentiment-aware topics on the lengthy documents. However, the characteristics of posts, i.e., short texts, on social media pose new challenges: (1) context sparsity problem of posts makes traditional sentiment-topic models inapplicable; (2) conventional sentiment-topic models are designed for flat documents without structure information, while publishing users, publishing timeslices and hashtags of posts provide rich structure information for these posts. In this paper, we firstly devise a method to mine potential hashtags, based on explicit hashtags, to further enrich structure information for posts, then we propose a novel Sentiment Topic Model for Posts (STMP) which aggregates posts with the structure information, i.e., timeslices, users and hashtags, to alleviate the context sparsity problem. Experiments on Sentiment140 and Twitter7 show STMP outperforms previous models both in sentiment classification and sentiment-aware topic extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)
Article Google Scholar
Chen, Z., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., Ghosh, R.: Leveraging multi-domain prior knowledge in topic models. In: Proceedings of IJCAI, pp. 2071–2077. AAAI (2013)
Google Scholar
Dermouche, M., Khouas, L., Velcin, J., Loudcher, S.: A joint model for topic-sentiment modeling from text. In: Proceedings of SAC, pp. 819–824. ACM (2015)
Google Scholar
Dermouche, M., Velcin, J., Khouas, L., Loudcher, S.: A joint model for topic-sentiment evolution over time. In: Proceedings of ICDM, pp. 773–778. IEEE (2014)
Google Scholar
Diao, Q., Jiang, J., Zhu, F., Lim, E.-P.: Finding bursty topics from microblogs. In: Proceedings of ACL, pp. 536–544. ACL (2012)
Google Scholar
Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, pp. 1–12 (2009)
Google Scholar
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of SIGIR, pp. 50–57. ACM (1999)
Google Scholar
Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: Proceedings of WSDM, pp. 815–824. ACM (2011)
Google Scholar
Kiritchenko, S., Zhu, X., Cherry, C., Mohammad, S.: NRC-Canada-2014: detecting aspects and sentiment in customer reviews. In: SemEval, pp. 437–442. ACL (2014)
Google Scholar
Lim, K.W., Buntine, W.: Twitter opinion topic model: extracting product opinions from tweets by leveraging hashtags and sentiment lexicon. In: Proceedings of CIKM, pp. 1319–1328. ACM (2014)
Google Scholar
Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of CIKM, pp. 375–384. ACM (2009)
Google Scholar
Lin, C., He, Y., Everson, R., Rüger, S.: Weakly supervised joint sentiment-topic detection from text. IEEE Trans. Knowl. Data Eng. 24(6), 1134–1145 (2012)
Article Google Scholar
Lu, B., Ott, M., Cardie, C., Tsou, B.K.: Multi-aspect sentiment analysis with topic models. In: Proceedings of ICDMW, pp. 81–88. IEEE (2011)
Google Scholar
Mei, Q., Ling, X., Wondra, M., Su, H., Zhai, C.: Topic sentiment mixture: modeling facets and opinions in weblogs. In: Proceedings of WWW, pp. 171–180. ACM (2007)
Google Scholar
Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proceedings of EMNLP, pp. 262–272. ACL (2011)
Google Scholar
Mukherjee, S., Basu, G., Joshi, S.: Joint author sentiment topic model. In: SDM, pp. 370–378. SIAM (2014)
Google Scholar
Nguyen, T.H., Shirai, K.: Topic modeling based sentiment analysis on social media for stock market prediction. In: Proceedings of ACL, pp. 1354–1364. ACL (2015)
Google Scholar
Pennington, v., Socher, R., Manning. C.D.: Glove: global vectors for word representation. In: Proceedings of EMNLP, pp. 1532–1543. ACL (2014)
Google Scholar
Rong, X.: Word2Vec parameter learning explained. arXiv preprint arXiv:1411.2738 (2014)
Tsur, O., Rappoport, A.: What’s in a hashtag? Content based prediction of the spread of ideas in microblogging communities. In: Proceedings of WSDM, pp. 643–652. ACM (2012)
Google Scholar
Turney, P.D., Littman, M.L.: Measuring praise and criticism: inference of semantic orientation from association. ACM Trans. Inf. Syst. 21(4), 315–346 (2003)
Article Google Scholar
Wallach, H.M., Mimno, D.M., McCallum, A.: Rethinking LDA: why priors matter. In: NIPS, pp. 1973–1981 (2009)
Google Scholar
Wang, Y., Liu, J., Huang, Y., Feng, X.: Using hashtag graph-based topic model to connect semantically-related words without co-occurrence in microblogs. IEEE Trans. Knowl. Data Eng. 28(7), 1919–1933 (2016)
Article Google Scholar
Wang, Y., Liu, J., Qu, J., Huang, Y., Chen, J., Feng, X.: Hashtag graph based topic model for tweet mining. In: Proceedings of ICDM, pp. 1025–1030. IEEE (2014)
Google Scholar
Xu, K., Qi, G., Huang, J., Wu, T.: A joint model for sentiment-aware topic detection on social media. In: Procedings of ECAI, pp. 338–346. IOS Press (2016)
Google Scholar
Zhang, Q., Gong, Y., Sun, X., Huang, X.: Time-aware personalized hashtag recommendation on social media. In: Proceedings of COLING, pp. 203–212. ACL (2014)
Google Scholar
Zhao, W.X., Jiang, J., Yan, H., Li, X.: Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid. In: Proceedings of EMNLP, pp. 56–65. ACL (2010)
Google Scholar
Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., Li, X.: Comparing Twitter and traditional media using topic models. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20161-5_34
Chapter Google Scholar
Zheng, C., Chengtao, L., Jian-Tao, S., Zhang, J.: Sentiment topic model with decomposed prior. In: Proceedings of SDM, pp. 767–775. SIAM (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Southeast Unversity, Nanjing, China
Kang Xu, Junheng Huang & Guilin Qi

Authors

Kang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Junheng Huang
View author publications
You can also search for this author in PubMed Google Scholar
Guilin Qi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kang Xu .

Editor information

Editors and Affiliations

Griffith University, Brisbane, Queensland, Australia
Zhe Wang
Dresden University of Technology, Dresden, Germany
Anni-Yasmin Turhan
Griffith University, Brisbane, Queensland, Australia
Kewen Wang
Tianjin University, Tianjin, China
Xiaowang Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, K., Huang, J., Qi, G. (2017). A New Sentiment and Topic Model for Short Texts on Social Media. In: Wang, Z., Turhan, AY., Wang, K., Zhang, X. (eds) Semantic Technology. JIST 2017. Lecture Notes in Computer Science(), vol 10675. Springer, Cham. https://doi.org/10.1007/978-3-319-70682-5_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-70682-5_12
Published: 08 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70681-8
Online ISBN: 978-3-319-70682-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics