Skip to main content

A New Sentiment and Topic Model for Short Texts on Social Media

  • Conference paper
  • First Online:
Semantic Technology (JIST 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10675))

Included in the following conference series:

  • 1008 Accesses

Abstract

Nowadays plenty of user-generated posts, e.g., tweets and sina weibos, are published on social media and the posts imply the public’s opinions towards various topics. Joint sentiment/topic models are widely applied in detecting sentiment-aware topics on the lengthy documents. However, the characteristics of posts, i.e., short texts, on social media pose new challenges: (1) context sparsity problem of posts makes traditional sentiment-topic models inapplicable; (2) conventional sentiment-topic models are designed for flat documents without structure information, while publishing users, publishing timeslices and hashtags of posts provide rich structure information for these posts. In this paper, we firstly devise a method to mine potential hashtags, based on explicit hashtags, to further enrich structure information for posts, then we propose a novel Sentiment Topic Model for Posts (STMP) which aggregates posts with the structure information, i.e., timeslices, users and hashtags, to alleviate the context sparsity problem. Experiments on Sentiment140 and Twitter7 show STMP outperforms previous models both in sentiment classification and sentiment-aware topic extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://snap.stanford.edu/data/twitter7.html.

  2. 2.

    http://help.sentiment140.com/for-students/.

  3. 3.

    https://en.wikipedia.org/wiki/List_of_emotions.

References

  1. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  2. Bollen, J., Mao, H., Zeng, X.: Twitter mood predicts the stock market. J. Comput. Sci. 2(1), 1–8 (2011)

    Article  Google Scholar 

  3. Chen, Z., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M., Ghosh, R.: Leveraging multi-domain prior knowledge in topic models. In: Proceedings of IJCAI, pp. 2071–2077. AAAI (2013)

    Google Scholar 

  4. Dermouche, M., Khouas, L., Velcin, J., Loudcher, S.: A joint model for topic-sentiment modeling from text. In: Proceedings of SAC, pp. 819–824. ACM (2015)

    Google Scholar 

  5. Dermouche, M., Velcin, J., Khouas, L., Loudcher, S.: A joint model for topic-sentiment evolution over time. In: Proceedings of ICDM, pp. 773–778. IEEE (2014)

    Google Scholar 

  6. Diao, Q., Jiang, J., Zhu, F., Lim, E.-P.: Finding bursty topics from microblogs. In: Proceedings of ACL, pp. 536–544. ACL (2012)

    Google Scholar 

  7. Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, pp. 1–12 (2009)

    Google Scholar 

  8. Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of SIGIR, pp. 50–57. ACM (1999)

    Google Scholar 

  9. Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: Proceedings of WSDM, pp. 815–824. ACM (2011)

    Google Scholar 

  10. Kiritchenko, S., Zhu, X., Cherry, C., Mohammad, S.: NRC-Canada-2014: detecting aspects and sentiment in customer reviews. In: SemEval, pp. 437–442. ACL (2014)

    Google Scholar 

  11. Lim, K.W., Buntine, W.: Twitter opinion topic model: extracting product opinions from tweets by leveraging hashtags and sentiment lexicon. In: Proceedings of CIKM, pp. 1319–1328. ACM (2014)

    Google Scholar 

  12. Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of CIKM, pp. 375–384. ACM (2009)

    Google Scholar 

  13. Lin, C., He, Y., Everson, R., Rüger, S.: Weakly supervised joint sentiment-topic detection from text. IEEE Trans. Knowl. Data Eng. 24(6), 1134–1145 (2012)

    Article  Google Scholar 

  14. Lu, B., Ott, M., Cardie, C., Tsou, B.K.: Multi-aspect sentiment analysis with topic models. In: Proceedings of ICDMW, pp. 81–88. IEEE (2011)

    Google Scholar 

  15. Mei, Q., Ling, X., Wondra, M., Su, H., Zhai, C.: Topic sentiment mixture: modeling facets and opinions in weblogs. In: Proceedings of WWW, pp. 171–180. ACM (2007)

    Google Scholar 

  16. Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proceedings of EMNLP, pp. 262–272. ACL (2011)

    Google Scholar 

  17. Mukherjee, S., Basu, G., Joshi, S.: Joint author sentiment topic model. In: SDM, pp. 370–378. SIAM (2014)

    Google Scholar 

  18. Nguyen, T.H., Shirai, K.: Topic modeling based sentiment analysis on social media for stock market prediction. In: Proceedings of ACL, pp. 1354–1364. ACL (2015)

    Google Scholar 

  19. Pennington, v., Socher, R., Manning. C.D.: Glove: global vectors for word representation. In: Proceedings of EMNLP, pp. 1532–1543. ACL (2014)

    Google Scholar 

  20. Rong, X.: Word2Vec parameter learning explained. arXiv preprint arXiv:1411.2738 (2014)

  21. Tsur, O., Rappoport, A.: What’s in a hashtag? Content based prediction of the spread of ideas in microblogging communities. In: Proceedings of WSDM, pp. 643–652. ACM (2012)

    Google Scholar 

  22. Turney, P.D., Littman, M.L.: Measuring praise and criticism: inference of semantic orientation from association. ACM Trans. Inf. Syst. 21(4), 315–346 (2003)

    Article  Google Scholar 

  23. Wallach, H.M., Mimno, D.M., McCallum, A.: Rethinking LDA: why priors matter. In: NIPS, pp. 1973–1981 (2009)

    Google Scholar 

  24. Wang, Y., Liu, J., Huang, Y., Feng, X.: Using hashtag graph-based topic model to connect semantically-related words without co-occurrence in microblogs. IEEE Trans. Knowl. Data Eng. 28(7), 1919–1933 (2016)

    Article  Google Scholar 

  25. Wang, Y., Liu, J., Qu, J., Huang, Y., Chen, J., Feng, X.: Hashtag graph based topic model for tweet mining. In: Proceedings of ICDM, pp. 1025–1030. IEEE (2014)

    Google Scholar 

  26. Xu, K., Qi, G., Huang, J., Wu, T.: A joint model for sentiment-aware topic detection on social media. In: Procedings of ECAI, pp. 338–346. IOS Press (2016)

    Google Scholar 

  27. Zhang, Q., Gong, Y., Sun, X., Huang, X.: Time-aware personalized hashtag recommendation on social media. In: Proceedings of COLING, pp. 203–212. ACL (2014)

    Google Scholar 

  28. Zhao, W.X., Jiang, J., Yan, H., Li, X.: Jointly modeling aspects and opinions with a MaxEnt-LDA hybrid. In: Proceedings of EMNLP, pp. 56–65. ACL (2010)

    Google Scholar 

  29. Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., Li, X.: Comparing Twitter and traditional media using topic models. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-20161-5_34

    Chapter  Google Scholar 

  30. Zheng, C., Chengtao, L., Jian-Tao, S., Zhang, J.: Sentiment topic model with decomposed prior. In: Proceedings of SDM, pp. 767–775. SIAM (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kang Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xu, K., Huang, J., Qi, G. (2017). A New Sentiment and Topic Model for Short Texts on Social Media. In: Wang, Z., Turhan, AY., Wang, K., Zhang, X. (eds) Semantic Technology. JIST 2017. Lecture Notes in Computer Science(), vol 10675. Springer, Cham. https://doi.org/10.1007/978-3-319-70682-5_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-70682-5_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-70681-8

  • Online ISBN: 978-3-319-70682-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics