Topic Extraction on Twitter Considering Author’s Role Based on Bipartite Networks

  • Takako HashimotoEmail author
  • Tetsuji Kuboyama
  • Hiroshi Okamoto
  • Kilho Shin
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10558)


This paper proposes a quality topic extraction on Twitter based on author’s role on bipartite networks. We suppose that author’s role which means who were in what group, affects the quality of extracted topics. Our proposed method expresses relations between authors and words as bipartite networks, explores author’s role by forming clusters using our original community detection technique, and finds quality topics considering the semantic accuracy of words and author’s role.


Topic extraction Social media analysis Twitter analysis Bipartite network Data mining Community detection 



This paper was supported by the Grant-in-Aid for Scientific Research (KAKENHI Grant Numbers 26280090, 15K00314, and 17H00762) from the Japan Society for the Promotion of Science.


  1. 1.
    Qiu, X., Inagi, A.S., Nukui, S., Murata, T., Okamoto, H.: Random walk based community detection from bipartite networks. In: Proceedings of The 30th Annual Conference of the Japanese Society for Artificial Intelligence (2016)Google Scholar
  2. 2.
    Hashimoto, T., Kuboyama, T., Okamoto, H., Shin, K.: Topic extraction from millions of tweets based on community detection in bipartite networks. In: Proceedings of 27th International Conference on Information Modelling and Knowledge Bases, pp. 409–424 (2017)Google Scholar
  3. 3.
    Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 262–272. Association for Computational Linguistics (2011)Google Scholar
  4. 4.
    Jaccard, P.: The distribution of the flora in the alpine zone. New Phytol. 11(2), 37–50 (1912)CrossRefGoogle Scholar
  5. 5.
    Sayyadi, H., Raschid, L.: A graph analytical approach for topic detection. ACM Trans. Internet Technol. 13(2), 4:1–4:23 (2013). Article No. 4CrossRefGoogle Scholar
  6. 6.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(4–5), 993–1022 (2003). doi: 10.1162/jmlr.2003.3.4-5.993 zbMATHGoogle Scholar
  7. 7.
    Wu, H.C., Luk, R.W.P., Wong, K.F., Kwok, K.L.: Interpreting TF-IDF term weights as making relevance decisions. ACM Trans. Inf. Syst. 26(3), 13:1–13:37 (2008). doi: 10.1145/1361684.1361686. Article No. 13CrossRefGoogle Scholar
  8. 8.
    Blei, D.M., Lafferty, J.D.: Topic models. In: Srivastava, A., Sahami, M. (eds.) Text Mining: Theory and Applications. Taylor and Francis, UK (2009)Google Scholar
  9. 9.
    Bhattacharya, I., Sil, J.: Query classification using LDA topic model and sparse representation based classifier. In: Proceedings of the 3rd IKDD Conference on Data Science 2016, p. 24. ACM (2014)Google Scholar
  10. 10.
    Endo, Y., Toda, H., Koike, Y.: What’s hot in the theme: query dependent emerging topic extraction from social streams. In: Proceedings of the 24th International Conference on World Wide Web, pp. 31–32. ACM (2015)Google Scholar
  11. 11.
    Fujino, I., Hoshino, Y.: A method for identifying topics in twitter and its application for analyzing the transition of topics. In: Proceedings of DEIM Forum 2014. C4-2 (2014)Google Scholar
  12. 12.
    Wang, Y., Agichtein, E., Benzi, M.: TM-LDA: efficient online modeling of latent topic transitions in social media. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 123–131. ACM (2012)Google Scholar
  13. 13.
    Zhao, W.X., Jiang, J., Weng, J., He, J., Lim, E.-P., Yan, H., Li, X.: Comparing twitter and traditional media using topic models. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 338–349. Springer, Heidelberg (2011). doi: 10.1007/978-3-642-20161-5_34 CrossRefGoogle Scholar
  14. 14.
    Newman, M.E.J.: Communities, modules and large-scale structure in networks. Nature Phys. 8(1), 25–31 (2012)CrossRefGoogle Scholar
  15. 15.
  16. 16.
    MeCab: Yet Another Part-of-Speech and Morphological Analyzer.

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Takako Hashimoto
    • 1
    Email author
  • Tetsuji Kuboyama
    • 2
  • Hiroshi Okamoto
    • 3
  • Kilho Shin
    • 4
  1. 1.Chiba University of CommerceIchikawaJapan
  2. 2.Gakushuin UniversityTokyoJapan
  3. 3.RIKEN Brain Science InstituteSaitamaJapan
  4. 4.University of HyogoKobeJapan

Personalised recommendations