Automatic Detection of Hidden Communities in the Texts of Russian Social Network Corpus

Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 1292)


This paper proposes a linguistically-rich approach to hidden community detection which was tested in experiments with the Russian corpus of VKontakte posts. Modern algorithms for hidden community detection are based on graph theory, these procedures leaving out of account the linguistic features of analyzed texts. The authors have developed a new hybrid approach to the detection of hidden communities, combining author-topic modeling and automatic topic labeling. Specific linguistic parameters of Russian posts were revealed for correct language processing. The results justify the use of the algorithm that can be further integrated with already developed graph methods.


Hidden communities Corpus linguistics Social networks Author-topic models Automatic topic labeling 


  1. 1.
    Aletras, N., Stevenson, M.: Representing topics using images. In: NAACL-HLT 2013, pp. 158–167 (2013).
  2. 2.
    Allahyari, M., Pouriyeh, S., Kochut, K., Arabnia, H.: A knowledge-based topic modeling approach for automatic topic labeling. Int. J. Adv. Comput. Sci. Appl. 8, 335–349 (2017). Scholar
  3. 3.
    Alymov, A., Babiychuk, G.: Analysis of user profiles to determine the most significant objects of a subset in VK social network. Sci. Technol. Educ. 10, 41–45 (2017)Google Scholar
  4. 4.
    Argamon, Sh., Koppel, M., Pennebaker, J., Schler, A.: Automatically profiling the author of an anonymous text. Commun. ACM – Inspiring Women Comput. 52, 119–123 (2009).
  5. 5.
    Baumes, J., Goldberg, M., Magdon-Ismail, M., Wallace, W.A.: Discovering hidden groups in communication networks. In: Chen, H., Moore, R., Zeng, D.D., Leavitt, J. (eds.) ISI 2004. LNCS, vol. 3073, pp. 378–389. Springer, Heidelberg (2004). Scholar
  6. 6.
    Bodrunova, S., Blekanov, I., Kukarkin, M.: Topics in the Russian Twitter and relations between their interpretability and sentiment. In: Sixth International Conference on Social Networks Analysis, Management and Security, pp. 549–554 (2019).
  7. 7.
    Buchanan, M.: Nexus: Small Worlds and the Groundbreaking Science of Networks, 235 p. W.W. Norton & Company (2002)Google Scholar
  8. 8.
    Erofeeva, A., Mitrofanova, O.: Automatic topic labeling in topic models of the corpora of Russian texts. Struct. Appl. Linguist. 12, 122–147 (2016)Google Scholar
  9. 9.
    He, K., Li, Y., Soundarajan, S., Hopcroft, J.: Hidden community detection in social networks. Inf. Sci. 425, 92–106 (2018). Scholar
  10. 10.
    Ipatov, Yu., Kalagin, I., Krevetsky, A., Sokolov, B.: Analysis of dynamic characteristics of complex graph structures. In: News of higher educational institutions. Instrum. Eng. 6, 511–516 (2018).
  11. 11.
    Khoroshevsky, V., Efimenko, I.: Semantic technologies in scientometrics: tasks, problems, solutions and prospects. In: Cognitive-Semiotic Aspects of Modeling in Humanities, pp. 221–267 (2017)Google Scholar
  12. 12.
    Kolomeychenko, M., Chepovsky, A., Chepovsky, A.: Algorithm for detecting communities in social networks. Fundam. Appl. Math. 19, 21–32 (2014)Google Scholar
  13. 13.
    Koltsov, S., Pashakhin, S., Dokuka, S.: A full-cycle methodology for news topic modeling and user feedback research. In: Staab, S., Koltsova, O., Ignatov, D.I. (eds.) SocInfo 2018. LNCS, vol. 11185, pp. 308–321. Springer, Cham (2018). Scholar
  14. 14.
    Kriukova, A., Erofeeva, A., Mitrofanova, O., Sukharev, K.: Explicit semantic analysis as a means for topic labelling. In: Ustalov, D., Filchenkov, A., Pivovarova, L., Žižka, J. (eds.) AINL 2018. CCIS, vol. 930, pp. 110–116. Springer, Cham (2018). Scholar
  15. 15.
    Magatti, D., Calegari, S., Ciucci, D., Stella, F.: Automatic labeling of topics. In: Intelligent Systems Design and Applications, pp. 1227–1232 (2009).
  16. 16.
    Mei, Q., Shen, X., Zhai, C.: Automatic labeling of multinomial topic models. In: SIGKDD, pp. 490–499 (2007).
  17. 17.
    Mityagin, S., Yakushev, A., Bukhanovsky, A.: Research on Internet social networks for detecting the related interests of drug addicted people. Engineering 6, 59–64 (2012)Google Scholar
  18. 18.
    Nikolenko, S., Koltcov, S., Koltsova, O.: Topic modelling for qualitative studies. J. Inf. Sci. 43, 88–102 (2015). Scholar
  19. 19.
    Nitkin, D., Yudina, M.: Programme for extracting information on relationships of users of VK social network. Dyn. Syst. Mech. Mach. 4, 32–35 (2016)Google Scholar
  20. 20.
    Panicheva, P., Mirzagitova, A., Ledovaya, Y.: Semantic feature aggregation for gender identification in Russian Facebook. In: Filchenkov, A., Pivovarova, L., Žižka, J. (eds.) AINL 2017. CCIS, vol. 789, pp. 3–15. Springer, Cham (2018). Scholar
  21. 21.
    Smirnova, O.: Risk estimation for accounts exposed to suicide-themed quests. In: Modern Education Technologies and IT-Education, vol. 13, pp. 53–60 (2017)Google Scholar
  22. 22.
    Stevens, K., Kegelmeyer, P., Andrzejewski D., Buttler D.: Exploring topic coherence over many models and many topics. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 952–961 (2012)Google Scholar
  23. 23.
    Mitrofanova, O., Sampetova, V., Mamaev, I., Moskvina, A., Sukharev, K.: Topic modelling of the russian corpus of pikabu posts: author-topic distribution and topic labelling. In: Proceedings of the International Conference « Internet and Modern Society» (IMS 2020), International Workshop «Computational Linguistics» (CompLing-2020) (2020, in press)Google Scholar
  24. 24.
    Vorontsov, K., Frei, O., Apishev, M., Romov, P., Suvorova, M., Yanina, A.: Non-Bayesian additive regularization for multimodal topic modeling of large collections. In: Proceedings of the 2015 Workshop on Topic Models: Post-Processing and Applications, pp. 29–37 (2015).
  25. 25.
    Vorontsov, K., Potapenko, A.: Tutorial on probabilistic topic modeling: additive regularization for stochastic matrix factorization. In: Ignatov, D.I., Khachay, M.Y., Panchenko, A., Konstantinova, N., Yavorskiy, R.E. (eds.) AIST 2014. CCIS, vol. 436, pp. 29–46. Springer, Cham (2014). Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Saint Petersburg State UniversitySaint PetersburgRussia

Personalised recommendations