Abstract
The urgent scientific problem of multifactor clustering using various methods of normalization and averaging is investigated. Metric calculation values to improve the quality of clustering. A literary review of scientific publications on the topic of clustering social graphs and identifying communities has been carried out. The shortcomings of modern research in the field of analysis of social networks are identified. The list of network analysis metrics recommended as basic for data pre-processing is presented. The algorithm of the hybrid method of multifactorial clustering is presented, which allows reducing the computational costs of data clustering. An algorithm execution procedure is described for selecting several centrality metrics. Various methods of averaging centrality metrics are presented. This approach can significantly increase the assessment of the quality of clustering. The developed hybrid method based on averaging and the Louvain multi-factor clustering algorithm allows us to reduce computational resources. The clusterization application problem in the online community ITMO.EXPERT of the VKontakte social network is considered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Himelboim, I., Smith, M., Rainie, L., Shneiderman, B., Espina, C.: Classifying twitter topic-networks using social network analysis. J. Soc. Media + Soc. 3, 1–13 (2017). https://doi.org/10.1177/2056305117691545
Reihaneh, K., Takaffoli, M., Zaïane, O.: Analyzing participation of students in online courses using social network analysis techniques. In: 4th International Conference on Educational Data Mining, Netherlands, pp. 21–30. EDM Press (2011)
Cha, Y., Cho, J.: Social-network analysis using topic models. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, pp. 565–574. Association for Computing Machinery (2012)
Lei, T., Huan, L.: Graph mining applications to social network analysis. In: Aggarwal, C.C., Wang, H. (eds.) Managing and Mining Graph Data, pp. 487–513. Springer, Boston (2010). https://doi.org/10.1007/978-1-4419-6045-0_16
Oliveira, M., Gama, J.: An overview of social network analysis. WIREs Data Mining Knowl. Discov. 2, 99–115 (2012)
Wang, T., et al.: Understanding graph sampling algorithms for social network analysis. In: 31st International Conference on Distributed Computing Systems Workshops, Minneapolis, pp. 123–128. IEEE Press (2011)
Chen, Y., Hu, J., Zhao, H., Xiao, Y., Hui, P.: Measurement and analysis of the swarm social network with tens of millions of nodes. IEEE Access 6, 4547–4559 (2018)
Bhagat, S., Cormode, G., Muthukrishnan, S.: Node classification in social networks. In: Aggarwal, C. (ed.) Social Network Data Analytics, pp. 115–148. Springer, Boston (2011). https://doi.org/10.1007/978-1-4419-8462-3_5
Truong, Q.D., Truong, Q.B., Dkaki, T.: Graph methods for social network analysis. In: Vinh, P.C., Barolli, L. (eds.) ICTCC 2016. LNICSSITE, vol. 168, pp. 276–286. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46909-6_25
Cordeiro, M., Sarmento, R., Brazdil, P., Gama J.: Evolving networks and social network analysis methods and techniques. In: Višňovský, J., Radošinská, J. (eds.) Social Media and Journalism - Trends, Connections, Implications, London, pp. 101–134. IntechOpen (2018)
Taniarza, N., Adiwijaya, Maharani, W.: Social network analysis using k-Path centrality method. J. Phys.: Conf. Ser. 971, 1–9 (2018). https://doi.org/10.1088/1742-6596/971/1/012015
Niewiadomska-Szynkiewicz, E.: Application of social network analysis to the investigation of interpersonal connections. J. Telecommun. Inf. Technol. 2, 81–89 (2012)
Bothorel, C., Cruz, J., Magnani, M., Micenkova, B.: Clustering attributed graphs: models, measures and methods. Netw. Sci. 3, 408–444 (2015)
Cruz, J., Bothorel, C., Poulet, F.: Community detection and visualization in social networks: integrating structural and semantic information. ACM Trans. Intell. Syst. Technol. 5, 11:1–11:26 (2014)
Coscia, M., Giannotti, F., Pedreschi, D.: A classification for community discovery methods in complex networks. Stat. Anal. Data Mining 4, 512–546 (2011). https://doi.org/10.1002/sam.10133
Dang, A., Viennet, E.: Community detection based on structural and attribute similarities. In: 6th International Conference on Digital Society, Valencia, pp. 7–14. IARIA XPS Press (2012)
Hric, D., Darst, R., Fortunato, S.: Community detection in networks: structural clusters versus ground truth. Phys. Rev. E 90, 062805 (2014). https://doi.org/10.1103/PhysRevE.90.062805
Yang, J., McAuley, J., Leskovec, J.: Community detection in networks with node attributes. In: 13th IEEE International Conference on Data Mining, Dallas, pp. 1151–1156. IEEE Press (2013)
A Social Network Analysis of Articles on Social Network Analysis. https://arxiv.org/pdf/1810.09781.pdf/. Accessed 10 Jan 2020
Zhao, Y., Cai, S., Tang, M., Shang, M.: Coarse cluster enhancing collaborative recommendation for social network systems. Physica A: Stat. Mech. Appl. 483, 209–218 (2017)
Sun, L., Tao, T., Chen, F., Luo, Y.: An optimized clustering method with improved cluster center for social network based on gravitational search algorithm. In: Chen, F., Luo, Y. (eds.) Industrial IoT 2017. LNICST, vol. 202, pp. 61–71. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60753-5_7
Sun, Y., Yin, S., Li, H., Teng, L., Karim, S.: GPOGC: gaussian pigeon-oriented graph clustering algorithm for social networks cluster. IEEE Access 7, 99254–99262 (2019)
Berlingerio, M., Coscia, M., Giannotti, F., Monreale, A., Pedreschi, D.: Foundations of multidimensional network analysis. In: International Conference on Advances in Social Networks Analysis and Mining, New York, pp. 485–489. IEEE Press (2011)
Brandes, U., Gaertler, M., Wagner, D.: Engineering graph clustering: models and experimental evaluation. J. Exp. Algorithmics 12, 1–26 (2008)
Fortunato, S.: Community detection in graphs. Phys. Rep. 486, 75–174 (2010)
Blondel, V., Guillaume, J., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech.: Theory Exp. (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Televnoy, A., Ivanov, S.E., Gorlushkina, N. (2020). Hybrid Method of Multiple Factor Data Clusterization. In: Alexandrov, D.A., Boukhanovsky, A.V., Chugunov, A.V., Kabanov, Y., Koltsova, O., Musabirov, I. (eds) Digital Transformation and Global Society. DTGS 2020. Communications in Computer and Information Science, vol 1242. Springer, Cham. https://doi.org/10.1007/978-3-030-65218-0_11
Download citation
DOI: https://doi.org/10.1007/978-3-030-65218-0_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-65217-3
Online ISBN: 978-3-030-65218-0
eBook Packages: Computer ScienceComputer Science (R0)