Skip to main content

Hybrid Method of Multiple Factor Data Clusterization

  • Conference paper
  • First Online:
Digital Transformation and Global Society (DTGS 2020)

Abstract

The urgent scientific problem of multifactor clustering using various methods of normalization and averaging is investigated. Metric calculation values to improve the quality of clustering. A literary review of scientific publications on the topic of clustering social graphs and identifying communities has been carried out. The shortcomings of modern research in the field of analysis of social networks are identified. The list of network analysis metrics recommended as basic for data pre-processing is presented. The algorithm of the hybrid method of multifactorial clustering is presented, which allows reducing the computational costs of data clustering. An algorithm execution procedure is described for selecting several centrality metrics. Various methods of averaging centrality metrics are presented. This approach can significantly increase the assessment of the quality of clustering. The developed hybrid method based on averaging and the Louvain multi-factor clustering algorithm allows us to reduce computational resources. The clusterization application problem in the online community ITMO.EXPERT of the VKontakte social network is considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Himelboim, I., Smith, M., Rainie, L., Shneiderman, B., Espina, C.: Classifying twitter topic-networks using social network analysis. J. Soc. Media + Soc. 3, 1–13 (2017). https://doi.org/10.1177/2056305117691545

  2. Reihaneh, K., Takaffoli, M., Zaïane, O.: Analyzing participation of students in online courses using social network analysis techniques. In: 4th International Conference on Educational Data Mining, Netherlands, pp. 21–30. EDM Press (2011)

    Google Scholar 

  3. Cha, Y., Cho, J.: Social-network analysis using topic models. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, pp. 565–574. Association for Computing Machinery (2012)

    Google Scholar 

  4. Lei, T., Huan, L.: Graph mining applications to social network analysis. In: Aggarwal, C.C., Wang, H. (eds.) Managing and Mining Graph Data, pp. 487–513. Springer, Boston (2010). https://doi.org/10.1007/978-1-4419-6045-0_16

    Chapter  Google Scholar 

  5. Oliveira, M., Gama, J.: An overview of social network analysis. WIREs Data Mining Knowl. Discov. 2, 99–115 (2012)

    Google Scholar 

  6. Wang, T., et al.: Understanding graph sampling algorithms for social network analysis. In: 31st International Conference on Distributed Computing Systems Workshops, Minneapolis, pp. 123–128. IEEE Press (2011)

    Google Scholar 

  7. Chen, Y., Hu, J., Zhao, H., Xiao, Y., Hui, P.: Measurement and analysis of the swarm social network with tens of millions of nodes. IEEE Access 6, 4547–4559 (2018)

    Article  Google Scholar 

  8. Bhagat, S., Cormode, G., Muthukrishnan, S.: Node classification in social networks. In: Aggarwal, C. (ed.) Social Network Data Analytics, pp. 115–148. Springer, Boston (2011). https://doi.org/10.1007/978-1-4419-8462-3_5

    Chapter  Google Scholar 

  9. Truong, Q.D., Truong, Q.B., Dkaki, T.: Graph methods for social network analysis. In: Vinh, P.C., Barolli, L. (eds.) ICTCC 2016. LNICSSITE, vol. 168, pp. 276–286. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46909-6_25

    Chapter  Google Scholar 

  10. Cordeiro, M., Sarmento, R., Brazdil, P., Gama J.: Evolving networks and social network analysis methods and techniques. In: Višňovský, J., Radošinská, J. (eds.) Social Media and Journalism - Trends, Connections, Implications, London, pp. 101–134. IntechOpen (2018)

    Google Scholar 

  11. Taniarza, N., Adiwijaya, Maharani, W.: Social network analysis using k-Path centrality method. J. Phys.: Conf. Ser. 971, 1–9 (2018). https://doi.org/10.1088/1742-6596/971/1/012015

  12. Niewiadomska-Szynkiewicz, E.: Application of social network analysis to the investigation of interpersonal connections. J. Telecommun. Inf. Technol. 2, 81–89 (2012)

    Google Scholar 

  13. Bothorel, C., Cruz, J., Magnani, M., Micenkova, B.: Clustering attributed graphs: models, measures and methods. Netw. Sci. 3, 408–444 (2015)

    Article  Google Scholar 

  14. Cruz, J., Bothorel, C., Poulet, F.: Community detection and visualization in social networks: integrating structural and semantic information. ACM Trans. Intell. Syst. Technol. 5, 11:1–11:26 (2014)

    Google Scholar 

  15. Coscia, M., Giannotti, F., Pedreschi, D.: A classification for community discovery methods in complex networks. Stat. Anal. Data Mining 4, 512–546 (2011). https://doi.org/10.1002/sam.10133

    Article  MathSciNet  MATH  Google Scholar 

  16. Dang, A., Viennet, E.: Community detection based on structural and attribute similarities. In: 6th International Conference on Digital Society, Valencia, pp. 7–14. IARIA XPS Press (2012)

    Google Scholar 

  17. Hric, D., Darst, R., Fortunato, S.: Community detection in networks: structural clusters versus ground truth. Phys. Rev. E 90, 062805 (2014). https://doi.org/10.1103/PhysRevE.90.062805

  18. Yang, J., McAuley, J., Leskovec, J.: Community detection in networks with node attributes. In: 13th IEEE International Conference on Data Mining, Dallas, pp. 1151–1156. IEEE Press (2013)

    Google Scholar 

  19. A Social Network Analysis of Articles on Social Network Analysis. https://arxiv.org/pdf/1810.09781.pdf/. Accessed 10 Jan 2020

  20. Zhao, Y., Cai, S., Tang, M., Shang, M.: Coarse cluster enhancing collaborative recommendation for social network systems. Physica A: Stat. Mech. Appl. 483, 209–218 (2017)

    Article  MathSciNet  Google Scholar 

  21. Sun, L., Tao, T., Chen, F., Luo, Y.: An optimized clustering method with improved cluster center for social network based on gravitational search algorithm. In: Chen, F., Luo, Y. (eds.) Industrial IoT 2017. LNICST, vol. 202, pp. 61–71. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60753-5_7

    Chapter  Google Scholar 

  22. Sun, Y., Yin, S., Li, H., Teng, L., Karim, S.: GPOGC: gaussian pigeon-oriented graph clustering algorithm for social networks cluster. IEEE Access 7, 99254–99262 (2019)

    Article  Google Scholar 

  23. Berlingerio, M., Coscia, M., Giannotti, F., Monreale, A., Pedreschi, D.: Foundations of multidimensional network analysis. In: International Conference on Advances in Social Networks Analysis and Mining, New York, pp. 485–489. IEEE Press (2011)

    Google Scholar 

  24. Brandes, U., Gaertler, M., Wagner, D.: Engineering graph clustering: models and experimental evaluation. J. Exp. Algorithmics 12, 1–26 (2008)

    MathSciNet  MATH  Google Scholar 

  25. Fortunato, S.: Community detection in graphs. Phys. Rep. 486, 75–174 (2010)

    Article  MathSciNet  Google Scholar 

  26. Blondel, V., Guillaume, J., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech.: Theory Exp. (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrey Televnoy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Televnoy, A., Ivanov, S.E., Gorlushkina, N. (2020). Hybrid Method of Multiple Factor Data Clusterization. In: Alexandrov, D.A., Boukhanovsky, A.V., Chugunov, A.V., Kabanov, Y., Koltsova, O., Musabirov, I. (eds) Digital Transformation and Global Society. DTGS 2020. Communications in Computer and Information Science, vol 1242. Springer, Cham. https://doi.org/10.1007/978-3-030-65218-0_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-65218-0_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-65217-3

  • Online ISBN: 978-3-030-65218-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics