Skip to main content

Analyzing the Similarity-Based Clusterability of the Vertices in a Complex Network

  • Conference paper
  • First Online:
16th International Conference on Information Technology-New Generations (ITNG 2019)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 800))

  • 1621 Accesses

Abstract

We propose an approach to quantify the extent to which vertices in a complex network are clusterable on the basis of the similarity of the values with respect to two or more node-level metrics. We use the Hopkins Statistic to assess the clusterability and consider the centrality metrics as the node-level metrics. Our approach is to construct a logical topology of the vertices in the complex network using the normalized values of the centrality metrics as coordinates and determine the Hopkins Statistic for such a logical topology of the vertices. Our hypothesis is that if two or more vertices in a complex network have similar values for the centrality metrics, then the vertices should be clusterable to one or more clusters due to their proximity to each other in the normalized centrality-based coordinate system. The value for the Hopkins Statistic measure (ranges from 0 to 1) for such a logical topology of the vertices should be high, and vice-versa. We evaluate the Hopkins Statistic for 47 real-world complex networks (of diverse degree distributions) with respect to the neighborhood-based and shortest path-based centrality metrics. We observe the neighborhood centrality-based logical topologies of the vertices to incur relatively larger Hopkins Statistic values (i.e., exhibit higher similarity-based clusterability) for a majority of the networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Grando, F., Noble, D., Lamb, L.C.: An analysis of centrality measures for complex and social networks. In: Proceedings of IEEE Global Communications Conference, pp. 1–6, Washington, DC, USA (Dec 2016)

    Google Scholar 

  2. Meghanathan, N.: On the use of centrality measures to determine connected dominating sets for mobile ad hoc networks. Int. J. Ad Hoc Ubiquitous Comput. 26(4), 205–221 (2017)

    Article  Google Scholar 

  3. Ozgur, A., Vu, T., Erkan, G., Radev, D.R.: Identifying gene-disease associations using centrality on a literature mined gene-interaction network. Bioinformatics. 24(13), 277–285 (2008)

    Article  Google Scholar 

  4. Ding, Y.: Scientific collaboration and endorsement: network analysis of coauthorship and citation networks. J. Informet. 5(1), 187–203 (2011)

    Article  Google Scholar 

  5. Eslami, M., Zheng, G., Eramian, H., Levchuk, G.: Anomaly detection on bipartite graphs for cyber situational awareness and threat detection. In: Proceedings of the 2017 IEEE International Conference on Big Data, pp. 4741–4743. Boston, MA, USA (Dec 2017)

    Google Scholar 

  6. Newman, M.E.J.: Networks: An Introduction. Oxford University Press, Oxford (2010)

    Book  Google Scholar 

  7. Bonacich, P.: Power and centrality: a family of measures. Am. J. Sociol. 92(5), 1170–1182 (1987)

    Article  Google Scholar 

  8. Freeman, L.: A set of measures of centrality based on betweenness. Sociometry. 40(1), 35–41 (1977)

    Article  Google Scholar 

  9. Freeman, L.: Centrality in social networks: conceptual clarification. Soc. Networks. 1(3), 215–239 (1979)

    Article  MathSciNet  Google Scholar 

  10. Huang, J., Sun, H., Han, J., Deng, H., Sun, Y., Liu, Y.: SHRINK: a structural clustering algorithm for detecting hierarchical communities in networks. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 219–228, Toronto, Canada (Oct 2010)

    Google Scholar 

  11. Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. Proc. Natl. Acad. Sci. U. S. A. 99(12), 7821–7826 (June 2002)

    Article  MathSciNet  Google Scholar 

  12. Banerjee, A., Dave, R.: Validating clusters using the hopkins statistic. In: Proceedings of the IEEE International Conference on Fuzzy Systems, pp. 149–153, Budapest, Hungary (July 2004)

    Google Scholar 

  13. Szabo, G., Alava, M., Kertesz, J.: Clustering in complex networks. Complex Networks, Lect. Notes Phys. 650(1), 139–162 (2004)

    Article  MathSciNet  Google Scholar 

  14. Soler, J., Tence, F., Gaubert, L., Buche, C.: Data clustering and similarity. In: Proceedings of the 26th International Florida Artificial Intelligence Research Society Conference, pp. 492–495, St. Pete Beach, FL, USA (May 2013)

    Google Scholar 

  15. Aalam, P., Siddique, T.: Comparative study of data mining tools used for clustering. In: Proceedings of the 3rd International Conference on Computing for Sustainable Global Development, pp. 3971–3975, New Delhi, India (Mar 2016)

    Google Scholar 

  16. Meghanathan, N.: Randomness index for complex network analysis. Soc. Netw. Anal. Min. 7(25), 1–15 (2017)

    Google Scholar 

  17. Meghanathan, N.: Spectral radius as a measure of variation in node degree for complex network graphs. In: Proceedings of the 3rd International Conference on Digital Contents and Applications, pp. 30–33, Hainan, China (Dec 2014)

    Google Scholar 

  18. Taghva, K., Veni, R.: Effects of similarity metrics on document clustering. In: Proceedings of the 7th International Conference on Information Technology: New Generations, pp. 222–226, Las Vegas, NV (2010)

    Google Scholar 

  19. Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan and Kaufmann Publishers, Burlington, MA (July 2011)

    MATH  Google Scholar 

  20. Blooma, M.J., Chua, A.Y.K., Goh, D.H.: Quadripartite graph-based clustering of questions. In: Proceedings of the 8th International Conference on Information Technology: New Generations, pp. 591–596, Las Vegas, NV (2011)

    Google Scholar 

  21. Jeh, G., Widom, J.: SimRank: a measure of structural-context similarity. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 538–543, Edmonton, Alberta, Canada (July 2002)

    Google Scholar 

  22. Yu, W., Lin, X., Zhang, W., Chang, L., Pei, J.: More is simpler: effectively and efficiently assessing node-pair similarities based on hyperlinks. Proc. VLDB Endowment. 7(1), 13–24 (Sept 2013)

    Article  Google Scholar 

Download references

Acknowledgements

This research was funded by the NASA EPSCoR subaward #: NNX14AN38A from the University of Mississippi; the NSF MRI Grant 13-38192 and NSF CNS Grant 14-56638.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Natarajan Meghanathan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rahman, M.A., Meghanathan, N. (2019). Analyzing the Similarity-Based Clusterability of the Vertices in a Complex Network. In: Latifi, S. (eds) 16th International Conference on Information Technology-New Generations (ITNG 2019). Advances in Intelligent Systems and Computing, vol 800. Springer, Cham. https://doi.org/10.1007/978-3-030-14070-0_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-14070-0_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-14069-4

  • Online ISBN: 978-3-030-14070-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics