Skip to main content

Clustering Social Networks Using Distance-Preserving Subgraphs

  • Chapter
  • First Online:

Part of the book series: Lecture Notes in Social Networks ((LNSN,volume 6))

Abstract

Cluster analysis describes the division of a dataset into subsets of related objects, which are usually disjoint. There is considerable variety among the different types of clustering algorithms. Some of these clustering algorithms represent the dataset as a graph, and use graph-based properties to generate the clusters. However, many graph properties have not been explored as the basis for a clustering algorithm. In graph theory, a subgraph of a graph is distance-preserving if the distances (lengths of shortest paths) between every pair of vertices in the subgraph are the same as the corresponding distances in the original graph. In this paper, we consider the question of finding proper distance-preserving subgraphs, and the problem of partitioning a simple graph into an arbitrary number of distance-preserving subgraphs for clustering purposes. We then present a clustering algorithm called DP-Cluster, based on the notion of distance-preserving subgraphs. We also introduce the concept of relaxation values to the distance-preserving subgraph finding heuristic embedded in DP-Cluster, and investigate this and other variations of the algorithm. One area of research that makes considerable use of graph theory is the analysis of social networks. For this reason we evaluate the performance of DP-Cluster on two real-world social network datasets.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Ankerst, M., Breunig, M., Kriegel, H., Sander, J.: OPTICS: ordering points to identify the clustering structure. ACM SIGMOD Rec. 28(2), 49–60 (1999)

    Article  Google Scholar 

  2. Bandelt, H., Mulder, H.: Distance-hereditary graphs. J. Comb. Theory B 41(2), 182–208 (1986)

    Article  MathSciNet  MATH  Google Scholar 

  3. Bellman, R.: On a routing problem. Q. Appl. Math. 16(1), 87–90 (1958)

    MATH  Google Scholar 

  4. Charikar, M., Chekuri, C., Feder, T., Motwani, R.: Incremental clustering and dynamic information retrieval. In: Proceedings of the Twenty-Ninth Annual ACM Symposium on the Theory of Computing, pp. 626–635. ACM, New York (1997)

    Google Scholar 

  5. Damiand, G., Habib, M., Paul, C.: A simple paradigm for graph recognition: application to cographs and distance hereditary graphs. Theor. Comput. Sci. 263(1–2), 99–111 (2001)

    Article  MathSciNet  MATH  Google Scholar 

  6. Dijkstra, E.: A note on two problems in connexion with graphs. Numer. Math. 1(1), 269–271 (1959)

    Article  MathSciNet  MATH  Google Scholar 

  7. Doreian, P., Batagelj, V., Ferligoj, A.: Positional analyses of sociometric data. Models and Methods in Social Network Analysis, pp. 77–97. Cambridge University Press, New York (2005)

    Google Scholar 

  8. Ester, M., Kriegel, H., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of KDD, vol. 96, pp. 226–231. AAAI, Menlo Park (1996)

    Google Scholar 

  9. Flake, G., Tarjan, R., Tsioutsiouliklis, K.: Graph clustering and minimum cut trees. Int. Math. 1(4), 385–408 (2004)

    MathSciNet  MATH  Google Scholar 

  10. Floyd, R.: Algorithm 97: shortest path. Commun. ACM 5(6), 345 (1962)

    Article  Google Scholar 

  11. Getoor, L., Diehl, C.: Link mining: a survey. ACM SIGKDD Explor. Newsl. 7(2), 12 (2005)

    Google Scholar 

  12. Hammer, P., Maffray, F.: Completely separable graphs. Discret. Appl. Math. 27(1–2), 85–99 (1990)

    Article  MathSciNet  MATH  Google Scholar 

  13. Howorka, E.: A characterization of distance-hereditary graphs. Q. J. Math. Oxf. Ser. 2(28), 417–420 (1977)

    Article  MathSciNet  Google Scholar 

  14. Liu, K., Bhaduri, K., Das, K., Nguyen, P., Kargupta, H.: Client-side web mining for community formation in peer-to-peer environments. ACM SIGKDD Explor. Newsl. 8(2), 20 (2006)

    Article  Google Scholar 

  15. MacQueen, J.: Some methods for classification and analysis of multivariate observations. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. Defense Technical Information Center, Ft. Belvoir (1966)

    Google Scholar 

  16. Newman, M., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 26113 (2004)

    Article  Google Scholar 

  17. Nussbaum, R., Esfahanian, A., Tan, P.: Clustering social networks using distance-preserving subgraphs. In: 2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 380–385. IEEE, Washington, DC (2010)

    Google Scholar 

  18. Plesnik, J.: A heuristic for the p-center problem in graphs. Discret. Appl. Math. 17(3), 263–268 (1987)

    Article  MathSciNet  MATH  Google Scholar 

  19. Scripps, J., Tan, P.: Constrained overlapping clusters: minimizing the negative effects of bridge-nodes. Stat. Anal. Data Min. 3(1), 20–37 (2010)

    MathSciNet  Google Scholar 

  20. Tantipathananandh, C., Berger-Wolf, T., Kempe, D.: A framework for community identification in dynamic social networks. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, p. 726. ACM, New York (2007)

    Google Scholar 

  21. Wasserman, S., Faust, K.: Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge (1994)

    Book  Google Scholar 

  22. Watts, D., Strogatz, S.: Collective dynamics of ‘small-world’ networks. Nature 393(6684), 440–442 (1998)

    Article  Google Scholar 

  23. Zhou, D., Councill, I., Zha, H., Giles, C.: Discovering temporal communities from social network documents. In: Proceedings of the 2007 Seventh IEEE International Conference on Data Mining, pp. 745–750. IEEE, Washington, DC (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ronald Nussbaum .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Wien

About this chapter

Cite this chapter

Nussbaum, R., Esfahanian, AH., Tan, PN. (2013). Clustering Social Networks Using Distance-Preserving Subgraphs. In: Özyer, T., Rokne, J., Wagner, G., Reuser, A. (eds) The Influence of Technology on Social Network Analysis and Mining. Lecture Notes in Social Networks, vol 6. Springer, Vienna. https://doi.org/10.1007/978-3-7091-1346-2_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-7091-1346-2_14

  • Published:

  • Publisher Name: Springer, Vienna

  • Print ISBN: 978-3-7091-1345-5

  • Online ISBN: 978-3-7091-1346-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics