Clustering Social Networks Using Distance-Preserving Subgraphs

Nussbaum, Ronald; Esfahanian, Abdol-Hossein; Tan, Pang-Ning

doi:10.1007/978-3-7091-1346-2_14

Clustering Social Networks Using Distance-Preserving Subgraphs

Ronald Nussbaum⁵,
Abdol-Hossein Esfahanian⁵ &
Pang-Ning Tan⁵

Chapter
First Online: 21 December 2012

2668 Accesses
5 Citations

Part of the book series: Lecture Notes in Social Networks ((LNSN,volume 6))

Abstract

Cluster analysis describes the division of a dataset into subsets of related objects, which are usually disjoint. There is considerable variety among the different types of clustering algorithms. Some of these clustering algorithms represent the dataset as a graph, and use graph-based properties to generate the clusters. However, many graph properties have not been explored as the basis for a clustering algorithm. In graph theory, a subgraph of a graph is distance-preserving if the distances (lengths of shortest paths) between every pair of vertices in the subgraph are the same as the corresponding distances in the original graph. In this paper, we consider the question of finding proper distance-preserving subgraphs, and the problem of partitioning a simple graph into an arbitrary number of distance-preserving subgraphs for clustering purposes. We then present a clustering algorithm called DP-Cluster, based on the notion of distance-preserving subgraphs. We also introduce the concept of relaxation values to the distance-preserving subgraph finding heuristic embedded in DP-Cluster, and investigate this and other variations of the algorithm. One area of research that makes considerable use of graph theory is the analysis of social networks. For this reason we evaluate the performance of DP-Cluster on two real-world social network datasets.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Ankerst, M., Breunig, M., Kriegel, H., Sander, J.: OPTICS: ordering points to identify the clustering structure. ACM SIGMOD Rec. 28(2), 49–60 (1999)
Article Google Scholar
Bandelt, H., Mulder, H.: Distance-hereditary graphs. J. Comb. Theory B 41(2), 182–208 (1986)
Article MathSciNet MATH Google Scholar
Bellman, R.: On a routing problem. Q. Appl. Math. 16(1), 87–90 (1958)
MATH Google Scholar
Charikar, M., Chekuri, C., Feder, T., Motwani, R.: Incremental clustering and dynamic information retrieval. In: Proceedings of the Twenty-Ninth Annual ACM Symposium on the Theory of Computing, pp. 626–635. ACM, New York (1997)
Google Scholar
Damiand, G., Habib, M., Paul, C.: A simple paradigm for graph recognition: application to cographs and distance hereditary graphs. Theor. Comput. Sci. 263(1–2), 99–111 (2001)
Article MathSciNet MATH Google Scholar
Dijkstra, E.: A note on two problems in connexion with graphs. Numer. Math. 1(1), 269–271 (1959)
Article MathSciNet MATH Google Scholar
Doreian, P., Batagelj, V., Ferligoj, A.: Positional analyses of sociometric data. Models and Methods in Social Network Analysis, pp. 77–97. Cambridge University Press, New York (2005)
Google Scholar
Ester, M., Kriegel, H., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of KDD, vol. 96, pp. 226–231. AAAI, Menlo Park (1996)
Google Scholar
Flake, G., Tarjan, R., Tsioutsiouliklis, K.: Graph clustering and minimum cut trees. Int. Math. 1(4), 385–408 (2004)
MathSciNet MATH Google Scholar
Floyd, R.: Algorithm 97: shortest path. Commun. ACM 5(6), 345 (1962)
Article Google Scholar
Getoor, L., Diehl, C.: Link mining: a survey. ACM SIGKDD Explor. Newsl. 7(2), 12 (2005)
Google Scholar
Hammer, P., Maffray, F.: Completely separable graphs. Discret. Appl. Math. 27(1–2), 85–99 (1990)
Article MathSciNet MATH Google Scholar
Howorka, E.: A characterization of distance-hereditary graphs. Q. J. Math. Oxf. Ser. 2(28), 417–420 (1977)
Article MathSciNet Google Scholar
Liu, K., Bhaduri, K., Das, K., Nguyen, P., Kargupta, H.: Client-side web mining for community formation in peer-to-peer environments. ACM SIGKDD Explor. Newsl. 8(2), 20 (2006)
Article Google Scholar
MacQueen, J.: Some methods for classification and analysis of multivariate observations. Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. Defense Technical Information Center, Ft. Belvoir (1966)
Google Scholar
Newman, M., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 26113 (2004)
Article Google Scholar
Nussbaum, R., Esfahanian, A., Tan, P.: Clustering social networks using distance-preserving subgraphs. In: 2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 380–385. IEEE, Washington, DC (2010)
Google Scholar
Plesnik, J.: A heuristic for the p-center problem in graphs. Discret. Appl. Math. 17(3), 263–268 (1987)
Article MathSciNet MATH Google Scholar
Scripps, J., Tan, P.: Constrained overlapping clusters: minimizing the negative effects of bridge-nodes. Stat. Anal. Data Min. 3(1), 20–37 (2010)
MathSciNet Google Scholar
Tantipathananandh, C., Berger-Wolf, T., Kempe, D.: A framework for community identification in dynamic social networks. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, p. 726. ACM, New York (2007)
Google Scholar
Wasserman, S., Faust, K.: Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge (1994)
Book Google Scholar
Watts, D., Strogatz, S.: Collective dynamics of ‘small-world’ networks. Nature 393(6684), 440–442 (1998)
Article Google Scholar
Zhou, D., Councill, I., Zha, H., Giles, C.: Discovering temporal communities from social network documents. In: Proceedings of the 2007 Seventh IEEE International Conference on Data Mining, pp. 745–750. IEEE, Washington, DC (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Michigan State University, East Lansing, MI, 48824-1226, USA
Ronald Nussbaum, Abdol-Hossein Esfahanian & Pang-Ning Tan

Authors

Ronald Nussbaum
View author publications
You can also search for this author in PubMed Google Scholar
Abdol-Hossein Esfahanian
View author publications
You can also search for this author in PubMed Google Scholar
Pang-Ning Tan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ronald Nussbaum .

Editor information

Editors and Affiliations

Department of Computer Engineering, TOBB University, Sogutozu Cad No. 43, Sogutozu Ankara, Turkey
Tansel Özyer
Computer Science, University of Calgary, University Dr. NW 2500, Calgary, T2N 1N4, Canada
Jon Rokne
IPSC, European Commission Joint Research Cent., Via Enrico Fermi 2749, Ispra, 21027, Italy
Gerhard Wagner
De Wetstraat 16, Leiden, 2332 XT, Netherlands
Arno H.P. Reuser

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Nussbaum, R., Esfahanian, AH., Tan, PN. (2013). Clustering Social Networks Using Distance-Preserving Subgraphs. In: Özyer, T., Rokne, J., Wagner, G., Reuser, A. (eds) The Influence of Technology on Social Network Analysis and Mining. Lecture Notes in Social Networks, vol 6. Springer, Vienna. https://doi.org/10.1007/978-3-7091-1346-2_14

Download citation

DOI: https://doi.org/10.1007/978-3-7091-1346-2_14
Published: 21 December 2012
Publisher Name: Springer, Vienna
Print ISBN: 978-3-7091-1345-5
Online ISBN: 978-3-7091-1346-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics