Abstract
The identification of social groups remains one of the main analytical themes in the analysis of social networks and, in more general terms, in the study of social organization. Traditional network approaches to group identification encounter a variety of problems when the data to be analyzed involve two-mode networks, i.e., relations between two distinct sets of objects with no reflexive relation allowed within each set. In this paper we propose a relatively novel approach to the recognition and identification of social groups in data generated by network-based processes in the context of two-mode networks. Our approach is based on a family of learning algorithms called Support Vector Machines (SVM). The analytical framework provided by SVM provides a flexible statistical environment to solve classification tasks, and to reframe regression and density estimation problems. We explore the relative merits of our approach to the analysis of social networks in the context of the well known “Southern women” (SW) data set collected by Davis Gardner and Gardner. We compare our results with those that have been produced by different analytical approaches. We show that our method, which acts as a data-independent preprocessing step, is able to reduce the complexity of the clustering problem enabling the application of simpler configurations of common algorithms.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
BENHUR A., HORH D., SIEGELMANN H.T. and VAPNIK V. (2001): Support Vector Clustering. Journal of Machine Learning, 2, 125–137.
BORGATTI S. P. and EVERETT M. G. (1997): Network analysis of 2-mode data. Social Networks, 19/3, 243–269.
BOORMAN S. and WHITE H. (1976): Social Structure from Multiple Networks II. Role Structures. American Journal of Sociology 81: 1384–1446.
BREIGER R. (1974): The Duality of Persons and Groups. Social Forces, 53, 181–90.
DAVIS A., GARDNER B. B. and GARDNER M. R. (1941): Deep South. Chicago: The University of Chicago Press.
DOREIAN P., BATAGELJ V. and FERLIGOJ A. (2004): Generalized blockmodeling of two-mode network data. Social Networks 26, 29–53.
DOREIAN P., BATAGELJ V. and FERLIGOJ A. (2005): Generalized blockmodeling. Cambridge University Press.
FREEMAN L. C. and WHITE D. R. (1994): Using Galois lattices to present network data. In P. Marsden (Ed.): Sociological Methodology (pp. 127–146). Cambridge: Blackwell.
FREEMAN L. (2003): Finding social groups: A meta-analysis of the southern women data. In R. Breiger, K. Carley and P. Pattison (Eds.): Dynamic Social Network Modeling and Analysis. The National Academies Press.
GREENACRE M. J. (1984): Theory and Applications of Correspondence Analysis. London: Academic Press.
PATTISON P. E. and BREIGER R. (2002): Lattices and Dimensional Representations: Matrix Decompositions and Ordering Structures. Social Networks, 24, 423–444.
SCHOLKOPF B., PLATT J.C., SHAWE-TAYLOR J., and SMOLA A.J. (2001): Estimating the support of a high-dimensional distribution. Neural Computation, 13(7):14431471.
TAX D.M.J. (2005): DDtools, the Data Description Toolbox for Matlab. version 1.4.1.
TAX D.M.J. and DUIN R.P.W. (2004): Support Vector Data Description. Journal of Machine Learning, 54/1, 45–46.
VAPNIK V. (1995): The Nature of Statistical Learning Theory. Springer Verlag.
WHITE H., BOORMAN S. and BREIGER R. (1976): Social Structure from Multiple Networks I. Blockmodels of Roles and Positions. American Journal of Sociology 81: 730–779.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Roffilli, M., Lomi, A. (2006). Identifying and Classifying Social Groups: A Machine Learning Approach. In: Batagelj, V., Bock, HH., Ferligoj, A., Žiberna, A. (eds) Data Science and Classification. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg . https://doi.org/10.1007/3-540-34416-0_17
Download citation
DOI: https://doi.org/10.1007/3-540-34416-0_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34415-5
Online ISBN: 978-3-540-34416-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)