Skip to main content

Access-Control Prediction in Social Network Sites: Examining the Role of Homophily

  • Conference paper
  • First Online:
Social Informatics (SocInfo 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11186))

Included in the following conference series:

  • 1130 Accesses

Abstract

Often, users of Social Network Sites (SNSs) like Facebook or Twitter have issues when controlling the access to their content. Access-control predictive models are used to recommend access-control configurations which are aligned with the users’ individual privacy preferences. One basic strategy for the prediction of access-control configurations is to generate access-control lists out of the emerging communities inside the user’s ego-network. That is, in a community-based fashion. Homophily, which is the tendency of individuals to bond with others who hold similar characteristics, can influence the network structure of SNSs and bias the users’ privacy preferences. Consequently, it can also impact the quality of the configurations generated by access-control predictive models that follow a community-based approach. In this work, we use a simulation model to evaluate the effect of homophily when predicting access-control lists in SNSs. We generate networks with different levels of homophily and analyse thereby its impact on access-control recommendations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The scripts can be found in the following repository: https://bit.ly/2Nth7HE.

  2. 2.

    http://www.pewresearch.org/fact-tank/2014/02/03/what-people-like-dislike-about-facebook/.

References

  1. Barabási, A.L.: Network Science. Cambridge University Press, Cambridge (2016)

    Google Scholar 

  2. Barabási, A.L., Albert, R.: Emergence of scaling in random networks. Science 286(5439), 509–512 (1999). https://doi.org/10.1126/science.286.5439.509, http://science.sciencemag.org/content/286/5439/509

  3. Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exper. 2008(10), P10008 (2008)

    Article  Google Scholar 

  4. Boyd, D., Levy, K., Marwick, A.: The networked nature of algorithmic discrimination. In: Data and Discrimination: Collected Essays. Open Technology Institute, New America Washington, DC (2014)

    Google Scholar 

  5. Csardi, G., Nepusz, T.: The igraph software package for complex network research. InterJournal Complex Syst. 1695(5), 1–9 (2006). http://igraph.org

  6. Díaz Ferreyra, N.E., Meis, R., Heisel, M.: At your own risk: shaping privacy heuristics for online self-disclosure. In: Proceedings of the 16th Annual Conference on Privacy, Security and Trust, August 2018

    Google Scholar 

  7. Fang, L., LeFevre, K.: Privacy wizards for social networking sites. In: Proceedings of the 19th International Conference on World Wide Web, pp. 351–360, WWW 2010. ACM, New York (2010). https://doi.org/10.1145/1772690.1772727

  8. Gil, M., Pelechano, V., Fons, J., Albert, M.: Designing the human in the loop of self-adaptive systems. In: García, C.R., Caballero-Gil, P., Burmester, M., Quesada-Arencibia, A. (eds.) UCAmI 2016. LNCS, vol. 10069, pp. 437–449. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-48746-5_45

    Chapter  Google Scholar 

  9. Kim, K., Altmann, J., Hwang, J.: The impact of the subgroup structure on the evolution of networks: an economic model of network evolution. In: 2010 INFOCOM IEEE Conference on Computer Communications Workshops, pp. 1–9, March 2010. https://doi.org/10.1109/infocomw.2010.5466705

  10. Kim, K., Altmann, J.: Effect of homophily on network formation. Commun. Nonlinear Sci. Numer. Simul. 44, 482–494 (2017)

    Article  MathSciNet  Google Scholar 

  11. Leskovec, J., McAuley, J.: Learning to discover social circles in ego networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 25, pp. 539–547. Curran Associates, Inc. (2012)

    Google Scholar 

  12. Madden, M., et al.: Teens, social media, and privacy. Pew Res. Cent. 21, 2–86 (2013)

    Google Scholar 

  13. McPherson, M., Smith-Lovin, L., Cook, J.M.: Birds of a feather: homophily in social networks. Annu. Rev. Sociol. 27(1), 415–444 (2001)

    Article  Google Scholar 

  14. Misra, G., Such, J.M.: REACT: recommending access control decisions to social media users. In: Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, pp. 421–426, ASONAM 2017. ACM, New York (2017). https://doi.org/10.1145/3110025.3110073

  15. Misra, G., Such, J.M., Balogun, H.: Non-sharing communities? An empirical study of community detection for access control decisions. In: IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 49–56 (2016)

    Google Scholar 

  16. Newman, M.E.J.: Mixing patterns in networks. Phys. Rev. E 67, 026126 (2003). https://doi.org/10.1103/PhysRevE.67.026126

    Article  MathSciNet  Google Scholar 

  17. Newman, M.E.J.: Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 74, 036104 (2006). https://doi.org/10.1103/PhysRevE.74.036104

    Article  MathSciNet  Google Scholar 

  18. Thedchanamoorthy, G., Piraveenan, M., Kasthuriratna, D., Senanayake, U.: Node assortativity in complex networks: an alternative approach. Procedia Comput. Sci. 29, 2449–2461 (2014)

    Article  Google Scholar 

  19. Volkovich, Y., Scellato, S., Laniado, D., Mascolo, C., Kaltenbrunner, A.: The length of bridge ties: structural and geographic properties of online social interactions. In: Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (2012)

    Google Scholar 

Download references

Acknowledgments

This work was supported by the Deutsche Forschungsgemeinschaft (DFG) under grant No. GRK 2167, Research Training Group “User-Centred Social Media”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nicolás E. Díaz Ferreyra .

Editor information

Editors and Affiliations

Appendices

Appendix

In this Appendix we introduce the theoretical foundations used for the definition and implementation of our simulation model of ego-networks.

A Network Evolution with Homophily

Up to now, scholars have proposed several evolution models for constructing scale-free networks. Among them, the prefferential attachment mechanism introduced by Barabassi and Albert [2] is one of the most prominent ones. However, this model does not consider attribute similarity when computing the linking probability between two nodes. Following, we introduce an approach for the simulation of ego-networks that takes homophily explicitly into account.

1.1 A.1 Group-Openness

Many empirical and theoretical studies have shown that people prefer to link to those people with whom they share certain characteristics [13]. Moreover, these studies have also shown that homophily can lead to the emergence of clusters inside a social network in which similar people are linked more densely with each other [9]. In order to study the role of homophily in the evolution of scale-free networks, Kim et al. [10] introduced group-openness characteristics to the preferential attachment mechanism of Barabasi and Albert [1]. In this approach, nodes which share a particular characteristic s belong to group s. Consequently, the group-openness factor \(\varLambda _s^t\) between two groups s and t is defined as:

(1)

where the homophily index \(\varLambda \) is a real number between 0 and 1 [9]. If \(\varLambda = 0\), nodes in group s do not link with nodes in group t (\(t \ne s\)) but link only with those nodes in the same group. This state describes completely closed groups, in which members prefer to link only with those who hold their same characteristics. Conversely, if \(\varLambda = 1\) homophily does not affect the linkage between nodes, independently of whether they belong to the same group or not. This state describes completely open groups that show neutrality when linking to others [10].

1.2 A.2 Preferential Attachment with Homophily

The preferential attachment mechanism introduced by Barabasi and Albert [1] describes the process by which new nodes prefer to link to the more connected nodes in a network (i.e. the hubs). Hence, the probability \(\varPi _i\) that a new node connects to node i is proportional to the degree \(k_i\) of node i:

(2)

Using the group-openness mechanism defined in Eq. 1, Kim et al. [10] introduced homophily to the preferential attachment model of Eq. 2. As result, the probability \(\varPi _i^{pq}\) that a new node of group q is linked to node i of group p is defined as:

(3)

where \(k_i^p\) is the degree of node i of group p, and \(\varLambda _p^q\) represents the homophily between the group of the new node q and the group of node i. As one can observe, in Eq. 2 the probability that a new node connects to an existing node i is normalized by the sum of degrees of all existing nodes in the network. This is also the case for Eq. 3, only that this time the group-openness factor of each node is considered for the normalization. In other words, if all groups are completely open (i.e. \(\varLambda = 1\)), Eq. 3 is identical to Eq. 2. On the other hand, if the groups are completely open (i.e. \(\varLambda = 0\)), Eq. 3 is reduced to Eq. 2 for each particular group [10].

In order to explain the evolution of nodes who are active inside a network for a long period of time, Kim et al. [9] introduced an additional rule which describes the creation of links between existing nodes. This is, the probability \(\varPi _{ij}^{pq}\) that node i of group p links to node j of group q is defined as:

$$\begin{aligned} \varPi _{ij}^{pq}&= \frac{k_{i}^{p}.k_{j}^{q}.\varLambda _{p}^{q}}{\sum _{l} \sum _{m>l}k_{l}^{\mu }.k_{m}^{\nu }.\varLambda _{\mu }^{\nu }} \end{aligned}$$
(4)

where \(k_i^p\) and \(k_j^q\) are the degrees of node i and of node j respectively [10]. Nodes i and j belong to groups p and q respectively, and \(\varLambda _p^q\) is the group-openness between these two groups. Like in Eqs. 2 and 3, Eq. 4 is normalized by the sum of all possible combinations of links between existing nodes in the network [10]. In this case, \(k_l^\mu \) and \(k_m^\nu \) are the degrees of nodes l and m which belong to groups \(\mu \) and \(\nu \), respectively. Likewise, \(\varLambda _\mu ^\nu \) refers to the group-openness between groups \(\mu \) and \(\nu \) [10].

B Simulation Model

The model introduced in Sect. A.2 of this Appendix generates a network in which nodes link with each other according to attribute similarity. Therefore, it assumes that the values of these attributes have been assigned to the nodes prior to the attachment phase. Following, we define the attributes used to characterise the nodes of our simulated networks and the distribution of their respective values. Likewise, we define the parameters used to set-up the simulation.

1.1 B.1 Node-Attributed Ego-Networks

In our model, the ego and its alters are characterized with the attributes gender, workplace and location where gender can take the values male or female, workplace the values Starbucks, Google or Ikea, and location the values Leeds or York. These attributes and their respective values are conditionally distributed following the probability tree of Fig. 4. According to this distribution, a node in the network is generated with 50% chance of being female and 50% chance of being male (i.e. \(P(male)=P(female)=0.5\)). Then, the values for workplace are assigned with 33% chance according to the gender value of the node. For instance, if \(gender=female\), then \(P(Starbucks|female)=P(Google|female)=P(Ikea|female)=0.33\). Likewise, the values for location are assigned with 50% chance given the gender and workplace values of the node. This means that in the case of a node whose gender and location attributes are female and Ikea, then \(P(York| female \cap Ikea)=P(Leeds| female \cap Ikea)=0.5\).

Fig. 4.
figure 4

Attributes probability distribution.

Each attribute value represents a group. Therefore, our model consists of 7 groups (i.e. male, female, Starbucks, Google, Ikea, Leeds and York) together with the corresponding group-openness factors between them. If we consider group-openness a symmetric relation between two groups s and t, then \(\varLambda _s^t=\varLambda _t^s\). This means that for 7 groups one must define \(C_{7,2}=\frac{7!}{2!(7-2)!}=21\) different group-openness factors. This information can be expressed through a group-openness matrix \(\varvec{\varLambda }_{7\,\times \,7}\) in which each cell represents a factor \(\varLambda _s^t\) as shown in Fig. 5. As one can observe, this matrix is symmetric and contains ones on its main diagonal. This is because \(\varLambda _s^t=\varLambda _t^s\) and, according to Eq. 1, the group-openness factor \(\varLambda _s^t\) is 1 when \(s=t\).

Fig. 5.
figure 5

Group-openness matrix.

Nodes are described in terms of one value per attribute and, consequently, belong to more than one group at the same time (e.g. a node whose gender is female, works in Ikea and lives in York, belongs to the groups female, Ikea and York respectively). Therefore, the total homophily factor between two nodes i and j depends on more than one group-openness factor. In other words, one should compute the homophily between i and j considering the group-openness factors of all possible combinations among the groups to which i and j belong. For instance, if i belongs to the group-set male, Starbucks and York, and j to the group-set female, Google and York, then one should consider \(\varLambda _{female}^{male}\), \(\varLambda _{Google}^{male}\), \(\varLambda _{York}^{male}\), \(\varLambda _{female}^{Starbucks}\), \(\varLambda _{Google}^{Starbucks}\), \(\varLambda _{York}^{Starbucks}\), \(\varLambda _{female}^{York}\), \(\varLambda _{Google}^{York}\), and \(\varLambda _{York}^{York}\). Consequently, the total homophily factor between two group-sets P and Q is defined as:

(5)

where P and Q are the groups to which nodes i and j belong, respectively. According to the definition above, the preferential attachment model described in Eqs. 3 and 4 can be re-defined. That is, the probability \(\varPi _i^{PQ}\) that a new node of group-set Q is linked to node i of group-set P is defined as:

(6)

where \(k_i^P\) is the degree of node i from group-set P, and \(\mathcal {H}{_P^Q}\) represents the total homophily factor between the group-set of the new node Q and the group-set of node i. Likewise, the probability \(\varPi _{ij}^{PQ}\) that node i of group-set P links to node j of group-set Q is defined as:

$$\begin{aligned} \varPi _{ij}^{PQ}&= \frac{k_{i}^{P}.k_{j}^{Q}.\mathcal {H}{_P^Q}}{\sum _{l} \sum _{m>l}k_{l}^{M}.k_{m}^{N}.\mathcal {H}{_M^N}} \end{aligned}$$
(7)

where \(k_i^P\) and \(k_j^Q\) are the degrees of node i and of node j respectively and \(\mathcal {H}{_P^Q}\) is the total homophily factor between group-sets P and Q.

1.2 B.2 Simulation Set-up

Our simulation model for ego-networks comprises Eqs. 6 and 7 together with the attribute probability distribution introduced in Appendix B. According to the Pew Research Center, the average size of an ego-network in Facebook was of 338 friends/nodes in 2014Footnote 2 (this number scaled up to 425 in a study focused on adolescents and online privacy in 2013 [12]). Therefore, we will consider an average ego-network consisting of 500 nodes and execute our simulation for \(F=500\) time units. The initial set-up for all simulations consists of an network of two nodes (\(N(0)=2\)) and one link (\(K=1\)). It is also assumed that only one node enters the network at time t (\(b=2\)) and generates only one link (\(\beta =1\)). On the other hand, the number of new links between existing nodes at time t is given by \(\lfloor N(t)\cdot \alpha \rfloor \) where \(0 \le \alpha < 1\) and N(t) is the number of links at t. In order to preserve the degree distribution in our simulated networks we adopt \(\alpha =0.001\) as suggested by Kim et al. [9].

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Díaz Ferreyra, N.E., Hecking, T., Ulrich Hoppe, H., Heisel, M. (2018). Access-Control Prediction in Social Network Sites: Examining the Role of Homophily. In: Staab, S., Koltsova, O., Ignatov, D. (eds) Social Informatics. SocInfo 2018. Lecture Notes in Computer Science(), vol 11186. Springer, Cham. https://doi.org/10.1007/978-3-030-01159-8_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-01159-8_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-01158-1

  • Online ISBN: 978-3-030-01159-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics