Abstract
Social link identification SIL, that is to identify accounts across different online social networks that belong to the same user, is an important task in social network applications. Most existing methods to solve this problem directly applied machine-learning classifiers on features extracted from user’s rich information. In practice, however, only some limited user information can be obtained because of privacy concerns. In addition, we observe the existing methods cannot handle huge amount of potential account pairs from different OSNs. In this paper, we propose an effective SIL method to address the above two challenges by expanding known anchor links (seed account pairs belonging to the same person). In particular, we leverage potentially useful information possessed by the existing anchor link, and then develop a local expansion model to identify new social links, which are taken as a generated anchor link to be used for iteratively identifying additional new social link. We evaluate our method on two most popular Chinese social networks. Experimental results show our proposed method achieves much better performance in terms of both the number of correct account pairs and efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Li, X.-L., Foo, C.S., Tew, K.L., Ng, S.-K.: Searching for rising stars in bibliography networks. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds.) DASFAA 2009. LNCS, vol. 5463, pp. 288–292. Springer, Heidelberg (2009)
Li, X.-L., Tan, A., Yu, P.S., Ng, S.-K.: ECODE: event-based community detection from social networks. In: Yu, J.X., Kim, M.H., Unland, R. (eds.) DASFAA 2011, Part I. LNCS, vol. 6587, pp. 22–37. Springer, Heidelberg (2011)
Carmagnola, F., Cena, F.: User identification for cross-system personalization. Inf. Sci. 179(1–2), 16–32 (2009)
Vosecky, J., Hong, D., Shen, V.Y.: User identification across multiple social networks. In: Proceedings of NDT 2009 (2009)
Zafarani, R., Liu, H.: Connecting corresponding identities across communities. In: Proceedings of ICWSM 2009 (2009)
Narayanan, A., Shmatikov, V.: De-anonymizing social networks. In: Proceedings of S&P (2009)
Liu, S., Wang, S., Zhu, F., Zhang, J., Krishnan, R.: HYDRA: large-scale social identity linkage via heterogeneous behavior modeling. In: Proceedings of SIGMOD 2014 (2014)
Zafarani, R., Liu, H.: Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of KDD 2013 (2013)
Jain, P., Kumaraguru, P.: Finding nemo: Searching and Resolving Identities of Users across Online Social Networks (2012). arXiv preprint arXiv:1212.6147
Jain, P., Kumaraguru, P., Joshi, A.: @i seek ‘fb.me’: identifying users across multiple online social networks. In: Proceedings of WWW 2013 (2013)
Malhotra, A., Totti, L., Meira, W., Kumaraguru, P., Almeida, V.: Studying user footprints in different online social networks. In: Proceedings of Advances in Social Networks Analysis and Mining, 2012 (2012)
Nunes, A., Calado, P., Martins, B.: Resolving user identities over social networks through supervised learning and rich similarity features. In: Proceedings of SAC 2012 (2012)
Zhang, H., Kan, M.-Y., Liu, Y., Ma, S.: Online social network profile linkage. In: Jaafar, A., Mohamad Ali, N., Mohd Noah, S.A., Smeaton, A.F., Bruza, P., Bakar, Z.A., Jamil, N., Sembok, T.M.T. (eds.) AIRS 2014. LNCS, vol. 8870, pp. 197–208. Springer, Heidelberg (2014)
Liu, J., Zhang, F., Song, X., Song, Y.I., Lin, C.Y., Hon, H.W.: What’s in a Name?: an unsupervised approach to link users across communities. In: Proceedings of WSDM 2013 (2013)
Goga, O.: Matching User Accounts across Online Social Networks: Methods and Applications. Ph.D. thesis, University Pierre and Marie CURIE (2014)
Iofciu, T., Fankhauser, P., Abel, F., Bischoff, K.: Identifying users across social tagging systems. In: Proceedings of ICWSM 2011 (2011)
Kong, X., Zhang, J., Yu, P.S.: Inferring anchor links across multiple heterogeneous social networks. In: Proceedings of CIKM 2013 (2013)
Anwar, T., Abulaish, M.: An MCL-based text mining approach for namesake disambiguation on the web. In: Proceedings of ICWI 2012 (2012)
Carmagnola, F., Osborne, F., Torre, I.: User data discovery and aggregation: the CS-UDD algorithm. Inf. Sci. 270(20), 41–72 (2014)
Goga, O., Lei, H., Krishnan, S., Friedland, G., Sommer, R., Teixeira, R.: Exploiting innocuous activity for correlating users across sites. In: Proceedings of WWW 2013 (2013)
Zhang, J.W., Yu, P.S., Zhou, Z.H.: Meta-path based multi-network collective link prediction. In: Proceedings of KDD 2014 (2014)
Li, J., Wang, G.A., Chen, H.: Identity matching using personal and social identity features. Inf. Syst. Front. 13(1), 101–113 (2011)
Iofciu, T., Fankhauser, P., Abel, F., Bischoff, K.: Identifying users across social tagging systems. In: Proceedings of AAAI Conference on Weblogs and Social Media, 2011 (2011)
Chen, Y., Zhuang, C., Cao, Q., Hui, P.: Understanding cross-site linking in online social networks. In: Proceedings of SNA-KDD 2014 (2014)
Gusfield, D.: Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology. Cambridge University Press, New York (1999)
Rapoport, A.: Spread of information through a population with socio-structural bias i: assumption of transitivity. Bull. Math. Biophys. 15(4), 523–533 (1953)
Zhao, X., Sala, A., Zheng, H., Zhao, B.: Efficient shortest paths on massive social graphs. In: Proceedings of CollaborateCom 2011 (2011)
Acknowledgments
This work was partially supported by grants from the National Natural Science Foundation of China (Grant No. U1533104, U1333109, 61301245, 61305107).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Zhang, Y., Wang, L., Li, X., Xiao, C. (2016). Social Identity Link Across Incomplete Social Information Sources Using Anchor Link Expansion. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J., Wang, R. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2016. Lecture Notes in Computer Science(), vol 9651. Springer, Cham. https://doi.org/10.1007/978-3-319-31753-3_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-31753-3_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31752-6
Online ISBN: 978-3-319-31753-3
eBook Packages: Computer ScienceComputer Science (R0)