Abstract
User identity linkage (UIL) aims to link identities belonging to the same individual across various platforms. While numerous methods have been proposed for paired or multiple platforms, UIL is still a non-trivial task due to the following challenges. (1) How to alleviate the sparsity and incompleteness of user information from different platforms? (2) How can UIL approaches achieve high effectiveness while still maintaining low complexity in multi-platform scenarios? In light of these challenges, we propose MMUIL (enhancing multi-platform user identity linkage with multi-information), a novel model excelling in high effectiveness while still maintaining low complexity. The model consists of a Multi-Information Embedding (MIE) module and a Partially Shared Adversarial Learning (PSAL) module. Specifically, for the first challenge, MIE simultaneously considers the token sequence semantics in usernames and the structural information of multi-type networks (i.e., homogeneous and heterogeneous networks). To address the second challenge, the adversarial learning-based PSAL decreases the complexity with shared partial parameters (i.e., shared generators). Meanwhile, to enhance the model’s effectiveness, PSAL exploits an attention mechanism to mitigate the disadvantages of shared partial parameters, such as partial information loss and noise introduction, while integrating the above multi-information intensively. The extensive experiments conducted on two real-world datasets demonstrate that our proposed model MMUIL significantly outperforms the state-of-the-art methods.
Similar content being viewed by others
References
Zhou J, Fan J (2019) TransLink: user identity linkage across heterogeneous social networks via translating embeddings. In: IEEE INFOCOM, pp 2116–2124
Zafarani R, Liu H (2009) Connecting corresponding identities across communities. In: Proceedings of the international AAAI conference on web and social media, pp 354–357
Li C, Wang S, Wang H, Liang Y, Yu PS, Li Z, Wang W (2019) Partially shared adversarial learning for semi-supervised multi-platform user identity linkage. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp 249–258
Chen B, Chen X (2022) MAUIL: multilevel attribute embedding for semisupervised user identity linkage. Inf Sci 593(2022):527–545
Mu X, Zhu F, Lim E, Xiao J, Wang J, Zhou Z (2016) User identity linkage by latent user space modelling. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1775–1784
Li Y, Peng Y, Zhang Z, Yin H, Xu Q (2019) Matching user accounts across social networks based on username and display name. World Wide Web J 22(3):1075–1097
Riederer CJ, Kim Y, Chaintreau A, Korula N, Lattanzi S (2016) Linking users across domains with location data: Theory and validation. In: Proceedings of the 25th international conference on world wide web, pp 707–719
Nie Y, Jia Y, Li S, Zhu X, Li A, Zhou B (2016) Identifying users across social networks based on dynamic core interests. Neurocomputing 210(2016):107–115
Chen W, Wang W, Yin H, Fang J, Zhao L (2020) User account linkage across multiple platforms with location data. J Comput Sci Technol 35(4):751–768
Xie W, Mu X, Lee RK, Zhu F, Lim E (2018) Unsupervised user identity linkage via factoid embedding. In: 2018 IEEE international conference on data mining, pp 1338–1343
Zhou F, Liu L, Zhang K, Trajcevski G, Wu J, Zhong T (2018) DeepLink: a deep learning approach for user identity linkage. In: IEEE INFOCOM, pp 1313–1321
Zhou X, Liang X, Zhang H, Ma Y (2016) Cross-platform identification of anonymous identical users in multiple social media networks. IEEE Trans Knowl Data Eng 28(2):411–424
Liu L, Cheung WK, Li X, Liao L (2016) Aligning users across social networks using network embedding. In: International joint conference on artificial intelligence, pp 1774–1780
Le QV, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp 1188–1196
Perozzi B, Al-Rfou R, Skiena S (2014) DeepWalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 701–710
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781
Zhao J, Wang X, Shi C, Liu Z, Ye Y (2020) Network schema preserving heterogeneous information network embedding. In: International joint conference on artificial intelligence, pp 1366–1372
Guo M, Liu Z, Mu T, Hu S (2021) Beyond self-attention: External attention using two linear layers for visual tasks. CoRR arXiv: 2105.02358
Arjovsky M, Chintala S, Bottou L (2017) Wasserstein GAN. CoRR arXiv:1701.07875
Hong C, Yu J, Zhang J, Jin X, Lee K-H (2018) Multimodal face-pose estimation with multitask manifold deep learning. IEEE Trans Ind Inf 15(7):3952–3961
Yu J, Tao D, Wang M, Rui Y (2014) Learning to rank using user clicks and visual features for image retrieval. IEEE Trans Cybernet 45(4):767–779
Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659–5670
Yu J, Tan M, Zhang H, Rui Y, Tao D (2019) Hierarchical deep click feature prediction for fine-grained image recognition. IEEE Trans Pattern Anal Mach Intell 44(2):563–578
Hong C, Yu J, Tao D, Wang M (2014) Image-based three-dimensional human pose recovery by multiview locality-sensitive sparse retrieval. IEEE Trans Ind Electron 62(6):3742–3751
Wang M, Chen W, Xu J, Zhao P, Zhao L (2020) User profile linkage across multiple social platforms. In: Web information systems engineering, pp 125–140
Ren Y, Meng L, Zhang J (2020) Scalable heterogeneous social network alignment through synergistic graph partition. In: Proceedings of the 31st ACM conference on hypertext and social media, pp 261–270
Man T, Shen H, Liu S, Jin X, Cheng X (2016) Predict anchor links across social networks via an embedding approach. In: International joint conference on artificial intelligence, pp 1823–1829
Gao H, Wang Y, Shao J, Shen H, Cheng X (2021) UGCLink: user identity linkage by modeling user generated contents with knowledge distillation. In: 2021 IEEE international conference on big data, pp 607–613
Chen X, Song X, Peng G, Feng S, Nie L (2021) Adversarial-enhanced hybrid graph network for user identity linkage. In: Proceedings of the 44th international ACM SIGIR conference on research and development in information retrieval, pp 1084–1093
Xue H, Sun B, Mao W, Lin J, Zhang Y, Liu X, Yang X, Chen Z (2022) Spatial density-based user identity linkage across social networks. In: 2022 IEEE international conference on big data, pp 656–664
Ma X, Ding F, Peng K, Yang Y, Wang C (2023) CP-Link: exploiting continuous spatio-temporal check-in patterns for user identity linkage. IEEE Trans Mob Comput 22(8):4594–4606
Shao J, Wang Y, Gao H, Shi B, Shen H, Cheng X (2023) AsyLink: user identity linkage from text to geo-location via sparse labeled data. Neurocomputing 515(2023):174–184
Liu J, Zhang F, Song X, Song Y-I, Lin C-Y, Hon H-W (2013) What’s in a name?: An unsupervised approach to link users across communities. In: Proceedings of the 6th ACM international conference on web search and data mining, pp 495–504
Zafarani R, Liu H (2013) Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, pp 41–49
Zhang H, Kan M, Liu Y, Ma S (2014) Online social network profile linkage based on cost-sensitive feature acquisition. In: Proceedings of the 3rd national conference on social media processing, pp 117–128
Zhang Y, Tang J, Yang Z, Pei J, Yu PS (2015) COSNET: connecting heterogeneous social networks with local and global consistency. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1485–1494
Kong X, Zhang J, Yu PS (2013) Inferring anchor links across multiple heterogeneous social networks. In: Conference on information and knowledge management, pp 179–188
Gao M, Lim E, Lo D, Zhu F, Prasetyo PK, Zhou A (2015) CNL: Collective network linkage across heterogeneous social platforms. In: 2015 IEEE international conference on data mining, pp 757–762
Tu C, Liu H, Liu Z, Sun M (2017) CANE: context-aware network embedding for relation modeling. In: Proceedings of the 55th annual meeting of the association for computational linguistics, pp 1722–1731
Zheng C, Pan L, Wu P (2022) JORA: weakly supervised user identity linkage via jointly learning to represent and align. IEEE Trans Neural Netw Learn Syst 1(2022):1–12
Long M, Chen S, Du X, Wang J (2023) DegUIL: degree-aware graph neural networks for long-tailed user identity linkage. arXiv preprint arXiv:2308.05322
Acknowledgements
This work is supported by the National Natural Science Foundation of China No. 62272332 and the Major Program of the Natural Science Foundation of Jiangsu Higher Education Institutions of China No. 22KJA520006.
Author information
Authors and Affiliations
Contributions
Qian Zhou and Yihan Hei wrote the main manuscript text, with contributions from other reviewers who validated the content. The manuscript was reviewed by all authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhou, Q., Hei, Y., Chen, W. et al. MMUIL: enhancing multi-platform user identity linkage with multi-information. Knowl Inf Syst (2024). https://doi.org/10.1007/s10115-024-02088-5
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10115-024-02088-5