Skip to main content
Log in

MMUIL: enhancing multi-platform user identity linkage with multi-information

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

User identity linkage (UIL) aims to link identities belonging to the same individual across various platforms. While numerous methods have been proposed for paired or multiple platforms, UIL is still a non-trivial task due to the following challenges. (1) How to alleviate the sparsity and incompleteness of user information from different platforms? (2) How can UIL approaches achieve high effectiveness while still maintaining low complexity in multi-platform scenarios? In light of these challenges, we propose MMUIL (enhancing multi-platform user identity linkage with multi-information), a novel model excelling in high effectiveness while still maintaining low complexity. The model consists of a Multi-Information Embedding (MIE) module and a Partially Shared Adversarial Learning (PSAL) module. Specifically, for the first challenge, MIE simultaneously considers the token sequence semantics in usernames and the structural information of multi-type networks (i.e., homogeneous and heterogeneous networks). To address the second challenge, the adversarial learning-based PSAL decreases the complexity with shared partial parameters (i.e., shared generators). Meanwhile, to enhance the model’s effectiveness, PSAL exploits an attention mechanism to mitigate the disadvantages of shared partial parameters, such as partial information loss and noise introduction, while integrating the above multi-information intensively. The extensive experiments conducted on two real-world datasets demonstrate that our proposed model MMUIL significantly outperforms the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Algorithm 1
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. https://github.com/wxqzhou/MMUIL.

  2. http://dblp.uni-trier.de.

  3. http://dblp.uni-trier.de/xml/.

References

  1. Zhou J, Fan J (2019) TransLink: user identity linkage across heterogeneous social networks via translating embeddings. In: IEEE INFOCOM, pp 2116–2124

  2. Zafarani R, Liu H (2009) Connecting corresponding identities across communities. In: Proceedings of the international AAAI conference on web and social media, pp 354–357

  3. Li C, Wang S, Wang H, Liang Y, Yu PS, Li Z, Wang W (2019) Partially shared adversarial learning for semi-supervised multi-platform user identity linkage. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp 249–258

  4. Chen B, Chen X (2022) MAUIL: multilevel attribute embedding for semisupervised user identity linkage. Inf Sci 593(2022):527–545

    Article  Google Scholar 

  5. Mu X, Zhu F, Lim E, Xiao J, Wang J, Zhou Z (2016) User identity linkage by latent user space modelling. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1775–1784

  6. Li Y, Peng Y, Zhang Z, Yin H, Xu Q (2019) Matching user accounts across social networks based on username and display name. World Wide Web J 22(3):1075–1097

    Article  Google Scholar 

  7. Riederer CJ, Kim Y, Chaintreau A, Korula N, Lattanzi S (2016) Linking users across domains with location data: Theory and validation. In: Proceedings of the 25th international conference on world wide web, pp 707–719

  8. Nie Y, Jia Y, Li S, Zhu X, Li A, Zhou B (2016) Identifying users across social networks based on dynamic core interests. Neurocomputing 210(2016):107–115

    Article  Google Scholar 

  9. Chen W, Wang W, Yin H, Fang J, Zhao L (2020) User account linkage across multiple platforms with location data. J Comput Sci Technol 35(4):751–768

    Article  Google Scholar 

  10. Xie W, Mu X, Lee RK, Zhu F, Lim E (2018) Unsupervised user identity linkage via factoid embedding. In: 2018 IEEE international conference on data mining, pp 1338–1343

  11. Zhou F, Liu L, Zhang K, Trajcevski G, Wu J, Zhong T (2018) DeepLink: a deep learning approach for user identity linkage. In: IEEE INFOCOM, pp 1313–1321

  12. Zhou X, Liang X, Zhang H, Ma Y (2016) Cross-platform identification of anonymous identical users in multiple social media networks. IEEE Trans Knowl Data Eng 28(2):411–424

    Article  Google Scholar 

  13. Liu L, Cheung WK, Li X, Liao L (2016) Aligning users across social networks using network embedding. In: International joint conference on artificial intelligence, pp 1774–1780

  14. Le QV, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp 1188–1196

  15. Perozzi B, Al-Rfou R, Skiena S (2014) DeepWalk: online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 701–710

  16. Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781

  17. Zhao J, Wang X, Shi C, Liu Z, Ye Y (2020) Network schema preserving heterogeneous information network embedding. In: International joint conference on artificial intelligence, pp 1366–1372

  18. Guo M, Liu Z, Mu T, Hu S (2021) Beyond self-attention: External attention using two linear layers for visual tasks. CoRR arXiv: 2105.02358

  19. Arjovsky M, Chintala S, Bottou L (2017) Wasserstein GAN. CoRR arXiv:1701.07875

  20. Hong C, Yu J, Zhang J, Jin X, Lee K-H (2018) Multimodal face-pose estimation with multitask manifold deep learning. IEEE Trans Ind Inf 15(7):3952–3961

    Article  Google Scholar 

  21. Yu J, Tao D, Wang M, Rui Y (2014) Learning to rank using user clicks and visual features for image retrieval. IEEE Trans Cybernet 45(4):767–779

    Article  Google Scholar 

  22. Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659–5670

    Article  MathSciNet  Google Scholar 

  23. Yu J, Tan M, Zhang H, Rui Y, Tao D (2019) Hierarchical deep click feature prediction for fine-grained image recognition. IEEE Trans Pattern Anal Mach Intell 44(2):563–578

    Article  Google Scholar 

  24. Hong C, Yu J, Tao D, Wang M (2014) Image-based three-dimensional human pose recovery by multiview locality-sensitive sparse retrieval. IEEE Trans Ind Electron 62(6):3742–3751

    Google Scholar 

  25. Wang M, Chen W, Xu J, Zhao P, Zhao L (2020) User profile linkage across multiple social platforms. In: Web information systems engineering, pp 125–140

  26. Ren Y, Meng L, Zhang J (2020) Scalable heterogeneous social network alignment through synergistic graph partition. In: Proceedings of the 31st ACM conference on hypertext and social media, pp 261–270

  27. Man T, Shen H, Liu S, Jin X, Cheng X (2016) Predict anchor links across social networks via an embedding approach. In: International joint conference on artificial intelligence, pp 1823–1829

  28. Gao H, Wang Y, Shao J, Shen H, Cheng X (2021) UGCLink: user identity linkage by modeling user generated contents with knowledge distillation. In: 2021 IEEE international conference on big data, pp 607–613

  29. Chen X, Song X, Peng G, Feng S, Nie L (2021) Adversarial-enhanced hybrid graph network for user identity linkage. In: Proceedings of the 44th international ACM SIGIR conference on research and development in information retrieval, pp 1084–1093

  30. Xue H, Sun B, Mao W, Lin J, Zhang Y, Liu X, Yang X, Chen Z (2022) Spatial density-based user identity linkage across social networks. In: 2022 IEEE international conference on big data, pp 656–664

  31. Ma X, Ding F, Peng K, Yang Y, Wang C (2023) CP-Link: exploiting continuous spatio-temporal check-in patterns for user identity linkage. IEEE Trans Mob Comput 22(8):4594–4606

    Article  Google Scholar 

  32. Shao J, Wang Y, Gao H, Shi B, Shen H, Cheng X (2023) AsyLink: user identity linkage from text to geo-location via sparse labeled data. Neurocomputing 515(2023):174–184

    Article  Google Scholar 

  33. Liu J, Zhang F, Song X, Song Y-I, Lin C-Y, Hon H-W (2013) What’s in a name?: An unsupervised approach to link users across communities. In: Proceedings of the 6th ACM international conference on web search and data mining, pp 495–504

  34. Zafarani R, Liu H (2013) Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, pp 41–49

  35. Zhang H, Kan M, Liu Y, Ma S (2014) Online social network profile linkage based on cost-sensitive feature acquisition. In: Proceedings of the 3rd national conference on social media processing, pp 117–128

  36. Zhang Y, Tang J, Yang Z, Pei J, Yu PS (2015) COSNET: connecting heterogeneous social networks with local and global consistency. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1485–1494

  37. Kong X, Zhang J, Yu PS (2013) Inferring anchor links across multiple heterogeneous social networks. In: Conference on information and knowledge management, pp 179–188

  38. Gao M, Lim E, Lo D, Zhu F, Prasetyo PK, Zhou A (2015) CNL: Collective network linkage across heterogeneous social platforms. In: 2015 IEEE international conference on data mining, pp 757–762

  39. Tu C, Liu H, Liu Z, Sun M (2017) CANE: context-aware network embedding for relation modeling. In: Proceedings of the 55th annual meeting of the association for computational linguistics, pp 1722–1731

  40. Zheng C, Pan L, Wu P (2022) JORA: weakly supervised user identity linkage via jointly learning to represent and align. IEEE Trans Neural Netw Learn Syst 1(2022):1–12

    Google Scholar 

  41. Long M, Chen S, Du X, Wang J (2023) DegUIL: degree-aware graph neural networks for long-tailed user identity linkage. arXiv preprint arXiv:2308.05322

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China No. 62272332 and the Major Program of the Natural Science Foundation of Jiangsu Higher Education Institutions of China No. 22KJA520006.

Author information

Authors and Affiliations

Authors

Contributions

Qian Zhou and Yihan Hei wrote the main manuscript text, with contributions from other reviewers who validated the content. The manuscript was reviewed by all authors.

Corresponding authors

Correspondence to Wei Chen or Lei Zhao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, Q., Hei, Y., Chen, W. et al. MMUIL: enhancing multi-platform user identity linkage with multi-information. Knowl Inf Syst (2024). https://doi.org/10.1007/s10115-024-02088-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10115-024-02088-5

Keywords

Navigation