Abstract
Sources of complementary information are connected when we link user accounts belonging to the same user across different platforms or devices. The expanded information promotes the development of a wide range of applications, such as cross-platform prediction, cross-platform recommendation, and advertisement. Due to the significance of user account linkage and the widespread popularization of GPS-enabled mobile devices, there are increasing research studies on linking user account with spatio-temporal data across location-aware social networks. Being different from most existing studies in this domain that only focus on the effectiveness, we propose a novel framework entitled HFUL (A Hybrid Framework for User Account Linkage across Location-Aware Social Networks), where efficiency, effectiveness, scalability, robustness, and application of user account linkage are considered. Specifically, to improve the efficiency, we develop a comprehensive index structure from the spatio-temporal perspective, and design novel pruning strategies to reduce the search space. To improve the effectiveness, a kernel density estimation-based method has been proposed to alleviate the data sparsity problem in measuring users’ similarities. Additionally, we investigate the application of HFUL in terms of user prediction, time prediction, and location prediction. The extensive experiments conducted on three real-world datasets demonstrate the superiority of HFUL in terms of effectiveness, efficiency, scalability, robustness, and application compared with the state-of-the-art methods.
Similar content being viewed by others
References
Gao, H., Liu, H.: Data analysis on location-based social networks, in Mobile social networking, 2014, pp. 165–194
Pham, H., Shahabi, C., Liu, Y.: Ebm: an entropy-based model to infer social strength from spatiotemporal data, in SIGMOD, 2013, pp. 265–276
Lichman, M., Smyth, P.: Modeling human location data with mixtures of kernel densities, in KDD, 2014, pp. 35–44
Riederer, C., Kim, Y., Chaintreau, A., Korula, N., Lattanzi, S.: Linking users across domains with location data: Theory and validation, in WWW, 2016, pp. 707–719
Noulas, A., Scellato, S., Mascolo, C., Pontil, M.: An empirical study of geographic user activity patterns in foursquare. ICWSM 11, 70–573 (2011)
Wang, W., Yin, H., Sadiq, S., Chen, L., Xie, M., Zhou, X.: Spore: a sequential personalized spatial item recommender system, in ICDE, 2016, pp. 954–965
Zhang, J., Kong, X., Yu, P. S.: Transferring heterogeneous links across location-based social networks, in WSDM, 2014, pp. 303–312
Li, Z., Ding, B., Han, J., Kays, R., Nye, P.: Mining periodic behaviors for moving objects, in KDD, 2010, pp. 1099–1108
Yuan, Q., Cong, G., Ma, Z., Sun, A., Magnenat-Thalmann, N.: Who, where, when and what: discover spatio-temporal topics for twitter users, in KDD, 2013, pp. 605–613
Wand, M.P.: Fast computation of multivariate kernel estimators. J. Comput. Graph. Stat. 3(4), 433–445 (1994)
Lopez-Novoa, U., Sáenz, J., Mendiburu, A., Miguel-Alonso, J.: An efficient implementation of kernel density estimation for multi-core and many-core architectures. J. High Perform. Comput. Appl. 29(3), 331–347 (2015)
Lopez-Novoa, U., Mendiburu, A., Miguel-Alonso, J.: Kernel density estimation in accelerators. J. Supercomput. 72(2), 545–566 (2016)
Chen, W., Yin, H., Wang, W., Zhao, L., Hua, W., Zhou, X.: Exploiting spatio-temporal user behaviors for user linkage, in CIKM, 2017
Chen, W., Yin, H., Wang, W., Zhao, L., Zhou, X.: Effective and efficient user account linkage across location based social networks, in ICDE, 2018, pp. 1085–1096
Shu, K., Wang, S., Tang, J., Zafarani, R., Liu, H.: User identity linkage across online social networks: a review. SIGKDD Explor. 18(2), 5–17 (2017)
Huynh, T. T., Tong, V. V., Nguyen, T. T., Yin, H., Weidlich, M., Hung, N. Q. V.: Adaptive network alignment with unsupervised and multi-order convolutional networks, in ICDE, 2020, pp. 85–96
Zhang, Y., Yin, H., Huang, Z., Du, X., Yang, G., Lian, D.: Discrete deep learning for fast content-aware recommendation, in WSDM, 2018, pp. 717–726
Wang, Y., Feng, C., Chen, L., Yin, H., Guo, C., Chu, Y.: User identity linkage across social networks via linked heterogeneous network embedding. World Wide Web 22(6), 2611–2632 (2019)
Chen, H., Yin, H., Sun, X., Chen, T., Gabrys, B., Musial, K.: Multi-level graph convolutional networks for cross-platform anchor link prediction, in KDD, 2020, pp. 1503–1511
Zafarani, R., Liu, H.: Connecting corresponding identities across communities. ICWSM 9, 354–357 (2009)
Vosecky, J., Hong, D., Shen, V.Y.: User identification across social networks using the web profile and friend network. J. Web Appl. 2(1), 23–34 (2010)
Iofciu, T., Fankhauser, P., Abel, F., Bischoff, K.: Identifying users across social tagging systems. in ICWSM, 2011
Liu, J., Zhang, F., Song, X., Song, Y.-I., Lin, C.-Y., Hon, H.-W.: What’s in a name?: an unsupervised approach to link users across communities, in WSDM, 2013, pp. 495–504
R. Zafarani and H. Liu, Connecting users across social media sites: a behavioral-modeling approach, in KDD, 2013, pp. 41–49
Peled, O., Fire, M., Rokach, L., Elovici, Y.: Entity matching in online social networks, in Social Computing, 2013, pp. 339–344
Liu, S., Wang, S., Zhu, F., Zhang, J., Krishnan, R.: Hydra: Large-scale social identity linkage via heterogeneous behavior modeling, in KDD, 2014, pp. 51–62
Shen, Y., Jin, H.: Controllable information sharing for user accounts linkage across multiple online social networks, in CIKM, 2014, pp. 381–390
Mu, X., Zhu, F., Lim, E.-P., Xiao, J., Wang, J., Zhou, Z.-H.: User identity linkage by latent user space modelling, in KDD, 2016, pp. 1775–1784
Zhou, F., Liu, L., Zhang, K., Trajcevski, G., Wu, J., Zhong, T.: Deeplink: a deep learning approach for user identity linkage, in INFOCOM, 2018, pp. 1313–1321
Xie, W., Mu, X., Lee, R. K.-W., Zhu, F., Lim, E.-P.: Unsupervised user identity linkage via factoid embedding, in ICDM, 2018, pp. 1338–1343
Liu, L., Zhang, Y., Fu, S., Zhong, F., Hu, J., Zhang, P.: Abne: an attention-based network embedding for user alignment across social networks. IEEE Access 7, 595–605 (2019)
Zhou, J., Fan, J.: Translink: user identity linkage across heterogeneous social networks via translating embeddings, in INFOCOM, 2019, pp. 2116–2124
Fu, S., Wang, G., Xia, S., Liu, L.: Deep multi-granularity graph embedding for user identity linkage across social networks. Knowl. Based Syst. 193, 105301105301 (2020)
Han, X., Wang, L., Xu, L., Zhang, S.: Social media account linkage using user-generated geo-location data, in ISI, 2016, pp. 157–162
Gao, X., Ji, W., Li, Y., Deng, Y., Dong, W.: User identification with spatio-temporal awareness across social networks, in CIKM, 2018, pp. 1831–1834
Jin, F., Hua, W., Xu, J., Zhou, X.: Moving object linking based on historical trace, in ICDE, 2019, pp. 1058–1069
Zhang, W., Lai, X., Wang, J.: Social link inference via multiview matching network from spatiotemporal trajectories, IEEE Transactions on Neural Networks and Learning Systems, pp. 1–12, 2020
Scott, D.W., Sheather, S.J.: Kernel density estimation with binned data. Commun. Stat.-Theory Methods 14(6), 1353–1359 (1985)
Silverman, B. W.: Density estimation for statistics and data analysis, 1986, vol. 26
Zhang, J.-D., Chow, C.-Y.: igslr: personalized geo-social location recommendation: a kernel density estimation approach, in GIS, 2013, pp. 334–343
Hulden, M., Silfverberg, M., Francom, J.: Kernel density estimation for text-based geolocation. in AAAI, 2015, pp. 145–150
Backurs, A., Indyk, P., Wagner, T.: Space and time efficient kernel density estimation in high dimensions, in NeurIPS, 2019, pp. 15 773–15 782
Hohl, A., Chen, P.: Spatiotemporal simulation: local ripley’s K function parameterizes adaptive kernel density estimation, in SIGSPATIAL, 2019, pp. 16–23
Hasan, S., Zhan, X., Ukkusuri, S. V.: Understanding urban human activity and mobility patterns using large-scale location-based data from online social media, In Proceedings of the 2nd ACM SIGKDD international workshop on urban computing, 2013
Zhang, P., Deng, M., Van de Weghe, N.: Clustering spatio-temporal trajectories based on kernel density estimation, in ICCSA, 2014, pp. 298–311
Romano, B., Jiang, Z.: Visualizing traffic accident hotspots based on spatial-temporal network kernel density estimation, in SIGSPATIAL, 2017, pp. 1–4
Wang, Z., Liu, L., Zhou, H., Lan, M.: How is the confidentiality of crime locations affected by parameters in kernel density estimation? Int. J. Geo-Inf. 8(12), 544 (2019)
Zhou, Z., Lan, R., Rui, Y., Zhou, J., Dong, L., Cheng, R., Cai, X.: A new acoustic emission source location method using tri-variate kernel density estimator. IEEE Access 7, 379–388 (2019)
Coleman, B., Shrivastava, A.: Sub-linear races sketches for approximate kernel density estimation on streaming data, in WWW, 2020, pp. 1739–1749
Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344(6191), 1492–1496 (2014)
Albanese, A., Pal, S.K., Petrosino, A.: Rough sets, kernel set, and spatiotemporal outlier detection. TKDE 26(1), 194–207 (2014)
Duggimpudi, M.B., Abbady, S., Chen, J., Raghavan, V.V.: Spatio-temporal outlier detection algorithms based on computing behavioral outlierness factor. Data Knowl. Eng. 122, 1–24 (2019)
Begum, N., Ulanova, L., Wang, J., Keogh, E. J.: Accelerating dynamic time warping clustering with a novel admissible pruning strategy, in KDD, 2015, pp. 49–58
Cao, W., Wu, Z., Wang, D., Li, J., Wu, H.: Automatic user identification method across heterogeneous mobility data sources, in ICDE, 2016, pp. 978–989
Y. Theodoridis, J. R. O. Silva, and M. A. Nascimento, On the generation of spatiotemporal datasets, in SSD, vol. 1651, 1999, pp. 147–164
Acknowledgements
This work is supported by Australian Research Council Future Fellowship (Grant No. FT210100624) and Discovery Project (Grant No. DP190101985). It is partially supported by the National Natural Science Foundation of China under Grant No. 61902270 and No. 62072125, and the Major Program of the Natural Science Foundation of Jiangsu Higher Education Institutions of China under Grant No. 19KJA610002.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Chen, W., Wang, W., Yin, H. et al. HFUL: a hybrid framework for user account linkage across location-aware social networks. The VLDB Journal 32, 1–22 (2023). https://doi.org/10.1007/s00778-022-00730-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00778-022-00730-8