Link Prediction in Social Networks: An Edge Creation History-Retrieval Based Method that Combines Topological and Contextual Data

Cavalcante, Argus A. B.; Justel, Claudia M.; Goldschmidt, Ronaldo R.

doi:10.1007/978-3-030-61380-8_26

Argus A. B. Cavalcante¹⁰,
Claudia M. Justel¹⁰ &
Ronaldo R. Goldschmidt¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12320))

Included in the following conference series:

Brazilian Conference on Intelligent Systems

864 Accesses

Abstract

Link prediction is an online social network (OSN) analysis task whose objective is to identify pairs of non-connected nodes with a high probability of getting connected in the near future. Recently, proposed link prediction methods consider topological data from OSN past states (i.e., snapshots that depict the network structure at certain moments in the past). Although past states-based methods retrieve information that describes how the network’s topology was at the events of link emergence (i.e., moments when the existing edges were created), they do not take into account contextual data concerning those events. Hence, they take the chance to disregard information about the circumstances that may have influenced the appearance of old edges, and that could be useful to predict the creation of new ones. To remedy this issue, this work extends a past states-based method to retrieve both topological and contextual data from the events of edge emergence and combine them to predict links. The extended method presented promising results on experimental data. Overall, it overcame the original method in five different scenarios from five co-authorship OSN frequently used for link prediction method evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
A G graph is said: (a) homogeneous if, and only if, G has only one type of node and one type of edge; and (b) has attributes if, and only if, G contains attributes in its nodes and/or in its edges.
2.
Prototype code is available at: https://gitlab.com/arguscavalcante/link_pred.

References

Adamic, L.A., Adar, E.: Friends and neighbors on the web. Soc. Netw. 25(3), 211–230 (2003)
Article Google Scholar
Ahmed, N.M., Chen, L.: An efficient algorithm for link prediction in temporal uncertain social networks. Inf. Sci. 331(C), 120–136 (2016)
Article MathSciNet Google Scholar
Cavalcante, A.A., Muniz, C.P., Goldschmidt, R.R.: Context-based time score: an effective similarity function for link prediction in social networks. In: Proceedings of the 24th Brazilian Symposium on Multimedia and the Web, WebMedia 2018, pp. 339–346. Association for Computing Machinery, New York (2018)
Google Scholar
Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, WWW 2014, pp. 925–936. ACM, New York (2014)
Google Scholar
Damasceno, F.F., Veras, M.B., Mesquita, D.P., Gomes, J.P., de Brito, C.E.: Shrinkage k-means: a clustering algorithm based on the James-Stein estimator. In: 2016 5th Brazilian Conference on Intelligent Systems (BRACIS). IEEE (2016)
Google Scholar
Florentino, E., Cavalcante, A., Goldschmidt, R.: Um método baseado na evolução dos dados topológicos para a predição de links em redes sociais. In: SBSI 2019 (2019)
Google Scholar
Florentino, E., Cavalcante, A., Goldschmidt, R.: An edge creation history retrieval based method to predict links in social networks. Knowl.-Based Syst. 205, 106268 (2020)
Article Google Scholar
Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3) (2010)
Google Scholar
Harris, Z.S.: Distributional structure. Word 10(2–3), 146–162 (1954)
Article Google Scholar
Hasan, M., Chaoji, V., Salem, S., Zaki, M.: Link prediction using supervised learning. In: Proceedings of SDM 2006 Workshop on Link Analysis, Counterterrorism and Security, January 2006
Google Scholar
Hasan, M.A., Zaki, M.J.: A Survey of Link Prediction in Social Networks, pp. 243–275. Springer US, Boston (2011)
Google Scholar
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, IJCAI 1995, vol. 2, pp. 1137–1143. Morgan Kaufmann Publishers Inc., San Francisco (1995)
Google Scholar
Kumar, R., Novak, J., Tomkins, A.: Structure and evolution of online social networks, pp. 337–357. Springer, New York (2010)
Google Scholar
Laishram, R., Mehrotra, K., Mohan, C.K.: Link prediction in social networks with edge aging. In: 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 606–613, November 2016
Google Scholar
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st International Conference on International Conference on Machine Learning, ICML 2014, vol. 32, pp. 3111–3119. JMLR.org (2014)
Google Scholar
Liben-Nowell, D., Kleinberg, J.: The link-prediction problem for social networks. J. Am. Soc. Inf. Sci. Technol. 58(7), 1019–1031 (2007)
Article Google Scholar
Lü, L., Zhou, T.: Link prediction in complex networks: a survey. Phys. A 390(6), 69:1–69:33 (2011)
Google Scholar
Mislove, A., Marcon, M., Gummadi, K.P., Druschel, P., Bhattacharjee, B.: Measurement and analysis of online social networks. In: Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement, IMC 2007, pp. 29–42. ACM, New York (2007)
Google Scholar
Munasinghe, L., Ichise, R.: Time score: a new feature for link prediction in social networks. IEICE Trans. Inf. Syst. E95.D, 821–828 (2012)
Article Google Scholar
Muniz, C.P., Goldschmidt, R., Choren, R.: Combining contextual, temporal and topological information for unsupervised link prediction in social networks. Knowl.-Based Syst. 156, 129–137 (2018)
Article Google Scholar
Negi, S., Chaudhury, S.: Link prediction in heterogeneous social networks. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM 2016, pp. 609–617. ACM, New York (2016)
Google Scholar
Nettleton, D.F.: Data mining of social networks represented as graphs. Comput. Sci. Rev. 7(Suppl. C), 1–34 (2013)
Article Google Scholar
Newman, M.: Clustering and preferential attachment in growing networks. Phys. Rev. E 64, 025102 (2001)
Article Google Scholar
Pecli, A., Cavalcanti, M.C., Goldschmidt, R.: Automatic feature selection for supervised learning in link prediction applications: a comparative study. Knowl. Inf. Syst. 56(1), 85–121 (2017). https://doi.org/10.1007/s10115-017-1121-6
Article Google Scholar
Popescul, A., Popescul, R., Ungar, L.H.: Statistical relational learning for link prediction (2003)
Google Scholar
Rummele, N., Ichise, R., Werthner, H.: Exploring supervised methods for temporal link prediction in heterogeneous social networks. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015 Companion, pp. 1363–1368. ACM, New York (2015)
Google Scholar
Salton, G.: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley Longman Publishing Co., Inc., Boston (1989)
Google Scholar
Shalforoushan, S.H., Jalali, M.: Link prediction in social networks using Bayesian networks. In: 2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP), pp. 246–250, March 2015
Google Scholar
Sharma, P.K., Rathore, S., Park, J.H.: Multilevel learning based modeling for link prediction and users’ consumption preference in online social networks. Future Gener. Comput. Syst. (2017)
Google Scholar
Sparck Jones, K.: A statistical interpretation of term specificity and its application in retrieval. In: Document Retrieval Systems, pp. 132–142. Taylor Graham Publishing, London (1988)
Google Scholar
Tasnadi, E., Berend, G.: Supervised prediction of social network links using implicit sources of information. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015 Companion, pp. 1117–1122. ACM, New York (2015)
Google Scholar
Wilcoxon, F.: Individual Comparisons by Ranking Methods. Bobbs-Merrill Reprint Series in the Social Sciences, S541. Bobbs-Merrill Company Incorporated (1945)
Google Scholar
Wu, J., Zhang, G., Ren, Y.: A balanced modularity maximization link prediction model in social networks. Inf. Process. Manage. 53(1), 295–307 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Engenharia de Computação, Instituto Militar de Engenharia, Praça Gen. Tibúrcio, 80 - Urca, Rio de Janeiro, RJ, 22290-270, Brazil
Argus A. B. Cavalcante, Claudia M. Justel & Ronaldo R. Goldschmidt

Authors

Argus A. B. Cavalcante
View author publications
You can also search for this author in PubMed Google Scholar
Claudia M. Justel
View author publications
You can also search for this author in PubMed Google Scholar
Ronaldo R. Goldschmidt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Argus A. B. Cavalcante .

Editor information

Editors and Affiliations

Federal University of São Carlos, São Carlos, Brazil
Ricardo Cerri
Federal University of ABC, Santo Andre, Brazil
Ronaldo C. Prati

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cavalcante, A.A.B., Justel, C.M., Goldschmidt, R.R. (2020). Link Prediction in Social Networks: An Edge Creation History-Retrieval Based Method that Combines Topological and Contextual Data. In: Cerri, R., Prati, R.C. (eds) Intelligent Systems. BRACIS 2020. Lecture Notes in Computer Science(), vol 12320. Springer, Cham. https://doi.org/10.1007/978-3-030-61380-8_26

Download citation

DOI: https://doi.org/10.1007/978-3-030-61380-8_26
Published: 13 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61379-2
Online ISBN: 978-3-030-61380-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics