Skip to main content

Link Prediction in Social Networks: An Edge Creation History-Retrieval Based Method that Combines Topological and Contextual Data

  • Conference paper
  • First Online:
Intelligent Systems (BRACIS 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12320))

Included in the following conference series:

  • 864 Accesses

Abstract

Link prediction is an online social network (OSN) analysis task whose objective is to identify pairs of non-connected nodes with a high probability of getting connected in the near future. Recently, proposed link prediction methods consider topological data from OSN past states (i.e., snapshots that depict the network structure at certain moments in the past). Although past states-based methods retrieve information that describes how the network’s topology was at the events of link emergence (i.e., moments when the existing edges were created), they do not take into account contextual data concerning those events. Hence, they take the chance to disregard information about the circumstances that may have influenced the appearance of old edges, and that could be useful to predict the creation of new ones. To remedy this issue, this work extends a past states-based method to retrieve both topological and contextual data from the events of edge emergence and combine them to predict links. The extended method presented promising results on experimental data. Overall, it overcame the original method in five different scenarios from five co-authorship OSN frequently used for link prediction method evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A G graph is said: (a) homogeneous if, and only if, G has only one type of node and one type of edge; and (b) has attributes if, and only if, G contains attributes in its nodes and/or in its edges.

  2. 2.

    Prototype code is available at: https://gitlab.com/arguscavalcante/link_pred.

References

  1. Adamic, L.A., Adar, E.: Friends and neighbors on the web. Soc. Netw. 25(3), 211–230 (2003)

    Article  Google Scholar 

  2. Ahmed, N.M., Chen, L.: An efficient algorithm for link prediction in temporal uncertain social networks. Inf. Sci. 331(C), 120–136 (2016)

    Article  MathSciNet  Google Scholar 

  3. Cavalcante, A.A., Muniz, C.P., Goldschmidt, R.R.: Context-based time score: an effective similarity function for link prediction in social networks. In: Proceedings of the 24th Brazilian Symposium on Multimedia and the Web, WebMedia 2018, pp. 339–346. Association for Computing Machinery, New York (2018)

    Google Scholar 

  4. Cheng, J., Adamic, L., Dow, P.A., Kleinberg, J.M., Leskovec, J.: Can cascades be predicted? In: Proceedings of the 23rd International Conference on World Wide Web, WWW 2014, pp. 925–936. ACM, New York (2014)

    Google Scholar 

  5. Damasceno, F.F., Veras, M.B., Mesquita, D.P., Gomes, J.P., de Brito, C.E.: Shrinkage k-means: a clustering algorithm based on the James-Stein estimator. In: 2016 5th Brazilian Conference on Intelligent Systems (BRACIS). IEEE (2016)

    Google Scholar 

  6. Florentino, E., Cavalcante, A., Goldschmidt, R.: Um método baseado na evolução dos dados topológicos para a predição de links em redes sociais. In: SBSI 2019 (2019)

    Google Scholar 

  7. Florentino, E., Cavalcante, A., Goldschmidt, R.: An edge creation history retrieval based method to predict links in social networks. Knowl.-Based Syst. 205, 106268 (2020)

    Article  Google Scholar 

  8. Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3) (2010)

    Google Scholar 

  9. Harris, Z.S.: Distributional structure. Word 10(2–3), 146–162 (1954)

    Article  Google Scholar 

  10. Hasan, M., Chaoji, V., Salem, S., Zaki, M.: Link prediction using supervised learning. In: Proceedings of SDM 2006 Workshop on Link Analysis, Counterterrorism and Security, January 2006

    Google Scholar 

  11. Hasan, M.A., Zaki, M.J.: A Survey of Link Prediction in Social Networks, pp. 243–275. Springer US, Boston (2011)

    Google Scholar 

  12. Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, IJCAI 1995, vol. 2, pp. 1137–1143. Morgan Kaufmann Publishers Inc., San Francisco (1995)

    Google Scholar 

  13. Kumar, R., Novak, J., Tomkins, A.: Structure and evolution of online social networks, pp. 337–357. Springer, New York (2010)

    Google Scholar 

  14. Laishram, R., Mehrotra, K., Mohan, C.K.: Link prediction in social networks with edge aging. In: 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 606–613, November 2016

    Google Scholar 

  15. Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st International Conference on International Conference on Machine Learning, ICML 2014, vol. 32, pp. 3111–3119. JMLR.org (2014)

    Google Scholar 

  16. Liben-Nowell, D., Kleinberg, J.: The link-prediction problem for social networks. J. Am. Soc. Inf. Sci. Technol. 58(7), 1019–1031 (2007)

    Article  Google Scholar 

  17. Lü, L., Zhou, T.: Link prediction in complex networks: a survey. Phys. A 390(6), 69:1–69:33 (2011)

    Google Scholar 

  18. Mislove, A., Marcon, M., Gummadi, K.P., Druschel, P., Bhattacharjee, B.: Measurement and analysis of online social networks. In: Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement, IMC 2007, pp. 29–42. ACM, New York (2007)

    Google Scholar 

  19. Munasinghe, L., Ichise, R.: Time score: a new feature for link prediction in social networks. IEICE Trans. Inf. Syst. E95.D, 821–828 (2012)

    Article  Google Scholar 

  20. Muniz, C.P., Goldschmidt, R., Choren, R.: Combining contextual, temporal and topological information for unsupervised link prediction in social networks. Knowl.-Based Syst. 156, 129–137 (2018)

    Article  Google Scholar 

  21. Negi, S., Chaudhury, S.: Link prediction in heterogeneous social networks. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM 2016, pp. 609–617. ACM, New York (2016)

    Google Scholar 

  22. Nettleton, D.F.: Data mining of social networks represented as graphs. Comput. Sci. Rev. 7(Suppl. C), 1–34 (2013)

    Article  Google Scholar 

  23. Newman, M.: Clustering and preferential attachment in growing networks. Phys. Rev. E 64, 025102 (2001)

    Article  Google Scholar 

  24. Pecli, A., Cavalcanti, M.C., Goldschmidt, R.: Automatic feature selection for supervised learning in link prediction applications: a comparative study. Knowl. Inf. Syst. 56(1), 85–121 (2017). https://doi.org/10.1007/s10115-017-1121-6

    Article  Google Scholar 

  25. Popescul, A., Popescul, R., Ungar, L.H.: Statistical relational learning for link prediction (2003)

    Google Scholar 

  26. Rummele, N., Ichise, R., Werthner, H.: Exploring supervised methods for temporal link prediction in heterogeneous social networks. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015 Companion, pp. 1363–1368. ACM, New York (2015)

    Google Scholar 

  27. Salton, G.: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley Longman Publishing Co., Inc., Boston (1989)

    Google Scholar 

  28. Shalforoushan, S.H., Jalali, M.: Link prediction in social networks using Bayesian networks. In: 2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP), pp. 246–250, March 2015

    Google Scholar 

  29. Sharma, P.K., Rathore, S., Park, J.H.: Multilevel learning based modeling for link prediction and users’ consumption preference in online social networks. Future Gener. Comput. Syst. (2017)

    Google Scholar 

  30. Sparck Jones, K.: A statistical interpretation of term specificity and its application in retrieval. In: Document Retrieval Systems, pp. 132–142. Taylor Graham Publishing, London (1988)

    Google Scholar 

  31. Tasnadi, E., Berend, G.: Supervised prediction of social network links using implicit sources of information. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015 Companion, pp. 1117–1122. ACM, New York (2015)

    Google Scholar 

  32. Wilcoxon, F.: Individual Comparisons by Ranking Methods. Bobbs-Merrill Reprint Series in the Social Sciences, S541. Bobbs-Merrill Company Incorporated (1945)

    Google Scholar 

  33. Wu, J., Zhang, G., Ren, Y.: A balanced modularity maximization link prediction model in social networks. Inf. Process. Manage. 53(1), 295–307 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Argus A. B. Cavalcante .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Cavalcante, A.A.B., Justel, C.M., Goldschmidt, R.R. (2020). Link Prediction in Social Networks: An Edge Creation History-Retrieval Based Method that Combines Topological and Contextual Data. In: Cerri, R., Prati, R.C. (eds) Intelligent Systems. BRACIS 2020. Lecture Notes in Computer Science(), vol 12320. Springer, Cham. https://doi.org/10.1007/978-3-030-61380-8_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-61380-8_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-61379-2

  • Online ISBN: 978-3-030-61380-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics