Encyclopedia of Machine Learning and Data Mining

2017 Edition
| Editors: Claude Sammut, Geoffrey I. Webb

Link Prediction

  • Galileo NamataEmail author
  • Lise Getoor
Reference work entry
DOI: https://doi.org/10.1007/978-1-4899-7687-1_486



Many datasets can naturally be represented as graph where nodes represent instances and links represent relationships between those instances. A fundamental problem with these types of data is that the link information in the graph may be of dubious quality; links may incorrectly exist between unrelated nodes and links may be missing between two related nodes. The goal of link prediction is to predict the existence of incorrect or missing links between the nodes of the graph.


Inferring the existences of edges between nodes in a graph has traditionally been referred to as link prediction (Liben-Nowell and Kleinberg 2003a; Taskar et al. 2003). Link prediction is a challenging problem that has been studied in various guises in different domains. For example, in social network analysis, there is work on predicting friendship links (Zheleva et al. 2008), event participation links (i.e., coauthorship O’Madadhain...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. Albert R, DasGupta B, Dondi R, Kachalo S, Sontag E, Zelikovsky A et al (2007) A novel method for signal transduction network inference from indirect experimental evidence. J Comput Biol 14:407–419MathSciNetCrossRefGoogle Scholar
  2. Balasubramanyan R, Carvalho VR, Cohen W (2009) Cutonce recipient recommendation and leak detection in action. In: Workshop on enhanced messaging, ChicagoGoogle Scholar
  3. Carvalho VR, Cohen WW (2007) Preventing information leaks in email. In: SIAM conference on data mining, MinneapolisCrossRefGoogle Scholar
  4. Chaiwanarom P, Lursinsap C (2008) Link completion using prediction by partial matching. In: International symposium on communications and information technologies, VientianeCrossRefGoogle Scholar
  5. Clauset A, Moore C, Newman MEJ (2008) Hierarchical structure and the prediction of missing links in networks. Nature 453:98CrossRefGoogle Scholar
  6. Deng M, Mehta S, Sun F, Chen T (2002) Inferring domain-domain interactions from protein-protein interactions. Genome Res 12(10):1540–1548CrossRefGoogle Scholar
  7. Diehl C, Namata GM, Getoor L (2007) Relationship identification for social network discovery. In: Proceedings of the 22nd national conference on artificial intelligence, VancouverGoogle Scholar
  8. Farrell S, Campbell C, Myagmar S (2005) Relescope: an experiment in accelerating relationships. In: Extended abstracts on human factors in computing systems, PortlandCrossRefGoogle Scholar
  9. Getoor L, Friedman N, Koller D, Taskar B (2003) Learning probabilistic models of link structure. Mach Learn 3:679–707MathSciNetzbMATHGoogle Scholar
  10. Goldenberg A, Kubica J, Komarek P, Moore A, Schneider J (2003) A comparison of statistical and machine learning algorithms on the task of link completion. In: Conference on knowledge discovery and data mining, workshop on link analysis for detecting complex behavior, Washington, DCGoogle Scholar
  11. Huang Z, Lin DKJ (2008) The time-series link prediction problem with applications in communication surveillance. Inf J Comput 21:286–303CrossRefGoogle Scholar
  12. Huang Z, Zeng DD (2006) A link prediction approach to anomalous email detection. In: IEEE international conference on systems, man, and cybernetics, TaipeiCrossRefGoogle Scholar
  13. Liben-Nowell D, Kleinberg J (2003a) The link prediction problem for social networks. In: International conference on information and knowledge management, New OrleansGoogle Scholar
  14. Liben-Nowell and Kleinberg (2003b)Google Scholar
  15. Milne D, Witten IH (2008) Learning to link with wikipedia. In: Proceedings of the 17th ACM conference on information and knowledge management, Napa ValleyGoogle Scholar
  16. O’Madadhain J, Hutchins J, Smyth P (2005) Prediction and ranking algorithms for event-based network data. SIGKDD Explor Newsl 7(2):23–30CrossRefGoogle Scholar
  17. Popescul A, Ungar LH (2003) Statistical relational learning for link prediction. In: International joint conferences on artificial intelligence workshop on learning statistical models from relational dataGoogle Scholar
  18. Rattigan MJ, Jensen D (2005a) The case for anomalous link discovery. SIGKDD Explor Newsl 7:41–47Google Scholar
  19. Rattigan and Jensen (2005b)Google Scholar
  20. Richardson M, Domingos P (2006) Markov logic networks. Mach Learn 62:107–136CrossRefGoogle Scholar
  21. Spring N, Wetherall D, Anderson T (2004) Reverse engineering the internet. SIGCOMM Comput Commun Rev 34(1):3–8CrossRefGoogle Scholar
  22. Sprinzak E, Altuvia Y, Margalit H (2006) Characterization and prediction of protein-protein interactions within and between complexes. Proc Natl Acad Sci 103(40):14718–14723CrossRefGoogle Scholar
  23. Szilagyi A, Grimm V, Arakaki AK, Skolnick J (2005a) Prediction of physical protein-protein interactions. Phys Biol 2(2):S1–S16Google Scholar
  24. Szilagyi et al. (2005b)Google Scholar
  25. Taskar B, Wong M-F, Abbeel P, Koller D (2003) Link prediction in relational data. In: Advances in neural information processing systems, VancouverGoogle Scholar
  26. Yu H, Paccanaro A, Trifonov V, Gerstein M (2006) Predicting interactions in protein networks by completing defective cliques. Bioinformatics 22(7):823–829CrossRefGoogle Scholar
  27. Zheleva E, Getoor L, Golbeck J, Kuter U (2008) Using friendship ties and family circles for link prediction. In: 2nd ACM SIGKDD workshop on social network mining and analysis, Las VegasGoogle Scholar
  28. Zhu J (2003) Mining web site link structure for adaptive web site navigation and search. Ph.D. thesis, University of Ulster at JordanstownGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2017

Authors and Affiliations

  1. 1.University of MarylandCollege ParkUSA