Skip to main content
Log in

Estimating node indirect interaction duration to enhance link prediction

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Link prediction is the problem of inferring new relationships among nodes in a network that are likely to occur in the near future. Classical approaches mainly consider neighborhood structure similarity when linking nodes. However, we may also want to take into account if the two nodes are already indirectly interacting and if they will benefit from the link by having an active interaction over the time. For instance, it is better to link two nodes u and v if we know that these two nodes will interact in the social network even in the future, rather than suggesting \(v'\), which will never interact with u. In this paper, we deal with a variant of the link prediction problem: Given a pair of indirectly interacting nodes, predict whether or not they will form a link in the future. We propose a solution to this problem that leverages the predicted duration of their interaction and propose two supervised learning approaches to predict how long will two nodes interact in a network. Given a set of network-based predictors, the basic approach consists of learning a binary classifier to predict whether or not an observed indirect interaction will last in the future. The second and more fine-grained approach consists of estimating how long the interaction will last by modeling the problem via survival analysis or as a regression task. Once the duration is estimated, new links are predicted according to their descending order. Experimental results on Facebook Network and Wall Interaction and Wikipedia Clickstream datasets show that our more fine-grained approach performs the best and beats a link prediction model that does not consider the interaction duration and is based only on network properties.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. https://thewikigame.com

  2. https://meta.wikimedia.org/wiki/Research:Wikipedia_clickstream.

  3. For the case of classification, we considered the predicted probability of having a link in the future.

References

  • Adafre SF, de Rijke M (2005) Discovering missing links in Wikipedia. In: LinkKDD ’05, pp 90–97

  • Ameri S, Fard MJ, Chinnam RB, Reddy CK (2016) Survival analysis based framework for early prediction of student dropouts. In: CIKM, pp 903–912

  • Backstrom L, Leskovec J (2011) Supervised random walks: predicting and recommending links in social networks. In: WSDM, pp 635–644

  • Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. Comput Netw 30(1–7):107–117

    Google Scholar 

  • Cacheda F, Barbieri N, Blanco R (2017) Click through rate prediction for local search results. In: WSDM, pp 171–180

  • Dave VS, Hasan MA, Reddy CK (2017) How fast will you get a response? Predicting interval time for reciprocal link creation. In: ICWSM, pp 508–511

  • Davis J, Goadrich M (2006) The relationship between precision-recall and roc curves. In: Proceedings of the 23rd international conference on machine learning. ACM, pp 233–240

  • Dhakal N, Spezzano F, Xu D (2017) Predicting friendship strength for privacy preserving: a case study on Facebook. In: ASONAM, pp 1096–1103

  • Gilbert E, Karahalios K (2009) Predicting tie strength with social media. In: CHI, pp 211–220

  • Goyal P, Ferrara E (2018) Graph embedding techniques, applications, and performance: a survey. Knowledge-Based Syst 151:78–94

    Article  Google Scholar 

  • Grover A, Leskovec J (2016) Node2vec: scalable feature learning for networks. In: KDD, pp 855–864

  • Gundala LA, Spezzano F (2018) A framework for predicting links between indirectly interacting nodes. In: IEEE/ACM 2018 international conference on advances in social networks analysis and mining, ASONAM 2018, Barcelona, Spain, August 28–31, 2018, pp 544–551

  • Hasan MA, Zaki MJ (2011) A survey of link prediction in social networks. In: Aggarwal C (ed) Social network data analytics. Springer, Boston, MA, pp 243–275

    Chapter  Google Scholar 

  • Jones JJ, Settle JE, Bond RM, Fariss CJ, Marlow C, Fowler JH (2013) Inferring tie strength from online directed behavior. PloS One 8(1):e52168

    Article  Google Scholar 

  • Kahanda I, Neville J (2009) Using transactional information to predict link strength in online social networks. In: 3rd international AAAI conference on weblogs and social media, pp 74–81

  • Kamath K, Sharma A, Wang D, Yin Z (2014) Realgraph: user interaction prediction at twitter. In: User Engagement Optimization Workshop @ KDD

  • Kumar S, Spezzano F, Subrahmanian VS, Faloutsos C (2016) Edge weight prediction in weighted signed networks. In: ICDM, pp 221–230

  • Lei C, Ruan J (2012) A novel link prediction algorithm for reconstructing protein–protein interaction networks by topological similarity. Bioinformatics 29(3):355–364

    Article  Google Scholar 

  • Liben-Nowell D, Kleinberg J (2007) The link-prediction problem for social networks. J Assoc Inf Sci Technol 58(7):1019–1031

    Article  Google Scholar 

  • Li Y, Rakesh V, Reddy CK (2016) Project success prediction in crowdfunding environments. In: WSDM, pp 247–256

  • Menon AK, Elkan C (2011) Link prediction via matrix factorization. In: ECML PKDD, pp 437–452

  • Murtaugh PA, Burns LD, Schuster J (1999) Predictiong the retention of university students. Res High Educ 40(3):355–371

    Article  Google Scholar 

  • Noraset T, Bhagavatula C, Downey D (2014) Adding high-precision links to Wikipedia. In: EMNLP ’14, pp 651–656

  • Paranjape A, West R, Zia L, Leskovec J (2016) Improving website hyperlink structure using server logs. In: WSDM ’16, pp 615–624

  • Pavlov M, Ichise R (2007) Finding experts by link prediction in co-authorship networks. In: Proceedings of the 2nd international conference on finding experts on the web with semantics, vol 290, pp 42–55

  • Pavlov M, Ichise R (2007) Finding experts by link prediction in co-authorship networks. In: Proceedings of the 2nd international conference on finding experts on the web with semantics, vol 290. FEWS’07, pp 42–55

  • Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: online learning of social representations. In: KDD, pp 701–710

  • Qian Y, Adali S (2014) Foundations of trust and distrust in networks: extended structural balance theory. ACM Trans Web (TWEB) 8(3):13

    Google Scholar 

  • Rakesh V, Lee W, Reddy CK (2016) Probabilistic group recommendation model for crowd funding domains. In: WSDM, pp 257–266

  • Richardson M, Dominowska E, Ragno R (2007) Predicting clicks: estimating the click-through rate for new ads. In: WWW, pp 521–530

  • Tang L, Liu H (2011) Leveraging social media networks for classification. Data Min Knowl Discov 23(3):447–478

    Article  MathSciNet  Google Scholar 

  • Tang J, Qu M, Wang M, Zhang M, Yan J, Mei Q (2015) Line: large-scale information network embedding. In: WWW, pp 1067–1077

  • Viswanath B, Mislove A, Cha M, Gummadi KP (2009) On the evolution of user interaction in Facebook. In: WOSN, pp 37–42

  • Wang P, Li Y, Reddy CK (2017) Machine learning for survival analysis: a survey. CoRR arXiv:abs/1708.04649

  • West R, Paranjape A, Leskovec J (2015) Mining missing hyperlinks from human navigation traces: a case study of Wikipedia. In: WWW ’15, pp 1242–1252

  • West R, Paskov HS, Leskovec J, Potts C (2014) Exploiting social network structure for person-to-person sentiment analysis. Trans Assoc Comput Linguist 2:297–310

    Article  Google Scholar 

  • West R, Pineau J, Precup D (2009) Wikispeedia: an online game for inferring semantic distances between concepts. In: IJCAI’09, pp 1598–1603

  • West R, Precup D, Pineau J (2009) Completing Wikipedia’s hyperlink structure through dimensionality reduction. In: CIKM ’09, pp 1097–1106

  • Wilson C, Sala A, Puttaswamy KP, Zhao BY (2012) Beyond social graphs: user interactions in online social networks and their implications. ACM Trans Web (TWEB) 6(4):17

    Google Scholar 

  • Xiang R, Neville J, Rogati M (2010) Modeling relationship strength in online social networks. In: WWW, pp 981–990

  • Zignani M, Gaito S, Rossi GP (2016) Predicting the link strength of “newborn” links. In: WWW Companion, pp 147–148

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francesca Spezzano.

Additional information

This paper is an extended version of the conference paper “Laxmi Amulya Gundala and Francesca Spezzano, A Framework for Predicting Links Between Indirectly Interacting Nodes. In Proceedings of the IEEE/ACM 2018 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2018), pp 544–551” (Gundala and Spezzano 2018).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gundala, L.A., Spezzano, F. Estimating node indirect interaction duration to enhance link prediction. Soc. Netw. Anal. Min. 9, 17 (2019). https://doi.org/10.1007/s13278-019-0561-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-019-0561-2

Keywords

Navigation