User Recommendation in Low Degree Networks with a Learning-Based Approach

  • Marcelo G. ArmentanoEmail author
  • Ariel Monteserin
  • Franco Berdun
  • Emilio Bongiorno
  • Luis María Coussirat
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11288)


User recommendation plays an important role in microblogging systems since users connect to these networks to share and consume content. Finding relevant users to follow is then a hot topic in the study of social networks. Microblogging networks are characterized by having a large number of users, but each of them connects with a limited number of other users, making the graph of followers to have a low degree. One of the main problems of approaching user recommendation with a learning-based approach in low-degree networks is the problem of extreme class imbalance. In this article, we propose a balancing scheme to face this problem, and we evaluate different classification algorithms using as features classical metrics for link prediction. We found that the learning-based approach outperformed individual metrics for the problem of user recommendation in the evaluated dataset. We also found that the proposed balancing approach lead to better results, enabling a better identification of existing connections between users.


User recommendation Online social networks Link prediction 



This work was partially supported by research project PICT-2014-2750.


  1. Ahmed, C., ElKorany, A., Bahgat, R.: A supervised learning approach to link prediction in Twitter. Soc. Netw. Anal. Min. 6(1), 24 (2016)CrossRefGoogle Scholar
  2. Al Hasan, M., Zaki, M.J.: A survey of link prediction in social networks. In: Aggarwal, C.C. (ed.) Social Network Data Analytics, pp. 243–275. Springer, US (2011). Scholar
  3. Armentano, M.G., Godoy, D., Amandi, A.: Topology-based recommendation of users in micro-blogging communities. J. Comput. Sci. Technol. 27(3), 624–634 (2012)CrossRefGoogle Scholar
  4. Armentano, M.G., Godoy, D., Amandi, A.A.: Followee recommendation based on text analysis of micro-blogging activity. Inf. Syst. 38(8), 1116–1127 (2013)CrossRefGoogle Scholar
  5. Bhattacharyya, P., Garg, A., Wu, S.F.: Analysis of user keyword similarity in online social networks. Soc. Netw. Anal. Min. 1(3), 143–158 (2011)CrossRefGoogle Scholar
  6. Chawla, N.V., et al.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)CrossRefGoogle Scholar
  7. Chen, H., Jin, H., Cui, X.: Hybrid followee recommendation in microblogging systems. Sci. China Inf. Sci. 60(1), 012–102 (2017)CrossRefGoogle Scholar
  8. Ertekin, S., Huang, J., Giles, C.L.: Active learning for class imbalance problem. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2007, pp. 823–824. ACM, New York (2007)Google Scholar
  9. Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, Amsterdam (2011)CrossRefGoogle Scholar
  10. Han, S., Xu, Y.: Link prediction in microblog network using supervised learning with multiple features. JCP 11(1), 72–82 (2016)CrossRefGoogle Scholar
  11. Ho, T.K.: Random decision forests. In: Proceedings of 3rd International Conference on Document Analysis and Recognition, vol. 1, pp. 278–282 (1995)Google Scholar
  12. Karakoulas, G., Shawe-Taylor, J.: Optimizing classifiers for imbalanced training sets. In: Proceedings of the 11th International Conference on Neural Information Processing Systems, NIPS 1998, pp. 253–259. MIT Press, Cambridge (1998)Google Scholar
  13. Liben-Nowell, D., Kleinberg, J.: The link-prediction problem for social networks. J. Assoc. Inf. Sci. Technol. 58(7), 1019–1031 (2007)CrossRefGoogle Scholar
  14. McCandless, M., Hatcher, E., Gospodnetic, O.: Lucene in Action, Second Edition: Covers Apache Lucene 3.0. Manning Publications Co., Greenwich (2010)Google Scholar
  15. Mitchell, T.M.: Machine Learning, vol. 45, no. 37, pp. 870–877. McGraw Hill, Burr Ridge (1997)Google Scholar
  16. Porter, M.F.: An algorithm for suffix stripping. Rossiiskaya Akademiya Nauk. Programmirovanie 14(3), 130–137 (1980)Google Scholar
  17. Rattigan, M.J., Jensen, D.: The case for anomalous link discovery. SIGKDD Explor. Newsl. 7(2), 41–47 (2005)CrossRefGoogle Scholar
  18. Rish, I.: An empirical study of the naive Bayes classifier. In: IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, pp. 41–46. IBM, New York (2001)Google Scholar
  19. Salton, G., Mcgill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1986)Google Scholar
  20. Wang, P.: Link prediction in social networks: the state-of-the-art. Sci. China Inf. Sci. 58(1), 1–38 (2015)MathSciNetGoogle Scholar
  21. Witten, I.H., et al.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington (2016)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Marcelo G. Armentano
    • 1
    Email author
  • Ariel Monteserin
    • 1
  • Franco Berdun
    • 1
  • Emilio Bongiorno
    • 2
  • Luis María Coussirat
    • 2
  1. 1.ISISTAN Research Institute (CONICET-UNICEN)TandilArgentina
  2. 2.Facultad de Ciencias ExactasUNICENTandilArgentina

Personalised recommendations