Advertisement

A Programming Interface and Platform Support for Developing Recommendation Algorithms on Large-Scale Social Networks

  • Alejandro Corbellini
  • Daniela Godoy
  • Cristian Mateos
  • Alejandro Zunino
  • Silvia Schiaffino
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8658)

Abstract

Friend recommendation algorithms in large-scale social networks such as Facebook or Twitter usually require the exploration of huge user graphs. In current solutions for parallelizing graph algorithms, the burden of dealing with distributed concerns falls on algorithm developers. In this paper, a simple yet powerful programming interface (API) to implement distributed graph traversal algorithms is presented. A case study on implementing a followee recommendation algorithm for Twitter using the API is described. This case study not only illustrates the simplicity offered by the API for developing algorithms, but also how different aspects of the distributed solutions can be treated and experimented without altering the algorithm code. Experiments evaluating the performance of different job scheduling strategies illustrate the flexibility or our approach.

Keywords

Schedule Strategy Information Seeker Recommendation Algorithm Graph Database Adjacency Graph 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Armentano, M., Godoy, D., Amandi, A.: Topology-based recommendation of users in micro-blogging communities. Journal of Computer Science and Technology 27(3), 624–634 (2012)CrossRefGoogle Scholar
  2. 2.
    Aurelius. Titan (2014), http://thinkaurelius.github.io/titan/ (accessed April 14, 2014)
  3. 3.
    Cattell, R.: Scalable SQL and NoSQL data stores. ACM SIGMOD Record 39(4), 12–27 (2011)CrossRefGoogle Scholar
  4. 4.
    Gupta, P., Goel, A., Lin, J., Sharma, A., Wang, D., Zadeh, R.: WTF: The who to follow service at Twitter. In: Proceedings of the 22th International World Wide Web Conference (WWW 2013), Rio de Janeiro, Brazil (2013)Google Scholar
  5. 5.
    Java, A., Song, X., Finin, T., Tseng, B.: Why we Twitter: Understanding microblogging usage and communities. In: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, San Jose, CA, USA, pp. 56–65 (2007)Google Scholar
  6. 6.
    Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. Journal of the ACM 46(5), 604–632 (1999)CrossRefzbMATHMathSciNetGoogle Scholar
  7. 7.
    Krepska, E., Kielmann, T., Fokkink, W., Bal, H.: HipG: Parallel processing of large-scale graphs. ACM SIGOPS Operating Systems Review 45(2), 3–13 (2011)CrossRefGoogle Scholar
  8. 8.
    Krishnamurthy, B., Gill, P., Arlitt, M.: A few chirps about Twitter. In: Proceedings of the 1st Workshop on Online Social Networks (WOSP 2008), Seattle, USA, pp. 19–24 (2008)Google Scholar
  9. 9.
    Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web (WWW 2010), Raleigh, NC, USA, pp. 591–600 (2010)Google Scholar
  10. 10.
    Lempel, R., Moran, S.: SALSA: the stochastic approach for link-structure analysis. ACM Transactions on Information Systems 19(2), 131–160 (2001)CrossRefGoogle Scholar
  11. 11.
    Malewicz, G., Austern, M.H., Bik, A.J.C., Dehnert, J.C., Horn, I., Leiser, N., Czajkowski, G.: Pregel: A system for large-scale graph processing. In: Proceedings of the 2010 International Conference on Management of Data (SIGMOD 2010), Indianapolis, IN, USA, pp. 135–146 (2010)Google Scholar
  12. 12.
    Inc. Neo Technology. Neo4J (2013), http://www.neo4j.org/ (accessed August 5, 2013)
  13. 13.
    Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web, pp. 1–17 (1999)Google Scholar
  14. 14.
    Shao, B., Wang, H., Li, Y.: The Trinity Graph Engine. Technical Report MSR-TR-2012-30, Microsoft Research (March 2012)Google Scholar
  15. 15.
    da Silva, S.T.F., Oliveira, J., Borges, M.R.S.: Contextual analysis of the victims’ social network for people recommendation on the emergency scenario. In: Herskovic, V., Hoppe, H.U., Jansen, M., Ziegler, J. (eds.) CRIWG 2012. LNCS, vol. 7493, pp. 200–207. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  16. 16.
    Strauch, C., Sites, U.L.S., Kriha, W.: NoSQL databases. Lecture Notes, Stuttgart Media University (2011)Google Scholar
  17. 17.
    Twitter Inc. FlockDB (2013), https://github.com/twitter/flockdb (accessed August 5, 2013)
  18. 18.
    Yang, S., Yan, X., Zong, B., Khan, A.: Towards effective partition management for large graphs. In: Proceedings of the 2012 International Conference on Management of Data (SIGMOD 2012), Scottsdale, AZ, USA, pp. 517–528 (2012)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Alejandro Corbellini
    • 1
  • Daniela Godoy
    • 1
  • Cristian Mateos
    • 1
  • Alejandro Zunino
    • 1
  • Silvia Schiaffino
    • 1
  1. 1.ISISTAN Research Institute - Consejo Nacional de Investigaciones Cientí́ficas y Técnicas (CONICET)Univ. Nacional del Centro de la Provincia de Bs. As. (UNICEN)TandilArgentina

Personalised recommendations