Neighborhood-Based Tag Prediction

  • Adriana Budura
  • Sebastian Michel
  • Philippe Cudré-Mauroux
  • Karl Aberer
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5554)


We consider the problem of tag prediction in collaborative tagging systems where users share and annotate resources on the Web. We put forward HAMLET, a novel approach to automatically propagate tags along the edges of a graph which relates similar documents. We identify the core principles underlying tag propagation for which we derive suitable scoring models combined in one overall ranking formula. Leveraging these scores, we present an efficient top-k tag selection algorithm that infers additional tags by carefully inspecting neighbors in the document graph. Experiments using real-world data demonstrate the viability of our approach in large-scale environments where tags are scarce.


Document Graph Place Semantic Initial Document Index List Social Annotation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Angelova, R., Weikum, G.: Graph-based text classification: learn from your neighbors. In: SIGIR (2006)Google Scholar
  2. 2.
    Bao, S., Xue, G., Wu, X., Yu, Y., Fei, B., Su, Z.: Optimizing web search using social annotations. In: WWW (2007)Google Scholar
  3. 3.
    Chirita, P.A., Costache, S., Nejdl, W., Handschuh, S.: P-tag: large scale automatic generation of personalized annotation tags for the web. In: WWW (2007)Google Scholar
  4. 4.
    Davison, B.D.: Topical locality in the web. In: SIGIR (2000)Google Scholar
  5. 5.
    Fagin, R.: Combining fuzzy information from multiple systems. J. Comput. Syst. Sci. 58(1) (1999)Google Scholar
  6. 6.
    Fagin, R., Lotem, A., Naor, M.: Optimal aggregation algorithms for middleware. J. Comput. Syst. Sci. 66(4) (2003)Google Scholar
  7. 7.
    Golder, S., Huberman, B.A.: Usage patterns of collaborative tagging systems. Journal of Information Science 32(2) (2006)Google Scholar
  8. 8.
    Halpin, H., Robu, V., Shepherd, H.: The complex dynamics of collaborative tagging. In: WWW (2007)Google Scholar
  9. 9.
    Heymann, P., Koutrika, G., Garcia-Molina, H.: Can social bookmarking improve web search? In: WSDM (2008)Google Scholar
  10. 10.
    Heymann, P., Ramage, D., Garcia-Molina, H.: Social tag prediction. In: SIGIR (2008)Google Scholar
  11. 11.
    Hotho, A., Jäschke, R., Schmitz, C., Stumme, G.: Information retrieval in folksonomies: Search and ranking. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 411–426. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  12. 12.
    Järvelin, K., Kekäläinen, J.: Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20(4) (2002)Google Scholar
  13. 13.
    Jäschke, R., Marinho, L.B., Hotho, A., Schmidt-Thieme, L., Stumme, G.: Tag recommendations in folksonomies. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) PKDD 2007. LNCS, vol. 4702, pp. 506–514. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  14. 14.
    Kinsella, S., Budura, A., Skobeltsyn, G., Michel, S., Breslin, J.G., Aberer, K.: From Web 1.0 to Web 2.0 and back - how did your grandma use to tag? In: WIDM (2008)Google Scholar
  15. 15.
    Kleinberg, J.M., Tardos, É.: Approximation algorithms for classification problems with pairwise relationships: Metric labeling and markov random fields. In: FOCS (1999)Google Scholar
  16. 16.
    Marlow, C., Naaman, M., Boyd, D., Davis, M.: Ht06, tagging paper, taxonomy, flickr, academic article, to read. In: HYPERTEXT (2006)Google Scholar
  17. 17.
    Mika, P.: Ontologies Are Us: A Unified Model of Social Networks and Semantics. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 522–536. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  18. 18.
    Rattenbury, T., Good, N., Naaman, M.: Towards automatic extraction of event and place semantics from flickr tags. In: SIGIR (2007)Google Scholar
  19. 19.
    Schenkel, R., Crecelius, T., Kacimi, M., Michel, S., Neumann, T., Parreira, J.X., Weikum, G.: Efficient top-k querying over social-tagging networks. In: SIGIR (2008)Google Scholar
  20. 20.
    Schmitz, P.: Inducing ontology from flickr tags. In: Workshop on Collaborative Tagging at WWW (2006)Google Scholar
  21. 21.
    Sigurbjörnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW (2008)Google Scholar
  22. 22.
    Song, Y., Zhuang, Z., Li, H., Zhao, Q., Li, J., Lee, W., Giles, C.: Real-time automatic tag recommendation. In: SIGIR (2008)Google Scholar
  23. 23.
    Wu, X., Zhang, L., Yu, Y.: Exploring social annotations for the semantic web. In: WWW (2006)Google Scholar
  24. 24.
    Yanbe, Y., Jatowt, A., Nakamura, S., Tanaka, K.: Towards improving web search by utilizing social bookmarks. In: Baresi, L., Fraternali, P., Houben, G.-J. (eds.) ICWE 2007. LNCS, vol. 4607, pp. 343–357. Springer, Heidelberg (2007)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Adriana Budura
    • 1
  • Sebastian Michel
    • 1
  • Philippe Cudré-Mauroux
    • 2
  • Karl Aberer
    • 1
  1. 1.Ecole Polytechnique Fédérale de Lausanne (EPFL)Switzerland
  2. 2.MITUSA

Personalised recommendations