Skip to main content

Advertisement

SpringerLink
Log in
Menu
Find a journal Publish with us
Search
Cart
Book cover

International Conference on Web Engineering

ICWE 2012: Web Engineering pp 61–75Cite as

  1. Home
  2. Web Engineering
  3. Conference paper
Methodologies for Improved Tag Cloud Generation with Clustering

Methodologies for Improved Tag Cloud Generation with Clustering

  • Martin Leginus19,
  • Peter Dolog19,
  • Ricardo Lage19 &
  • …
  • Frederico Durao19 
  • Conference paper
  • 2046 Accesses

  • 2 Citations

Part of the Lecture Notes in Computer Science book series (LNISA,volume 7387)

Abstract

Tag clouds are useful means for navigation in the social web systems. Usually the systems implement the tag cloud generation based on tag popularity which is not always the best method. In this paper we propose methodologies on how to combine clustering into the tag cloud generation to improve coverage and overlap. We study several clustering algorithms to generate tag clouds. We show that by extending cloud generation based on tag popularity with clustering we slightly improve coverage. We also show that if the cloud is generated by clustering independently of the tag popularity baseline we minimize overlap and increase coverage. In the first case we therefore provide more items for a user to explore. In the second case we provide more diverse items for a user to explore. We experiment with the methodologies on two different datasets: Delicious and Bibsonomy. The methodologies perform slightly better on bibsonomy due to its specific focus. The best performing is the hierarchical clustering.

Keywords

  • Cluster Technique
  • Cloud Generation
  • Complete Linkage Hierarchical Cluster
  • Chained Coverage
  • Agglomerative Hierarchical Cluster Technique

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Download conference paper PDF

References

  1. Bai, B., Weston, J., Grangier, D., Collobert, R., Sadamasa, K., Qi, Y., Chapelle, O., Weinberger, K.: Learning to rank with (a lot of) word features. Information Retrieval 13(3), 291–314 (2010)

    CrossRef  Google Scholar 

  2. Bateman, S., Gutwin, C., Nacenta, M.: Seeing things in the clouds: the effect of visual features on tag cloud selections. In: Proceedings of the Nineteenth ACM Conference on Hypertext and Hypermedia, HT 2008, pp. 193–202. ACM, New York (2008)

    CrossRef  Google Scholar 

  3. Bateman, S., Gutwin, C., Nacenta, M.: Seeing things in the clouds: the effect of visual features on tag cloud selections. In: Proceedings of the Nineteenth ACM Conference on Hypertext and Hypermedia, HT 2008, pp. 193–202. ACM, New York (2008)

    CrossRef  Google Scholar 

  4. Durao, F., Dolog, P., Leginus, M., Lage, R.: SimSpectrum: A Similarity Based Spectral Clustering Approach to Generate a Tag Cloud. In: Harth, A., Koch, N. (eds.) ICWE 2011. LNCS, vol. 7059, pp. 145–154. Springer, Heidelberg (2012)

    CrossRef  Google Scholar 

  5. Echarte, F., Astrain, J.J., Córdoba, A., Villadangos, J.: Pattern Matching Techniques to Identify Syntactic Variations of Tags in Folksonomies. In: Lytras, M.D., Damiani, E., Tennyson, R.D. (eds.) WSKS 2008. LNCS (LNAI), vol. 5288, pp. 557–564. Springer, Heidelberg (2008)

    CrossRef  Google Scholar 

  6. Halvey, M.J., Keane, M.T.: An assessment of tag presentation techniques. In: Proceedings of the 16th International Conference on World Wide Web, WWW 2007, pp. 1313–1314. ACM, New York (2007)

    CrossRef  Google Scholar 

  7. Hassan-Montero, Y., Herrero-Solana, V.: Improving tag-clouds as visual information retrieval interfaces. In: INSCIT 2006 Conference, Merída (2006)

    Google Scholar 

  8. Hassan-Montero, Y., Herrero-Solana, V.: Improving tag-clouds as visual information retrieval interfaces. In: International Conference on Multidisciplinary Information Sciences and Technologies, Citeseer, pp. 25–28 (2006)

    Google Scholar 

  9. Huang, A.: Similarity measures for text document clustering. In: Proceedings of the Sixth New Zealand Computer Science Research Student Conference (NZCSRSC 2008), Christchurch, New Zealand, pp. 49–56 (2008)

    Google Scholar 

  10. Johnson, S.: Hierarchical clustering schemes. Psychometrika 32(3), 241–254 (1967)

    CrossRef  Google Scholar 

  11. Kaser, O., Lemire, D.: Tag-cloud drawing: Algorithms for cloud visualization. CoRR, abs/cs/0703109 (2007)

    Google Scholar 

  12. Knautz, K., Soubusta, S., Stock, W.G.: Tag clusters as information retrieval interfaces. In: HICSS, pp. 1–10 (2010)

    Google Scholar 

  13. Knowledge and U. o. K. Data Engineering Group: Benchmark folksonomy data from bibsonomy, version of January 1 (2010)

    Google Scholar 

  14. Kuo, B.Y.-L., Hentrich, T., Good, B.M., Wilkinson, M.D.: Tag clouds for summarizing web search results. In: Proceedings of the 16th International Conference on World Wide Web, WWW 2007, pp. 1203–1204. ACM, New York (2007)

    CrossRef  Google Scholar 

  15. Kuo, B.Y.-L., Hentrich, T., Good, B.M., Wilkinson, M.D.: Tag clouds for summarizing web search results. In: Proceedings of the 16th International Conference on World Wide Web, WWW 2007, pp. 1203–1204. ACM, New York (2007)

    CrossRef  Google Scholar 

  16. Leginus, M., Zemaitis, V.: Speeding up tensor based recommenders with clustered tag space and improving quality of recommendations with non-negative tensor factorization. Master’s thesis, Aalborg University (2011)

    Google Scholar 

  17. Ramage, D., Heymann, P., Manning, C., Garcia-Molina, H.: Clustering the tagged web. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 54–63. ACM (2009)

    Google Scholar 

  18. Rivadeneira, A.W., Gruen, D.M., Muller, M.J., Millen, D.R.: Getting our head in the clouds: toward evaluation studies of tagclouds. In: Proceedings of the SIGCHI Conference on Human factors in Computing Systems, CHI 2007, pp. 995–998. ACM, New York (2007)

    CrossRef  Google Scholar 

  19. Schrammel, J., Leitner, M., Tscheligi, M.: Semantically structured tag clouds: an empirical evaluation of clustered presentation approaches. In: Proceedings of the 27th International Conference on Human Factors in Computing Systems, CHI 2009, pp. 2037–2040. ACM, New York (2009)

    CrossRef  Google Scholar 

  20. Shepitsen, A., Gemmell, J., Mobasher, B., Burke, R.: Personalized recommendation in social tagging systems using hierarchical clustering. In: Proceedings of the 2008 ACM Conference on Recommender Systems, RecSys 2008, pp. 259–266. ACM, New York (2008)

    CrossRef  Google Scholar 

  21. Sinclair, J., Cardew-Hall, M.: The folksonomy tag cloud: when is it useful? J. Inf. Sci. 34, 15–29 (2008)

    CrossRef  Google Scholar 

  22. van Dam, J., Vandic, D., Hogenboom, F., Frasincar, F.: Searching and browsing tag spaces using the semantic tag clustering search framework. In: 2010 IEEE Fourth International Conference on Semantic Computing (ICSC), pp. 436–439. IEEE (2010)

    Google Scholar 

  23. Venetis, P., Koutrika, G., Garcia-Molina, H.: On the selection of tags for tag clouds. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, WSDM 2011, pp. 835–844 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

  1. Department of Computer Science, Aalborg University, Selma Lagerlofs Vej 300, Denmark

    Martin Leginus, Peter Dolog, Ricardo Lage & Frederico Durao

Authors
  1. Martin Leginus
    View author publications

    You can also search for this author in PubMed Google Scholar

  2. Peter Dolog
    View author publications

    You can also search for this author in PubMed Google Scholar

  3. Ricardo Lage
    View author publications

    You can also search for this author in PubMed Google Scholar

  4. Frederico Durao
    View author publications

    You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

  1. Dipartimento di Elettronica e Informazione, Politecnico di Milano, Via Ponzio 34/5, 20133, Milano, Italy

    Marco Brambilla

  2. Department of Computer Science, Tokyo Institute of Technology, 2-12-1 Oookayama, 152-8552, Tokyo, Japan

    Takehiro Tokuda

  3. Institut für Informatik, Freie Universität Berlin, Königin-Luise-Strasse 24-26, 14195, Berlin, Germany

    Robert Tolksdorf

Rights and permissions

Reprints and Permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Leginus, M., Dolog, P., Lage, R., Durao, F. (2012). Methodologies for Improved Tag Cloud Generation with Clustering. In: Brambilla, M., Tokuda, T., Tolksdorf, R. (eds) Web Engineering. ICWE 2012. Lecture Notes in Computer Science, vol 7387. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31753-8_5

Download citation

  • .RIS
  • .ENW
  • .BIB
  • DOI: https://doi.org/10.1007/978-3-642-31753-8_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31752-1

  • Online ISBN: 978-3-642-31753-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Share this paper

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Search

Navigation

  • Find a journal
  • Publish with us

Discover content

  • Journals A-Z
  • Books A-Z

Publish with us

  • Publish your research
  • Open access publishing

Products and services

  • Our products
  • Librarians
  • Societies
  • Partners and advertisers

Our imprints

  • Springer
  • Nature Portfolio
  • BMC
  • Palgrave Macmillan
  • Apress
  • Your US state privacy rights
  • Accessibility statement
  • Terms and conditions
  • Privacy policy
  • Help and support

167.114.118.210

Not affiliated

Springer Nature

© 2023 Springer Nature