Skip to main content

Abstract

Since their appearance Tag Clouds are widely used tools in Internet. The main purposes of these textual visualizations are information retrieval, content representation and browsing of text. Despite their widespread use and the large number of research that has been carried out on them, the main metrics available in the literature evaluate the quality of the tag cloud based only on the query results. There are no adequate metrics when the tag cloud is extracted from text and used to represent information content. In this work, three new metrics are proposed for the evaluation of tag clouds when their main function is to represent information content: coverage, overlap and disparity, as well as a fourth metric: the balance, in which we propose a way to calculate it by using OWA operators.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.filmaffinity.com/, last accessed March 2018.

References

  1. Aouiche, K., Lemire, D., Godin, R.: Web 2.0 OLAP: from data cubes to tag clouds. In: Cordeiro, J., Hammoudi, S., Filipe, J. (eds.) WEBIST 2008. LNBIP, vol. 18, pp. 51–64. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-01344-7_5

    Chapter  Google Scholar 

  2. Goutte, C., Gaussier, E.: A probabilistic interpretation of precision, recall and F-score, with implication for evaluation. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 345–359. Springer, Heidelberg (2005). https://doi.org/10.1007/978-3-540-31865-1_25

    Chapter  Google Scholar 

  3. Leone, S., Geel, M., Müller, C., Norrie, M.C.: Exploiting tag clouds for database browsing and querying. In: Soffer, P., Proper, E. (eds.) CAiSE Forum 2010. LNBIP, vol. 72, pp. 15–28. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-17722-4_2

    Chapter  Google Scholar 

  4. Morik, K., Kaspari, A., Wurst, M., Skirzynski, M.: Multi-objective frequent termset clustering. Knowl. Inf. Syst. 30(3), 715–738 (2012)

    Article  Google Scholar 

  5. Skoutas, D., Alrifai, M.: Tag clouds revisited. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM, pp. 221–230 (2011)

    Google Scholar 

  6. Torres-Parejo, U., Campaña, J.R., Vila, M.-A., Delgado, M.: Text retrieval and visualization in databases using tag clouds. In: Greco, S., Bouchon-Meunier, B., Coletti, G., Fedrizzi, M., Matarazzo, B., Yager, R.R. (eds.) IPMU 2012. CCIS, vol. 297, pp. 390–399. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31709-5_40

    Chapter  Google Scholar 

  7. Torres-Parejo, U., Campaña, J., Delgado, M., Vila, M.: MTCIR: a multi-term tag cloud information retrieval system. Expert Syst. Appl. 40, 5448–5455 (2013)

    Article  Google Scholar 

  8. Torres-Parejo, U., Campaña, J., Vila, M., Delgado, M.: A theoretical model for the automatic generation of tag clouds. Knowl. Inf. Syst. 40(2), 315–347 (2014)

    Article  Google Scholar 

  9. Venetis, P., Koutrika, G., Garcia-Molina, H.: On the selection of tags for tag clouds. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining, WSDM, pp. 835–844 (2011)

    Google Scholar 

  10. Yager, R.: On ordered weighted averaging aggregation operators in multicriteria decisionmaking. IEEE Trans. Syst. Man Cybern. 18(1), 183–190 (1988)

    Article  MathSciNet  Google Scholar 

  11. Yager, R.: Families of OWA operators. Fuzzy Sets Syst. 59(2), 125–148 (1993)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work has been partially supported by the “Plan Andaluz de Investigación, Junta de Andalucía” (Spain) under research project P10- TIC6019.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Úrsula Torres-Parejo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Torres-Parejo, Ú., Campaña, J.R., Vila, MA., Delgado, M. (2018). Metrics for Tag Cloud Evaluation. In: Medina, J., et al. Information Processing and Management of Uncertainty in Knowledge-Based Systems. Theory and Foundations. IPMU 2018. Communications in Computer and Information Science, vol 853. Springer, Cham. https://doi.org/10.1007/978-3-319-91473-2_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-91473-2_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-91472-5

  • Online ISBN: 978-3-319-91473-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics