Abstract
In this article we discuss how social tagging can be used to improve the methodology used for clustering evaluation. We analyze the impact of the integration of tags in the clustering process and its effectiveness. Following the semiotic theory, the own nature of tags allows the reflection of which ones should be considered depending on the interpretant (community of users, or tag writer). Using a case with the community of users as the interpretant, our novel clustering algorithm (k-C), which is based on community detection on a network of tags, was compared with the standard k-means algorithm. The results indicate that the k-C algorithm created more effective clusters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Huang, A.W., Chuang, T.: Social tagging, online communication, and Peircean semiotics: a conceptual framework. J. Inf. Sci. 35, 340–357 (2009)
Cunha, E., Figueira, Á., Mealha, Ó.: Clustering documents using tagging communities and semantic proximity. In: 8th Iberian Conference on Information Systems and Technologies (CISTI), Lisboa, Portugal, vol. I, pp. 591–596 (2013)
Cunha, E., Figueira, Á., Mealha, Ó.: Clustering and classifying text documents - a revisit to tagging integration methods. In: 5th International Conference on Knowledge Discovery and information Retrieval (KDIR 2013), Vila Moura, Portugal, pp. 160–168 (2013)
Fortunato, S., Castellano, C.: Community structure in graphs. In: Encyclopedia of Complexity and Systems Science, pp. 1141–1163 (2009)
Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. In: Proceedings of the National Academy of Science, no. 12, pp. 7821–7826 (2002)
Wakita, K., Tsurumi, T.: Finding community structure in mega-scale social networks: [extended abstract]. In: Proceedings of the 16th International Conference on World Wide Web, Banff, Alberta, Canada, pp. 1275–1276. ACM (2007)
Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)
Clauset, A., Newman, M.E.J., Moore, C.: Finding community structure in very large networks. Phys. Rev. E 70, 066111 (2004)
MacQueen, J.B.: Some methods for classification and analysis of multivariate. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297. University of California Press (1967)
Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, Society for Industrial and Applied Mathematics, New Orleans, Louisiana, pp. 1027–1035 (2007)
Feldman, R., Sanger, J.: The Text Mining Handbook Advanced Approaches in Analyzing Unstructured Data, 1st edn., p. 410. Cambridge University Press, Cambridge (2007)
Zhong, S.: Efficient online spherical k-means clustering. In: Prokhorov, D. (eds.) Proceeding of the IEEE International Joint Conference on Neural Networks (IJCNN 2005) (2005)
Demsar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
Acknowledgements
This work is financed by National Funds through the Portuguese funding agency, FCT - Fundação para a Ciência e a Tecnologia within project: UID/EEA/50014.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Cunha, E., Figueira, Á. (2020). Contribution of Social Tagging to Clustering Effectiveness Using as Interpretant the User’s Community. In: Rocha, Á., Adeli, H., Reis, L., Costanzo, S., Orovic, I., Moreira, F. (eds) Trends and Innovations in Information Systems and Technologies. WorldCIST 2020. Advances in Intelligent Systems and Computing, vol 1159. Springer, Cham. https://doi.org/10.1007/978-3-030-45688-7_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-45688-7_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-45687-0
Online ISBN: 978-3-030-45688-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)