Abstract
Building taxonomies for Web content manually is costly and timeconsuming. An alternative is to allow users to create folksonomies: collective social classifications. However, folksonomies have inconsistent structures and their use for searching and browsing is limited. Approaches have been proposed for acquiring implicit hierarchical structures from folksonomies, but these approaches suffer from the “generality-popularity” problem, in that they assume that popularity is a proxy for generality (that high level taxonomic terms will occur more often than low level ones). In this paper we test this assumption, and propose an improved approach (based on the Heymann-Benz algorithm) for tackling this problem by direction checking relations against a corpus of text. Our results show that popularity works as a proxy for generality in at most 77 of cases, but that this can be improved to 81% using our approach. This improvement will translate to higher quality tag hierarchy structures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vander Wal, T.: Folksonomy Coinage and Definition (2007), http://vanderwal.net/folksonomy.html
O’Reilly, T.: What is web 2.0: design patterns and business models for the next generation of software (2005), http://oreilly.com/web2/archive/what-is-web-20.html (June 20, 2013)
Gupta, M., Li, R., Yin, Z., Han, J.: An Overview of Social Tagging and Applications. In: Aggarwal, C. (ed.) Social Network Data Analytics, pp. 447–497. Springer, New York (2011)
Strohmaier, M., Helic, D., Benz, D., Körner, C., Kern, R.: Evaluation of Folksonomy Induction Algorithms. ACM Transactions on Intelligent Systems and Technology 3(4), Article 74 (2012)
Mathes, A.: Folksonomies-cooperative classification and communication through shared metadata. Computer Mediated Communication 47(10) (2004), http://adammathes.com/academic/computer-mediated-communication/folksonomies.pdf
Golder, S., Huberman, B.: Usage patterns of collaborative tagging systems. Journal of Information Science 32(2), 198–208 (2006)
Guy, M., Tonkin, E.: Tidying up tags. D-Lib Magazine 12(1) (January 2006), ISSN 1082-9873
Heymann, P., Garcia-Molinay, H.: Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems. InfoLab Technical Report, Stanford (2006)
Solskinnsbakk, G., Gulla, J.: A Hybrid Approach to Constructing Tag Hierarchies. In: International conference on: On the move to meaningful internet systems: Part II, Hersonissos, Crete, Greece, pp. 975–982 (2010)
Benz, D., Hotho, A., Stutzer, S.: Semantics made by you and me: Self-emerging ontologies cancapture the diversity of shared knowledge. In: 2nd Web Science Conference (WebSci 2010), Raleigh, NC, USA (2010)
Begelman, G., Keller, P., Smadja, F.: Automated tag clustering: Improving search and exploration in the tag space. In: Collaborative Web Tagging Workshop at WWW 2006, Edinburgh, Scotland, pp.15-33 (2006)
Lin, H., Davis, J.: Computational and crowdsourcing methods for extracting ontological structure from folksonomy. In: 7th Extended Semantic Web Conference (ESWC 2010), Heraklion, Greece, pp.472-477 (2010)
Schmitz, P.: Inducing ontology from flickr tags. In: Collaborative Web Tagging Workshop at WWW 2006, Edinburgh, Scotland (2006)
Angeletou, S., Sabou, M., Specia, L., Motta, E.: Bridging the gap between folksonomies and the semantic web: An experience report. In : 4th European Semantic Web Conference (ESWC 2007), Innsbruck, Austria, pp.30-43 (2007)
Laniado, D., Eynard, D., Colombetti, M.: Using WordNet to turn a folksonomy into a hierarchy of concepts. In: 4th italian semantic web workshop: Semantic web application and perspectives, Bari, Italy, pp. 192–201 (2007)
Morrison, P.: Tagging and searching: Search retrieval effectiveness of folksonomies on the world wide web. Information Processing and Management 44(4), 1562–1579 (2008)
Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284(5), 28–37 (2001)
Park, Y., Byrd, R., Boguraev, B.: Towards Ontologies On Demand. In: Workshop on Semantic Web Technologies for Searching and Retrieving Scientific Data (ISWC-03), Florida, USA (2003)
Mika, P.: Ontologies are us: A unified model of social networks and semantics. Web Semantics: Science, Services and Agents on the World Wide Web 5(1), 5–15 (2007)
Kiu, C.-C., Tsui, E.: TaxoFolk: a hybrid taxonomy–folksonomy classification for enhanced knowledge navigation. Knowledge Management Research & Practice 8(1), 24–32 (2010)
Zheng, H., Wu, X., Yu, Y.: Enriching WordNet with Folksonomies. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds.) PAKDD 2008. LNCS (LNAI), vol. 5012, pp. 1075–1080. Springer, Heidelberg (2008)
Van Damme, C., Hepp, M., Siorpaes, K.: Folksontology: An integrated approach for turning folksonomies into ontologies. In: ESWC Workshop Bridging the Gap between Semantic Web and Web 2, pp. 57–70. Innsbruck, Austria (2007)
Solskinnsbakk, G., Gulla, J.: Mining tag similarity in folksonomies. In: 3rd international workshop on Search and mining user-generated contents (SMUC 2011), Glasgow, Scotland, pp. 53–60 (2011)
Hearst, M.: Automatic acquisition of hyponyms from large text corpora. In: 14th Conference on Computational Linguistics, Morristown, NJ, USA, pp. 539–545 (1992)
Cimiano, P., Hotho, A., Staab, S.: Learning concept hierarchies from text corpora using formal concept analysis. Journal of Artificial Intelligence Research 24(1), 305–339 (2005)
Harris, Z.: Mathematical structures of language. John Wiley and Son (1968)
Faure, D., Nedellec, C.: A corpus-based conceptual clustering method for verb frames and ontology. In: The LREC Workshop on Adapting Lexical and Corpus Resources to Sublanguages and Applications, pp. 5–12 (1998)
Berland, M., Charniak, E.: Finding parts in very large corpora. In: the 37th Annual Meeting of the Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, pp. 57–64 (1999)
Snow, R., Jurafsky, D., Ng., A.: Learning syntactic patterns for automatic hypernym discovery. In: The Eighteenth Annual Conference on Neural Information Processing Systems (NIPS 2004), Vancouver, Canada, vol. 17 (2004)
Hearst, M.: Automated discovery of wordnet relations. In: Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database, MIT Press, Cambridge (1998)
Wu, H., Zubair, M., Maly, K.: Harvesting social knowledge from folksonomies. In: 17th Conference on Hypertext and Hypermedia, Odense, Denmark, pp. 111–114 (2006)
Hoser, B., Hotho, A., Jäschke, R., Schmitz, C., Stumme, G.: Semantic network analysis of ontologies. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 514–529. Springer, Heidelberg (2006)
Schmitz, C., Hotho, A., Jäschke, R., Stumme, G.: Mining association rules in folksonomies. In: 10th IFCS Conference: Studies in Classification, Data Analysis and Knowledge Organization, Ljubljana, Slovenia, pp. 261–270 (2006)
Sanderson, M., Croft, B.: Deriving concept hierarchies from text. In: 22nd ACM Conference of the Special Interest Group in Information Retrieval, Berkeley, California, USA, pp. 206–213 (1999)
Schwarzkopf, E., Heckmann, D., Dengler, D., Kröner, A.: Mining the structure of tag spaces for user modeling. In: Workshop on Data Mining for User Modeling at the 11th International Conference on User Modeling, Corfu, Griechenland, pp. 63–75 (2007)
Hamasaki, M., Matsuo, Y., Nishimura, T., Takeda, H.: Ontology extraction using social network. In: International Workshop on the Semantic Web for Collaborative Knowledge Acquisition, Hyderabad, India (2007)
Plangprasopchok, A., Lerman, K., Getoor, L.: From saplings to a tree: Integrating structured metadata via relational affinity propagation. In: Proceedings of the AAAI Workshop on Statistical Relational AI, Menlo Park, CA, USA (2010)
Frey, B., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007)
Angeletou, S., Sabou, M., Motta, E.: Semantically Enriching Folksonomies with FLOR. In: 1st International Workshop on Collective Semantics: Collective Intelligence & the Semantic Web (CISWeb 2008), Tenerife, Spain (2008)
Cantador, I., Szomszor, M., Alani, H., Fernández, M., Castells, P.: Enriching ontological user profiles with tagging history for multi-domain recommendations. In: 1st International Workshop on Collective Semantics: Collective Intelligence & the Semantic Web (CISWeb 2008), Tenerife, Spain (2008)
Tesconi, M., Ronzano, F., Marchetti, A., Minutoli, S.: Semantify del.icio.us: Automatically Turn your Tags into Senses. In: Social Data on the Web Workshop at the 7th International Semantic Web Conference, Karlsruhe, Germany (2008)
Garcia, A., Szomszor, M., Alani, H., Corcho, O.: Preliminary results in tag disambiguation using dbpedia. In: 1st International Workshop in Collective Knowledge Capturing and Representation, California, USA (2009)
Specia, L., Motta, E.: Integrating Folksonomies with the Semantic Web. In: 4th European Conference on The Semantic Web: Research and Applications, Innsbruck, Austria, pp. 624–639 (2007)
Giannakidou, E., Koutsonikola, V., Vakali, A., Kompatsiaris, Y.: Co-clustering tags and social data sources. In: 9th International Conference on Web-Age Information Management, Zhangjiajie, China, pp. 317–324 (2008)
Lin, H., Davis, J., Zhou, Y.: An integrated approach to extracting ontological structures from folksonomies. In: 6th European Semantic Web Conference, Heraklion, Greece, pp. 654–668 (2009)
Cimiano, P.: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications, vol. 27. Springer (2006)
Plangprasopchok, A., Lerman, K., Getoor, L.: Growing a Tree in the Forest: Constructing Folksonomies by Integrating Structured Metadata. In: 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, pp. 949–958 (2010)
Plangprasopchok, A., Lerman, K.: Constructing Folksonomies from User-Specified Relations on Flickr. In: 18th International World Wide Web Conference, Madrid, Spain, pp. 781–790 (2009)
Almoqhim, F., Millard, D.E., Shadbolt, N.: An approach to building high-quality tag hierarchies from crowdsourced taxonomic tag pairs. In: Jatowt, A., Lim, E.-P., Ding, Y., Miura, A., Tezuka, T., Dias, G., Tanaka, K., Flanagin, A., Dai, B.T. (eds.) SocInfo 2013. LNCS, vol. 8238, pp. 129–138. Springer, Heidelberg (2013)
Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Miller, G.: WordNet: a lexical database for English. Communications of the ACM 38(11), 39–41 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Almoqhim, F., Millard, D.E., Shadbolt, N. (2014). Improving on Popularity as a Proxy for Generality When Building Tag Hierarchies from Folksonomies. In: Aiello, L.M., McFarland, D. (eds) Social Informatics. SocInfo 2014. Lecture Notes in Computer Science, vol 8851. Springer, Cham. https://doi.org/10.1007/978-3-319-13734-6_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-13734-6_7
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13733-9
Online ISBN: 978-3-319-13734-6
eBook Packages: Computer ScienceComputer Science (R0)