Instanced-Based Mapping between Thesauri and Folksonomies

Wartena, Christian; Brussee, Rogier

doi:10.1007/978-3-540-88564-1_23

Christian Wartena⁸ &
Rogier Brussee⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5318))

Included in the following conference series:

International Semantic Web Conference

2192 Accesses
7 Citations

Abstract

The emergence of web based systems in which users can annotate items, raises the question of the semantic interoperability between vocabularies originating from collaborative annotation processes, often called folksonomies, and keywords assigned in a more traditional way. If collections are annotated according to two systems, e.g. with tags and keywords, the annotated data can be used for instance based mapping between the vocabularies. The basis for this kind of matching is an appropriate similarity measure between concepts, based on their distribution as annotations. In this paper we propose a new similarity measure that can take advantage of some special properties of user generated metadata. We have evaluated this measure with a set of articles from Wikipedia which are both classified according to the topic structure of Wikipedia and annotated by users of the bookmarking service del.icio.us. The results using the new measure are significantly better than those obtained using standard similarity measures proposed for this task in the literature, i.e., it correlates better with human judgments. We argue that the measure also has benefits for instance based mapping of more traditionally developed vocabularies.

Download to read the full chapter text

Chapter PDF

Semantic Models for Interactive Systems: The Case of Tagging and Folksonomies

Towards a Web of Semantic Tags

Exploiting Wikipedia for Evaluating Semantic Relatedness Mechanisms

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Golder, S.A., Huberman, B.A.: The structure of collaborative tagging systems. CoRR abs/cs/0508082 (2005)
Google Scholar
Noll, M.G., Meinel, C.: Authors vs. readers: a comparative study of document metadata and content in the www. In: King, P.R., Simske, S.J. (eds.) ACM Symposium on Document Engineering, pp. 177–186. ACM, New York (2007)
Google Scholar
Lux, M., Granitzer, M., Kern, R.: Aspects of broad folksonomies. In: DEXA Workshops, pp. 283–287. IEEE Computer Society, Los Alamitos (2007)
Google Scholar
Halpin, H., Robu, V., Shepherd, H.: The complex dynamics of collaborative tagging. In: WWW, pp. 211–220 (2007)
Google Scholar
Hotho, A., Jäschke, R., Schmitz, C., Stumme, G.: BibSonomy: A Social Bookmark and Publication Sharing System. In: Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures, pp. 87–102 (2006)
Google Scholar
Euzenat, J., Shvaiko, P.: Ontology matching. Springer, Heidelberg (DE) (2007)
MATH Google Scholar
Isaac, A., van der Meij, L., Schlobach, S., Wang, S.: An empirical study of instance-based ontology matching. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ISWC 2007. LNCS, vol. 4825, pp. 253–266. Springer, Heidelberg (2007)
Chapter Google Scholar
Stumme, G., Maedche, A.: FCA-Merge: Bottom-up merging of ontologies. In: 7th Intl. Conf. on Artificial Intelligence (IJCAI 2001), pp. 225–230 (2001)
Google Scholar
Ponzetto, S.P., Strube, M.: Deriving a large-scale taxonomy from Wikipedia. In: AAAI, pp. 1440–1445. AAAI Press, Menlo Park (2007)
Google Scholar
Huijsen, W.O., Wartena, C., Brussee, R.: Learning ontologies from wikipedia for semantic annotation of texts. In: Proceedings of the 13th Knowledge Management Forum, Milano (November 2008) (to appear)
Google Scholar
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts (1999)
MATH Google Scholar
Wartena, C., Brussee, R.: Topic detection by clustering keywords. In: DEXA Workshops. IEEE Computer Society, Los Alamitos (to appear, 2008)
Google Scholar
Landauer, T., Foltz, P., Laham, D.: Introduction to latent semantic analysis. Discourse Processes 25, 259–284 (1998)
Article Google Scholar
Li, H., Yamanishi, K.: Topic analysis using a finite mixture model. Inf. Process. Manage. 39(4), 521–541 (2003)
Article MATH Google Scholar
Fuglede, B., Topsoe, F.: Jensen-shannon divergence and hilbert space embedding. In: Proc. of the Internat. Symposium on Information Theory, p. 31 (2004)
Google Scholar
Melenhorst, M., Grootveld, M., Veenstra, M.: Tag-based information retrieval of educational videos. EBU Technical Review Q2 (2008), http://www.ebu.ch/en/technical/trev/trev_2008-Q2_social-tagging.pdf
Malaisé, V., Gazendam, L., Brugman, H.: Disambiguating automatic semantic annotation based on a thesaurus structure. In: Actes de la 14e conférence sur le Traitement Automatique des Langues Naturelles, pp. 197–206 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Telematica Instituut, P.O. Box 589, 7500 AN, Enschede, The Netherlands
Christian Wartena & Rogier Brussee

Authors

Christian Wartena
View author publications
You can also search for this author in PubMed Google Scholar
Rogier Brussee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Wright State University, Colonel Glenn Way 3640, 454350001, Dayton, USA
Amit Sheth
Institut für Informatik, Universität Koblenz-Landau, Universitätsstr. 1, 56016, Koblenz, Germany
Steffen Staab
BBN Technologies, 48103, Ann Arbor, USA
Mike Dean
DoCoMo Communications Laboratories Europe GmbH, 80687, Munich, Germany
Massimo Paolucci
Department of Computer Science, University of Sheffield, Regent Court, 211 Portobello Street, S1 4DP, Sheffield, UK
Diana Maynard
CSEE Department, UMBC, 1000 Hilltop Circle, MD 21250, Baltimore, USA
Timothy Finin
Department of Computer Science and Engineering, Wright State University, 3640 Colonel Glenn Highway, OH 45435, Dayton, USA
Krishnaprasad Thirunarayan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wartena, C., Brussee, R. (2008). Instanced-Based Mapping between Thesauri and Folksonomies. In: Sheth, A., et al. The Semantic Web - ISWC 2008. ISWC 2008. Lecture Notes in Computer Science, vol 5318. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88564-1_23

Download citation

DOI: https://doi.org/10.1007/978-3-540-88564-1_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88563-4
Online ISBN: 978-3-540-88564-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Instanced-Based Mapping between Thesauri and Folksonomies

Abstract

Chapter PDF

Similar content being viewed by others

Semantic Models for Interactive Systems: The Case of Tagging and Folksonomies

Towards a Web of Semantic Tags

Exploiting Wikipedia for Evaluating Semantic Relatedness Mechanisms

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Instanced-Based Mapping between Thesauri and Folksonomies

Abstract

Chapter PDF

Similar content being viewed by others

Semantic Models for Interactive Systems: The Case of Tagging and Folksonomies

Towards a Web of Semantic Tags

Exploiting Wikipedia for Evaluating Semantic Relatedness Mechanisms

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation