Abstract
Hierarchical classifications, thesauri, and informal taxonomies are likely the most valuable input for creating, at reasonable cost, non-toy ontologies in many domains. They contain, readily available, a wealth of category definitions plus a hierarchy, and they reflect some degree of community consensus. However, their transformation into useful ontologies is not as straightforward as it appears. In this paper, we show that (1) it often depends on the context of usage whether an informal hierarchical categorization schema is a classification, a thesaurus, or a taxonomy, and (2) present a novel methodology for automatically deriving consistent RDF-S and OWL ontologies from such schemas. Finally, we (3) demonstrate the usefulness of this approach by transforming the two e-business categorization standards eCl@ss and UNSPSC into ontologies that overcome the limitations of earlier prototypes. Our approach allows for the script-based creation of meaningful ontology classes for a particular context while preserving the original hierarchy, even if the latter is not a real subsumption hierarchy in this particular context. Human intervention in the transformation is limited to checking some conceptual properties and identifying frequent anomalies, and the only input required is an informal categorization plus a notion of the target context. In particular, the approach does not require instance data, as ontology learning approaches would usually do.
Chapter PDF
Similar content being viewed by others
Keywords
References
United Nations Development Programme,United Nations Standard Products and Services Code (UNSPSC), available at http://www.unspsc.org/ (retrieved March 15, 2007)
eClass e.V., eCl@ss: Standardized Material and Service Classification, available at http://www.eclass-online.com/ (retrieved March 15, 2007)
Hepp, M., Leukel, J., Schmitz, V.: A Quantitative Analysis of Product Categorization Standards: Content, Coverage, and Maintenance of eCl@ss, UNSPSC, eOTD, and the RosettaNet Technical Dictionary. Knowledge and Information Systems (forthcoming)
McGuinness, D.L.: UNSPSC Ontology in DAML+OIL, available at http://www.ksl.stanford.edu/projects/DAML/UNSPSC.daml (retrieved March 15, 2007)
Klein, M.: DAML+OIL and RDF Schema representation of UNSPSC, available at http://www.cs.vu.nl/~mcaklein/unspsc/ (retrieved March 15, 2007)
Hepp, M.: Products and Services Ontologies: A Methodology for Deriving OWL Ontologies from Industrial Categorization Standards. Int’l Journal on Semantic Web & Information Systems (IJSWIS) 2, 72–99 (2006)
Hepp, M.: Representing the Hierarchy of Industrial Taxonomies in OWL: The gen/tax Approach. In: Proceedings of the ISWC Workshop Semantic Web Case Studies and Best Practices for eBusiness (SWCASE05), Galway, Irland (2005)
van Assem, M., Menken, M.R., Schreiber, G., Wielemaker, J., Wielinga, B.J.: A Method for Converting Thesauri to RDF/OWL. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 17–31. Springer, Heidelberg (2004)
Giunchiglia, F., Marchese, M., Zaihrayeu, I.: Towards a Theory of Formal Classification. In: Proceedings of the AAAI-05 Workshop on Contexts and Ontologies: Theory, Practice and Applications (C&O-2005), Pittsburgh, Pennsylvania, USA (2005)
Kiryakov, A., Ognyanov, D., Manov, D.: OWLIM - a Pragmatic Semantic Repository for OWL. In: Proceedings of the International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS 2005), New York City, USA (2005)
Brachman, R.J.: What IS-A Is and Isn’t: An Analysis of Taxonomic Links in Semantic Networks. IEEE Computer 16, 30–36 (1983)
Rector, A.L., Wroe, C., Rogers, J., Roberts, A.: Untangling Taxonomies and Relationships: Personal and Practical Problems in Loosely Coupled Development of Large Ontologies. In: Proceedings of the K-CAP’01, Victoria, British Columbia, Canada (2001)
Kashyap, V., Borgida, A.: Representing the UMLS Semantic Network using OWL. In: Fensel, D., Sycara, K.P., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 1–16. Springer, Heidelberg (2003)
Zhang, D., Lee, W.S.: Learning to integrate web taxonomies. Journal of Web Semantics 2, 131–151 (2004)
Wielinga, B.J., Schreiber, A.T., Sandberg, J.A.C.: From Thesaurus to Ontology. In: Proceedings of the First International Conference on Knowledge Capture (K-CAP 2001), Victoria, British Columbia, Canada (2001)
Wielinga, B.J., Wielemaker, J., Schreiber, G., van Assem, M.: Methods for Porting Resources to the Semantic Web. In: Bussler, C.J., Davies, J., Fensel, D., Studer, R. (eds.) ESWS 2004. LNCS, vol. 3053, pp. 299–311. Springer, Heidelberg (2004)
Giunchiglia, F., Marchese, M., Zaihrayeu, I.: Encoding Classifications into Lightweight Ontologies. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 80–94. Springer, Heidelberg (2006)
Bizer, C., Wolk, J.: RDF Version of the eClass 4.1 Product Classification Schema, available at http://www.wiwiss.fu-berlin.de/suhl/bizer/ecommerce/eClass-4.1.rdf (retrieved March 15, 2007)
Hepp, M.: eCl@ssOWL. The Products and Services Ontology, available at http://www.heppnetz.de/eclassowl/ (retrieved March 15, 2007)
Fernández-López, M., Gómez-Pérez, A.: Overview and analysis of methodologies for building ontologies. The Knowledge Engineering Review 17, 129–156 (2002)
de Bruijn, J.: Using Ontologies. Enabling Knowledge Sharing and Reuse on the Semantic Web. DERI Technical Report DERI-2003-10-29, pp. 1–49 (October 2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Hepp, M., de Bruijn, J. (2007). GenTax: A Generic Methodology for Deriving OWL and RDF-S Ontologies from Hierarchical Classifications, Thesauri, and Inconsistent Taxonomies. In: Franconi, E., Kifer, M., May, W. (eds) The Semantic Web: Research and Applications. ESWC 2007. Lecture Notes in Computer Science, vol 4519. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72667-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-72667-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72666-1
Online ISBN: 978-3-540-72667-8
eBook Packages: Computer ScienceComputer Science (R0)