Journal on Data Semantics

, Volume 3, Issue 1, pp 47–73

Assessing and Improving the Quality of SKOS Vocabularies

Original Article

DOI: 10.1007/s13740-013-0026-0

Cite this article as:
Suominen, O. & Mader, C. J Data Semant (2014) 3: 47. doi:10.1007/s13740-013-0026-0

Abstract

Controlled vocabularies are increasingly made available on the Web of Data using the Simple Knowledge Organization System (SKOS) ontology. Assessment of vocabulary quality is important for determining the suitability of vocabularies for reuse in applications and for improving vocabulary development processes. We define 26 quality issues, i.e., computable functions that expose potential quality problems. In an analysis of a representative set of 24 SKOS vocabularies, we found all of them to contain structural errors and/or other quality problems. We propose a set of correction heuristics which we have used to automatically correct a significant proportion of the identified problems. Our reference implementations of these methods, the quality assessment tool qSKOS and the quality improvement tool Skosify, are available for reuse as open-source software.

Keywords

Controlled vocabularies Linked Data Semantic Web Quality assessment Data quality 

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  1. 1.Semantic Computing Research Group, Department of Media TechnologyAalto UniversityEspooFinland
  2. 2.Multimedia Information Systems Group, Faculty of Computer ScienceUniversity of ViennaViennaAustria

Personalised recommendations