Linked Open Data: Uncertainty in Equivalence of Properties
Linked Open Data (LOD) is a graph-based repository of data that uses data representation format called Resource Description Framework (RDF). The basic piece of RDF data is a triple subject-property-object. LOD seen as a network of interconnected pieces of data creates an environment suitable for developing methods enabling learning processes that rely on data integration. Application of frequentionistic-based approaches to integrate data leads to identification of pieces of information that are consistent and frequently used. An essential element of such methods is the ability to identify similar pieces of data. In reality, multiple sources of information use different vocabularies to represent relations (properties) existing between data. That introduces a challenge for data integration methods.
In this paper, we propose a simple approach to determine degrees of equivalences between relations (properties) defined by different LOD vocabularies. We process numbers of occurrences of matching pairs of RDF triples in order to determine intervals representing lower and upper levels of property equivalences. As the result, we obtain a graph of equivalent properties where interval-based strength of edges represent degrees of similarity between properties. A case study illustrating the details of the approach and a validation experiment are included.
KeywordsRDF data Property equivalence Possibility theory
- 3.Cheatham, M., Hitzler, P.: The properties of property alignment. In: Proceeding of 9th International Conference on Ontology Matching, vol. 1317, pp. 13–24, Italy (2014)Google Scholar
- 5.Dubois, D., Prade, H.: Possibility theory and its applications: a retrospective and prospective view. In: Decision Theory and Multi-Agent Planning, vol. 482, pp. 89–109. Springer (2006)Google Scholar
- 6.Gunaratna, K., Thirunarayan, K., Jain, P., Sheth, A., Wijeratne, S.: A statistical and schema independent approach to identify equivalent properties on linked data. In: ISEM 2013, Graz, Austria (2013)Google Scholar
- 9.Lassila, O., Swick, R.: Resource description framework (RDF) model and syntax specification, 14 February 2017. www.w3.org/TR/1999/REC-rdf-syntax-19990222
- 11.Nguyen, K., Ichise, R., Le, B.: SLINT: a schema-independent linked data interlinking system. In: 7th International Conference on Ontology Matching, USA, pp. 1–12 (2012)Google Scholar
- 14.Tran, Q., Ichise, R., Ho, B.: Cluster-based similarity aggregation for ontology matching. In: 6th International Conference on Ontology Matching, Germany, pp. 142–147 (2011)Google Scholar
- 16.https://www.w3.org/RDF. Accessed 14 Mar 2017
- 17.http://wiki.dbpedia.org. Accessed 14 Mar 2017
- 18.https://www.wikidata.org/. Accessed 14 Mar 2017
- 19.http://www.yago-knowledge.org. Accessed 14 Mar 2017