Minimizing conservativity violations in ontology alignments: algorithms and evaluation

Solimando, Alessandro; Jiménez-Ruiz, Ernesto; Guerrini, Giovanna

doi:10.1007/s10115-016-0983-3

Minimizing conservativity violations in ontology alignments: algorithms and evaluation

Regular Paper
Published: 27 August 2016

Volume 51, pages 775–819, (2017)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Alessandro Solimando ORCID: orcid.org/0000-0001-6748-7755¹,
Ernesto Jiménez-Ruiz² &
Giovanna Guerrini¹

502 Accesses
26 Citations
1 Altmetric
Explore all metrics

Abstract

In order to enable interoperability between ontology-based systems, ontology matching techniques have been proposed. However, when the generated mappings lead to undesired logical consequences, their usefulness may be diminished. In this paper, we present an approach to detect and minimize the violations of the so-called conservativity principle where novel subsumption entailments between named concepts in one of the input ontologies are considered as unwanted. The practical applicability of the proposed approach is experimentally demonstrated on the datasets from the Ontology Alignment Evaluation Initiative.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Detecting and Correcting Conservativity Principle Violations in Ontology-to-Ontology Mappings

Opening the Black Box of Ontology Matching

An Extensible Linear Approach for Holistic Ontology Matching

Notes

http://oaei.ontologymatching.org/.
Alignments from UMLS are extracted according to the method defined in [36].
This paper was presented in a workshop without formal proceedings.
We exclude disjointness from the semantic relations given that most of the available systems do not compute this relation. Negative constraints are typically harder to identify and assess than positive ones [18].
This DL is less expressive than \(\mathcal {SROIQ}\), the underlying DL of OWL 2.
We assume that \(\mathsf {diff}_{\Sigma }^{\approx }({\mathcal {O}}_i, {\mathcal {O}}^{\emptyset }_{{\mathcal {O}}_1,{\mathcal {O}}_2})=\emptyset \).
Note that in this paper, we only target the mappings in the repair process and we consider the input ontologies as immutable. Other approaches like Pesquita et al. [62] question the automatic generation of repairs and suggest to update the ontologies, when necessary, to avoid violations.
In our setting, A is required to be antireflexive as we disallow self-arcs.
http://www.optique-project.eu.
http://factpages.npd.no/factpages/.
Optique uses OWL 2 QL ontologies for query rewriting, while the query formulation may be based on much richer OWL 2 ontologies. The axioms that fall outside the OWL 2 QL profile are either approximated or not considered for the rewriting.
Although several proposals for graph formalisms for representing DL ontologies exist in the literature (e.g., [58]), we provided a simplified variant specifically tailored to capture equivalence violations.
We use the syntax of Lparse 1.0, a parser for logic programs used as a front-end by different logic programming solvers, more details at http://www.tcs.hut.fi/Software/smodels/.
Although alternative frameworks could have been employed (e.g., Constraint Logic Programming), we have adopted ASP as it is known to be well suited for graph-related problems and to produce compact and easy-to-understand solutions [10].
In [73], this algorithm is referred to as CycleBreaker.
For sake of space, the algorithm, which directly follows from Definition 4.2, is omitted. The interested reader can find it in [73], Algorithm 12.
Note that, as for the case of EqRepair, we do not compute the classification of \({\mathcal {O}}_1 \cup {\mathcal {O}}_2 \cup \mathcal {M}\) since this will prevent our extension of D&G to identify and record the mappings involved in an unsatisfiability (i.e., a subsumption violation).
In scenarios where the confidence of the mapping is missing (e.g., in reference or manually created mapping sets) or unreliable, our mapping repair technique computes fresh confidence values based on the locality principle [36].
The complete source code of the proposed algorithms and the performed experiments is available at https://github.com/asolimando/logmap-conservativity/.
The test environment consisted of a desktop computer equipped with 32GB DDR 3 RAM at 1333MHz, and an AMD Fusion FX 4350 (quad-core, each running at 4.2GHz) as CPU. The dataset is stored on a 128GB SSD, where the operating system Ubuntu (12.04, 64-bit version) is also installed. Our prototype can run with less than 8GB for the majority of the considered tests; we, however, allocate 26GB of RAM for the JVM in order to minimize the influence of the garbage collector on the recorded temporal measurements.
Note that the reference mappings of the OAEI campaign are already coherent w.r.t. the test case ontologies, and thus, the consistency repair step was not necessary.
The interested reader please refer to Section 4.6.4 in [73] for an overview of these approaches.
http://www.cancer.gov/cancertopics/cancerlibrary/terminologyresources.
Each agent is equipped with a local (private) OWL 2 ontology.

References

Agrawal R, Borgida A, Jagadish HV (1989) Efficient management of transitive relationships in large data and knowledge bases. In: ACM SIGMOD Conference on Management of Data, pp 253–262
Arnold P, Rahm E (2013) Semantic enrichment of ontology mappings: a linguistic-based approach. In: Advances in Databases and Information System - East European Conference (ADBIS), pp 42–55
Baader F, Horrocks I, Sattler U (2008) Chapter 3 description logics. In: Bruce Porter Frank van Harmelen, Vladimir Lifschitz (eds) Handbook of Knowledge Representation, vo 3 of Foundations of Artificial Intelligence, Elsevier, pp 135–179
Beisswanger E, Hahn U (2012) Towards valid and reusable reference alignments: ten basic quality checks for ontology alignments and their application to three different reference data sets. J Biomed Semant 3(Suppl 1):S4
Article Google Scholar
Bodenreider O (2004) The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res 32:267–270
Article Google Scholar
Borgida A, Serafini L (2003) Distributed description logics: assimilating information from peer sources. J Data Semant 1:153–184
MATH Google Scholar
Christophides V, Plexousakis D, Scholl M, Tourtounis S (2003) On labeling schemes for the semantic web. In: International World Wide Web Conference (WWW), pp 544–555
Cuenca Grau B, Horrocks I, Kazakov Y, Sattler U (2008) Modular reuse of ontologies. J Artif Intell Res (JAIR) 31:273–318
MathSciNet MATH Google Scholar
David J, Euzenat J, Scharffe F, Trojahn C (2011) The alignment API 4.0. Semant Web J 2(1):3–10
Google Scholar
Dovier A, Formisano A, Pontelli E (2007) An experimental comparison of constraint logic programming and answer set programming. AAAI 7:1622–1625
MATH Google Scholar
Dowling WF, Gallier JH (1984) Linear-time algorithms for testing the satisfiability of propositional Horn formulae. J Log Program 1(3):267–284
Article MathSciNet MATH Google Scholar
DuyHoa N, Bellahsene Z (2013) YAM++ results for OAEI 2013. In: Ontology Matching Workshop (OM), pp 211–218
Euzenat J (2015) Revision in networks of ontologies. Artif Intell 228:195–216
Article MathSciNet MATH Google Scholar
Euzenat J, Meilicke C, Stuckenschmidt H, Shvaiko P, Trojahn C (2011) Ontology alignment evaluation initiative: six years of experience. J Data Semant 15:158–192
Article Google Scholar
Euzenat J, Shvaiko P (2010) Ontology matching. Springer, Heidelberg
MATH Google Scholar
Even G, Naor JS, Schieber B, Sudan M (1998) Approximating minimum feedback sets and multicuts in directed graphs. Algorithmica 20(2):151–174
Article MathSciNet MATH Google Scholar
Faria D, Pesquita C, Santos E, Palmonari M, Cruz Isabel F, Couto FM (2013) The agreementmakerlight ontology matching system. In: OTM Conferences, pp 527–541
Ferré S, Rudolph S (2012) Advocatus diaboli–exploratory enrichment of ontologies with negative constraints. In: International Conference on Knowledge Engineering (EKAW), pp 42–56
Fleischhacker D, Völker J (2011) Inductive learning of disjointness axioms. In: OTM Conferences, pp 680–697
Galinier P, Lemamou E, Bouzidi MW (2013) Applying local search to the feedback Vertex set problem. J Heur 1–22
Gallo G, Urbani G (1989) Algorithms for testing the satisfiability of propositional formulae. J Log Program 7(1):45–61
Article MathSciNet MATH Google Scholar
Giese M, Soylu A, Vega-Gorgojo G, Waaler A, Haase P, Jiménez-Ruiz E, Lanti D, Rezk M, Xiao G, Özçep Ö, Rosati R (2015) Optique: zooming in on big data. IEEE Comput 48(3):60–67
Article Google Scholar
Glimm B, Horrocks I, Motik B, Stoilos G, Wang Z (2014) Hermit: an OWL 2 reasoner. J Autom Reason 53(3):245–269
Article MATH Google Scholar
Golbeck J, Fragoso G, Hartel FW, Hendler JA, Oberthaler J, Parsia B (2003) The national cancer institute’s thésaurus and ontology. J Web Semant 1(1):75–80
Article Google Scholar
Gonçalves RS, Parsia B, Sattler U (2012) Concept-based semantic difference in expressive description logics. In: International Semantic Web Conference (ISWC), Springer, pp 99–115
Grau BC, Horrocks I, Kazakov Y, Sattler U (2007) Just the right amount: extracting modules from ontologies. In: International Conference on World Wide Web (WWW), pp 717–726. ACM
Grau BC, Horrocks I, Motik B, Parsia B, Patel-Schneider PF, Sattler U (2008) OWL 2: the next step for OWL. J Web Semant 6(4):309–322
Article Google Scholar
Horridge M (2011) Justification based explanation in ontologies. PhD thesis, University of Manchester
Horridge M, Parsia B, Sattler U (2008) Laconic and precise justifications in OWL. In: International Semantic Web Conference (ISWC), pp 323–338
Horrocks I, Kutz O, Sattler U (2006) The even more Irresistible SROIQ. In: International Conference on Principles of Knowledge Representation and Reasoning (KR), pp 57–67
Ivanova V, Lambrix P (2013) A unified approach for aligning taxonomies and debugging taxonomies and their alignments. In: European Semantic Web Conference (ESWC), Springer, pp 1–15
Jean-Mary Yves R, Patrick Shironoshita E, Kabuka Mansur R (2009) Ontology matching with semantic verification. J Web Semant 7(3):235–251
Article Google Scholar
Jiménez-Ruiz E, Grau BC (2011) LogMap: logic-based and scalable ontology matching. In: International Semantic Web Conference (ISWC), pp 273–288,
Jiménez-Ruiz E, Grau BC, Horrocks I (2012) On the feasibility of using OWL 2 DL reasoners for ontology matching problems. In: OWL Reasoner Evaluation Workshop (ORE)
Jiménez-Ruiz E, Grau BC, Horrocks I, Berlanga R (2009) Ontology integration using mappings: towards getting the right logical consequences. In: European Semantic Web Conference (ESWC), pp 173–187
Jiménez-Ruiz E, Grau BC, Horrocks I, Berlanga R (2011) Logic-based assessment of the compatibility of UMLS ontology sources. J Biomed Semant 2(Suppl 1):S2
Article Google Scholar
Jiménez-Ruiz E, Grau BC, Zhou Y, Horrocks I (2012) Large-scale interactive ontology matching: algorithms and implementation. In: European Conference on Artificial Intelligence (ECAI), pp 444–449
Jiménez-Ruiz E, Kharlamov E, Zheleznyakov D, Horrocks I, Pinkel C, Skjæveland MG, Thorstensen E, Mora J (2015) BootOX: practical mapping of RDBs to OWL 2. In: International Semantic Web Conference (ISWC), pp 113–132
Jiménez-Ruiz E, Meilicke C, Grau BC, Horrocks I (2013) Evaluating mapping repair systems with large biomedical ontologies. In: Description Logics (DL), pp 246–257
Jiménez-Ruiz E, Payne TR, Solimando A, Tamma V (2015) Avoiding alignment-based conservativity violations through dialogue. In: International Workshop on OWL: Experiences and Directions (OWLED)
Jiménez-Ruiz E, Payne TR, Solimando A, Tamma VAM (2016) Limiting logical violations in ontology alignment through negotiation. In: Principles of Knowledge Representation and Reasoning: Proceedings of the Fifteenth International Conference (KR), pp 217–226
Kalyanpur A, Parsia B, Horridge M, Sirin E (2007) Finding all justifications of OWL DL entailments. In: International Semantic Web Conference (ISWC), pp 267–280
Kazakov Y, Krötzsch M, Simancik F (2014) The incredible ELK - from polynomial procedures to efficient reasoning with \({\cal EL}\) ontologies. J Autom Reason 53(1):1–61
Article MathSciNet MATH Google Scholar
Kharlamov E, Hovland D, Jiménez-Ruiz E, Lanti D, Lie H, Pinkel C, Rezk M, Skjæveland MG, Zheleznyakov E, Horrocks I (2015) Ontology based access to exploration data at statoil. In: International Semantic Web Conference (ISWC), pp 93–112
Kharlamov E, Solomakhina N, Özçep ÖL, Zheleznyakov D, Hubauer T, Lamparter S, Roshchin M, Soylu A, Watson S (2014) How semantic technologies can enhance data access at siemens energy. In: International Semantic Web Conference (ISWC), pp 601–619
Konev B, Walther D, Wolter F (2008) The logical difference problem for description logic terminologies. In: International Joint Conference on Automated Reasoning (IJCAR), pp 259–274
Kontchakov R, Wolter F, Zakharyaschev M (20084) Can you tell the difference between DL-Lite ontologies?. In: International Conference on Principles of Knowledge Representation and Reasoning (KR)
Lambrix P, Dragisic Z, Ivanova V (2013) Get my pizza right: repairing missing is-a relations in \({\cal ALC}\) ontologies. In: Semantic Technology, Springer, pp 17–32
Lambrix P, Liu Q (2013) Debugging the missing is-a structure within taxonomies networked by partial reference alignments. Data Knowl Eng (DKE) 86:179–205
Article Google Scholar
Lambrix P, Wei-Kleiner F, Dragisic Z, Ivanova V (2013) Repairing missing is-a structure in ontologies is an abductive reasoning problem. In: International Workshop on Debugging Ontologies and Ontology Mappings (WoDOOM), pp 33
Lutz C, Walther D, Wolter F (2007) Conservative extensions in expressive description logics. Int Jt Conf Artif Intell (IJCAI) 7:453–458
Google Scholar
Lutz C, Wolter F (2010) Deciding inseparability and conservative extensions in the description logic EL. J Symb Comput 45(2):194–228
Article MATH Google Scholar
Mascardi V, Ancona D, Barbieri M, Bordini RH, Ricci A (2014) CooL-AgentSpeak: Endowing Agentspeak-DL agents with plan exchange and ontology services. Web Intell Agent Syst 12(1):83–107
Google Scholar
Meilicke C (2011) Alignments incoherency in ontology matching. PhD thesis, University of Mannheim
Meilicke C, Stuckenschmidt H, Tamilin A (2009) Reasoning support for mapping revision. J Log Comput 19(5):807–829
Article MathSciNet MATH Google Scholar
Meilicke C, Völker J, Stuckenschmidt H (2008) Learning disjointness for debugging mappings between lightweight ontologies. In: International Conference on Knowledge Engineering (EKAW), pp 93–108
Melnik S, Garcia-Molina H, Rahm E (2002) Similarity flooding: a versatile graph matching algorithm and its application to schema matching. In: IEEE International Conference on Data Engineering (ICDE), pp 117–128
Motik B, Grau BC, Horrocks I, Sattler U (2009) Representing ontologies using description logics, description graphs, and rules. Artif Intell J 173(14):1275–1309
Article MathSciNet MATH Google Scholar
Nebot V, Berlanga R (2009) Efficient retrieval of ontology fragments using an interval labeling scheme. Inf Sci J 179(24):4151–4173
Article Google Scholar
Payne TR, Tamma V (2014) A Dialectical approach to selectively reusing ontological correspondences. In: Knowledge Engineering and Knowledge Management (EKAW), Springer, pp 397–412
Payne TR, Tamma V (2014) Negotiating over ontological correspondences with asymmetric and incomplete knowledge. In: International Conference on Autonomous Agents and Multi-Agent Systems, (AAMAS), pp 517–524
Pesquita C, Faria D, Santos E, Couto FM (2013) To repair or not to repair: reconciling correctness and coherence in ontology reference alignments. In: Ontology Matching Workshop (OM), pp 13–24
Raymond R (1987) A theory of diagnosis from first principles. Artif Intell J 32(1):57–59
Article MathSciNet MATH Google Scholar
Rodriguez-Muro M, Rezk M (2015) Efficient SPARQL-to-SQL with R2RML mappings. J Web Semant 33:141–169
Article Google Scholar
Rosse C, Mejino JLV Jr (2003) A reference ontology for biomedical informatics: the foundational model of anatomy. J Biomed Inform 36(6):478–500
Article Google Scholar
Santos E, Faria D, Pesquita C, Couto F (2013) Ontology alignment repair through modularization and confidence-based heuristics. arXiv:1307.5322 preprint
Schlobach S (2005) Debugging and semantic clarification by pinpointing. In: European Semantic Web Conference (ESWC), Springer, pp 226–240
Schlobach S, Cornet R (2003) Non-standard reasoning services for the debugging of description logic terminologies. In: International Joint Conference on Artificial Intelligence (IJCAI), pp 355–362
Schulz S, Cornet R, Spackman KA (2011) Consolidating SNOMED CT’s ontological commitment. Appl Ontol 6(1):1–11
Google Scholar
Sesen MB, Bañares-Alcántara R, Fox J, Kadir T, Brady JM (2012) Lung Cancer Assistant: an ontology-driven, online decision support prototype. In: International Workshop on OWL: Experiences and Directions (OWLED)
Shvaiko P, Euzenat J (2012) Ontology matching: state of the art and future challenges. Trans Knowl Data Eng (TKDE) 25(1):158–176
Article Google Scholar
Skjæveland MG, Lian EH, Horrocks I (2013) Publishing the Norwegian Petroleum directorate’s FactPages as semantic eeb data. In: International Semantic Web Conference (ISWC), pp 162–177
Solimando A (2015) Change Management in the Traditional and Semantic Web. PhD thesis, University of Genoa. https://github.com/asolimando/logmap-conservativity/raw/master/SolimandoA-thesis.pdf
Solimando A, Jiménez-Ruiz E, Guerrini G (2014) A multi-strategy approach for detecting and correcting conservativity principle violations in ontology alignments. In: International Workshop on OWL: Experiences and Directions (OWLED), pp 13–24
Solimando A, Jiménez-Ruiz E, Guerrini G (2014) Detecting and correcting conservativity principle violations in ontology-to-ontology mappings. In: International Semantic Web Conference (ISWC), pp 1–16
Solimando A, Jiménez-Ruiz E, Guerrini G (2015) On the feasibility of using OWL 2 reasoners in ontology alignment repair problems. In: OWL Reasoner Evaluation Workshop (ORE), pp 60–67
Solimando A, Jiménez-Ruiz E, Guerrini G (2016) Pushing the limits of OWL 2 reasoners in ontology alignment repair problems. Intell Artif 10:1–18
Article Google Scholar
Solimando A, Jiménez-Ruiz E, Pinkel C (2014) Evaluating ontology alignment systems in query answering tasks. In: International Semantic Web Posters & Demonstrations Track (ISWC), pp 301–304
Soylu A, Giese M, Jimenez-Ruiz E, Vega-Gorgojo G, Horrocks I (2015) Experiencing OptiqueVQS: a multi-paradigm and ontology-based visual query system for end users. Univers Access Inf Soc 15(1):129–152
Article Google Scholar
Suntisrivaraporn B, Qi G, Ji Q, Haase P (2008) A modularization-based approach to finding all justifications for OWL DL entailments. In: Asian Semantic Web Conference (ASWC), pp 1–15
Tarjan R (1972) Depth-first search and linear graph algorithms. SIAM J Comput 1(2):146–160
Article MathSciNet MATH Google Scholar
Völker J, Vrandecic D, Sure Y, Hotho A (2007) Learning disjointness. In: European Semantic Web Conference (ESWC), pp 175–189
Šváb O, Svátek V, Berka P, Rak D, Tomášek P (2005) Ontofarm: towards an experimental collection of parallel ontologies. In: International Semantic Web Conference (ISWC). Poster Session
W3C as Hitzler P, Krötzsch M, Parsia B, Patel-Schneider PF, Rudolph S (2009) OWL 2 Web Ontology language primer. http://www.w3.org/TR/owl2-primer/
Wang P, Xu B (2012) Debugging ontology mappings: a static approach. Comput Inform 27(1):21–36
MathSciNet Google Scholar
Zhang S, Mork P, Bodenreider O (2004) Lessons learned from aligning two representations of anatomy. In: International Conference on Principles of Knowledge Representation and Reasoning (KR)
Zimmermann A, Euzenat J (2006) Three semantics for distributed systems and their relations with alignment composition. In: International Semantic Web Conference (ISWC). Springer, pp 16–29

Download references

Acknowledgments

Ernesto Jiménez-Ruiz was funded by the European Commission under FP7 Grant Agreement 318338, “Optique”, and the EPSRC projects Score! ED3 and DBOnto. We also thank the invaluable help provided by Bernardo Cuenca and Ian Horrocks. We are also very grateful for the support of the Optique colleagues that facilitated our understanding of the domain, especially Dag Hovland, Evgeny Kharlamov, Dmitry Zheleznyakov, Martin Giese and Martin G. Skjæveland. Finally, we would also like to thank the anonymous reviewers of this paper.

Author information

Authors and Affiliations

DIBRIS, Informatica, Bioingegneria, Robotica e Ingegneria dei Sistemi, University of Genova, Via Dodecaneso 35, 16146, Genova, Italy
Alessandro Solimando & Giovanna Guerrini
Department of Computer Science, University of Oxford, Oxford, UK
Ernesto Jiménez-Ruiz

Authors

Alessandro Solimando
View author publications
You can also search for this author in PubMed Google Scholar
Ernesto Jiménez-Ruiz
View author publications
You can also search for this author in PubMed Google Scholar
Giovanna Guerrini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alessandro Solimando.

Appendix

This section investigates the computational complexity of diagnosis computation (“Diagnosis Computation Complexity” in section “Appendix”) and its decomposability into subproblems (“Decomposability of equivalence violations diagnosis computation” in section “Appendix”).

1.1 Diagnosis computation complexity

With the aim of proving that MAP-WFES is NP-hard, Proposition 10.1 introduces a polynomial reduction from WFES to MAP-WFES, denoted as WFES \(\preceq \) MAP-WFES. The intuitive idea behind the reduction is the following. Each arc \((t,v,c)\) of the original graph is “split” into two arcs \((t,u,c)\) and \((u,v,c)\), with u a fresh node. All the nodes t, v are associated with one of the input ontology, while the fresh nodes are associated with the other. In this way, all the arcs are mappings (i.e., all of them are potentially removable, exactly as the original arcs). It is easy to see that the reduction preserves the solution weight and that a 1–1 correspondence exists between cycles in the two graphs. In addition, we remark that MAP-WFES does not break cycles traversing only vertices of one of the input ontologies. No such cycles can exist because all the arcs are mappings, as discussed above. A reduction example is given in Example 10.1, followed by the definition of the reduction in Proposition 10.1.

Example 10.1

In Fig. 11, graphs G (left) and \(G'\) (right) are shown. WFES \(\preceq \) MAP-WFES coincides with \(G'\). The solution to \(G'\) is equal to \(\Delta = \{ (c,cd,1), (b,bg,0.1), (gf,f,0.4), (a,af,0.2) \} \), with a total weight of 1.7. \(D= \{ (b,g,0.1), (g,f,0.4),(a,f,0.2), (c,d,1) \} \) is the corresponding solution to the instance of the WFES problem represented by G and can be easily verified that is both minimal (having weight 1.7) and correct.

Proposition 10.1

WFES \(\preceq \) MAP-WFES. A polynomial reduction from the WFES problem to the MAP-WFES problem exists. Let \(G=(V,A)\) be a digraph. The reduction consists in constructing a digraph \(G'=(V',A')\) such that a subset of edges, namely \(\Delta \subseteq A\), is a solution to MAP-WFES iff the corresponding set of arcs, namely D, is a solution to WFES on G. The reduction is as follows:

1.
for each \((x,y,c) \in A\), we create a fresh vertex \(v_{xy}\), we add it to \(V_2'\), and we create a pair of arcs \((x,v_{xy},c), (v_{xy},y,c)\) that are added to \(A'\) and \(\mathcal {M}\),
2.
\(V_1' = V\) and \(V' = V_1' \cup V_2'\).

A set of arcs, namely \(\Delta \subseteq A'\), is a solution to the MAP-WFES problem on digraph \(G'\) iff the corresponding feedback edge set D is a solution to the WFES G, where \(G'\) is computed from G, and for each arc of the form \((x,v_{xy},c)\) or \((v_{xy},y,c)\) in \(\Delta \), we have a corresponding arc \((x,y,c)\) in D.

Proof

In order to prove the correctness of the reduction, we need to show that, if G \(\preceq \) \(G'\), with G an instance of the WFES problem and \(G'\) an instance of the MAP-WFES problem, a set of arcs \(\Delta \) is a (minimal) solution to \(G'\) iff the corresponding set of arcs D is a (minimal) solution to G. As discussed in [16], the proposed reduction is polynomial and it preserves graph connectivity and the weight of the solutions, by preserving a 1–1 correspondence between cycles of G and those of \(G'\), due to the 1–1 correspondency between the arcs of A and those of \(A'\).

\(\Rightarrow \): If D is a solution to WFES on G, \(\Delta \) is a solution to MAP-WFES on \(G'\). Suppose that \(\Delta \) is not a solution. This requires that either at least a cycle \(\kappa '\) exists in digraph \((V',A'{\setminus }\Delta )\) or that a diagnosis \(\Delta '\) exists such that \(w(\Delta ') < w(\Delta )\). For the first case, given the 1–1 correspondence between cycles of G and \(G'\), this implies that a corresponding cycle in G exists as well, thus contradicting that D is a solution to the instance of the WFES problem represented by G. For the latter case, given the 1–1 correspondence between arcs of G and \(G'\), this implies that a solution \(D'\) corresponding to \(\Delta '\) exists. By the weight preservation property of the reduction, \(w(D') < w(D)\) holds, contradicting that D is a (minimal) solution to the instance of the WFES problem represented by G.

\(\Leftarrow \): If \(\Delta \) is a solution to MAP-WFES on \(G'\), D is a solution to WFES on G. Suppose that D is not. This requires that either a cycle \(\kappa \) of G exists in digraph \((V,A {\setminus } D)\) or that a solution \(D'\) exists such that \(w(D') < w(D)\). The first case requires the existence of a cycle \(\kappa '\) of \(G'\) (corresponding to \(\kappa \)) that is left unbroken. In turn, this either violates that \(\Delta \) is a solution, or that, for some \(i \in \{1,2\} \), \(\kappa '\) exclusively traverses elements of \(V'_i\). This situation is excluded by construction of \(G'\), because no arcs between vertices of the same subset \(V'_i\) of \(V'\) exist. For the latter case, this implies that a diagnosis \(\Delta '\) corresponding to the (minimal) solution \(D'\) does not exist. This requires that at least an element of \(D'\) cannot belong to a diagnosis (i.e., it cannot be removed). By construction of \(G'\), we have that \(\mathcal {M} = A'\). Given that only elements of \(A' {\setminus } \mathcal {M}\) cannot belong to a diagnosis for the MAP-WFES problem, this results in a contradiction, thus proving the correctness of the reduction. \(\square \)

From the results of Proposition 10.1, it follows that MAP-WFES is NP-hard, as detailed in Proposition 10.2.

Proposition 10.2

MAP-WFES is NP-hard.

Proof

The proof follows from the polynomial reduction from the WFES problem, that is NP-hard [16], to MAP-WFES. \(\square \)

1.2 Decomposability of equivalence violations diagnosis computation

Proposition 10.3 relates unsafe cycles and problematic SCCs, showing that each unsafe cycle results in a problematic SCC.

Proposition 10.3

A SCC is problematic iff it (totally) contains at least one unsafe cycle.

Proof

\(\Rightarrow \): Consider a problematic SCC S, with projections \(\Pi _1\) and \(\Pi _2\) on the input ontologies. From problematic SCC definition (Definition 4.6), at least one of the projections of S, say \(\Pi _1\), is not a local SCC. We therefore also know that \(\Pi _1\) is not a SCC; otherwise, it would also be a local SCC. Suppose that \(\Pi _1\) is a subset of a SCC \(\Pi '\). This implies that all the elements of \(\Pi '\) belongs to S as well, and therefore, \(\Pi '\) and \(\Pi _1\) are identical, but this contradicts the assumption that \(\Pi _1\) is not a SCC.

\(\Leftarrow :\) from the definition of cycle and SCC, each cycle \(\kappa \) is contained in a SCC S (i.e., \(\kappa \subseteq S\)). Let \(\kappa \) be an unsafe cycle, and let also \(\kappa _1\) (resp. \(\kappa _2\)) be the subset of vertices of \(\kappa \) belonging to an input ontology \({\mathcal {O}}_1\) (resp. \({\mathcal {O}}_2\)). By definition of unsafe cycle (Definition 4.3), at least one of this subsets, say \(\kappa _1\), is not contained in any local SCC. But given that \(\kappa \subseteq S\), \(\kappa _1 \subseteq \Pi _{{\mathcal {O}}_1}(S)\) holds. Therefore, \(\Pi _{{\mathcal {O}}_1}(S)\) is not contained in any local SCC either and, by Definition 4.6, S is a problematic SCC. \(\square \)

Proposition 10.3 guarantees completeness for a detection technique for violations to the conservativity principle on a graph representation of an aligned ontology, based on problematic SCCs. Given that all the violations result in unsafe cycles and that they totally belong to a single problematic SCC, completeness for a repair technique breaking all the unsafe cycles follows.

Notice also that a (unsafe) cycle always belongs to one and only one (problematic) SCC (as expressed by Proposition 10.4), while a problematic SCC may contain more than one cycle. Therefore, a technique detecting problematic SCCs may be more efficient than one directly addressing unsafe cycles.

Proposition 10.4

Safe cycle never traverse multiple SCCs of the same input ontology.

Proof

By Definition 4.3, all the vertices belonging to a projection \(\Pi \) of a safe cycle \(\kappa ^s\) need to be traversed by a cycle \(\kappa '\) in the input ontology these vertices belong to. The claim is that cycle \(\kappa '\) identifies either a SCC of the aligned ontology or a subset of a SCC. Assume that vertices of \(\Pi \) belong to at least two SCCs \(S_1, S_2\), that is, \(\Pi \cap S_1 \ne \emptyset \) and \(\Pi \cap S_2 \ne \emptyset \). Being traversed by a cycle, all the vertices of \(\Pi \) are mutually reachable. Then, from transitivity of reachability, it follows that all the vertices in \(S_1 \cup S_2\) are mutually reachable. This contradicts the hypothesis that \(S_1\) and \(S_2\) are two distinct SCCs, thus proving the proposition. Such argument be can straightforwardly generalized to more than two SCCs. \(\square \)

Proposition 10.5 proves the correctness of our approach and the optimality of the computed (global) diagnosis.

Proposition 10.5

Computing a (global) diagnosis for a graph G, representing an aligned ontology, can be reduced to computing the (local) diagnoses for the problematic SCCs of G. The (minimal) global diagnosis is the union of the (minimal) local diagnoses.

Proof

From Proposition 10.3, it follows that: (i) all and only problematic SCCs contain unsafe cycles, (ii) an unsafe cycle does not traverse vertices of more than one SCC (i.e., the unsafe cycles of distinct SCCs are totally disjoint). From (i), completeness follows (it is sufficient to compute a diagnosis for each problematic SCCs to remove all the unsafe cycles in the aligned ontology). (ii) ensures the independence of SCCs, and this guarantees minimality and correctness for local diagnoses computed in isolation. Finally, it is immediate to see that the minimality property is preserved by the union of local diagnoses, and this concludes the proposition. \(\square \)

Proposition 10.5 thus guarantees that the global diagnosis computed as the union of the diagnoses of the problematic SCCs is both minimal and correct (that is, it breaks all the unsafe cycles).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Solimando, A., Jiménez-Ruiz, E. & Guerrini, G. Minimizing conservativity violations in ontology alignments: algorithms and evaluation. Knowl Inf Syst 51, 775–819 (2017). https://doi.org/10.1007/s10115-016-0983-3

Download citation

Received: 20 January 2016
Revised: 28 July 2016
Accepted: 17 August 2016
Published: 27 August 2016
Issue Date: June 2017
DOI: https://doi.org/10.1007/s10115-016-0983-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Minimizing conservativity violations in ontology alignments: algorithms and evaluation

Abstract

Access this article

Similar content being viewed by others

Detecting and Correcting Conservativity Principle Violations in Ontology-to-Ontology Mappings

Opening the Black Box of Ontology Matching

An Extensible Linear Approach for Holistic Ontology Matching

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

1.1 Diagnosis computation complexity

Example 10.1

Proposition 10.1

Proof

Proposition 10.2

Proof

1.2 Decomposability of equivalence violations diagnosis computation

Proposition 10.3

Proof

Proposition 10.4

Proof

Proposition 10.5

Proof

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Minimizing conservativity violations in ontology alignments: algorithms and evaluation

Abstract

Access this article

Similar content being viewed by others

Detecting and Correcting Conservativity Principle Violations in Ontology-to-Ontology Mappings

Opening the Black Box of Ontology Matching

An Extensible Linear Approach for Holistic Ontology Matching

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

1.1 Diagnosis computation complexity

Example 10.1

Proposition 10.1

Proof

Proposition 10.2

Proof

1.2 Decomposability of equivalence violations diagnosis computation

Proposition 10.3

Proof

Proposition 10.4

Proof

Proposition 10.5

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation