Skip to main content
Log in

Minimizing conservativity violations in ontology alignments: algorithms and evaluation

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

In order to enable interoperability between ontology-based systems, ontology matching techniques have been proposed. However, when the generated mappings lead to undesired logical consequences, their usefulness may be diminished. In this paper, we present an approach to detect and minimize the violations of the so-called conservativity principle where novel subsumption entailments between named concepts in one of the input ontologies are considered as unwanted. The practical applicability of the proposed approach is experimentally demonstrated on the datasets from the Ontology Alignment Evaluation Initiative.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Notes

  1. http://oaei.ontologymatching.org/.

  2. Alignments from UMLS are extracted according to the method defined in [36].

  3. This paper was presented in a workshop without formal proceedings.

  4. We exclude disjointness from the semantic relations given that most of the available systems do not compute this relation. Negative constraints are typically harder to identify and assess than positive ones [18].

  5. This DL is less expressive than \(\mathcal {SROIQ}\), the underlying DL of OWL 2.

  6. We assume that \(\mathsf {diff}_{\Sigma }^{\approx }({\mathcal {O}}_i, {\mathcal {O}}^{\emptyset }_{{\mathcal {O}}_1,{\mathcal {O}}_2})=\emptyset \).

  7. Note that in this paper, we only target the mappings in the repair process and we consider the input ontologies as immutable. Other approaches like Pesquita et al. [62] question the automatic generation of repairs and suggest to update the ontologies, when necessary, to avoid violations.

  8. In our setting, A is required to be antireflexive as we disallow self-arcs.

  9. http://www.optique-project.eu.

  10. http://factpages.npd.no/factpages/.

  11. Optique uses OWL 2 QL ontologies for query rewriting, while the query formulation may be based on much richer OWL 2 ontologies. The axioms that fall outside the OWL 2 QL profile are either approximated or not considered for the rewriting.

  12. Although several proposals for graph formalisms for representing DL ontologies exist in the literature (e.g., [58]), we provided a simplified variant specifically tailored to capture equivalence violations.

  13. We use the syntax of Lparse 1.0, a parser for logic programs used as a front-end by different logic programming solvers, more details at http://www.tcs.hut.fi/Software/smodels/.

  14. Although alternative frameworks could have been employed (e.g., Constraint Logic Programming), we have adopted ASP as it is known to be well suited for graph-related problems and to produce compact and easy-to-understand solutions [10].

  15. In [73], this algorithm is referred to as CycleBreaker.

  16. For sake of space, the algorithm, which directly follows from Definition 4.2, is omitted. The interested reader can find it in [73], Algorithm 12.

  17. Note that, as for the case of EqRepair, we do not compute the classification of \({\mathcal {O}}_1 \cup {\mathcal {O}}_2 \cup \mathcal {M}\) since this will prevent our extension of D&G to identify and record the mappings involved in an unsatisfiability (i.e., a subsumption violation).

  18. In scenarios where the confidence of the mapping is missing (e.g., in reference or manually created mapping sets) or unreliable, our mapping repair technique computes fresh confidence values based on the locality principle [36].

  19. The complete source code of the proposed algorithms and the performed experiments is available at https://github.com/asolimando/logmap-conservativity/.

  20. The test environment consisted of a desktop computer equipped with 32GB DDR 3 RAM at 1333MHz, and an AMD Fusion FX 4350 (quad-core, each running at 4.2GHz) as CPU. The dataset is stored on a 128GB SSD, where the operating system Ubuntu (12.04, 64-bit version) is also installed. Our prototype can run with less than 8GB for the majority of the considered tests; we, however, allocate 26GB of RAM for the JVM in order to minimize the influence of the garbage collector on the recorded temporal measurements.

  21. Note that the reference mappings of the OAEI campaign are already coherent w.r.t. the test case ontologies, and thus, the consistency repair step was not necessary.

  22. The interested reader please refer to Section 4.6.4 in [73] for an overview of these approaches.

  23. http://www.cancer.gov/cancertopics/cancerlibrary/terminologyresources.

  24. Each agent is equipped with a local (private) OWL 2 ontology.

References

  1. Agrawal R, Borgida A, Jagadish HV (1989) Efficient management of transitive relationships in large data and knowledge bases. In: ACM SIGMOD Conference on Management of Data, pp 253–262

  2. Arnold P, Rahm E (2013) Semantic enrichment of ontology mappings: a linguistic-based approach. In: Advances in Databases and Information System - East European Conference (ADBIS), pp 42–55

  3. Baader F, Horrocks I, Sattler U (2008) Chapter 3 description logics. In: Bruce Porter Frank van Harmelen, Vladimir Lifschitz (eds) Handbook of Knowledge Representation, vo 3 of Foundations of Artificial Intelligence, Elsevier, pp 135–179

  4. Beisswanger E, Hahn U (2012) Towards valid and reusable reference alignments: ten basic quality checks for ontology alignments and their application to three different reference data sets. J Biomed Semant 3(Suppl 1):S4

    Article  Google Scholar 

  5. Bodenreider O (2004) The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res 32:267–270

    Article  Google Scholar 

  6. Borgida A, Serafini L (2003) Distributed description logics: assimilating information from peer sources. J Data Semant 1:153–184

    MATH  Google Scholar 

  7. Christophides V, Plexousakis D, Scholl M, Tourtounis S (2003) On labeling schemes for the semantic web. In: International World Wide Web Conference (WWW), pp 544–555

  8. Cuenca Grau B, Horrocks I, Kazakov Y, Sattler U (2008) Modular reuse of ontologies. J Artif Intell Res (JAIR) 31:273–318

    MathSciNet  MATH  Google Scholar 

  9. David J, Euzenat J, Scharffe F, Trojahn C (2011) The alignment API 4.0. Semant Web J 2(1):3–10

    Google Scholar 

  10. Dovier A, Formisano A, Pontelli E (2007) An experimental comparison of constraint logic programming and answer set programming. AAAI 7:1622–1625

    MATH  Google Scholar 

  11. Dowling WF, Gallier JH (1984) Linear-time algorithms for testing the satisfiability of propositional Horn formulae. J Log Program 1(3):267–284

    Article  MathSciNet  MATH  Google Scholar 

  12. DuyHoa N, Bellahsene Z (2013) YAM++ results for OAEI 2013. In: Ontology Matching Workshop (OM), pp 211–218

  13. Euzenat J (2015) Revision in networks of ontologies. Artif Intell 228:195–216

    Article  MathSciNet  MATH  Google Scholar 

  14. Euzenat J, Meilicke C, Stuckenschmidt H, Shvaiko P, Trojahn C (2011) Ontology alignment evaluation initiative: six years of experience. J Data Semant 15:158–192

    Article  Google Scholar 

  15. Euzenat J, Shvaiko P (2010) Ontology matching. Springer, Heidelberg

    MATH  Google Scholar 

  16. Even G, Naor JS, Schieber B, Sudan M (1998) Approximating minimum feedback sets and multicuts in directed graphs. Algorithmica 20(2):151–174

    Article  MathSciNet  MATH  Google Scholar 

  17. Faria D, Pesquita C, Santos E, Palmonari M, Cruz Isabel F, Couto FM (2013) The agreementmakerlight ontology matching system. In: OTM Conferences, pp 527–541

  18. Ferré S, Rudolph S (2012) Advocatus diaboli–exploratory enrichment of ontologies with negative constraints. In: International Conference on Knowledge Engineering (EKAW), pp 42–56

  19. Fleischhacker D, Völker J (2011) Inductive learning of disjointness axioms. In: OTM Conferences, pp 680–697

  20. Galinier P, Lemamou E, Bouzidi MW (2013) Applying local search to the feedback Vertex set problem. J Heur 1–22

  21. Gallo G, Urbani G (1989) Algorithms for testing the satisfiability of propositional formulae. J Log Program 7(1):45–61

    Article  MathSciNet  MATH  Google Scholar 

  22. Giese M, Soylu A, Vega-Gorgojo G, Waaler A, Haase P, Jiménez-Ruiz E, Lanti D, Rezk M, Xiao G, Özçep Ö, Rosati R (2015) Optique: zooming in on big data. IEEE Comput 48(3):60–67

    Article  Google Scholar 

  23. Glimm B, Horrocks I, Motik B, Stoilos G, Wang Z (2014) Hermit: an OWL 2 reasoner. J Autom Reason 53(3):245–269

    Article  MATH  Google Scholar 

  24. Golbeck J, Fragoso G, Hartel FW, Hendler JA, Oberthaler J, Parsia B (2003) The national cancer institute’s thésaurus and ontology. J Web Semant 1(1):75–80

    Article  Google Scholar 

  25. Gonçalves RS, Parsia B, Sattler U (2012) Concept-based semantic difference in expressive description logics. In: International Semantic Web Conference (ISWC), Springer, pp 99–115

  26. Grau BC, Horrocks I, Kazakov Y, Sattler U (2007) Just the right amount: extracting modules from ontologies. In: International Conference on World Wide Web (WWW), pp 717–726. ACM

  27. Grau BC, Horrocks I, Motik B, Parsia B, Patel-Schneider PF, Sattler U (2008) OWL 2: the next step for OWL. J Web Semant 6(4):309–322

    Article  Google Scholar 

  28. Horridge M (2011) Justification based explanation in ontologies. PhD thesis, University of Manchester

  29. Horridge M, Parsia B, Sattler U (2008) Laconic and precise justifications in OWL. In: International Semantic Web Conference (ISWC), pp 323–338

  30. Horrocks I, Kutz O, Sattler U (2006) The even more Irresistible SROIQ. In: International Conference on Principles of Knowledge Representation and Reasoning (KR), pp 57–67

  31. Ivanova V, Lambrix P (2013) A unified approach for aligning taxonomies and debugging taxonomies and their alignments. In: European Semantic Web Conference (ESWC), Springer, pp 1–15

  32. Jean-Mary Yves R, Patrick Shironoshita E, Kabuka Mansur R (2009) Ontology matching with semantic verification. J Web Semant 7(3):235–251

    Article  Google Scholar 

  33. Jiménez-Ruiz E, Grau BC (2011) LogMap: logic-based and scalable ontology matching. In: International Semantic Web Conference (ISWC), pp 273–288,

  34. Jiménez-Ruiz E, Grau BC, Horrocks I (2012) On the feasibility of using OWL 2 DL reasoners for ontology matching problems. In: OWL Reasoner Evaluation Workshop (ORE)

  35. Jiménez-Ruiz E, Grau BC, Horrocks I, Berlanga R (2009) Ontology integration using mappings: towards getting the right logical consequences. In: European Semantic Web Conference (ESWC), pp 173–187

  36. Jiménez-Ruiz E, Grau BC, Horrocks I, Berlanga R (2011) Logic-based assessment of the compatibility of UMLS ontology sources. J Biomed Semant 2(Suppl 1):S2

    Article  Google Scholar 

  37. Jiménez-Ruiz E, Grau BC, Zhou Y, Horrocks I (2012) Large-scale interactive ontology matching: algorithms and implementation. In: European Conference on Artificial Intelligence (ECAI), pp 444–449

  38. Jiménez-Ruiz E, Kharlamov E, Zheleznyakov D, Horrocks I, Pinkel C, Skjæveland MG, Thorstensen E, Mora J (2015) BootOX: practical mapping of RDBs to OWL 2. In: International Semantic Web Conference (ISWC), pp 113–132

  39. Jiménez-Ruiz E, Meilicke C, Grau BC, Horrocks I (2013) Evaluating mapping repair systems with large biomedical ontologies. In: Description Logics (DL), pp 246–257

  40. Jiménez-Ruiz E, Payne TR, Solimando A, Tamma V (2015) Avoiding alignment-based conservativity violations through dialogue. In: International Workshop on OWL: Experiences and Directions (OWLED)

  41. Jiménez-Ruiz E, Payne TR, Solimando A, Tamma VAM (2016) Limiting logical violations in ontology alignment through negotiation. In: Principles of Knowledge Representation and Reasoning: Proceedings of the Fifteenth International Conference (KR), pp 217–226

  42. Kalyanpur A, Parsia B, Horridge M, Sirin E (2007) Finding all justifications of OWL DL entailments. In: International Semantic Web Conference (ISWC), pp 267–280

  43. Kazakov Y, Krötzsch M, Simancik F (2014) The incredible ELK - from polynomial procedures to efficient reasoning with \({\cal EL}\) ontologies. J Autom Reason 53(1):1–61

    Article  MathSciNet  MATH  Google Scholar 

  44. Kharlamov E, Hovland D, Jiménez-Ruiz E, Lanti D, Lie H, Pinkel C, Rezk M, Skjæveland MG, Zheleznyakov E, Horrocks I (2015) Ontology based access to exploration data at statoil. In: International Semantic Web Conference (ISWC), pp 93–112

  45. Kharlamov E, Solomakhina N, Özçep ÖL, Zheleznyakov D, Hubauer T, Lamparter S, Roshchin M, Soylu A, Watson S (2014) How semantic technologies can enhance data access at siemens energy. In: International Semantic Web Conference (ISWC), pp 601–619

  46. Konev B, Walther D, Wolter F (2008) The logical difference problem for description logic terminologies. In: International Joint Conference on Automated Reasoning (IJCAR), pp 259–274

  47. Kontchakov R, Wolter F, Zakharyaschev M (20084) Can you tell the difference between DL-Lite ontologies?. In: International Conference on Principles of Knowledge Representation and Reasoning (KR)

  48. Lambrix P, Dragisic Z, Ivanova V (2013) Get my pizza right: repairing missing is-a relations in \({\cal ALC}\) ontologies. In: Semantic Technology, Springer, pp 17–32

  49. Lambrix P, Liu Q (2013) Debugging the missing is-a structure within taxonomies networked by partial reference alignments. Data Knowl Eng (DKE) 86:179–205

    Article  Google Scholar 

  50. Lambrix P, Wei-Kleiner F, Dragisic Z, Ivanova V (2013) Repairing missing is-a structure in ontologies is an abductive reasoning problem. In: International Workshop on Debugging Ontologies and Ontology Mappings (WoDOOM), pp 33

  51. Lutz C, Walther D, Wolter F (2007) Conservative extensions in expressive description logics. Int Jt Conf Artif Intell (IJCAI) 7:453–458

    Google Scholar 

  52. Lutz C, Wolter F (2010) Deciding inseparability and conservative extensions in the description logic EL. J Symb Comput 45(2):194–228

    Article  MATH  Google Scholar 

  53. Mascardi V, Ancona D, Barbieri M, Bordini RH, Ricci A (2014) CooL-AgentSpeak: Endowing Agentspeak-DL agents with plan exchange and ontology services. Web Intell Agent Syst 12(1):83–107

    Google Scholar 

  54. Meilicke C (2011) Alignments incoherency in ontology matching. PhD thesis, University of Mannheim

  55. Meilicke C, Stuckenschmidt H, Tamilin A (2009) Reasoning support for mapping revision. J Log Comput 19(5):807–829

    Article  MathSciNet  MATH  Google Scholar 

  56. Meilicke C, Völker J, Stuckenschmidt H (2008) Learning disjointness for debugging mappings between lightweight ontologies. In: International Conference on Knowledge Engineering (EKAW), pp 93–108

  57. Melnik S, Garcia-Molina H, Rahm E (2002) Similarity flooding: a versatile graph matching algorithm and its application to schema matching. In: IEEE International Conference on Data Engineering (ICDE), pp 117–128

  58. Motik B, Grau BC, Horrocks I, Sattler U (2009) Representing ontologies using description logics, description graphs, and rules. Artif Intell J 173(14):1275–1309

    Article  MathSciNet  MATH  Google Scholar 

  59. Nebot V, Berlanga R (2009) Efficient retrieval of ontology fragments using an interval labeling scheme. Inf Sci J 179(24):4151–4173

    Article  Google Scholar 

  60. Payne TR, Tamma V (2014) A Dialectical approach to selectively reusing ontological correspondences. In: Knowledge Engineering and Knowledge Management (EKAW), Springer, pp 397–412

  61. Payne TR, Tamma V (2014) Negotiating over ontological correspondences with asymmetric and incomplete knowledge. In: International Conference on Autonomous Agents and Multi-Agent Systems, (AAMAS), pp 517–524

  62. Pesquita C, Faria D, Santos E, Couto FM (2013) To repair or not to repair: reconciling correctness and coherence in ontology reference alignments. In: Ontology Matching Workshop (OM), pp 13–24

  63. Raymond R (1987) A theory of diagnosis from first principles. Artif Intell J 32(1):57–59

    Article  MathSciNet  MATH  Google Scholar 

  64. Rodriguez-Muro M, Rezk M (2015) Efficient SPARQL-to-SQL with R2RML mappings. J Web Semant 33:141–169

    Article  Google Scholar 

  65. Rosse C, Mejino JLV Jr (2003) A reference ontology for biomedical informatics: the foundational model of anatomy. J Biomed Inform 36(6):478–500

    Article  Google Scholar 

  66. Santos E, Faria D, Pesquita C, Couto F (2013) Ontology alignment repair through modularization and confidence-based heuristics. arXiv:1307.5322 preprint

  67. Schlobach S (2005) Debugging and semantic clarification by pinpointing. In: European Semantic Web Conference (ESWC), Springer, pp 226–240

  68. Schlobach S, Cornet R (2003) Non-standard reasoning services for the debugging of description logic terminologies. In: International Joint Conference on Artificial Intelligence (IJCAI), pp 355–362

  69. Schulz S, Cornet R, Spackman KA (2011) Consolidating SNOMED CT’s ontological commitment. Appl Ontol 6(1):1–11

    Google Scholar 

  70. Sesen MB, Bañares-Alcántara R, Fox J, Kadir T, Brady JM (2012) Lung Cancer Assistant: an ontology-driven, online decision support prototype. In: International Workshop on OWL: Experiences and Directions (OWLED)

  71. Shvaiko P, Euzenat J (2012) Ontology matching: state of the art and future challenges. Trans Knowl Data Eng (TKDE) 25(1):158–176

    Article  Google Scholar 

  72. Skjæveland MG, Lian EH, Horrocks I (2013) Publishing the Norwegian Petroleum directorate’s FactPages as semantic eeb data. In: International Semantic Web Conference (ISWC), pp 162–177

  73. Solimando A (2015) Change Management in the Traditional and Semantic Web. PhD thesis, University of Genoa. https://github.com/asolimando/logmap-conservativity/raw/master/SolimandoA-thesis.pdf

  74. Solimando A, Jiménez-Ruiz E, Guerrini G (2014) A multi-strategy approach for detecting and correcting conservativity principle violations in ontology alignments. In: International Workshop on OWL: Experiences and Directions (OWLED), pp 13–24

  75. Solimando A, Jiménez-Ruiz E, Guerrini G (2014) Detecting and correcting conservativity principle violations in ontology-to-ontology mappings. In: International Semantic Web Conference (ISWC), pp 1–16

  76. Solimando A, Jiménez-Ruiz E, Guerrini G (2015) On the feasibility of using OWL 2 reasoners in ontology alignment repair problems. In: OWL Reasoner Evaluation Workshop (ORE), pp 60–67

  77. Solimando A, Jiménez-Ruiz E, Guerrini G (2016) Pushing the limits of OWL 2 reasoners in ontology alignment repair problems. Intell Artif 10:1–18

    Article  Google Scholar 

  78. Solimando A, Jiménez-Ruiz E, Pinkel C (2014) Evaluating ontology alignment systems in query answering tasks. In: International Semantic Web Posters & Demonstrations Track (ISWC), pp 301–304

  79. Soylu A, Giese M, Jimenez-Ruiz E, Vega-Gorgojo G, Horrocks I (2015) Experiencing OptiqueVQS: a multi-paradigm and ontology-based visual query system for end users. Univers Access Inf Soc 15(1):129–152

    Article  Google Scholar 

  80. Suntisrivaraporn B, Qi G, Ji Q, Haase P (2008) A modularization-based approach to finding all justifications for OWL DL entailments. In: Asian Semantic Web Conference (ASWC), pp 1–15

  81. Tarjan R (1972) Depth-first search and linear graph algorithms. SIAM J Comput 1(2):146–160

    Article  MathSciNet  MATH  Google Scholar 

  82. Völker J, Vrandecic D, Sure Y, Hotho A (2007) Learning disjointness. In: European Semantic Web Conference (ESWC), pp 175–189

  83. Šváb O, Svátek V, Berka P, Rak D, Tomášek P (2005) Ontofarm: towards an experimental collection of parallel ontologies. In: International Semantic Web Conference (ISWC). Poster Session

  84. W3C as Hitzler P, Krötzsch M, Parsia B, Patel-Schneider PF, Rudolph S (2009) OWL 2 Web Ontology language primer. http://www.w3.org/TR/owl2-primer/

  85. Wang P, Xu B (2012) Debugging ontology mappings: a static approach. Comput Inform 27(1):21–36

    MathSciNet  Google Scholar 

  86. Zhang S, Mork P, Bodenreider O (2004) Lessons learned from aligning two representations of anatomy. In: International Conference on Principles of Knowledge Representation and Reasoning (KR)

  87. Zimmermann A, Euzenat J (2006) Three semantics for distributed systems and their relations with alignment composition. In: International Semantic Web Conference (ISWC). Springer, pp 16–29

Download references

Acknowledgments

Ernesto Jiménez-Ruiz was funded by the European Commission under FP7 Grant Agreement 318338, “Optique”, and the EPSRC projects Score! ED3 and DBOnto. We also thank the invaluable help provided by Bernardo Cuenca and Ian Horrocks. We are also very grateful for the support of the Optique colleagues that facilitated our understanding of the domain, especially Dag Hovland, Evgeny Kharlamov, Dmitry Zheleznyakov, Martin Giese and Martin G. Skjæveland. Finally, we would also like to thank the anonymous reviewers of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alessandro Solimando.

Appendix

Appendix

This section investigates the computational complexity of diagnosis computation (“Diagnosis Computation Complexity” in section “Appendix”) and its decomposability into subproblems (“Decomposability of equivalence violations diagnosis computation” in section “Appendix”).

1.1 Diagnosis computation complexity

With the aim of proving that MAP-WFES is NP-hard, Proposition 10.1 introduces a polynomial reduction from WFES to MAP-WFES, denoted as WFES \(\preceq \) MAP-WFES. The intuitive idea behind the reduction is the following. Each arc \((t,v,c)\) of the original graph is “split” into two arcs \((t,u,c)\) and \((u,v,c)\), with u a fresh node. All the nodes tv are associated with one of the input ontology, while the fresh nodes are associated with the other. In this way, all the arcs are mappings (i.e., all of them are potentially removable, exactly as the original arcs). It is easy to see that the reduction preserves the solution weight and that a 1–1 correspondence exists between cycles in the two graphs. In addition, we remark that MAP-WFES does not break cycles traversing only vertices of one of the input ontologies. No such cycles can exist because all the arcs are mappings, as discussed above. A reduction example is given in Example 10.1, followed by the definition of the reduction in Proposition 10.1.

Fig. 11
figure 11

Input graph G for WFES (left) reduced to a corresponding graph \(G'\), input for MAP-WFES (right). In graph \(G'\), all the round-shaped vertices belong to \(V_1'\), while square-shaped vertices belong to \(V_2'\). a Graph G (reduction input) b Graph \(G'\) (reduction output)

Example 10.1

In Fig. 11, graphs G (left) and \(G'\) (right) are shown. WFES \(\preceq \) MAP-WFES coincides with \(G'\). The solution to \(G'\) is equal to \(\Delta = \{ (c,cd,1), (b,bg,0.1), (gf,f,0.4), (a,af,0.2) \} \), with a total weight of 1.7. \(D= \{ (b,g,0.1), (g,f,0.4),(a,f,0.2), (c,d,1) \} \) is the corresponding solution to the instance of the WFES problem represented by G and can be easily verified that is both minimal (having weight 1.7) and correct.

Proposition 10.1

WFES \(\preceq \) MAP-WFES. A polynomial reduction from the WFES problem to the MAP-WFES problem exists. Let \(G=(V,A)\) be a digraph. The reduction consists in constructing a digraph \(G'=(V',A')\) such that a subset of edges, namely \(\Delta \subseteq A\), is a solution to MAP-WFES iff the corresponding set of arcs, namely D, is a solution to WFES on G. The reduction is as follows:

  1. 1.

    for each \((x,y,c) \in A\), we create a fresh vertex \(v_{xy}\), we add it to \(V_2'\), and we create a pair of arcs \((x,v_{xy},c), (v_{xy},y,c)\) that are added to \(A'\) and \(\mathcal {M}\),

  2. 2.

    \(V_1' = V\) and \(V' = V_1' \cup V_2'\).

A set of arcs, namely \(\Delta \subseteq A'\), is a solution to the MAP-WFES problem on digraph \(G'\) iff the corresponding feedback edge set D is a solution to the WFES G, where \(G'\) is computed from G, and for each arc of the form \((x,v_{xy},c)\) or \((v_{xy},y,c)\) in \(\Delta \), we have a corresponding arc \((x,y,c)\) in D.

Proof

In order to prove the correctness of the reduction, we need to show that, if G \(\preceq \) \(G'\), with G an instance of the WFES problem and \(G'\) an instance of the MAP-WFES problem, a set of arcs \(\Delta \) is a (minimal) solution to \(G'\) iff the corresponding set of arcs D is a (minimal) solution to G. As discussed in  [16], the proposed reduction is polynomial and it preserves graph connectivity and the weight of the solutions, by preserving a 1–1 correspondence between cycles of G and those of \(G'\), due to the 1–1 correspondency between the arcs of A and those of \(A'\).

\(\Rightarrow \): If D is a solution to WFES on G, \(\Delta \) is a solution to MAP-WFES on \(G'\). Suppose that \(\Delta \) is not a solution. This requires that either at least a cycle \(\kappa '\) exists in digraph \((V',A'{\setminus }\Delta )\) or that a diagnosis \(\Delta '\) exists such that \(w(\Delta ') < w(\Delta )\). For the first case, given the 1–1 correspondence between cycles of G and \(G'\), this implies that a corresponding cycle in G exists as well, thus contradicting that D is a solution to the instance of the WFES problem represented by G. For the latter case, given the 1–1 correspondence between arcs of G and \(G'\), this implies that a solution \(D'\) corresponding to \(\Delta '\) exists. By the weight preservation property of the reduction, \(w(D') < w(D)\) holds, contradicting that D is a (minimal) solution to the instance of the WFES problem represented by G.

\(\Leftarrow \): If \(\Delta \) is a solution to MAP-WFES on \(G'\), D is a solution to WFES on G. Suppose that D is not. This requires that either a cycle \(\kappa \) of G exists in digraph \((V,A {\setminus } D)\) or that a solution \(D'\) exists such that \(w(D') < w(D)\). The first case requires the existence of a cycle \(\kappa '\) of \(G'\) (corresponding to \(\kappa \)) that is left unbroken. In turn, this either violates that \(\Delta \) is a solution, or that, for some \(i \in \{1,2\} \), \(\kappa '\) exclusively traverses elements of \(V'_i\). This situation is excluded by construction of \(G'\), because no arcs between vertices of the same subset \(V'_i\) of \(V'\) exist. For the latter case, this implies that a diagnosis \(\Delta '\) corresponding to the (minimal) solution \(D'\) does not exist. This requires that at least an element of \(D'\) cannot belong to a diagnosis (i.e., it cannot be removed). By construction of \(G'\), we have that \(\mathcal {M} = A'\). Given that only elements of \(A' {\setminus } \mathcal {M}\) cannot belong to a diagnosis for the MAP-WFES problem, this results in a contradiction, thus proving the correctness of the reduction. \(\square \)

From the results of Proposition 10.1, it follows that MAP-WFES is NP-hard, as detailed in Proposition 10.2.

Proposition 10.2

MAP-WFES is NP-hard.

Proof

The proof follows from the polynomial reduction from the WFES problem, that is NP-hard  [16], to MAP-WFES. \(\square \)

1.2 Decomposability of equivalence violations diagnosis computation

Proposition 10.3 relates unsafe cycles and problematic SCCs, showing that each unsafe cycle results in a problematic SCC.

Proposition 10.3

A SCC is problematic iff it (totally) contains at least one unsafe cycle.

Proof

\(\Rightarrow \): Consider a problematic SCC S, with projections \(\Pi _1\) and \(\Pi _2\) on the input ontologies. From problematic SCC definition (Definition 4.6), at least one of the projections of S, say \(\Pi _1\), is not a local SCC. We therefore also know that \(\Pi _1\) is not a SCC; otherwise, it would also be a local SCC. Suppose that \(\Pi _1\) is a subset of a SCC \(\Pi '\). This implies that all the elements of \(\Pi '\) belongs to S as well, and therefore, \(\Pi '\) and \(\Pi _1\) are identical, but this contradicts the assumption that \(\Pi _1\) is not a SCC.

\(\Leftarrow :\) from the definition of cycle and SCC, each cycle \(\kappa \) is contained in a SCC S (i.e., \(\kappa \subseteq S\)). Let \(\kappa \) be an unsafe cycle, and let also \(\kappa _1\) (resp. \(\kappa _2\)) be the subset of vertices of \(\kappa \) belonging to an input ontology \({\mathcal {O}}_1\) (resp. \({\mathcal {O}}_2\)). By definition of unsafe cycle (Definition 4.3), at least one of this subsets, say \(\kappa _1\), is not contained in any local SCC. But given that \(\kappa \subseteq S\), \(\kappa _1 \subseteq \Pi _{{\mathcal {O}}_1}(S)\) holds. Therefore, \(\Pi _{{\mathcal {O}}_1}(S)\) is not contained in any local SCC either and, by Definition 4.6, S is a problematic SCC. \(\square \)

Proposition 10.3 guarantees completeness for a detection technique for violations to the conservativity principle on a graph representation of an aligned ontology, based on problematic SCCs. Given that all the violations result in unsafe cycles and that they totally belong to a single problematic SCC, completeness for a repair technique breaking all the unsafe cycles follows.

Notice also that a (unsafe) cycle always belongs to one and only one (problematic) SCC (as expressed by Proposition 10.4), while a problematic SCC may contain more than one cycle. Therefore, a technique detecting problematic SCCs may be more efficient than one directly addressing unsafe cycles.

Proposition 10.4

Safe cycle never traverse multiple SCCs of the same input ontology.

Proof

By Definition 4.3, all the vertices belonging to a projection \(\Pi \) of a safe cycle \(\kappa ^s\) need to be traversed by a cycle \(\kappa '\) in the input ontology these vertices belong to. The claim is that cycle \(\kappa '\) identifies either a SCC of the aligned ontology or a subset of a SCC. Assume that vertices of \(\Pi \) belong to at least two SCCs \(S_1, S_2\), that is, \(\Pi \cap S_1 \ne \emptyset \) and \(\Pi \cap S_2 \ne \emptyset \). Being traversed by a cycle, all the vertices of \(\Pi \) are mutually reachable. Then, from transitivity of reachability, it follows that all the vertices in \(S_1 \cup S_2\) are mutually reachable. This contradicts the hypothesis that \(S_1\) and \(S_2\) are two distinct SCCs, thus proving the proposition. Such argument be can straightforwardly generalized to more than two SCCs. \(\square \)

Proposition 10.5 proves the correctness of our approach and the optimality of the computed (global) diagnosis.

Proposition 10.5

Computing a (global) diagnosis for a graph G, representing an aligned ontology, can be reduced to computing the (local) diagnoses for the problematic SCCs of G. The (minimal) global diagnosis is the union of the (minimal) local diagnoses.

Proof

From Proposition 10.3, it follows that: (i) all and only problematic SCCs contain unsafe cycles, (ii) an unsafe cycle does not traverse vertices of more than one SCC (i.e., the unsafe cycles of distinct SCCs are totally disjoint). From (i), completeness follows (it is sufficient to compute a diagnosis for each problematic SCCs to remove all the unsafe cycles in the aligned ontology). (ii) ensures the independence of SCCs, and this guarantees minimality and correctness for local diagnoses computed in isolation. Finally, it is immediate to see that the minimality property is preserved by the union of local diagnoses, and this concludes the proposition. \(\square \)

Proposition 10.5 thus guarantees that the global diagnosis computed as the union of the diagnoses of the problematic SCCs is both minimal and correct (that is, it breaks all the unsafe cycles).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Solimando, A., Jiménez-Ruiz, E. & Guerrini, G. Minimizing conservativity violations in ontology alignments: algorithms and evaluation. Knowl Inf Syst 51, 775–819 (2017). https://doi.org/10.1007/s10115-016-0983-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-016-0983-3

Keywords

Navigation