Skip to main content
Log in

Efficient Projection of Ontologies

  • Original Article
  • Published:
Journal on Data Semantics

Abstract

In collaborative and interoperable information systems it is frequently necessary to export a subset of an ontology, in particular to export all instances and axioms subsumed by a shared domain ontology. Technically this can be realized by projecting the Abox axioms of the private instances to the vocabulary of the shared ontology using a reasoner. We show that the direct application of this method results in a very redundant export. Therefore, we propose methods to generate a much smaller equivalent export in an efficient way. These methods are based on a deep analysis of properties of axiom sets that can be used to efficiently filter redundancies during the export process and provide methods to further minimize redundancies in the result. We evaluate the performance and the degree of minimality that can be reached with our approach with standard benchmarking ontologies with large Aboxes and a use case of the bio-medical domain.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. In contrast to OWL2 [25] we do currently not address the export of negative role assertions since they are not used by any of the relevant ontologies for our scenario.

  2. http://owlapi.sourceforge.net.

  3. http://clarkparsia.com/pellet.

  4. http://swat.cse.lehigh.edu/projects/lubm/.

  5. http://www.cs.put.poznan.pl/alawrynowicz/semintec.htm.

  6. https://files.ifi.uzh.ch/ddis/oldweb/ddis/fileadmin/ont/nli/restaurant.owl.

  7. https://dkm.fbk.eu/index.php/ICD-10_Ontology.

References

  1. Aho AV, Garey MR, Ullman JD (1972) The transitive reduction of a directed graph. SIAM J Comput 1(2):131–137

    Article  MATH  MathSciNet  Google Scholar 

  2. Asslaber M, Abuja PM, Stark K, Eder J, Gottweis H, Trauner M, Samonigg H, Mischinger HJ, Schippinger W, Berghold A et al (2007) The genome austria tissue bank (gatib). Pathobiology 74(4):251–258

    Article  Google Scholar 

  3. Bail S, Parsia B, Sattler U (2011) Extracting finite sets of entailments from owl ontologies. In: Rosati R, Rudolph S, Zakharyaschev M (eds) Description logics, CEUR workshop proceedings, vol 745. CEUR-WS.org. http://dblp.uni-trier.de/db/conf/dlog/dlog2011.html#BailPS11

  4. Bao J, Voutsadakis G, Slutzki G, Honavar V (2009) Modular ontologies. In: Package-based description logics. Springer, Berlin, pp 349–371. doi:10.1007/978-3-642-01907-4_16

  5. Borgida A, Serafini L (2003) Distributed description logics: assimilating information from peer sources. In: Spaccapietra S, March S, Aberer K (eds) Journal on data semantics I, lecture notes in computer science, vol 2800. Springer, Berlin, pp 153–184. doi:10.1007/978-3-540-39733-5_7

  6. Bouquet P, Giunchiglia F, Harmelen F, Serafini L, Stuckenschmidt H (2003)C-owl: contextualizing ontologies. In: Fensel D, Sycara K, Mylopoulos J (eds) The semantic web—ISWC 2003. Lecture notes in computer science, vol 2870. Springer, Berlin, pp 164–179. doi:10.1007/978-3-540-39718-2_11

  7. Buneman P, Chapman A, Cheney J (2006) Provenance management in curated databases. In: Proceedings of the ACM international conference on management of data (SIGMOD), pp 539–550. ACM Press, New York. doi:10.1145/1142473.1142534. http://portal.acm.org/citation.cfm?id=1142534

  8. Cuenca Grau B, Horrocks I, Kazakov Y, Sattler U (2008) Modular reuse of ontologies: theory and practice. J Artif Intell Res 31:273–318. http://www.jair.org/media/2375/live-2375-3703-jair.pdf

  9. Davidson SB, Khanna S, Roy S, Stoyanovich J, Tannen V, Chen Y (2011) On provenance and privacy. In: Milo T (ed) ICDT, pp 3–10. ACM, New York

  10. Davidson SB, Milo T, Roy S (2012) A propagation model for provenance views of public/private workflows. CoRR abs/1212.2251. http://dblp.uni-trier.de/db/journals/corr/corr1212.html#abs-1212-2251

  11. Dou D, McDermott D, Qi P (2005) Journal on data semantics ii. Chap. Ontology translation on the semantic web. Springer, Berlin, pp 35–57. http://dl.acm.org/citation.cfm?id=2137611.2137614

  12. Eder J, Dabringer C, Schicho M, Stark K (2009) Information systems for federated biobanks. T Large Scale Data Knowl Centered Syst 1:156–190. http://dblp.uni-trier.de/db/journals/tlsdkcs/tlsdkcs1.html#EderDSS09

  13. Eder J, Gottweis H, Zatloukal K (2012) IT solutions for privacy protection in biobanking. Public Health Genomics 15(5):254–262

    Article  Google Scholar 

  14. Grau BC, Horrocks I, Kazakov Y, Sattler U (2007) Just the right amount: extracting modules from ontologies. In: Williamson CL, Zurko ME, Patel-Schneider PF, Shenoy PJ (eds) WWW, pp 717–726. ACM, New York

  15. Grau BC, Parsia B, Sirin E (2006) Combining OWL ontologies using e-connections. Web Semant Sci Serv Agents World Wide Web 4(1):40–59. doi:10.1016/j.websem.2005.09.010. http://www.sciencedirect.com/science/article/pii/S1570826805000314

  16. Gupta A, Condit C, Qian X (2010) Biodb: an ontology-enhanced information system for heterogeneous biological information. Data Knowl Eng 69(11):1084–1102. http://dblp.uni-trier.de/db/journals/dke/dke69.html#GuptaCQ10

  17. Hernandez T, Kambhampati S (2004) Integration of biological sources: current systems and challenges ahead. SIGMOD Rec 33(3):51–60. doi:10.1145/1031570.1031583

    Article  Google Scholar 

  18. Kalyanpur A, Parsia B, Horridge M, Sirin E (2007) Finding all justifications of owl dl entailments. In: ISWC/ASWC, pp 267–280

  19. Klarman S, Endriss U, Schlobach S (2011) Abox abduction in the description logic alc. J Autom Reason 46(1):43–80. doi:10.1007/s10817-010-9168-z

    Article  MATH  MathSciNet  Google Scholar 

  20. Konev B, Lutz C, Walther D, Wolter F (2008) Semantic modularity and module extraction in description logics. In: Proceedings of ECAI 2008, pp 55–59. IOS Press, Amsterdam. http://dl.acm.org/citation.cfm?id=1567281.1567298

  21. Krötzsch M, Simancik F, Horrocks I (2012) A description logic primer. CoRR abs/1201.4089

  22. Mascardi V, Cordì V, Rosso P (2007) A comparison of upper ontologies. In: Baldoni M, Boccalatte A, Paoli FD, Martelli M, Mascardi V (eds) WOA. Seneca Edizioni Torino, pp 55–64

  23. Noy N, Musen M (2004) Specifying ontology views by traversal. In: McIlraith EA (ed) The semantic web—ISWC 2004. Lecture notes in computer science, vol 3298. Springer, Berlin, pp 713–725. doi:10.1007/978-3-540-30475-3_49

  24. Noy NF, Musen MA (2003) The PROMPT suite: interactive tools for ontology merging and mapping. Int J Hum Comput Stud 59(6):983–1024. doi:10.1016/j.ijhcs.2003.08.002. http://www.sciencedirect.com/science/article/pii/S107158190300137X

  25. OWL Working Group W (2009) OWL 2 web ontology language: document overview. W3C Recommendation. http://www.w3.org/TR/owl2-overview/

  26. Pan JZ, Du J, Qi G, Shen YD (2012) Towards practical abox abduction in large description logic ontologies. Int J Semant Web Inf Syst 8(2):1–33. doi:10.4018/jswis.2012040101

    Article  Google Scholar 

  27. Ram S, Liu J (2009) A new perspective on semantics of data provenance. In: Freire J, Missier P, Sahoo SS (eds) SWPM, CEUR workshop proceedings, vol 526. CEUR-WS.org. http://dblp.uni-trier.de/db/conf/semweb/swpm2009.html#RamL08

  28. Schlicht A, Stuckenschmidt H (2007) Criteria-based partitioning of large ontologies. In: Proceedings of the 4th international conference on knowledge capture, K-CAP ’07. ACM, New York, pp 171–172. doi:10.1145/1298406.1298439

  29. Seidenberg J (2009) Web ontology segmentation: extraction, transformation, evaluation. In: Stuckenschmidt H, Parent C, Spaccapietra S (eds) Modular ontologies. Lecture notes in computer science, vol 5445. Springer, New York, pp 211–243

  30. Serafini L, Tamilin A (2007) Instance migration in heterogeneous ontology environments. In: Aberer KEA (ed) The semantic web. Lecture notes in computer science, vol 4825. Springer, Berlin, pp 452–465. doi:10.1007/978-3-540-76298-0_33

  31. Sheth AP, Larson JA (1990) Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Comput Surv 22(3):183–236. doi:10.1145/96602.96604

  32. Simmhan Y, Plale B, Gannon D (2005) A survey of data provenance in e-science. SIGMOD Rec 34(3):31–36

    Article  Google Scholar 

  33. Stark K, Eder J, Zatloukal K (2007) Achieving k-anonymity in datamarts used for gene expressions exploitation. J Integr Bioinform 4(1):57

    Google Scholar 

  34. Stuckenschmidt H, Parent C, Spaccapietra S (eds) (2009) Modular ontologies: concepts, theories and techniques for knowledge modularization. Lecture notes in computer science, vol 5445. Springer, Berlin. doi:10.1007/978-3-642-01907-4. http://ki.informatik.uni-mannheim.de/research/publications.html

  35. Teppo L, Pukkala E, Lehtonen M (1994) Data quality and quality control of a population-based cancer registry. Experience in finland. Acta Oncol (Stockh, Swed) 33(4):365–369. http://www.ncbi.nlm.nih.gov/pubmed/8018367

  36. Wichmann HE, Kuhn KA, Waldenberger M, Schmelcher D, Schuffenhauer S, Meitinger T, Wurst SH, Lamla G, Fortier I, Burton PR et al (2011) Comprehensive catalog of European biobanks. Nat Biotechnol 29(9):795–797

    Article  Google Scholar 

  37. Yuille M, van Ommen GJB, Bréchot C, Cambon-Thomsen A, Dagher G, Landegren U, Litton JE, Pasterk M, Peltonen L, Taussig M, Wichmann HE, Zatloukal K (2008) Biobanking for Europe. Brief Bioinform 9(1):14–24. http://dblp.uni-trier.de/db/journals/bib/bib9.html#YuilleOBCDLLPPTWZ08

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Julius Köpke.

Additional information

The work reported here was partially supported by the Austrian Ministry of Science and Research within the project BBMRI.AT.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Köpke, J., Eder, J. & Schicho, M. Efficient Projection of Ontologies. J Data Semant 4, 103–116 (2015). https://doi.org/10.1007/s13740-014-0043-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13740-014-0043-7

Keywords

Navigation