Advertisement

Journal on Data Semantics

, Volume 4, Issue 2, pp 103–116 | Cite as

Efficient Projection of Ontologies

  • Julius KöpkeEmail author
  • Johann Eder
  • Michaela Schicho
Original Article
  • 163 Downloads

Abstract

In collaborative and interoperable information systems it is frequently necessary to export a subset of an ontology, in particular to export all instances and axioms subsumed by a shared domain ontology. Technically this can be realized by projecting the Abox axioms of the private instances to the vocabulary of the shared ontology using a reasoner. We show that the direct application of this method results in a very redundant export. Therefore, we propose methods to generate a much smaller equivalent export in an efficient way. These methods are based on a deep analysis of properties of axiom sets that can be used to efficiently filter redundancies during the export process and provide methods to further minimize redundancies in the result. We evaluate the performance and the degree of minimality that can be reached with our approach with standard benchmarking ontologies with large Aboxes and a use case of the bio-medical domain.

Keywords

OWL Ontology Abox export  Abox minimizing Instance export Instance migration 

References

  1. 1.
    Aho AV, Garey MR, Ullman JD (1972) The transitive reduction of a directed graph. SIAM J Comput 1(2):131–137CrossRefzbMATHMathSciNetGoogle Scholar
  2. 2.
    Asslaber M, Abuja PM, Stark K, Eder J, Gottweis H, Trauner M, Samonigg H, Mischinger HJ, Schippinger W, Berghold A et al (2007) The genome austria tissue bank (gatib). Pathobiology 74(4):251–258CrossRefGoogle Scholar
  3. 3.
    Bail S, Parsia B, Sattler U (2011) Extracting finite sets of entailments from owl ontologies. In: Rosati R, Rudolph S, Zakharyaschev M (eds) Description logics, CEUR workshop proceedings, vol 745. CEUR-WS.org. http://dblp.uni-trier.de/db/conf/dlog/dlog2011.html#BailPS11
  4. 4.
    Bao J, Voutsadakis G, Slutzki G, Honavar V (2009) Modular ontologies. In: Package-based description logics. Springer, Berlin, pp 349–371. doi: 10.1007/978-3-642-01907-4_16
  5. 5.
    Borgida A, Serafini L (2003) Distributed description logics: assimilating information from peer sources. In: Spaccapietra S, March S, Aberer K (eds) Journal on data semantics I, lecture notes in computer science, vol 2800. Springer, Berlin, pp 153–184. doi: 10.1007/978-3-540-39733-5_7
  6. 6.
    Bouquet P, Giunchiglia F, Harmelen F, Serafini L, Stuckenschmidt H (2003)C-owl: contextualizing ontologies. In: Fensel D, Sycara K, Mylopoulos J (eds) The semantic web—ISWC 2003. Lecture notes in computer science, vol 2870. Springer, Berlin, pp 164–179. doi: 10.1007/978-3-540-39718-2_11
  7. 7.
    Buneman P, Chapman A, Cheney J (2006) Provenance management in curated databases. In: Proceedings of the ACM international conference on management of data (SIGMOD), pp 539–550. ACM Press, New York. doi: 10.1145/1142473.1142534. http://portal.acm.org/citation.cfm?id=1142534
  8. 8.
    Cuenca Grau B, Horrocks I, Kazakov Y, Sattler U (2008) Modular reuse of ontologies: theory and practice. J Artif Intell Res 31:273–318. http://www.jair.org/media/2375/live-2375-3703-jair.pdf
  9. 9.
    Davidson SB, Khanna S, Roy S, Stoyanovich J, Tannen V, Chen Y (2011) On provenance and privacy. In: Milo T (ed) ICDT, pp 3–10. ACM, New YorkGoogle Scholar
  10. 10.
    Davidson SB, Milo T, Roy S (2012) A propagation model for provenance views of public/private workflows. CoRR abs/1212.2251. http://dblp.uni-trier.de/db/journals/corr/corr1212.html#abs-1212-2251
  11. 11.
    Dou D, McDermott D, Qi P (2005) Journal on data semantics ii. Chap. Ontology translation on the semantic web. Springer, Berlin, pp 35–57. http://dl.acm.org/citation.cfm?id=2137611.2137614
  12. 12.
    Eder J, Dabringer C, Schicho M, Stark K (2009) Information systems for federated biobanks. T Large Scale Data Knowl Centered Syst 1:156–190. http://dblp.uni-trier.de/db/journals/tlsdkcs/tlsdkcs1.html#EderDSS09
  13. 13.
    Eder J, Gottweis H, Zatloukal K (2012) IT solutions for privacy protection in biobanking. Public Health Genomics 15(5):254–262CrossRefGoogle Scholar
  14. 14.
    Grau BC, Horrocks I, Kazakov Y, Sattler U (2007) Just the right amount: extracting modules from ontologies. In: Williamson CL, Zurko ME, Patel-Schneider PF, Shenoy PJ (eds) WWW, pp 717–726. ACM, New YorkGoogle Scholar
  15. 15.
    Grau BC, Parsia B, Sirin E (2006) Combining OWL ontologies using e-connections. Web Semant Sci Serv Agents World Wide Web 4(1):40–59. doi: 10.1016/j.websem.2005.09.010. http://www.sciencedirect.com/science/article/pii/S1570826805000314
  16. 16.
    Gupta A, Condit C, Qian X (2010) Biodb: an ontology-enhanced information system for heterogeneous biological information. Data Knowl Eng 69(11):1084–1102. http://dblp.uni-trier.de/db/journals/dke/dke69.html#GuptaCQ10
  17. 17.
    Hernandez T, Kambhampati S (2004) Integration of biological sources: current systems and challenges ahead. SIGMOD Rec 33(3):51–60. doi: 10.1145/1031570.1031583 CrossRefGoogle Scholar
  18. 18.
    Kalyanpur A, Parsia B, Horridge M, Sirin E (2007) Finding all justifications of owl dl entailments. In: ISWC/ASWC, pp 267–280Google Scholar
  19. 19.
    Klarman S, Endriss U, Schlobach S (2011) Abox abduction in the description logic alc. J Autom Reason 46(1):43–80. doi: 10.1007/s10817-010-9168-z CrossRefzbMATHMathSciNetGoogle Scholar
  20. 20.
    Konev B, Lutz C, Walther D, Wolter F (2008) Semantic modularity and module extraction in description logics. In: Proceedings of ECAI 2008, pp 55–59. IOS Press, Amsterdam. http://dl.acm.org/citation.cfm?id=1567281.1567298
  21. 21.
    Krötzsch M, Simancik F, Horrocks I (2012) A description logic primer. CoRR abs/1201.4089Google Scholar
  22. 22.
    Mascardi V, Cordì V, Rosso P (2007) A comparison of upper ontologies. In: Baldoni M, Boccalatte A, Paoli FD, Martelli M, Mascardi V (eds) WOA. Seneca Edizioni Torino, pp 55–64Google Scholar
  23. 23.
    Noy N, Musen M (2004) Specifying ontology views by traversal. In: McIlraith EA (ed) The semantic web—ISWC 2004. Lecture notes in computer science, vol 3298. Springer, Berlin, pp 713–725. doi: 10.1007/978-3-540-30475-3_49
  24. 24.
    Noy NF, Musen MA (2003) The PROMPT suite: interactive tools for ontology merging and mapping. Int J Hum Comput Stud 59(6):983–1024. doi: 10.1016/j.ijhcs.2003.08.002. http://www.sciencedirect.com/science/article/pii/S107158190300137X
  25. 25.
    OWL Working Group W (2009) OWL 2 web ontology language: document overview. W3C Recommendation. http://www.w3.org/TR/owl2-overview/
  26. 26.
    Pan JZ, Du J, Qi G, Shen YD (2012) Towards practical abox abduction in large description logic ontologies. Int J Semant Web Inf Syst 8(2):1–33. doi: 10.4018/jswis.2012040101 CrossRefGoogle Scholar
  27. 27.
    Ram S, Liu J (2009) A new perspective on semantics of data provenance. In: Freire J, Missier P, Sahoo SS (eds) SWPM, CEUR workshop proceedings, vol 526. CEUR-WS.org. http://dblp.uni-trier.de/db/conf/semweb/swpm2009.html#RamL08
  28. 28.
    Schlicht A, Stuckenschmidt H (2007) Criteria-based partitioning of large ontologies. In: Proceedings of the 4th international conference on knowledge capture, K-CAP ’07. ACM, New York, pp 171–172. doi: 10.1145/1298406.1298439
  29. 29.
    Seidenberg J (2009) Web ontology segmentation: extraction, transformation, evaluation. In: Stuckenschmidt H, Parent C, Spaccapietra S (eds) Modular ontologies. Lecture notes in computer science, vol 5445. Springer, New York, pp 211–243Google Scholar
  30. 30.
    Serafini L, Tamilin A (2007) Instance migration in heterogeneous ontology environments. In: Aberer KEA (ed) The semantic web. Lecture notes in computer science, vol 4825. Springer, Berlin, pp 452–465. doi: 10.1007/978-3-540-76298-0_33
  31. 31.
    Sheth AP, Larson JA (1990) Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Comput Surv 22(3):183–236. doi: 10.1145/96602.96604
  32. 32.
    Simmhan Y, Plale B, Gannon D (2005) A survey of data provenance in e-science. SIGMOD Rec 34(3):31–36CrossRefGoogle Scholar
  33. 33.
    Stark K, Eder J, Zatloukal K (2007) Achieving k-anonymity in datamarts used for gene expressions exploitation. J Integr Bioinform 4(1):57Google Scholar
  34. 34.
    Stuckenschmidt H, Parent C, Spaccapietra S (eds) (2009) Modular ontologies: concepts, theories and techniques for knowledge modularization. Lecture notes in computer science, vol 5445. Springer, Berlin. doi: 10.1007/978-3-642-01907-4. http://ki.informatik.uni-mannheim.de/research/publications.html
  35. 35.
    Teppo L, Pukkala E, Lehtonen M (1994) Data quality and quality control of a population-based cancer registry. Experience in finland. Acta Oncol (Stockh, Swed) 33(4):365–369. http://www.ncbi.nlm.nih.gov/pubmed/8018367
  36. 36.
    Wichmann HE, Kuhn KA, Waldenberger M, Schmelcher D, Schuffenhauer S, Meitinger T, Wurst SH, Lamla G, Fortier I, Burton PR et al (2011) Comprehensive catalog of European biobanks. Nat Biotechnol 29(9):795–797CrossRefGoogle Scholar
  37. 37.
    Yuille M, van Ommen GJB, Bréchot C, Cambon-Thomsen A, Dagher G, Landegren U, Litton JE, Pasterk M, Peltonen L, Taussig M, Wichmann HE, Zatloukal K (2008) Biobanking for Europe. Brief Bioinform 9(1):14–24. http://dblp.uni-trier.de/db/journals/bib/bib9.html#YuilleOBCDLLPPTWZ08

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  1. 1.Department of Informatics SystemsAlpen-Adria Universität KlagenfurtKlagenfurtAustria

Personalised recommendations