Abstract
In Ontology-Based Data Access (OBDA), a knowledge base known as an ontology models both the problem domain and the underlying data sources. We are concerned with providing with tools for performing OBDA with relational and non-relational data sources. We developed an OBDA tool that is able to access H2 databases, CSV files and Excel spreadsheets allowing the user to explicitly formulate mappings, and populating an ontology that can be saved for later querying. In this paper, we present a language for specifying the schema of the data in a spreadsheet data application, which then can be used to access the contents of a set of Excel books with the ultimate goal of materializing its data as an OWL/RDF ontology. We characterize the syntax and semantics of the language, present a prototypical implementation and report on the performance tests showing that our implementation can handle a workload of Excel tables of the order of ten thousand records. We also show a case study in which the ontology of an idealized university library can be defined using the our tool integrating both relational and spreadsheet data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
References
Xiao, G., et al.: Ontology-based data access - a survey. In: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18), pp. 5511–5519 (2018)
Gómez, S.A., Fillottrani, P.R.: Materialization of OWL ontologies from relational databases - a practical approach. In: Pesado, P., Arroyo, M. (eds.) Computer Science - CACIC 2019 selected papers, pp. 285–301. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-48325-8_19
Baader, F., Horrocks, I., Lutz, C., Sattler, U.: An Introduction to Description Logic. Cambridge University Press, Cambridge (2017)
Bao, J., Kendall, E.F., McGuinness, D.L., Patel-Schneider, P.F.: OWL 2 Web Ontology Language Quick Reference Guide (Second Edition) W3C Recommendation, 11 December 2012 (2012)
Gómez, S.A., Fillottrani, P.: A language for the specification of the schema of spreadsheets for the materialization of ontologies. In Mon, A., et al. (eds.) XXVI Congreso Argentino de Ciencias de la Computación (CACIC 2020), pp. 546–555, October 2020
Gómez, S.A., Fillottrani, P.R.: Towards a framework for ontology-based data access: materialization of OWL ontologies from relational databases. In Pesado, P., Aciti, C., (eds.) X Workshop en Innovación en Sistemas de Software (WISS 2018), XXIV Congreso Argentino de Ciencias de la Computación CACIC 2018, pp. 857–866 (2018)
Matentzoglu, N., Palmisano, I.: An Introduction to the OWL API. Technical report, The University of Manchester (2016)
Minh, N.H.: How to Read Excel Files in Java using Apache POI (2019)
Dilroshan, T.C.: Identification of problems faced by university libraries in the process of automation: with special reference to the libraries of moratuwa and colombo universities. Sri Lanka J. Librarianship Inf. Manage. 1(2), 82–98 (2009)
Malhan, I.: Challenges and problems of library and information education in India: an emerging knowledge society and the developing nations of Asia. Libr. Philos. Pract. 670 (2011). https://digitalcommons.unl.edu/libphilprac/670/
Mishra, A., Thakur, S., Singh, T.: Library automation: issues, challenges and remedies author. Times Int. J. Res. (Issue January 2015), 9–16 (2015). https://www.academia.edu/12808629/LIBRARY_AUTOMATION_ISSUES_CHALLENGES_AND_REMEDIES
Pothumani, S., Sridhar, J.: Solving problems of library management system. Int. J. Innov. Res. Comput. Commun. Eng. 3(7), 6466–6469 (2015). https://doi.org/10.15680/ijircce.2015.0307167
Raval, A.: Problems of library automation. Int. J. Res. Educ. 2(2) (2013). http://www.raijmr.com/ijre/wp-content/uploads/2017/11/IJRE_2013_vol02_issue_02_01.pdf
Harris, S., Seaborne, A.: SPARQL 1.1 Query Language for RDF W3C recommendation, 21 March 2013 (2013). https://www.w3.org/TR/rdf-sparql-query/
Langegger, A., Wöß, W.: XLWrap – querying and integrating arbitrary spreadsheets with SPARQL. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 359–374. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04930-9_23
Terrazas, B.V., Gomez-Perez, A., Calbimonte, J.P.: NOR2O: a library for transforming non-ontological resources to ontologies. In: ESWC 2010 (2010)
Acknowledgments
This research is funded by Secretaría General de Ciencia y Técnica, Universidad Nacional del Sur, Argentina and by Comisión de Investigaciones Científicas de la Provincia de Buenos Aires (CIC-PBA).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Gómez, S.A., Fillottrani, P.R. (2021). Specification of the Schema of Spreadsheets for the Materialization of Ontologies from Integrated Data Sources. In: Pesado, P., Eterovic, J. (eds) Computer Science – CACIC 2020. CACIC 2020. Communications in Computer and Information Science, vol 1409. Springer, Cham. https://doi.org/10.1007/978-3-030-75836-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-75836-3_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-75835-6
Online ISBN: 978-3-030-75836-3
eBook Packages: Computer ScienceComputer Science (R0)