Skip to main content

Specification of the Schema of Spreadsheets for the Materialization of Ontologies from Integrated Data Sources

  • Conference paper
  • First Online:
Computer Science – CACIC 2020 (CACIC 2020)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1409))

Included in the following conference series:

Abstract

In Ontology-Based Data Access (OBDA), a knowledge base known as an ontology models both the problem domain and the underlying data sources. We are concerned with providing with tools for performing OBDA with relational and non-relational data sources. We developed an OBDA tool that is able to access H2 databases, CSV files and Excel spreadsheets allowing the user to explicitly formulate mappings, and populating an ontology that can be saved for later querying. In this paper, we present a language for specifying the schema of the data in a spreadsheet data application, which then can be used to access the contents of a set of Excel books with the ultimate goal of materializing its data as an OWL/RDF ontology. We characterize the syntax and semantics of the language, present a prototypical implementation and report on the performance tests showing that our implementation can handle a workload of Excel tables of the order of ten thousand records. We also show a case study in which the ontology of an idealized university library can be defined using the our tool integrating both relational and spreadsheet data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/waqarini/excel2rdf.

  2. 2.

    https://sourceforge.net/projects/rdbtoonto/.

  3. 3.

    https://github.com/marcelotto/spread2rdf.

  4. 4.

    https://www.w3.org/2001/sw/wiki/OpenAnzo.

  5. 5.

    http://www.cambridgesemantics.com.

  6. 6.

    https://www.topquadrant.com/knowledge-assets/faq/tbc/.

  7. 7.

    https://github.com/Data2Semantics/TabLinker/wiki.

References

  1. Xiao, G., et al.: Ontology-based data access - a survey. In: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18), pp. 5511–5519 (2018)

    Google Scholar 

  2. Gómez, S.A., Fillottrani, P.R.: Materialization of OWL ontologies from relational databases - a practical approach. In: Pesado, P., Arroyo, M. (eds.) Computer Science - CACIC 2019 selected papers, pp. 285–301. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-48325-8_19

    Chapter  Google Scholar 

  3. Baader, F., Horrocks, I., Lutz, C., Sattler, U.: An Introduction to Description Logic. Cambridge University Press, Cambridge (2017)

    Book  Google Scholar 

  4. Bao, J., Kendall, E.F., McGuinness, D.L., Patel-Schneider, P.F.: OWL 2 Web Ontology Language Quick Reference Guide (Second Edition) W3C Recommendation, 11 December 2012 (2012)

    Google Scholar 

  5. Gómez, S.A., Fillottrani, P.: A language for the specification of the schema of spreadsheets for the materialization of ontologies. In Mon, A., et al. (eds.) XXVI Congreso Argentino de Ciencias de la Computación (CACIC 2020), pp. 546–555, October 2020

    Google Scholar 

  6. Gómez, S.A., Fillottrani, P.R.: Towards a framework for ontology-based data access: materialization of OWL ontologies from relational databases. In Pesado, P., Aciti, C., (eds.) X Workshop en Innovación en Sistemas de Software (WISS 2018), XXIV Congreso Argentino de Ciencias de la Computación CACIC 2018, pp. 857–866 (2018)

    Google Scholar 

  7. Matentzoglu, N., Palmisano, I.: An Introduction to the OWL API. Technical report, The University of Manchester (2016)

    Google Scholar 

  8. Minh, N.H.: How to Read Excel Files in Java using Apache POI (2019)

    Google Scholar 

  9. Dilroshan, T.C.: Identification of problems faced by university libraries in the process of automation: with special reference to the libraries of moratuwa and colombo universities. Sri Lanka J. Librarianship Inf. Manage. 1(2), 82–98 (2009)

    Article  Google Scholar 

  10. Malhan, I.: Challenges and problems of library and information education in India: an emerging knowledge society and the developing nations of Asia. Libr. Philos. Pract. 670 (2011). https://digitalcommons.unl.edu/libphilprac/670/

  11. Mishra, A., Thakur, S., Singh, T.: Library automation: issues, challenges and remedies author. Times Int. J. Res. (Issue January 2015), 9–16 (2015). https://www.academia.edu/12808629/LIBRARY_AUTOMATION_ISSUES_CHALLENGES_AND_REMEDIES

  12. Pothumani, S., Sridhar, J.: Solving problems of library management system. Int. J. Innov. Res. Comput. Commun. Eng. 3(7), 6466–6469 (2015). https://doi.org/10.15680/ijircce.2015.0307167

  13. Raval, A.: Problems of library automation. Int. J. Res. Educ. 2(2) (2013). http://www.raijmr.com/ijre/wp-content/uploads/2017/11/IJRE_2013_vol02_issue_02_01.pdf

  14. Harris, S., Seaborne, A.: SPARQL 1.1 Query Language for RDF W3C recommendation, 21 March 2013 (2013). https://www.w3.org/TR/rdf-sparql-query/

  15. Langegger, A., Wöß, W.: XLWrap – querying and integrating arbitrary spreadsheets with SPARQL. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 359–374. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04930-9_23

    Chapter  Google Scholar 

  16. Terrazas, B.V., Gomez-Perez, A., Calbimonte, J.P.: NOR2O: a library for transforming non-ontological resources to ontologies. In: ESWC 2010 (2010)

    Google Scholar 

Download references

Acknowledgments

This research is funded by Secretaría General de Ciencia y Técnica, Universidad Nacional del Sur, Argentina and by Comisión de Investigaciones Científicas de la Provincia de Buenos Aires (CIC-PBA).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sergio Alejandro Gómez .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gómez, S.A., Fillottrani, P.R. (2021). Specification of the Schema of Spreadsheets for the Materialization of Ontologies from Integrated Data Sources. In: Pesado, P., Eterovic, J. (eds) Computer Science – CACIC 2020. CACIC 2020. Communications in Computer and Information Science, vol 1409. Springer, Cham. https://doi.org/10.1007/978-3-030-75836-3_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-75836-3_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-75835-6

  • Online ISBN: 978-3-030-75836-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics