Skip to main content
Log in

Crowd Sourced Semantic Enrichment (CroSSE) for knowledge driven querying of digital resources

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

Today, most information sources provide factual, objective knowledge, but they fail to capture personalized contextual knowledge which could be used to enrich the available factual data and contribute to their interpretation, in the context of the knowledge of the user who queries the system. This would require a knowledge framework which can accommodate both objective data and semantic enrichments that capture user provided knowledge associated to the factual data in the database. Unfortunately, most conventional DBMSs lack the flexibilities necessary (a) to prevent the data and metadata, evolve quickly with changing application requirements and (b) to capture user-provided and/or crowdsourced data and knowledge for more effective decision support. In this paper, we present CrowdSourced Semantic Enrichment (CroSSE) knowledge framework which allows traditional databases and semantic enrichment modules to coexist. CroSSE provides a novel Semantically Enriched SQL (SESQL) language to enrich SQL queries with information from a knowledge base containing semantic annotations. We describe CroSSE and SESQL with examples taken from our SmartGround EU project.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. In Europe, there are from 150.000 to 500.000 very variable landfills. In 2008, 49% of the almost 3 billion tons of total waste generated in the EU-27 was disposed in dedicated landfills.

  2. http://www.smart-ground.eu/index.php

  3. The SmartGround platform, which implement many of the functionalities of the CroSSE, is available, for registered users, at http://smartground.atosresearch.eu/home.

  4. In case of multiple enrichments by select, each resulting knowledge table is combined with the others by means of an additional join in the FROM clause of the query.

  5. For readability, we used human-readable, descriptive names to the variables in the SPARQL queries. In reality, in automatically generated queries, these names are not descriptive.

  6. In case of multiple enrichments, each additional condition is combined in the WHERE clause of the rewritten query (see example 10).

References

  • Minc. (2017). A social platform for fostering educational interactions.

  • Adali, S., Candan, K., Papakonstantinou, Y., Subrahmanian, V. (1996). Query processing in the sims information mediator. In SIGMOD (pp. 137–148).

  • Arens, Y., Knoblock, C., Hsu, C. (1996). Query processing in the sims information mediator. In AAAI.

  • Beckett, D. (ed.) (2004). RDF/XML Syntax Specification (Revised),. W3C Recommendation.

  • Caldarola, E.G., Picariello, A., Rinaldi, A.M. (2015). An approach to ontology integration for ontology reuse in knowledge based digital ecosystems. In MEDES.

  • Calì, A., Gottlob, G., Pieris, A. (2010). Advanced processing for ontological queries. Proc. VLDB Endow., 3(1-2), 554–565.

    Article  Google Scholar 

  • Candan, K.S., Cao, H., Qi, Y., Sapino, M.L. (2008). System support for exploration and expert feedback in resolving conflicts during integration of metadata. VLDB Journal, 17(6), 22–119.

    Article  Google Scholar 

  • Cavallo, G., Di Mauro, F., Pasteris, P., Sapino, M.L., Candan, K.S. (2018). Contextually-enriched querying of integrated data sources. In ICDE18 Workshops.

  • Davulcu, H., Freire, J., Kifer, M., Ramakrishnan, I. (1999). A layered architecture for querying dynamic web content. In SIGMOD.

  • Davulcu, H., Kifer, M., Yang, G., Ramakrishnan, I. (2000). Design and implementation of the physical layer in webbases: the xrover experience. In DOOD.

    Chapter  Google Scholar 

  • Di Mauro, F., Pasteris, P., Sapino, M.L., Candan, K.S., Dino, G.A., Rossetti, P. (2016). Crowdsourced semantic enrichment for participatory e-government. In Proceedings of the 8th International Conference on Management of Digital EcoSystems, MEDES.

  • Di Pinto, F., Lembo, D., Lenzerini, M., Mancin, R., Poggi, A., Rosati, R., Ruzzi, M., Savo, D.F. (2013). Optimizing query rewriting in ontology-based data access. In Proceedings of the 16th international conference on extending database technology, EDBT (pp. 561–572).

  • Garcia-Molina, H., Papakonstantinou, Y., Quass, D., Rajararnan, A., Sagiv, Y., Ullman, J., Vassalos, V., Widom, J. (1997). The tsimmis approach to mediation: Data models and languages. JIIS, pp. 2.

  • Gottlob, G., Orsi, G., Pieris, A. (2011). Ontological queries: rewriting and optimization. In 2011 IEEE 27th International Conference on Data Engineering (pp. 2–13).

  • Kambhampati, S., Lambrecht, E., Nambiar, U., Nie, Z., Senthil, G. (2004). Optimizing recursive information gathering plans in emerac. JIIS, 2(2), 119–153.

    MATH  Google Scholar 

  • Kandogan, E., Roth, M., Schwarz, P.M., Hui, J., Terrizzano, I.G., Christodoulakis, C., Miller, R.J. (2015). Labbook: Metadata-driven social collaborative data analysis. In International Conference on Big Data.

  • Levy, A. (1998). The information manifold approach to data integration. IEEE Intelligent Systems, 13, 12–16.

    Google Scholar 

  • Lim, L., Wang, H., Wang, M. (2013). Semantic queries by example. Proceedings of the 16th International Conference on Extending Database Technology (EDBT 2013).

  • Maedche, A., Staab, S., Studer, R., Sure, Y., Volz, R. (2002). Seal – tying up information integration and web site management by ontologies. IEEE Data Engin. Bulletin, 25(1), 10–17.

    Google Scholar 

  • Mortensen, J.M., Musen, M. A., Noy, N. F. (2013a). Developing crowdsourced ontology engineering tasks: an iterative process. In Proceedings of the 1st international workshop on crowdsourcing the semantic web, CrowdSem (pp. 79–88).

  • Mortensen, J.M., Alexander, P.R., Musen, M.A., Noy, N.F. (2013b). Crowdsourcing ontology verification. In Proceedings of the 4th International Conference on Biomedical Ontology, ICBO (pp. 40–45).

  • Munir, K., & Anjum, M.S. (2017). The use of ontologies for effective knowledge modelling and information retrieval. Applied Computing and Informatics, 14(2), 116–126.

    Article  Google Scholar 

  • Munir, K., Odeh, M., Mcclatchey, R. (2012). Ontology-driven relational query formulation using the semantic and assertional capabilities of OWL-DL. Knowledge-Based Systems, 35, 144–159.

    Article  Google Scholar 

  • Suárez-Figueroa, M.C., Gómez-Pérez, A., Motta, E., Gangemi, A. (2012). Ontology engineering in a networked world, chapter 2. Berlin: Springer.

    Book  Google Scholar 

  • Taylor, N.E., & Ives, Z.G. (2006). Reconciling while tolerating disagreement in collaborative data sharing. In Proceedings SIGMOD06.

  • Tekli, J., Chbeir, R., Traina, A.J., Traina, C., Yetongnon, K., Ibanez, C.R., Assad, M.A., Kallas, C. (2018). Full-fledged semantic indexing and querying model designed for seamless integration in legacy rdbms. Data and Knowledge Engineering, 117, 133–173.

    Article  Google Scholar 

  • Xiao, G., Calvanese, D., Kontchakov, R., Lembo, D., Poggi, A., Rosati, R., Zakharyaschev, M. (2018). Ontology-based data access: a survey. In Proceedings IJCAI-18.

  • Xu, L., & Embley, D.W. (2002). Combining the best of global-as-view and local-as-view for data integration. In ISTA (pp. 123–136).

  • Ziegler, P., & Dittrich, K.R. (2007). Data integration - problems, approaches, and perspectives. In J. Krogstie, A.L. Opdahl, S. Brinkkemper (Eds.), Conceptual modelling in information systems engineering, chapter 3 (pp. 39–58). Berlin: Springer.

Download references

Acknowledgements

The research is partially supported by EU grants #641988 and #690817 and NSF grant #1633381. We thank project partners, especially our colleagues from the Earth Sciences Department at the University of Torino, P. Rossetti, G. Dino, and G. Biglia.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maria Luisa Sapino.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cavallo, G., Di Mauro, F., Pasteris, P. et al. Crowd Sourced Semantic Enrichment (CroSSE) for knowledge driven querying of digital resources. J Intell Inf Syst 53, 453–480 (2019). https://doi.org/10.1007/s10844-019-00559-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-019-00559-8

Keywords

Navigation