Advertisement

MatWare: Constructing and Exploiting Domain Specific Warehouses by Aggregating Semantic Data

  • Yannis Tzitzikas
  • Nikos Minadakis
  • Yannis Marketakis
  • Pavlos Fafalios
  • Carlo Allocca
  • Michalis Mountantonakis
  • Ioanna Zidianaki
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8465)

Abstract

In many applications one has to fetch and assemble pieces of information coming from more than one web sources such as SPARQL endpoints. In this paper we describe the corresponding requirements and challenges, based on our experience, and then we present a process and a tool that we have developed, called MatWare , for constructing such semantic warehouses. We focus on domain-specific warehouses, where the focus is given on the aspects of scope control, connectivity assessment, provenance, and freshness. MatWare (Materialized Warehouse) is a tool that automates the construction (and reconstruction) of such warehouses, and offers methods for tackling the aforementioned requirements. Finally we report our experiences from using it for building, maintaining and evolving an operational semantic warehouse for the marine domain, that is currently in use by several applications ranging from e-infrastructure services to smart phone applications.

Keywords

#eswc2014Tzitzikas 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Darari, F., Nutt, W., Pirrò, G., Razniewski, S.: Completeness Statements about RDF Data Sources and Their Use for Query Answering. In: Alani, H., et al. (eds.) ISWC 2013, Part I. LNCS, vol. 8218, pp. 66–83. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  2. 2.
    Fafalios, P., Kitsos, I., Marketakis, Y., Baldassarre, C., Salampasis, M., Tzitzikas, Y.: Web Searching with Entity Mining at Query Time. In: Salampasis, M., Larsen, B. (eds.) IRFC 2012. LNCS, vol. 7356, pp. 73–88. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  3. 3.
    Fafalios, P., Tzitzikas, Y.: X-ENS: Semantic Enrichment of Web Search Results at Real-Time. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, July 28 - August 01 (2013)Google Scholar
  4. 4.
    Hogan, A., Harth, A., Umbrich, J., Kinsella, S., Polleres, A., Decker, S.: Searching and Browsing Linked Data with SWSE: The Semantic Web Search Engine. Web Semantics: Science, Services and Agents on the World Wide Web 9(4) (2011)Google Scholar
  5. 5.
    Hu, Y., Janowicz, K., McKenzie, G., Sengupta, K., Hitzler, P.: A Linked-Data-Driven and Semantically-Enabled Journal Portal for Scientometrics. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 114–129. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  6. 6.
    Kitsos, I., Magoutis, K., Tzitzikas, Y.: Scalable Entity-based Summarization of Web Search Results Using MapReduce. In: Distributed and Parallel Databases (2013) (accepted, online first)Google Scholar
  7. 7.
    Knap, T., Michelfeit, J.: Linked Data Aggregation Algorithm: Increasing Completeness and Consistency of DataGoogle Scholar
  8. 8.
    Knap, T., Michelfeit, J., Daniel, J., Jerman, P., Rychnovský, D., Soukup, T., Nečaský, M.: ODCleanStore: A framework for managing and providing integrated linked data on the web. In: Wang, X.S., Cruz, I., Delis, A., Huang, G. (eds.) WISE 2012. LNCS, vol. 7651, pp. 815–816. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  9. 9.
    Makris, K., Skevakis, G., Kalokyri, V., Arapi, P., Christodoulakis, S., Stoitsis, J., Manolis, N., Rojas, S.L.: Federating Natural History Museums in Natural Europe. In: Garoufallou, E., Greenberg, J. (eds.) MTSR 2013. CCIS, vol. 390, pp. 361–372. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  10. 10.
    Mendes, P.N., Mühleisen, H., Bizer, C.: Sieve: Linked Data Quality Assessment and Fusion. In: Proceedings of the 2012 Joint EDBT/ICDT Workshops, pp. 116–123. ACM (2012)Google Scholar
  11. 11.
    Michelfeit, J., Knap, T.: Linked Data Fusion in ODCleanStore. In: International Semantic Web Conference, Posters & Demos (2012)Google Scholar
  12. 12.
    Oren, E., Delbru, R., Catasta, M., Cyganiak, R., Stenzhorn, H., Tummarello, G.: Sindice.com: a Document-Oriented Lookup Index for Open Linked Data. Int. J. Metadata Semant. Ontologies 3(1), 37–52 (2008)CrossRefGoogle Scholar
  13. 13.
    Sacco, G., Tzitzikas, Y.: Dynamic Taxonomies and Faceted Search: Theory, Practice, and Experience, vol. 25. Springer-Verlag New York Inc. (2009)Google Scholar
  14. 14.
    Tzitzikas, Y., et al.: Integrating Heterogeneous and Distributed Information about Marine Species through a Top Level Ontology. In: Garoufallou, E., Greenberg, J. (eds.) MTSR 2013. Communications in Computer and Information Science, vol. 390, pp. 289–301. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  15. 15.
    Tzitzikas, Y., Minadakis, N., Marketakis, Y., Fafalios, P., Alloca, C., Mountantonakis, M.: Quantifying the Connectivity of a Semantic Warehouse. In: Proceedings of the 4th International Workshop on Linked Web Data Management (LWDM 2014), in Conjunction with the 17th International Conference on Extending Database Technology (EDBT 2014) (2014)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Yannis Tzitzikas
    • 1
    • 2
  • Nikos Minadakis
    • 1
  • Yannis Marketakis
    • 1
  • Pavlos Fafalios
    • 1
    • 2
  • Carlo Allocca
    • 1
  • Michalis Mountantonakis
    • 1
    • 2
  • Ioanna Zidianaki
    • 1
    • 2
  1. 1.Institute of Computer ScienceFORTH-ICSGreece
  2. 2.Computer Science DepartmentUniversity of CreteGreece

Personalised recommendations