Skip to main content

A Cloud-Based, Geospatial Linked Data Management System

  • Chapter
  • First Online:
Transactions on Large-Scale Data- and Knowledge-Centered Systems XX

Part of the book series: Lecture Notes in Computer Science ((TLDKS,volume 9070))

Abstract

The Web has been evolving to a sink of disparate information sources which are totally isolated from each other. The technology of Linked Data (LD) promises to connect such information sources in order to enable their better exploitation by humans or automated programs. While various LD management systems have been proposed, only few of them are able to handle geospatial data which are becoming quite popular nowadays and lead to the creation of large geospatial footprints. However, none of the few systems that support Linked Open Geospatial Data is able to scale well to handle the increasing load from user queries. In addition, the publishing of geospatial LD also becomes quite advantageous due to complexity reasons. To this end, this article proposes a novel, cloud-based geospatial LD management system which can scale out or scale in according to the incoming load in order to serve the respective user requests with the appropriate service level. On top of this system lies a LD-as-a-service offering which abstracts away the user from any LD publishing complexities and provides all the appropriate functionality for enabling a full LD management. We also study and propose architectural solutions for the distributed update problem. The proposed system is evaluated under heavy load scenarios and the results show that the respective improvement in performance incurred is quite satisfactory and that the scaling actions are performed at the appropriate time points.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://virtuoso.openlinksw.com/.

  2. 2.

    http://inspire.jrc.ec.europa.eu/.

  3. 3.

    http://www.w3.org/TR/2013/REC-sparql11-query-20130321/.

  4. 4.

    https://cloud.google.com/bigquery/.

  5. 5.

    http://hbase.apache.org/.

  6. 6.

    www.dydra.com.

  7. 7.

    www.franz.com/agraph/allegrograph/.

  8. 8.

    www.systap.com/bigdata.htm.

  9. 9.

    www.ontotext.com/owlim&www.ontotext.com/owlim/geo-spatial.

  10. 10.

    http://virtuoso.openlinksw.com.

  11. 11.

    http://www.oracle.com/technetwork/database-options/spatialandgraph/overview/rdfsemantic-graph-1902016.html.

  12. 12.

    www.mpi-inf.mpg.de/yago-naga/yago/.

  13. 13.

    http://jena.apache.org/documentation/tdb.

  14. 14.

    xml \(\rightarrow \) http://www.w3.org/TR/rdf-sparql-XMLres/. json \(\rightarrow \) http://www.w3.org/TR/2013/REC-sparql11-results-json-20130321/. csv and tsv \(\rightarrow \) http://www.w3.org/TR/2013/REC-sparql11-results-csv-tsv-20130321/.

  15. 15.

    http://www.w3.org/2001/sw/RDFCore/ntriples/.

  16. 16.

    http://www.w3.org/TR/turtle/.

  17. 17.

    http://www.w3.org/TR/sparql11-update/.

  18. 18.

    http://www.w3.org/TR/r2rml/.

  19. 19.

    https://portal.ingeoclouds.eu/ingeoclouds-api/linkeddata/.

  20. 20.

    http://docs.openlinksw.com/virtuoso/rdfsparqlgeospat.html.

  21. 21.

    https://dev.opensahara.com/projects/useekm.

  22. 22.

    http://www.opengeospatial.org/standards/gml.

  23. 23.

    http://www.opengeospatial.org/standards/kml.

References

  1. Battle, R., Kolas, D.: Enabling the geospatial semantic web with parliament and geosparql. Semantic Web 3(4), 355–370 (2012)

    Google Scholar 

  2. Bugiotti, F., Goasdoué, F., Kaoudi, Z., Manolescu, I.: RDF data management in the amazon cloud. In: Proceedings of 2012 Joined EDBT/ICDT Workshops, pp. 61–72. ACM, Berlin (2012)

    Google Scholar 

  3. Fielding, R.T., Taylor, R.N.: Principled design of the modern web architecture. ACM Trans. Internet Technol. 2(2), 115–150 (2002). http://doi.acm.org/10.1145/514183.514185

    Article  Google Scholar 

  4. Franke, C., Morin, S., Chebotko, A., Abraham, J., Brazier, P.: Distributed semantic web data management in hbase and mysql cluster. In: Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing, pp. 105–112. CLOUD 2011. IEEE Computer Society, Washington, DC (2011), http://dx.doi.org/10.1109/CLOUD.2011.19

  5. Guéret, C., Groth, P., Oren, E., Schlobach, S.: eRDF: A Scalable architecture for querying the Web of Data. http://bit.ly/eRDF_tr

  6. Guéret, C., Kotoulas, S., Groth, P.: TripleCloud: An infrastructure for exploratory querying over Web-Scale RDF Data. In: Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT 2011), pp. 245–248. IEEE Computer Society, Washington, DC (2011)

    Google Scholar 

  7. Harth, A., Umbrich, J., Hogan, A., Decker, S.: YARS2: a federated repository for querying graph structured data from the web. In: Aberer, K., et al. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 211–224. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  8. Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: Yago2: a spatially and temporally enhanced knowledge base from wikipedia. Artif. Intell. 194, 28–61 (2013). http://dx.doi.org/10.1016/j.artint.2012.06.001

    Article  MATH  MathSciNet  Google Scholar 

  9. Husain, M.F., Khan, L., Kantarcioglu, M., Thuraisingham, B.M.: Data intensive query processing for large rdf graphs using cloud computing tools. In: IEEE CLOUD, pp. 1–10. IEEE (2010). http://dblp.uni-trier.de/db/conf/IEEEcloud/IEEEcloud2010.html#HusainKKT10

  10. Kritikos, K., Roussakis, Y., Kotzinos, D.: Linked open GeoData management in the cloud. In: 2nd International Workshop on Open Data (WOD 2013), Paris, France (2013)

    Google Scholar 

  11. Kyzirakos, K., Karpathiotakis, M., Koubarakis, M.: Strabon: a semantic geospatial DBMS. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part I. LNCS, vol. 7649, pp. 295–311. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  12. Ladwig, G., Harth, A.: CumulusRDF: linked data management on nested key-value stores. In: Proceedings of the 7th International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS 2011) (2011)

    Google Scholar 

  13. Le-Phuoc, D., Parreira, J.X., Hausenblas, M., Han, Y., Hauswirth, M.: Live linked open sensor database. In: Proceedings of the 6th International Conference on Semantic Systems, I-SEMANTICS 2010, pp. 46:1–46:4. ACM, New York (2010). http://doi.acm.org/10.1145/1839707.1839763

  14. Mika, P., Tummarello, G.: Web semantics in the clouds. IEEE Intell. Syst. 23(5), 82–87 (2008). http://dx.doi.org/10.1109/MIS.2008.94

    Article  Google Scholar 

  15. Neumann, T., Weikum, G.: The rdf-3x engine for scalable management of rdf data. VLDB J. 19(1), 91–113 (2010)

    Article  Google Scholar 

  16. Newman, A., Li, Y.F., Hunter, J.: Scalable semantics - the silver lining of cloud computing. In: Proceedings of the 2008 Fourth IEEE International Conference on eScience, ESCIENCE 2008, pp. 111–118, IEEE Computer Society, Washington, DC (2008). http://dx.doi.org/10.1109/eScience.2008.23

  17. Papailiou, N., Konstantinou, I., Tsoumakos, D., Koziris, N.: H2rdf: Adaptive query processing on rdf data in the cloud. In: Proceedings of the 21st International Conference Companion on World Wide Web, WWW 2012 Companion, pp. 397–400. ACM, New York (2012). http://doi.acm.org/10.1145/2187980.2188058

  18. Ravindra, P., Deshpande, V.V., Anyanwu, K.: Towards scalable rdf graph analytics on mapreduce. In: Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, MDAC 2010, pp. 5:1–5:6. ACM, New York (2010). http://doi.acm.org/10.1145/1779599.1779604

  19. Richardson, L., Ruby, S.: RESTful Web Services. O’Reilly Media, USA (2007)

    Google Scholar 

  20. Stein, R., Zacharias, V.: RDF on cloud number nine. In: 4th Workshop on New Forms of Reasoning for the Semantic Web: Scalable and Dynamic, pp. 11–23. CEUR (2010)

    Google Scholar 

  21. Sun, J., Jin, Q.: Scalable rdf store based on hbase and mapreduce. In: 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010), pp. 633–636. IEEE (2010)

    Google Scholar 

  22. Tanimura, Y., Matono, A., Lynden, S., Kojima, I.: Extensions to the pig data processing platform for scalable rdf data processing using hadoop. In: IEEE 30th International Conference on Data Engineering Workshops (ICDEW 2010), pp. 251–256. IEEE Computer Society, Los Alamitos (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kyriakos Kritikos .

Editor information

Editors and Affiliations

Appendix A - Experiment SPARQL Queries

Appendix A - Experiment SPARQL Queries

1st query

figure a

2nd query

figure b

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Kritikos, K., Rousakis, Y., Kotzinos, D. (2015). A Cloud-Based, Geospatial Linked Data Management System. In: Hameurlain, A., Küng, J., Wagner, R., Sakr, S., Wang, L., Zomaya, A. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems XX. Lecture Notes in Computer Science(), vol 9070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46703-9_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-46703-9_3

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-46702-2

  • Online ISBN: 978-3-662-46703-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics