Abstract
The Web has been evolving to a sink of disparate information sources which are totally isolated from each other. The technology of Linked Data (LD) promises to connect such information sources in order to enable their better exploitation by humans or automated programs. While various LD management systems have been proposed, only few of them are able to handle geospatial data which are becoming quite popular nowadays and lead to the creation of large geospatial footprints. However, none of the few systems that support Linked Open Geospatial Data is able to scale well to handle the increasing load from user queries. In addition, the publishing of geospatial LD also becomes quite advantageous due to complexity reasons. To this end, this article proposes a novel, cloud-based geospatial LD management system which can scale out or scale in according to the incoming load in order to serve the respective user requests with the appropriate service level. On top of this system lies a LD-as-a-service offering which abstracts away the user from any LD publishing complexities and provides all the appropriate functionality for enabling a full LD management. We also study and propose architectural solutions for the distributed update problem. The proposed system is evaluated under heavy load scenarios and the results show that the respective improvement in performance incurred is quite satisfactory and that the scaling actions are performed at the appropriate time points.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
xml \(\rightarrow \) http://www.w3.org/TR/rdf-sparql-XMLres/. json \(\rightarrow \) http://www.w3.org/TR/2013/REC-sparql11-results-json-20130321/. csv and tsv \(\rightarrow \) http://www.w3.org/TR/2013/REC-sparql11-results-csv-tsv-20130321/.
- 15.
- 16.
- 17.
- 18.
- 19.
- 20.
- 21.
- 22.
- 23.
References
Battle, R., Kolas, D.: Enabling the geospatial semantic web with parliament and geosparql. Semantic Web 3(4), 355–370 (2012)
Bugiotti, F., Goasdoué, F., Kaoudi, Z., Manolescu, I.: RDF data management in the amazon cloud. In: Proceedings of 2012 Joined EDBT/ICDT Workshops, pp. 61–72. ACM, Berlin (2012)
Fielding, R.T., Taylor, R.N.: Principled design of the modern web architecture. ACM Trans. Internet Technol. 2(2), 115–150 (2002). http://doi.acm.org/10.1145/514183.514185
Franke, C., Morin, S., Chebotko, A., Abraham, J., Brazier, P.: Distributed semantic web data management in hbase and mysql cluster. In: Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing, pp. 105–112. CLOUD 2011. IEEE Computer Society, Washington, DC (2011), http://dx.doi.org/10.1109/CLOUD.2011.19
Guéret, C., Groth, P., Oren, E., Schlobach, S.: eRDF: A Scalable architecture for querying the Web of Data. http://bit.ly/eRDF_tr
Guéret, C., Kotoulas, S., Groth, P.: TripleCloud: An infrastructure for exploratory querying over Web-Scale RDF Data. In: Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (WI-IAT 2011), pp. 245–248. IEEE Computer Society, Washington, DC (2011)
Harth, A., Umbrich, J., Hogan, A., Decker, S.: YARS2: a federated repository for querying graph structured data from the web. In: Aberer, K., et al. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 211–224. Springer, Heidelberg (2007)
Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: Yago2: a spatially and temporally enhanced knowledge base from wikipedia. Artif. Intell. 194, 28–61 (2013). http://dx.doi.org/10.1016/j.artint.2012.06.001
Husain, M.F., Khan, L., Kantarcioglu, M., Thuraisingham, B.M.: Data intensive query processing for large rdf graphs using cloud computing tools. In: IEEE CLOUD, pp. 1–10. IEEE (2010). http://dblp.uni-trier.de/db/conf/IEEEcloud/IEEEcloud2010.html#HusainKKT10
Kritikos, K., Roussakis, Y., Kotzinos, D.: Linked open GeoData management in the cloud. In: 2nd International Workshop on Open Data (WOD 2013), Paris, France (2013)
Kyzirakos, K., Karpathiotakis, M., Koubarakis, M.: Strabon: a semantic geospatial DBMS. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part I. LNCS, vol. 7649, pp. 295–311. Springer, Heidelberg (2012)
Ladwig, G., Harth, A.: CumulusRDF: linked data management on nested key-value stores. In: Proceedings of the 7th International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS 2011) (2011)
Le-Phuoc, D., Parreira, J.X., Hausenblas, M., Han, Y., Hauswirth, M.: Live linked open sensor database. In: Proceedings of the 6th International Conference on Semantic Systems, I-SEMANTICS 2010, pp. 46:1–46:4. ACM, New York (2010). http://doi.acm.org/10.1145/1839707.1839763
Mika, P., Tummarello, G.: Web semantics in the clouds. IEEE Intell. Syst. 23(5), 82–87 (2008). http://dx.doi.org/10.1109/MIS.2008.94
Neumann, T., Weikum, G.: The rdf-3x engine for scalable management of rdf data. VLDB J. 19(1), 91–113 (2010)
Newman, A., Li, Y.F., Hunter, J.: Scalable semantics - the silver lining of cloud computing. In: Proceedings of the 2008 Fourth IEEE International Conference on eScience, ESCIENCE 2008, pp. 111–118, IEEE Computer Society, Washington, DC (2008). http://dx.doi.org/10.1109/eScience.2008.23
Papailiou, N., Konstantinou, I., Tsoumakos, D., Koziris, N.: H2rdf: Adaptive query processing on rdf data in the cloud. In: Proceedings of the 21st International Conference Companion on World Wide Web, WWW 2012 Companion, pp. 397–400. ACM, New York (2012). http://doi.acm.org/10.1145/2187980.2188058
Ravindra, P., Deshpande, V.V., Anyanwu, K.: Towards scalable rdf graph analytics on mapreduce. In: Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, MDAC 2010, pp. 5:1–5:6. ACM, New York (2010). http://doi.acm.org/10.1145/1779599.1779604
Richardson, L., Ruby, S.: RESTful Web Services. O’Reilly Media, USA (2007)
Stein, R., Zacharias, V.: RDF on cloud number nine. In: 4th Workshop on New Forms of Reasoning for the Semantic Web: Scalable and Dynamic, pp. 11–23. CEUR (2010)
Sun, J., Jin, Q.: Scalable rdf store based on hbase and mapreduce. In: 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010), pp. 633–636. IEEE (2010)
Tanimura, Y., Matono, A., Lynden, S., Kojima, I.: Extensions to the pig data processing platform for scalable rdf data processing using hadoop. In: IEEE 30th International Conference on Data Engineering Workshops (ICDEW 2010), pp. 251–256. IEEE Computer Society, Los Alamitos (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix A - Experiment SPARQL Queries
Appendix A - Experiment SPARQL Queries
1st query
2nd query
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Kritikos, K., Rousakis, Y., Kotzinos, D. (2015). A Cloud-Based, Geospatial Linked Data Management System. In: Hameurlain, A., Küng, J., Wagner, R., Sakr, S., Wang, L., Zomaya, A. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems XX. Lecture Notes in Computer Science(), vol 9070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46703-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-662-46703-9_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-46702-2
Online ISBN: 978-3-662-46703-9
eBook Packages: Computer ScienceComputer Science (R0)