Skip to main content

A Rule-based Framework for Creating Instance Data from OpenStreetMap

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9209))

Abstract

Reasoning engines for ontological and rule-based knowledge bases are becoming increasingly important in areas like the Semantic Web or information integration. It has been acknowledged however that judging the performance of such reasoners and their underlying algorithms is difficult due to the lack of publicly available datasets with large amounts of (real-life) instance data. In this paper we describe a framework and a toolbox for creating such datasets, which is based on extracting instances from the publicly available OpenStreetMap (OSM) geospatial database. To this end, we give a formalization of OSM and present a rule-based language to specify the rules to extract instance data from OSM data. The declarative nature of the approach in combination with external functions and parameters allows one to create several variants of the dataset via small modifications of the specification. We describe a highly flexible toolbox to extract instance data from a given OSM map and a given set of rules. We have employed our tools to create benchmarks that have already been fruitfully used in practice.

Supported by the Vienna Science and Technology Fund (WWTF) project ICT12-15, by the Austrian Science Fund (FWF) project P25207, and by the EU project Optique FP7-318338.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.openstreetmap.org.

  2. 2.

    E.g., visible in https://www.mapbox.com/osm-data-report/.

  3. 3.

    For clarity, we rename the expressions used in OSM.

  4. 4.

    See the detailed syntax, prerequisites, and tools on https://github.com/ghxiao/city-bench.

  5. 5.

    https://github.com/ghxiao/owl-toolkit.

  6. 6.

    http://www.dlvsystem.com/dlv/.

  7. 7.

    https://github.com/enaeseth/python-fp-growth.

  8. 8.

    https://github.com/ghxiao/city-bench/tree/master/benchmarks/rr2015.

  9. 9.

    Downloaded on the 1.10.14 from http://download.bbbike.org/osm/bbbike/.

  10. 10.

    http://www.kr.tuwien.ac.at/research/systems/dlvhex/.

References

  1. Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Boston (1995)

    Google Scholar 

  2. Baader, F., Brand, S., Lutz, C.: Pushing the \(\cal {EL}\) envelope. In: Proceedings of IJCAI 2005, pp. 364–369. Morgan-Kaufmann Publishers (2005)

    Google Scholar 

  3. Borgelt, C.: Frequent item set mining. Data Min. Knowl. Disc. 2(6), 437–456 (2012). Wiley Interdisciplinary Reviews

    Article  Google Scholar 

  4. Calvanese, D., De Giacomo, G., Lembo, D., Lenzerini, M., Rosati, R.: Tractable reasoning and efficient query answering in description logics: the dl-lite family. J. Autom. Reasoning 39(3), 385–429 (2007)

    Article  Google Scholar 

  5. Eiter, T., Fink, M., Stepanova, D.: Computing repairs for inconsistent DL-programs over \({\cal EL}\) ontologies. In: Fermé, E., Leite, J. (eds.) JELIA 2014. LNCS, vol. 8761, pp. 426–441. Springer, Heidelberg (2014)

    Google Scholar 

  6. Eiter, T., Krennwallner, T., Schneider, P.: Lightweight spatial conjunctive query answering using keywords. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 243–258. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  7. Eiter, T., Schneider, P., Simkus, M., Xiao, G.: Using openstreetmap data to create benchmarks for description logic reasoners. In: Informal proceedings of ORE 2014, July 2014

    Google Scholar 

  8. Garbis, G., Kyzirakos, K., Koubarakis, M.: Geographica: a benchmark for geospatial rdf stores (Long Version). In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 343–359. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  9. Gelfond, M., Lifschitz, V.: The stable model semantics for logic programming. In: Proceedings of ICLP/SLP 1988, vol. 88, pp. 1070–1080 (1988)

    Google Scholar 

  10. Guo, Y., Pan, Z., Heflin, J.: LUBM: a benchmark for OWL knowledge base systems. Web Semantics 3(2–3), 158–182 (2005)

    Article  Google Scholar 

  11. Imprialou, M., Stoilos, G., Cuenca Grau, B., Benchmarking ontology-based query rewriting systems. In: Proceedings of AAAI 2012 (2012)

    Google Scholar 

  12. Kolas, D: A benchmark for spatial semantic web systems. In: 4th International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS2008), October 2008

    Google Scholar 

  13. Kollia, I., Glimm, B.: Optimizing SPARQL query answering over OWL ontologies. J. Artif. Intell. Res. (JAIR) 48, 253–303 (2013)

    MathSciNet  Google Scholar 

  14. Kontchakov, R., Rezk, M., Rodríguez-Muro, M., Xiao, G., Zakharyaschev, M.: Answering SPARQL queries over databases under OWL 2 QL entailment regime. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 552–567. Springer, Heidelberg (2014)

    Google Scholar 

  15. Lanti, D., Rezk, M., Xiao, G., Calvanese, D.: The NPD benchmark: reality check for OBDA systems. In: Proceedings of EDBT 2015. ACM Press (2015)

    Google Scholar 

  16. Ma, L., Yang, Y., Qiu, Z., Xie, G.T., Pan, Y., Liu, S.: Towards a complete OWL ontology benchmark. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 125–139. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  17. Mora, J., Corcho, O.: Towards a systematic benchmarking of ontology-based query rewriting systems. In: Alani, H., Kagal, L., Fokoue, A., Groth, P., Biemann, C., Parreira, J.X., Aroyo, L., Noy, N., Welty, C., Janowicz, K. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 376–391. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  18. Pérez-Urbina, H., Horrocks, I., Motik, B.: Efficient query answering for OWL 2. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 489–504. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  19. Sirin, E., Parsia, P., Cuenca Grau, B., Kalyanpur, A., Katz, Y.: Pellet: a practical OWL-DL reasoner. J. Web Sem. 5(2), 51–53 (2007)

    Article  Google Scholar 

  20. Stoilos, G., Cuenca Grau, B., Horrocks, I.: How incomplete is your semantic web reasoner? In: Proceedings of AAAI 2010. AAAI Press (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Patrik Schneider .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Eiter, T., Pan, J.Z., Schneider, P., Šimkus, M., Xiao, G. (2015). A Rule-based Framework for Creating Instance Data from OpenStreetMap . In: ten Cate, B., Mileo, A. (eds) Web Reasoning and Rule Systems. RR 2015. Lecture Notes in Computer Science(), vol 9209. Springer, Cham. https://doi.org/10.1007/978-3-319-22002-4_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22002-4_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22001-7

  • Online ISBN: 978-3-319-22002-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics