Advertisement

Moon Landing or Safari? A Study of Systematic Errors and Their Causes in Geographic Linked Data

  • Krzysztof JanowiczEmail author
  • Yingjie Hu
  • Grant McKenzie
  • Song Gao
  • Blake Regalia
  • Gengchen Mai
  • Rui Zhu
  • Benjamin Adams
  • Kerry Taylor
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9927)

Abstract

While the adoption of Linked Data technologies has grown dramatically over the past few years, it has not come without its own set of growing challenges. The triplification of domain data into Linked Data has not only given rise to a leading role of places and positioning information for the dense interlinkage of data about actors, objects, and events, but also led to massive errors in the generation, transformation, and semantic annotation of data. In a global and densely interlinked graph of data, even seemingly minor error can have far reaching consequences as different datasets make statements about the same resources. In this work we present the first comprehensive study of systematic errors and their potential causes. We also discuss lessons learned and means to avoid some of the introduced pitfalls in the future.

Keywords

Link Data Language Version Cardinal Direction Open Geospatial Consortium Knowledge Graph 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Notes

Acknowledgements

The authors would like to acknowledge partial support by the National Science Foundation (NSF) under award 1440202 EarthCube Building Blocks: Collaborative Proposal: GeoLink Leveraging Semantics and Linked Data for Data Sharing and Discovery in the Geosciences, NSF award 1540849 EarthCube IA: Collaborative Proposal: Cross-Domain Observational Metadata Environmental Sensing Network (X-DOMES), and the USGS award on Linked Data for the National Map.

References

  1. 1.
    Adams, B., Janowicz, K.: Thematic signatures for cleansing and enriching place-related linked data. Int. J. Geogr. Inf. Sci. 29(4), 556–579 (2015)CrossRefGoogle Scholar
  2. 2.
    Beek, W., Rietveld, L., Bazoobandi, H.R., Wielemaker, J., Schlobach, S.: LOD Laundromat: a uniform way of publishing other people’s dirty data. In: Mika, P., et al. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 213–228. Springer, Heidelberg (2014)Google Scholar
  3. 3.
    Ferrucci, D.A., Brown, E.W., Chu-Carroll, J., Fan, J., Gondek, D., Kalyanpur, A., Lally, A., Murdock, J.W., Nyberg, E., Prager, J.M., Welty, C.A.: Building Watson: an overview of the DeepQA project. AI Mag. 31(3), 59–79 (2010)Google Scholar
  4. 4.
    Fisher, P.F.: Models of uncertainty in spatial data. Geograph. Inf. Syst. 1, 191–205 (1999)Google Scholar
  5. 5.
    Hogan, A., Harth, A., Passant, A., Decker, S., Polleres, A.: Weaving the pedantic web. In: Proceedings of the WWW 2010 Workshop on Linked Data on the Web, LDOW 2010, Raleigh, USA, 27 April 2010 (2010)Google Scholar
  6. 6.
    Hogan, A., Hitzler, P., Janowicz, K.: Linked dataset description papers at the semantic web journal: a critical assessment. Semant. Web 7(2), 105–116 (2016)CrossRefGoogle Scholar
  7. 7.
    Janowicz, K.: Observation-driven geo-ontology engineering. Trans. GIS 16(3), 351–374 (2012)CrossRefGoogle Scholar
  8. 8.
    Janowicz, K., Hitzler, P.: The digital earth as knowledge engine. Semant. Web 3(3), 213–221 (2012)Google Scholar
  9. 9.
    Kontokostas, D., Westphal, P., Auer, S., Hellmann, S., Lehmann, J., Cornelissen, R., Zaveri, A.: Test-driven evaluation of linked data quality. In: Proceedings of the 23rd International Conference on World Wide Web, pp. 747–758. International World Wide Web Conferences Steering (2014)Google Scholar
  10. 10.
    Kuhn, W., Kauppinen, T., Janowicz, K.: Linked data - a paradigm shift for geographic information science. In: Duckham, M., Pebesma, E., Stewart, K., Frank, A.U. (eds.) GIScience 2014. LNCS, vol. 8728, pp. 173–186. Springer, Heidelberg (2014)Google Scholar
  11. 11.
    Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web 6(2), 167–195 (2015)Google Scholar
  12. 12.
    Perry, M., Herring, J.: OGC geosparql-a geographic query language for RDF data. Open Geospatial Consortium (2012)Google Scholar
  13. 13.
    Perry, M., Jain, P., Sheth, A.P.: SPARQL-ST: extending SPARQL to support spatiotemporal queries. In: Ashish, N., Sheth, A.P. (eds.) Geospatial Semantics and the Semantic Web - Foundations, Algorithms, and Applications. Semantic Web and Beyond: Computing for Human Experience, vol. 12, pp. 61–86. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  14. 14.
    Presutti, V., Draicchio, F., Gangemi, A.: Knowledge extraction based on discourse representation theory and linguistic frames. In: ten Teije, A., Völker, J., Handschuh, S., Stuckenschmidt, H., d’Acquin, M., Nikolov, A., Aussenac-Gilles, N., Hernandez, N. (eds.) EKAW 2012. LNCS, vol. 7603, pp. 114–129. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  15. 15.
    Rietveld, L., Verborgh, R., Beek, W., Vander Sande, M., Schlobach, S.: Linked data-as-a-service: the semantic web redeployed. In: Gandon, F., Sabou, M., Sack, H., d’Amato, C., Cudré-Mauroux, P., Zimmermann, A. (eds.) ESWC 2015. LNCS, vol. 9088, pp. 471–487. Springer, Heidelberg (2015)CrossRefGoogle Scholar
  16. 16.
    Williams, A.J., Harland, L., Groth, P., Pettifer, S., Chichester, C., Willighagen, E.L., Evelo, C.T., Blomberg, N., Ecker, G., Goble, C., Mons, B.: Open phacts: semantic interoperability for drug discovery. Drug Discov. Today 17(21), 1188–1198 (2012)CrossRefGoogle Scholar
  17. 17.
    Zaveri, A., Rula, A., Maurino, A., Pietrobon, R., Lehmann, J., Auer, S.: Quality assessment for linked data: a survey. Semant. Web 7(1), 63–93 (2015)CrossRefGoogle Scholar
  18. 18.
    Zhu, R., Hu, Y., Janowicz, K., McKenzie, G.: Spatial signatures for geographic feature types: examining gazetteer ontologies using spatial statistics. Trans. GIS 20(3), 333–355 (2016). doi: 10.1111/tgis.12232 CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Krzysztof Janowicz
    • 1
    Email author
  • Yingjie Hu
    • 1
  • Grant McKenzie
    • 2
  • Song Gao
    • 1
  • Blake Regalia
    • 1
  • Gengchen Mai
    • 1
  • Rui Zhu
    • 1
  • Benjamin Adams
    • 3
  • Kerry Taylor
    • 4
  1. 1.STKO LabUniversity of CaliforniaSanta BarbaraUSA
  2. 2.Department of Geographical SciencesUniversity of MarylandCollege ParkUSA
  3. 3.Centre for eResearchThe University of AucklandAucklandNew Zealand
  4. 4.Australian National UniversityCanberraAustralia

Personalised recommendations