GeoInformatica

, Volume 20, Issue 3, pp 503–527

Geographically weighted evidence combination approaches for combining discordant and inconsistent volunteered geographical information

  • Alexis Comber
  • Cidália Fonte
  • Giles Foody
  • Steffen Fritz
  • Paul Harris
  • Ana-Maria Olteanu-Raimond
  • Linda See
Article

DOI: 10.1007/s10707-016-0248-z

Cite this article as:
Comber, A., Fonte, C., Foody, G. et al. Geoinformatica (2016) 20: 503. doi:10.1007/s10707-016-0248-z

Abstract

There is much interest in being able to combine crowdsourced data. One of the critical issues in information sciences is how to combine data or information that are discordant or inconsistent in some way. Many previous approaches have taken a majority rules approach under the assumption that most people are correct most of the time. This paper analyses crowdsourced land cover data generated by the Geo-Wiki initiative in order to infer the land cover present at locations on a 50 km grid. It compares four evidence combination approaches (Dempster-Shafer, Bayes, Fuzzy Sets and Possibility) applied under a geographically weighted kernel with the geographically weighted average approach applied in many current Geo-Wiki analyses. A geographically weighted approach uses a moving kernel under which local analyses are undertaken. The contribution (or salience) of each data point to the analysis is weighted by its distance to the kernel centre, reflecting Tobler’s 1st law of geography. A series of analyses were undertaken using different kernel sizes (or bandwidths). Each of the geographically weighted evidence combination methods generated spatially distributed measures of belief in hypotheses associated with the presence of individual land cover classes at each location on the grid. These were compared with GlobCover, a global land cover product. The results from the geographically weighted average approach in general had higher correspondence with the reference data and this increased with bandwidth. However, for some classes other evidence combination approaches had higher correspondences possibly because of greater ambiguity over class conceptualisations and / or lower densities of crowdsourced data. The outputs also allowed the beliefs in each class to be mapped. The differences in the soft and the crisp maps are clearly associated with the logics of each evidence combination approach and of course the different questions that they ask of the data. The results show that discordant data can be combined (rather than being removed from analysis) and that data integrated in this way can be parameterised by different measures of belief uncertainty. The discussion highlights a number of critical areas for future research.

Keywords

Crowdsourcing Land cover Data quality VGI Data mining 

Copyright information

© Springer Science+Business Media New York 2016

Authors and Affiliations

  1. 1.School of GeographyUniversity of LeedsLeedsUK
  2. 2.Department of MathematicsUniversity of CoimbraCoimbraPortugal
  3. 3.Institute for Systems and Computers Engineering at CoimbraCoimbraPortugal
  4. 4.School of GeographyUniversity of NottinghamNottinghamUK
  5. 5.Ecosystems Services and Management ProgramInternational Institute for Applied Systems Analysis (IIASA)LaxenburgAustria
  6. 6.Sustainable Soil and Grassland Systems, Rothamsted ResearchOkehamptonUK
  7. 7.COGIT LaboratoryInstitut National de l’Information Géographique et Forestière Saint-MandéFrance

Personalised recommendations