Abstract
Due to the fast pace increasing availability and diversity of information sources in environmental sciences, there is a real need of sound statistical mapping techniques for using them jointly inside a unique theoretical framework. As these information sources may vary both with respect to their nature (continuous vs. categorical or qualitative), their spatial density as well as their intrinsic quality (soft vs. hard data), the design of such techniques is a challenging issue. In this paper, an efficient method for combining spatially non-exhaustive categorical and continuous data in a mapping context is proposed, based on the Bayesian maximum entropy paradigm. This approach relies first on the definition of a mixed random field, that can account for a stochastic link between categorical and continuous random fields through the use of a cross-covariance function. When incorporating general knowledge about the first- and second-order moments of these fields, it is shown that, under mild hypotheses, their joint distribution can be expressed as a mixture of conditional Gaussian prior distributions, with parameters estimation that can be obtained from entropy maximization. A posterior distribution that incorporates the various (soft or hard) continuous and categorical data at hand can then be obtained by a straightforward conditionalization step. The use and potential of the method is illustrated by the way of a simulated case study. A comparison with few common geostatistical methods in some limit cases also emphasizes their similarities and differences, both from the theoretical and practical viewpoints. As expected, adding categorical information may significantly improve the spatial prediction of a continuous variable, making this approach powerful and very promising.
Similar content being viewed by others
References
Bogaert P (2002) Spatial prediction of categorical variables: the Bayesian maximum entropy approach. Stoch Environ Res Risk Assess 16:425–448
Brus DJ, de Gruijter JJ, Marsman BA, Visschers R, Bregt AK, Breeuwsma A, Bouma J (1996) The performance of spatial interpolation methods and choropleth maps to estimate properties at points: a soil survey case study. Environmetrics 7:1–16
Christakos G (1990) A Bayesian maximum-entropy view to the spatial estimation problem. Math Geol 22:763–777
Christakos G (2000) Modern spatiotemporal geostatistics. Oxford University Press, New York
Christakos G, Li X (1998) Bayesian maximum entropy analysis and mapping: a farewell to kriging estimators? Math Geol 30:435–462
Christakos G, Bogaert P, Serre ML (2002) Temporal GIS. Advanced functions for field-based applications. Springer, Berlin Heidelberg New York
Cressie N (1990) The origins of kriging. Math Geol 22:239–252
Diggle P, Tawn J, Moyeed R (1998) Model-based geostatistics. J R Stat Soc: Appl Stat 47:299–350
Dobermann A, Ping JL (2004) Geostatistical integration of yield monitor data and remote sensing improves yield maps. Agron J 96:285–297
D’Or D, Bogaert P, Christakos G (2001) Application of the BME approach to soil texture mapping. Stoch Environ Res Risk Assess 15:87–100
Douaik A, Van Meirvenne M, Tóth T, Serre ML (2004) Space-time mapping of soil salinity using probabilistic bayesian maximum entropy. Stoch Environ Res Risk Assess 18:219–227
Goovaerts P (1997) Geostatistics for natural resources evaluation. Oxford University Press, New York
Jaynes ET (1982) On the rational of maximum-entropy methods. Proc IEEE 70:939–952
Journel AG (1983) Nonparametric estimation of spatial distributions. Math Geol 45:445–468
Lloyd CD, Atkinson PM (2004) Increased accuracy of geostatistical prediction of nitrogen dioxide in the United Kingdom with secondary data. Int J Appl Earth Obs Geoinform 5:293–305
Meul M, Van Meirvenne M (2003) Kriging soil texture under different types of nonstationarity. Geoderma 112:217–233
Savelieva E, Demyanova V, Kanevskia M, Serre ML, Christakos G (2005) BME-based uncertainty assessment of the Chernobyl fallout. Geoderma 128:312–324
Venables WN, Ripley BD (1994) Modern applied statistics with S-Plus. Springer, Berlin Heidelberg New York
Wang F (1990) Fuzzy supervised classification of remote sensing images. IEEE Trans Geosci Remote Sens 28:94–201
Acknowledgments
This work has been supported for the first author by a Belgian grant of the Fonds pour la formation à la Recherche dans l’Industrie et dans l’Agriculture (FRIA).
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1
As we assumed that each Y(x) is Bernouilli distributed, its discrete pdf is
and the equivalent continuous pdf is thus given by
In a similar way, as the joint discrete pdf for \(\left(Y_{k}({\bf x}_{i}), Y_{k \prime}({\bf x}_{j})\right)\) is given by
the equivalent continuous pdf is
whereas for the joint pdf of \((Y_{k} ({\bf x}_{i}),Z({\bf x}_{j}))\), we will express it as
Similar notations can be, of course, obtained for the joint pdf of any combinations of several continuous and/or binary variables.
Appendix 2
The general expression for a multivariate Gaussian distribution \(N(\varvec{\mu}_{\ell},\Sigma)\) over n+1 variables is given by
Expanding the quadratic term inside this exponential, we can write that
where the other factors are constant with respect to z. By identification with Eq. 33, we thus have the results
so that, as stated,
Rights and permissions
About this article
Cite this article
Wibrin, M., Bogaert, P. & Fasbender, D. Combining categorical and continuous spatial information within the Bayesian maximum entropy paradigm. Stoch Environ Res Ris Assess 20, 423–433 (2006). https://doi.org/10.1007/s00477-006-0035-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-006-0035-8