Designing environmental monitoring networks to measure extremes
- 120 Downloads
- 14 Citations
Abstract
This paper discusses challenges arising in the design of networks for monitoring extreme values over the domain of a random environmental space-time field {X ij } i = 1, . . . , I denoting site and j = 1, . . . denoting time (e.g. hour). The field of extremes for time span r over site domain i = 1, . . . ,I is given by \(\{Y_{i(r+1)}=\max_{j=k}^{k+n-1} X_{ij}\}\) for k = 1 + rn, r = 0, . . . ,. Such networks must not only measure extremes at the monitored sites but also enable their prediction at the non-monitored ones. Designing such a network poses special challenges that do not seem to have been generally recognized. One of these problems is the loss of spatial dependence between site responses in going from the environmental process to the field of extremes it generates. In particular we show empirically that the intersite covariance Cov(Y i(r+1),Y i′(r+1)) can generally decline toward zero as r increases, for site pairs i ≠ i′. Thus the measured extreme values may not predict the unmeasured ones very precisely. Consequently high levels of pollution exposure of a sensitive group (e.g. school children) located between monitored sites may be overlooked. This potential deficiency raises concerns about the adequacy of air pollution monitoring networks whose primary role is the detection of noncompliance with air quality standards based on extremes designed to protect human health. The need to monitor for noncompliance and thereby protect human health, points to other issues. How well do networks designed to monitor the field monitor their fields of extremes? What criterion should be used to select prospective monitoring sites when setting up or adding to a network? As the paper demonstrates by assessing an existing network, the answer to the first question is not well, at least in the case considered. To the second, the paper suggests a variety of plausible answers but shows through a simulation study, that they can lead to different optimum designs. The paper offers an approach that circumvents the dilemma posed by the answer to the second question. That approach models the field of extremes (suitably transformed) by a multivariate Gaussian-Inverse Wishart hierarchical Bayesian distribution. The adequacy of this model is empirically assessed in an application by finding the relative coverage frequency of the predictive credibility ellipsoid implied by its posterior distribution. The favorable results obtained suggest this posterior adequately describes that (transformed) field. Hence it can form the basis for designing an appropriate network. Its use is demonstrated by a hypothetical extension of an existing monitoring network. That foundation in turn enables a network to be designed of sufficient density (relative to cost) to serve its regulatory purpose.
Keywords
Extreme values Bayesian hierarchical models Space–time models Generalized inverted Wishart Multivariate extremes Spatial design Maxent Maximum entropyPreview
Unable to display preview. Download preview PDF.
References
- Angulo JM and Bueso MC (2001). Random perturbation methods applied to multivariate spatial sampling design. Environmetrics 12: 631–646 CrossRefGoogle Scholar
- Bueso MC, Angulo JM and Alonso FJ (1998). A state-space model apporach to optimum spatial sampling design bad on entropy. Environ Ecolo Stat 5: 29–44 CrossRefGoogle Scholar
- Bueso MC, Angulo JM, Qian G and Alonso FJ (1999a). Spatial sampling design based on stochastic complexity. J Mult Anal 71: 94–110 CrossRefGoogle Scholar
- Bueso MC, Angulo JM, Curz-Sanjuliàn J and García-Aróstegui JL (1999b). Optimal spatial sampling design in a multivariate framework. Math Geol 31: 507–525 CrossRefGoogle Scholar
- Caselton WF and Husain T (1980). Hydrologic networks: information transmission. Water Resour Plan Manage Div ASCE 106(WR2): 503–520 Google Scholar
- Caselton WF and Zidek JV (1984). Optimal monitoring network designs. Stat & Prob Lett 2: 223–227 CrossRefGoogle Scholar
- Caselton WF, Kan L, Zidek JV (1992) Quality data networks that minimize entropy. In: Guttorp P, Walden A (eds), Statistics in the environmental & earth sciences. Griffin, LondonGoogle Scholar
- Coles S and Pauli F (2002). Models and inference for uncertainty in extremal dependence. Biometrika 89: 183–196 CrossRefGoogle Scholar
- Fu A (2002) Case study: inference for extreme spatial random rainfall fields. MSc Thesis. Department of Statistics, University of British ColumbiaGoogle Scholar
- Fu AQ, Le ND, Zidek JV (2003) A statistical characterization of a simulated Canadian annual maximum rainfall field. TR 2003-17, Statistical and Mathematical Sciences Institute, RTP, NCGoogle Scholar
- Guttorp P, Le ND, Sampson PD and Zidek JV (1993). Using entropy in the redesign of an environmental monitoring network. In: Patil, GP, Rao, CR, and Ross, NP (eds) Multivariate environmental statistics, pp 175–202. North Holland/Elsevier Science, New York Google Scholar
- Heffernan JE and Tawn JA (2004). A conditional approach for multivariate extreme values (with discussion). J Roy Stat Soc Ser B 66: 497–546 CrossRefGoogle Scholar
- Kennedy WJ and Gentle JE (1980). Statistical computing. Marcel Dekker, Inc, New York Google Scholar
- Kibria GBM, Sun L, Zidek V and Le ND (2002). Bayesian spatial prediction of random space-time fields with application to mapping PM2.5 exposure. J Amer Stat Assoc 457: 101–112 Google Scholar
- Le ND, Zidek JV (1994) Network designs for monitoring multivariate random spatial fields. In: Vilaplana JP, Puri ML (eds), Recent Advances in Statistics and Probability. pp 191–206Google Scholar
- Le ND, Sun L and Zidek JV (2001). Spatial prediction and temporal backcasting for environmental fields having monotone data patterns. Can J Stat 29: 515–690 CrossRefGoogle Scholar
- Le ND and Zidek JV (2006). Statistical analysis of environmental space-time processes. Springer, New York Google Scholar
- Lindley DV (1956). On the measure of the information provided by an experiment. Ann Math Stat 27: 968–1005 Google Scholar
- Müller WG (2001). Collecting spatial data: optimum design of experiments for random fields. Physica-Verlag, Hiedelberg Google Scholar
- Sebastiani P and Wynn HP (2000). Maximum entropy sampling and optimal Bayesian experimental design. JR Stat Soc, Ser B 62: 145–157 CrossRefGoogle Scholar
- Shewry M and Wynn H (1987). Maximum entropy sampling. J Appl Stat 14: 165–207 CrossRefGoogle Scholar
- Smith RL (2004) Contribution to the discussion of Heffernan and Tawn (2004)Google Scholar
- Sun L, Zidek JV, Le ND and Özkaynak H (2000). Interpolating Vancouver’s daily ambient PM10 field. Environmetrics 11: 651–663 CrossRefGoogle Scholar
- Sun W (1998). Comparison of a co-kriging method with a Bayesian alternative. Environmetrics 9: 445–457 CrossRefGoogle Scholar
- Sun W, Le ND, Zidek JV and Burnett R (1998). Assessment of Bayesian multivariate interpolation approach for health impact studies. Environmetrics 9: 565–586 CrossRefGoogle Scholar
- Wu S and Zidek JV (1992). An entropy based review of selected NADP/NTN network sites for 1983–86. Atmos Environ 26: 2089–2103 Google Scholar
- Zidek JV, Sun W and Le ND (2000). Designing and integrating composite networks for monitoring multivariate gaussian pollution fields. Appl Stat 49: 63–79 Google Scholar
- Zidek JV, Meloche J, Shaddick G, Hatfield C, White R (2003) A computational model for estimating personal exposure to air pollutants with application to London’s PM10 in 1997. TR 2003-3, Statistical and Applied Mathematical Studies Institute, Research Triangle Park, NCGoogle Scholar