Abstract
The present study is aimed to evaluate levels of air pollution for the Barcelona Metropolitan Region. For this purpose, a newly developed approach called conformal predictors is considered, and, in particular, use is made of the ridge regression confidence machine (RRCM). The hallmark of this method is that it gives valid estimates, i.e. for a given level of significance of prediction, the probability of error does not exceed this level. Moreover, the chosen specification of the RRCM predictor does not place any requirements on data distribution, apart from being independent and identically distributed. A linear ridge regression conformal predictor has been applied to the data. It has allowed to obtain valid interval estimates of annual nitrogen dioxide concentrations with 95 % confidence. The model has provided good results, but to further increase the efficiency of prediction, the RBF kernel has been used. The data for this study have been provided by the XVPCA (Network for Monitoring and Forecasting of Air Pollution) of the Generalitat of Catalonia. The pollutant considered in this paper is nitrogen dioxide. Its values are represented by annual average concentrations within the period from 1998 to 2009. This paper also describes an application of ordinary kriging, and its results have been compared to those of ridge regression conformal predictor.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Barceló M., Saez M., Saurina C.: Spatial variability in mortality inequalities, socioeconomic deprivation, and air pollution in small areas of the Barcelona Metropolitan Region, Spain. Sci. Total Environ. 407(21), 5501–5523 (2009)
Cawley G., Talbot N.: Fast exact leave-one-out cross-validation of sparse least-squares support vector machines. Neural Netw. 17(10), 1467–1475 (2004)
Chilès, J., Delfiner, P.: Geostatistics: Modeling Spatial Uncertainty. Wiley Series in Probability and Statistics. Wiley, New York (2009)
Cyrys J., Hochadel M., Gehring U., Hoek G., Diegmann V., Brunekreef B., Heinrich J.: GIS-based estimation of exposure to particulate matter and NO2 in an urban area: stochastic versus dispersion modeling. Environ. Health Perspect. 113(8), 987–992 (2005)
Diggle, P., Ribeiro, P. Jr.: Model-Based Geostatistics. Springer, Berlin (2007)
Draper, N., Smith, H.: Applied Regression Analysis. Wiley, New York (1981)
Gilbert N., Goldberg M., Beckerman B., Brook J., Jerrett M.: Assessing spatial variability of ambient nitrogen dioxide in Montreál, Canada, with a land-use regression model. J. Air Waste Manag. Assoc. 55(8), 1059–1063 (2005)
Handcock M.S., Stein M.L.: A Bayesian analysis of kriging. Technometrics 35(4), 403–410 (1993)
Jerrett M., Arain A., Kanaroglou P., Beckerman B., Potoglou D., Sahsuvaroglu T., Morrison J., Giovis C.: A review and evaluation of intraurban air pollution exposure models. J. Expo. Anal. Environ. Epidemiol. 15, 185–204 (2005)
Lee P., Talbott E., Roberts J., Catov J., Sharma R., Ritz B.: Particulate air pollution exposure and C-reactive protein during early pregnancy. Epidemiology 22(4), 524–531 (2011)
Organization, W.H.: Air Quality Guidelines: Global Update 2005: Particulate Matter, Ozone, Nitrogen Dioxide and Sulfur Dioxide. EURO Nonserial Publication. World Health Organization (2006)
Pilz J., Spöck G.: Why do we need and how should we implement Bayesian kriging methods. Stoch. Environ. Res. Risk Assess. 22(5), 621–632 (2008)
Ribeiro, P. Jr., Diggle, P.: geoR: a package for geostatistical analysis. R-NEWS 1(2) (2001)
Samoli E., Aga E., Touloumi G., Nisiotis K., Forsberg B., Lefranc A., Pekkanen J., Wojtyniak B., Schindler C., Niciu E., Brunstein R., Dodic Fikfak M., Schwartz J., Katsouyanni K.: Short-term effects of nitrogen dioxide on mortality: an analysis within the APHEA project. Eur. Respir. J. 27(6), 1129–1138 (2006)
Schölkopf, B., Smola, A.: Learning With Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2002)
Spöck G., Kazianka, H., Pilz, J.: Bayesian trans-Gaussian kriging with log-log transformed skew data. In: Pilz, J. (ed.) Interfacing Geostatistics and GIS, pp. 29–43. Springer, Berlin (2009)
Switzer, P.: Kriging. In: Encyclopedia of Environmetrics. John Wiley & Sons, Ltd (2006)
Team, R.D.C.: R: A language and environment for statistical computing (2011). http://www.R-project.org
Vovk, V., Gammerman, A., Shafer, G.: Algorithmic Learning in a Random World. Springer, Berlin (2005)
Vovk V., Nouretdinov I., Gammerman A.: On-line predictive linear regression. Ann. Stat. 37(3), 1566–1590 (2009)
Wackernagel H.: Multivariate Geostatistics: An Introduction With Applications. Springer, Berlin (2003)
Webster R., Oliver M.: Geostatistics for Environmental Scientists. Wiley, New York (2007)
Zimmerman D., Pavlik C., Ruggles A., Armstrong M.: An experimental comparison of ordinary and universal kriging and inverse distance weighting. Math. Geol. 31, 375–390 (1999)
Zou B., Wilson J.G., Zhan F.B., Zeng Y.: Air pollution exposure assessment methods utilized in epidemiological studies. J. Environ. Monit. 11, 475–490 (2009)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ivina, O., Nouretdinov, I. & Gammerman, A. Valid predictions with confidence estimation in an air pollution problem. Prog Artif Intell 1, 235–243 (2012). https://doi.org/10.1007/s13748-012-0018-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13748-012-0018-6