Abstract
The evaluation of physically based computer models for air quality applications is crucial to assist in control strategy selection. Selecting the wrong control strategy has costly economic and social consequences. The objective comparison of the means and variances of modeled air pollution concentrations with the ones obtained from observed field data is the common approach for the assessment of model performance. One drawback of this strategy is that it fails to calibrate properly the tails of the modeled air pollution distribution. Improving the ability of these numerical models to characterize high pollution events is of critical interest for air quality management.
In this work we introduce an innovative framework to assess model performance, not only based on the two first moments of the model outputs and field data, but also on their entire distributions. Our approach also compares the spatial dependence and variability in two data sources. More specifically, we estimate the spatial-quantile functions for both the model output and field data, and we apply a nonlinear monotonic regression approach to the quantile functions taking into account the spatial dependence to compare the density functions of numerical models and field data. We use a Bayesian approach for estimation and fitting to characterize uncertainties in data and statistical models.
We apply our methodology to assess the performance of the US Environmental Protection Agency (EPA) Community Multiscale Air Quality (CMAQ) model to characterize ozone ambient concentrations. Our approach shows a 50.23% reduction in the root mean square error (RMSE) compared to the default approach based on the first 2 moments of the model output and field data.
Similar content being viewed by others
References
Berrocal, V. J., Gelfand, A. E., and Holland, D. M. (2010), “A Spatio-Temporal Downscaler for Output from Numerical Models,” Journal of Agricultural, Biological, and Environmental Statistics, 15, 176–197.
Bondell, H. D., Reich, B. J., and Wang, H. (2010), “Non-crossing Quantil Regression Curve Estimation,” Biometrika, 97 (4), 825–838.
Byun, D., and Schere, K. L. (2006), “Review of the Governing Equations, Computational Algorithms, and Other Components of the Models-3 Community Multscale Air Quality (CMAQ) Modeling System,” Applied Mechanics Reviews, 59, 51–77.
Cai, B., and Dunson, D. B. (2007), “Bayesian Multivariate Isotonic Regression Splines,” Journal of the American Statistical Association, 102 (480), 1158–1171.
Chen, M. H., Muller, P., Sun, D., Ye, K., and Dey, D. (2010), Frontiers of Statistical Decision Making and Bayesian Analysis, New York: Springer, Ch. 5.2 pp. 168–184.
Dunson, D. B., and Taylor, J. A. (2005), “Approximate Bayesian Inference for Quantiles,” Journal of Nonparametric Statistics, 17 (3), 385–400.
Eder, B., and Yu, S. (2006), “A Performance Evaluation of the 2004 Release of Models-3 CMAQ,” Atmospheric Environment, 40, 4811–4824.
Fuentes, M., and Raftery, A. E. (2005), “Model Evaluation and Spatial Interpolation by Bayesian Combination of Observations with Outputs from Numerical Models,” Biometrics, 61 (1), 36–45.
Hosking, J. R. M., and Wallis, J. R. (1987), “Parameter and Quantile Estimation for the Generalized Pareto Distribution,” Technometrics, 29 (3), 339–349.
Kennedy, M., and O’Hagan, A. (2001), “Bayesian Calibration of Computer Models,” Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63 (3), 425–464.
Koenker, R. (2005), Quantile Regression. Econometric Society Monograph Series, Cambridge: Cambridge University Press.
Kozumi, H., and Kobayashi, G. (2011), “Gibbs Sampling Methods for Bayesian Quantile Regression,” Journal of Statistical Computation and Simulation, 81 (11), 1–14.
Lavine, M. (1995), “On an Approximate Likelihood for Quantiles,” Biometrika, 82, 220–222.
Lim, C. Y., Stein, M., Ching, J. K., and Tang, R. (2010), “Statistical Properties of Differences Between Low and High Resolution CMAQ Runs with Matched Initial and Boundary Conditions,” Environmental Modelling & Software, 25 (1), 158–169.
Lu, Z. Q., and Clarkson, D. B. (1999), “Monotone Spline and Multidimensional Scaling,” in Proceedings of American Statistical Association, Section of Statistical Computing, pp. 185–190.
Paciorek, C. (2011, in press), “Combining Spatial Information Sources while Accounting for Systematic Errors in Proxies,” Journal of the Royal Statistical Society. Series C, Applied Statistics.
Ramsay, J. O. (1988), “Monotone Regression Splines in Action,” Statistical Science, 3 (4), 425–441.
Reich, B. J., Fuentes, M., and Dunson, D. B. (2011), “Bayesian Spatial Quantile Regression,” Journal of the American Statistical Association, 106 (493), 6–20.
Tokdar, S., and Kadane, J. (2011), “Simultaneous Linear Quantile Regression: A Semiparametric Bayesian Approach,” Bayesian Analysis, 6 (4), 1–22.
Wu, Y., and Liu, Y. (2009), “Stepwise Multiple Quantile Regression Estimation,” Statistics and Its Interface, 2, 299–310.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhou, J., Fuentes, M. & Davis, J. Calibration of Numerical Model Output Using Nonparametric Spatial Density Functions. JABES 16, 531–553 (2011). https://doi.org/10.1007/s13253-011-0076-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13253-011-0076-4