Abstract
A new procedure is proposed for deriving variable bandwidths in univariate kernel density estimation, based upon likelihood cross-validation and an analysis of a Bayesian graphical model. The procedure admits bandwidth selection which is flexible in terms of the amount of smoothing required. In addition, the basic model can be extended to incorporate local smoothing of the density estimate. The method is shown to perform well in both theoretical and practical situations, and we compare our method with those of Abramson (The Annals of Statistics 10: 1217–1223) and Sain and Scott (Journal of the American Statistical Association 91: 1525–1534). In particular, we note that in certain cases, the Sain and Scott method performs poorly even with relatively large sample sizes.
We compare various bandwidth selection methods using standard mean integrated square error criteria to assess the quality of the density estimates. We study situations where the underlying density is assumed both known and unknown, and note that in practice, our method performs well when sample sizes are small. In addition, we also apply the methods to real data, and again we believe our methods perform at least as well as existing methods.
Similar content being viewed by others
References
Abramson I. 1982. On bandwidth variation in kernel estimates - a square root law. The Annals of Statistics 10: 1217–1223.
Best N.J., Cowles M.K., and Vines S.K. 1995. CODA: Convergence Diagnosis and Output Analysis Software for Gibbs Sampling Output, version 0.3. MRC Biostatistics Unit, Cambridge, UK.
Bowman A.W. and Foster P.J. 1993. Adaptive smoothing and densitybased tests of multivariate normality. Journal of the American Statistical Association 88: 529–537.
Brewer M.J. 1998. A modelling approach for bandwidth selection in kernel density estimation. In: Proceedings of COMPSTAT 1998, Physica Verlag, Heidelberg, pp. 203–208.
Brewer M.J., Aitken C.G.G., and Talbot M. 1996. A comparison of hybrid strategies for Gibbs sampling in mixed graphical models. Computational Statistics and Data Analysis 21: 343–365.
Cao R., Cuevas A., and Mantiega W.G. 1994. A comparative study of several smoothing methods in density estimation. Computational Statistics and Data Analysis 17: 153–176.
Duin R.P.W. 1976. On the choice of smoothing parameters for Parzen estimators of probability density functions. IEEE Transactions in Computing C-25: 1175–1179.
Habbema J.D.F., Hermans J., and van den Brock K. 1974. A stepwise discriminant analysis program using density estimation. In: Proceedings of COMPSTAT 1974, Physica Verlag, Heidelberg, pp. 101–110.
Jones M.C. 1993. Simple boundary correction for kernel density estimates. Statistics and Computing 3: 135–146.
Jones M.C., Marron J.S., and Sheather S.J. 1996. A brief survey of bandwidth selection for density estimation. Journal of the American Statistical Association 91: 401–407.
Mollié A. 1996. Bayesian mapping of disease. In: Gilks W.R., Richardson S., and Spiegelhalter D.J. (Eds.), Markov Chain Monte Carlo in Practice, Chapman and Hall, London, pp. 359–379.
Park B.-U. and Turlach B.A. 1992. Practical performance of several data-driven bandwidth selectors. Computational Statistics 7: 251–285.
Sain S.R. and Scott D.W. 1996. On locally adaptive density estimation. Journal of the American Statistical Association 91: 1525–1534.
Schuster E.F. and Gregory C.G. 1981. On the nonconsistency of maximum likelihood nonparametric density estimators. In: Eddy W.F. (Ed.), Computer Science and Statistics: Proceedings of the 13th Symposium on the Interface, Springer-Verlag, NewYork, pp. 295–298.
Sheather S.J. 1992. The performance of six popular bandwidth selection methods on some real data sets. Computational Statistics 7: 225–250.
Sheather S.J. and Jones M.C. 1991. A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society B53: 683–690.
Silverman B.W. 1986. Density Estimation for Statistics and Data Analysis. Chapman and Hall, London.
Spiegelhalter D., Thomas A., Best N., and Gilks W. 1995. BUGS 0.5* Examples Volume 2 (version ii), MRC Biostatistics Unit, Cambridge, UK.
Terrell G.R. and Scott D.W. 1992. Variable kernel density estimation. Annals of Statistics 20: 1236–1265.
Wand M.P. and Jones M.C. 1995. Kernel Smoothing. Chapman and Hall, London.
Wermuth N. and Lauritzen S.L. 1990. On substantive research hypotheses, conditional independence graphs and graphical chain models. Journal of the Royal Statistical Society B52: 21–50.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Brewer, M.J. A Bayesian model for local smoothing in kernel density estimation. Statistics and Computing 10, 299–309 (2000). https://doi.org/10.1023/A:1008925425102
Issue Date:
DOI: https://doi.org/10.1023/A:1008925425102