Skip to main content
Log in

A Bayesian model for local smoothing in kernel density estimation

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

A new procedure is proposed for deriving variable bandwidths in univariate kernel density estimation, based upon likelihood cross-validation and an analysis of a Bayesian graphical model. The procedure admits bandwidth selection which is flexible in terms of the amount of smoothing required. In addition, the basic model can be extended to incorporate local smoothing of the density estimate. The method is shown to perform well in both theoretical and practical situations, and we compare our method with those of Abramson (The Annals of Statistics 10: 1217–1223) and Sain and Scott (Journal of the American Statistical Association 91: 1525–1534). In particular, we note that in certain cases, the Sain and Scott method performs poorly even with relatively large sample sizes.

We compare various bandwidth selection methods using standard mean integrated square error criteria to assess the quality of the density estimates. We study situations where the underlying density is assumed both known and unknown, and note that in practice, our method performs well when sample sizes are small. In addition, we also apply the methods to real data, and again we believe our methods perform at least as well as existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abramson I. 1982. On bandwidth variation in kernel estimates - a square root law. The Annals of Statistics 10: 1217–1223.

    Google Scholar 

  • Best N.J., Cowles M.K., and Vines S.K. 1995. CODA: Convergence Diagnosis and Output Analysis Software for Gibbs Sampling Output, version 0.3. MRC Biostatistics Unit, Cambridge, UK.

    Google Scholar 

  • Bowman A.W. and Foster P.J. 1993. Adaptive smoothing and densitybased tests of multivariate normality. Journal of the American Statistical Association 88: 529–537.

    Google Scholar 

  • Brewer M.J. 1998. A modelling approach for bandwidth selection in kernel density estimation. In: Proceedings of COMPSTAT 1998, Physica Verlag, Heidelberg, pp. 203–208.

    Google Scholar 

  • Brewer M.J., Aitken C.G.G., and Talbot M. 1996. A comparison of hybrid strategies for Gibbs sampling in mixed graphical models. Computational Statistics and Data Analysis 21: 343–365.

    Google Scholar 

  • Cao R., Cuevas A., and Mantiega W.G. 1994. A comparative study of several smoothing methods in density estimation. Computational Statistics and Data Analysis 17: 153–176.

    Google Scholar 

  • Duin R.P.W. 1976. On the choice of smoothing parameters for Parzen estimators of probability density functions. IEEE Transactions in Computing C-25: 1175–1179.

    Google Scholar 

  • Habbema J.D.F., Hermans J., and van den Brock K. 1974. A stepwise discriminant analysis program using density estimation. In: Proceedings of COMPSTAT 1974, Physica Verlag, Heidelberg, pp. 101–110.

    Google Scholar 

  • Jones M.C. 1993. Simple boundary correction for kernel density estimates. Statistics and Computing 3: 135–146.

    Google Scholar 

  • Jones M.C., Marron J.S., and Sheather S.J. 1996. A brief survey of bandwidth selection for density estimation. Journal of the American Statistical Association 91: 401–407.

    Google Scholar 

  • Mollié A. 1996. Bayesian mapping of disease. In: Gilks W.R., Richardson S., and Spiegelhalter D.J. (Eds.), Markov Chain Monte Carlo in Practice, Chapman and Hall, London, pp. 359–379.

    Google Scholar 

  • Park B.-U. and Turlach B.A. 1992. Practical performance of several data-driven bandwidth selectors. Computational Statistics 7: 251–285.

    Google Scholar 

  • Sain S.R. and Scott D.W. 1996. On locally adaptive density estimation. Journal of the American Statistical Association 91: 1525–1534.

    Google Scholar 

  • Schuster E.F. and Gregory C.G. 1981. On the nonconsistency of maximum likelihood nonparametric density estimators. In: Eddy W.F. (Ed.), Computer Science and Statistics: Proceedings of the 13th Symposium on the Interface, Springer-Verlag, NewYork, pp. 295–298.

    Google Scholar 

  • Sheather S.J. 1992. The performance of six popular bandwidth selection methods on some real data sets. Computational Statistics 7: 225–250.

    Google Scholar 

  • Sheather S.J. and Jones M.C. 1991. A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society B53: 683–690.

    Google Scholar 

  • Silverman B.W. 1986. Density Estimation for Statistics and Data Analysis. Chapman and Hall, London.

    Google Scholar 

  • Spiegelhalter D., Thomas A., Best N., and Gilks W. 1995. BUGS 0.5* Examples Volume 2 (version ii), MRC Biostatistics Unit, Cambridge, UK.

    Google Scholar 

  • Terrell G.R. and Scott D.W. 1992. Variable kernel density estimation. Annals of Statistics 20: 1236–1265.

    Google Scholar 

  • Wand M.P. and Jones M.C. 1995. Kernel Smoothing. Chapman and Hall, London.

    Google Scholar 

  • Wermuth N. and Lauritzen S.L. 1990. On substantive research hypotheses, conditional independence graphs and graphical chain models. Journal of the Royal Statistical Society B52: 21–50.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brewer, M.J. A Bayesian model for local smoothing in kernel density estimation. Statistics and Computing 10, 299–309 (2000). https://doi.org/10.1023/A:1008925425102

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008925425102

Navigation