A Bayesian model for local smoothing in kernel density estimation

Brewer, Mark J.

doi:10.1023/A:1008925425102

A Bayesian model for local smoothing in kernel density estimation

Published: October 2000

Volume 10, pages 299–309, (2000)
Cite this article

Statistics and Computing Aims and scope Submit manuscript

Mark J. Brewer¹

484 Accesses
49 Citations
Explore all metrics

Abstract

A new procedure is proposed for deriving variable bandwidths in univariate kernel density estimation, based upon likelihood cross-validation and an analysis of a Bayesian graphical model. The procedure admits bandwidth selection which is flexible in terms of the amount of smoothing required. In addition, the basic model can be extended to incorporate local smoothing of the density estimate. The method is shown to perform well in both theoretical and practical situations, and we compare our method with those of Abramson (The Annals of Statistics 10: 1217–1223) and Sain and Scott (Journal of the American Statistical Association 91: 1525–1534). In particular, we note that in certain cases, the Sain and Scott method performs poorly even with relatively large sample sizes.

We compare various bandwidth selection methods using standard mean integrated square error criteria to assess the quality of the density estimates. We study situations where the underlying density is assumed both known and unknown, and note that in practice, our method performs well when sample sizes are small. In addition, we also apply the methods to real data, and again we believe our methods perform at least as well as existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Abramson I. 1982. On bandwidth variation in kernel estimates - a square root law. The Annals of Statistics 10: 1217–1223.
Google Scholar
Best N.J., Cowles M.K., and Vines S.K. 1995. CODA: Convergence Diagnosis and Output Analysis Software for Gibbs Sampling Output, version 0.3. MRC Biostatistics Unit, Cambridge, UK.
Google Scholar
Bowman A.W. and Foster P.J. 1993. Adaptive smoothing and densitybased tests of multivariate normality. Journal of the American Statistical Association 88: 529–537.
Google Scholar
Brewer M.J. 1998. A modelling approach for bandwidth selection in kernel density estimation. In: Proceedings of COMPSTAT 1998, Physica Verlag, Heidelberg, pp. 203–208.
Google Scholar
Brewer M.J., Aitken C.G.G., and Talbot M. 1996. A comparison of hybrid strategies for Gibbs sampling in mixed graphical models. Computational Statistics and Data Analysis 21: 343–365.
Google Scholar
Cao R., Cuevas A., and Mantiega W.G. 1994. A comparative study of several smoothing methods in density estimation. Computational Statistics and Data Analysis 17: 153–176.
Google Scholar
Duin R.P.W. 1976. On the choice of smoothing parameters for Parzen estimators of probability density functions. IEEE Transactions in Computing C-25: 1175–1179.
Google Scholar
Habbema J.D.F., Hermans J., and van den Brock K. 1974. A stepwise discriminant analysis program using density estimation. In: Proceedings of COMPSTAT 1974, Physica Verlag, Heidelberg, pp. 101–110.
Google Scholar
Jones M.C. 1993. Simple boundary correction for kernel density estimates. Statistics and Computing 3: 135–146.
Google Scholar
Jones M.C., Marron J.S., and Sheather S.J. 1996. A brief survey of bandwidth selection for density estimation. Journal of the American Statistical Association 91: 401–407.
Google Scholar
Mollié A. 1996. Bayesian mapping of disease. In: Gilks W.R., Richardson S., and Spiegelhalter D.J. (Eds.), Markov Chain Monte Carlo in Practice, Chapman and Hall, London, pp. 359–379.
Google Scholar
Park B.-U. and Turlach B.A. 1992. Practical performance of several data-driven bandwidth selectors. Computational Statistics 7: 251–285.
Google Scholar
Sain S.R. and Scott D.W. 1996. On locally adaptive density estimation. Journal of the American Statistical Association 91: 1525–1534.
Google Scholar
Schuster E.F. and Gregory C.G. 1981. On the nonconsistency of maximum likelihood nonparametric density estimators. In: Eddy W.F. (Ed.), Computer Science and Statistics: Proceedings of the 13th Symposium on the Interface, Springer-Verlag, NewYork, pp. 295–298.
Google Scholar
Sheather S.J. 1992. The performance of six popular bandwidth selection methods on some real data sets. Computational Statistics 7: 225–250.
Google Scholar
Sheather S.J. and Jones M.C. 1991. A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society B53: 683–690.
Google Scholar
Silverman B.W. 1986. Density Estimation for Statistics and Data Analysis. Chapman and Hall, London.
Google Scholar
Spiegelhalter D., Thomas A., Best N., and Gilks W. 1995. BUGS 0.5* Examples Volume 2 (version ii), MRC Biostatistics Unit, Cambridge, UK.
Google Scholar
Terrell G.R. and Scott D.W. 1992. Variable kernel density estimation. Annals of Statistics 20: 1236–1265.
Google Scholar
Wand M.P. and Jones M.C. 1995. Kernel Smoothing. Chapman and Hall, London.
Google Scholar
Wermuth N. and Lauritzen S.L. 1990. On substantive research hypotheses, conditional independence graphs and graphical chain models. Journal of the Royal Statistical Society B52: 21–50.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Mathematical Sciences, University of Exeter, UK
Mark J. Brewer

Authors

Mark J. Brewer
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brewer, M.J. A Bayesian model for local smoothing in kernel density estimation. Statistics and Computing 10, 299–309 (2000). https://doi.org/10.1023/A:1008925425102

Download citation

Issue Date: October 2000
DOI: https://doi.org/10.1023/A:1008925425102

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Bayesian model for local smoothing in kernel density estimation

Abstract

Access this article

Similar content being viewed by others

On automatic kernel density estimate-based tests for goodness-of-fit

Maximum likelihood method for bandwidth selection in kernel conditional density estimate

Kader—An R Package for Nonparametric Kernel Adjusted Density Estimation and Regression

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

A Bayesian model for local smoothing in kernel density estimation

Abstract

Access this article

Similar content being viewed by others

On automatic kernel density estimate-based tests for goodness-of-fit

Maximum likelihood method for bandwidth selection in kernel conditional density estimate

Kader—An R Package for Nonparametric Kernel Adjusted Density Estimation and Regression

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation