Histogram-kernel error and its application for bin width selection in histograms

Wang, Xiu-xiang; Zhang, Jian-fang

doi:10.1007/s10255-007-7081-y

Histogram-kernel error and its application for bin width selection in histograms

Published: 23 April 2009

Volume 28, pages 607–624, (2012)
Cite this article

Acta Mathematicae Applicatae Sinica, English Series Aims and scope Submit manuscript

Xiu-xiang Wang¹ &
Jian-fang Zhang²

193 Accesses
3 Citations
Explore all metrics

Abstract

Histogram and kernel estimators are usually regarded as the two main classical data-based nonparametric tools to estimate the underlying density functions for some given data sets. In this paper we will integrate them and define a histogram-kernel error based on the integrated square error between histogram and binned kernel density estimator, and then exploit its asymptotic properties. Just as indicated in this paper, the histogram-kernel error only depends on the choice of bin width and the data for the given prior kernel densities. The asymptotic optimal bin width is derived by minimizing the mean histogram-kernel error. By comparing with Scott’s optimal bin width formula for a histogram, a new method is proposed to construct the data-based histogram without knowledge of the underlying density function. Monte Carlo study is used to verify the usefulness of our method for different kinds of density functions and sample sizes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multiplicative bias correction for generalized Birnbaum-Saunders kernel density estimators and application to nonnegative heavy tailed data

Article 17 July 2015

Comparative Analysis of Density Estimation Based Kernel Regression

A Kernel Goodness-of-fit Test for Maximum Likelihood Density Estimates of Normal Mixtures

References

Beer, C.F., Swanepoel, J.W.H. Simple and effective number-of-bins circumference selectors for a histogram. Statistics and Computing, 9: 27–35 (1999)
Article Google Scholar
Bowman, A.W. An alternative method of cross-validation for the smoothing of density estimates. Biometrika, 71: 353–360 (1984)
Article MathSciNet Google Scholar
Cencov, N.N. Estimation of an unknown distribution density from observations. Soviet Math., 3: 1159–1562 (1962)
Google Scholar
Daly, J.E. The construction of optimal histogram. Commun. Statist. Theory Meth., 17(9): 2921–2931 (1988)
Article MathSciNet MATH Google Scholar
Devroye, L. The double kernel method in density estimation”, Annales de L’Institut Henri Poincare, 25: 533–580 (1989)
MathSciNet MATH Google Scholar
Faraway, J.J., Jhun, M. Bootstrap choice of bandwidth for density estimation. Journal of Statistical Planning and Inference, 85: 1119–1122 (1990)
MathSciNet Google Scholar
Freedman, D., Diaconis, P. On the histogram as a density estimation: L ₂ theory. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 57: 453–476 (1981)
Article MathSciNet MATH Google Scholar
He, K., Meeden, G. Selecting the number of bins in a histogram: A decision theoretical approach. Journal of Statistical Planning and Inference, 61: 49–59 (1997)
Article MATH Google Scholar
Parzen, E. Nonparametric statistical data modeling (with discussion). Journal of the American Statistical Association, 74: 105–131 (1979)
MathSciNet MATH Google Scholar
Rosenblatt, M. Remarks on some nonparametric estimates of a density function. Annals of Mathematical Statistics, 27: 832–837 (1956)
Article MathSciNet MATH Google Scholar
Rudemo, M. Empirical choice of histogram and kernel density estimators. Scandinavian Journal of Statistics, 9: 65–78 (1982)
MathSciNet MATH Google Scholar
Sain, S.R., Scott, D.W. On locally adaptive density estimation. Journal of the American Statistical Association, 91: 1525–1534 (1996)
MathSciNet MATH Google Scholar
Scott, D.W. On Optimal and Data-Based Histograms. Biometrika, 66: 605–610 (1979)
Article MathSciNet MATH Google Scholar
Scott, D.W., Terrell, G.R. Biased and unbiased cross-validation in density estimation. Journal of the American Statistical Association, 82: 1131–1146 (1987)
MathSciNet MATH Google Scholar
Scott, D.W. Multivariate density estimation-Theory, practice and visualization. John Wiley & Sons, New York, 1992
Book MATH Google Scholar
Simonoff, J.S., Udina, F. Measuring the stability of histogram appearance when the anchor position is changed. Computational Statistics and Data Analysis, 23: 335–353 (1997)
Article MathSciNet MATH Google Scholar
Sturges, H.A. The choice of a class interval. Journal of the American Statistical Association, 21: 65–66 (1926)
Article Google Scholar
Taylor, C.C. Bootstrap choice of the smoothing parameter in kernel density estimation. Biometrika, 76: 705–712 (1989)
Article MathSciNet MATH Google Scholar
Terrell, G.R. The maximal smoothing principle in density estimation. Journal of the American Statistical Association, 85: 470–477 (1990)
MathSciNet Google Scholar
Terrell, G.R., Scott, D.W. Over-smoothed nonparamtric density estimates. Journal of the American Statistical Association, 80: 209–214 (1985)
MathSciNet Google Scholar
Wang, X.X. Nonparametric density estimation: Histogram and binned kernel density estimation theories and their applications. Master Thesis, Graduate University of Chinese Academy of Sciences, 2007

Download references

Author information

Authors and Affiliations

Department of Mathematics, Graduate University of Chinese Academy of Sciences, Beijing, 100049, China
Xiu-xiang Wang
College of Management, Graduate University of Chinese Academy of Sciences, Beijing, 100190, China
Jian-fang Zhang

Authors

Xiu-xiang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jian-fang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiu-xiang Wang.

Additional information

Supported by the National Natural Science Foundation of China (No. 70371018, 70572074)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Xx., Zhang, Jf. Histogram-kernel error and its application for bin width selection in histograms. Acta Math. Appl. Sin. Engl. Ser. 28, 607–624 (2012). https://doi.org/10.1007/s10255-007-7081-y

Download citation

Received: 17 July 2007
Revised: 03 April 2008
Published: 23 April 2009
Issue Date: July 2012
DOI: https://doi.org/10.1007/s10255-007-7081-y

Keywords

2000 MR Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Histogram-kernel error and its application for bin width selection in histograms

Abstract

Access this article

Similar content being viewed by others

Multiplicative bias correction for generalized Birnbaum-Saunders kernel density estimators and application to nonnegative heavy tailed data

Comparative Analysis of Density Estimation Based Kernel Regression

A Kernel Goodness-of-fit Test for Maximum Likelihood Density Estimates of Normal Mixtures

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

2000 MR Subject Classification

Navigation

Histogram-kernel error and its application for bin width selection in histograms

Abstract

Access this article

Similar content being viewed by others

Multiplicative bias correction for generalized Birnbaum-Saunders kernel density estimators and application to nonnegative heavy tailed data

Comparative Analysis of Density Estimation Based Kernel Regression

A Kernel Goodness-of-fit Test for Maximum Likelihood Density Estimates of Normal Mixtures

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

2000 MR Subject Classification

Search

Navigation