Skip to main content
Log in

A cross-validation method for data with ties in kernel density estimation

  • Published:
Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Abstract

Limitation of the cross-validation method of bandwidth selection is well known when applied to data with ties. A method which resolves this problem and which is easy to understand and implement is proposed. We show that the proposed approach is viable in theory, by proving its asymptotic equivalence to the standard cross-validation method. The practical usefulness is shown in simulations and an application to a real data example.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Andrews D.F., Herzberg A.M. (1985). Data. Berlin Heidelberg New York, Springer

    MATH  Google Scholar 

  • Bowman A.W. (1984). An alternative method of cross-validation for the smoothing of density estimates. Biometrika 71, 353–360

    Article  MathSciNet  Google Scholar 

  • Brinkman N.D. (1981). Ethanol fuel – a single-cylinder engine study of efficiency and exhaust emissions. SAE Transactions 90, 1410–1424

    Google Scholar 

  • Chiu S.T. (1991). The effect of discretization error on bandwidth selection for kernel density estimation. Biometrika 78, 436–441

    Article  MathSciNet  Google Scholar 

  • Hall P. (1983). Large sample optimality of least squares cross-validation in density estimation. Annals of Statistics 11, 1156–1174

    MATH  MathSciNet  Google Scholar 

  • Hall P., Heyde C.C. (1980). Martingale Limit Theory and its Application. New York–London, Academic Press

    MATH  Google Scholar 

  • Hall P., Marron J.S. (1987). Extent to which least-squares cross-validation minimises integrated square error in nonparametric density estimation. Probability Theory and Related Fields 74, 567–581

    Article  MATH  MathSciNet  Google Scholar 

  • Jones M.C., Marron J.S., Sheather S.J. (1996). A brief survey of bandwidth selection for density estimation. Journal of the American Statistical Association 91, 401–407

    Article  MATH  MathSciNet  Google Scholar 

  • Loader C.R. (1999). Bandwidth selection: classical or plug-in?. Annals of Statistics 27, 415–438

    Article  MATH  MathSciNet  Google Scholar 

  • Machado, J. A. F., Santos Silva, J. M. C. (2002). Quantiles for Counts. No. CWP22/02 in CeMMAP, Centre for Microdata Methods and Practice, Institute for Fiscal Studies, Depertment of Economics, UCL.

  • Marron J.S., Wand M. P. (1992). Exact mean integrated squared error. Annals of Statistics 20, 712–736

    Article  MATH  MathSciNet  Google Scholar 

  • Rosenblatt M. (1956). Remarks on some nonparametric estimates of a density function. Annals of Mathematical Statistics 27, 832–837

    Article  MathSciNet  Google Scholar 

  • Ruppert D., Carroll R.J., Maca J.D. (1999). Nonparametric regression in the presence of measurement error. Biometrika 86, 541–554

    Article  MATH  MathSciNet  Google Scholar 

  • Silverman B.W. (1986). Density Estimation for Statistics and Data Analysis. In, Monographs on Statistics and Applied Probability. London, Chapman and Hall

  • Stone C.J. (1984). An asymptotically optimal window selection rule for kernel density estimates. Annals of Statistics 12, 1285–1297

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kamila Żychaluk.

About this article

Cite this article

Żychaluk, K., Patil, P.N. A cross-validation method for data with ties in kernel density estimation. AISM 60, 21–44 (2008). https://doi.org/10.1007/s10463-006-0077-1

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10463-006-0077-1

Keywords

Navigation