Abstract
Limitation of the cross-validation method of bandwidth selection is well known when applied to data with ties. A method which resolves this problem and which is easy to understand and implement is proposed. We show that the proposed approach is viable in theory, by proving its asymptotic equivalence to the standard cross-validation method. The practical usefulness is shown in simulations and an application to a real data example.
Similar content being viewed by others
References
Andrews D.F., Herzberg A.M. (1985). Data. Berlin Heidelberg New York, Springer
Bowman A.W. (1984). An alternative method of cross-validation for the smoothing of density estimates. Biometrika 71, 353–360
Brinkman N.D. (1981). Ethanol fuel – a single-cylinder engine study of efficiency and exhaust emissions. SAE Transactions 90, 1410–1424
Chiu S.T. (1991). The effect of discretization error on bandwidth selection for kernel density estimation. Biometrika 78, 436–441
Hall P. (1983). Large sample optimality of least squares cross-validation in density estimation. Annals of Statistics 11, 1156–1174
Hall P., Heyde C.C. (1980). Martingale Limit Theory and its Application. New York–London, Academic Press
Hall P., Marron J.S. (1987). Extent to which least-squares cross-validation minimises integrated square error in nonparametric density estimation. Probability Theory and Related Fields 74, 567–581
Jones M.C., Marron J.S., Sheather S.J. (1996). A brief survey of bandwidth selection for density estimation. Journal of the American Statistical Association 91, 401–407
Loader C.R. (1999). Bandwidth selection: classical or plug-in?. Annals of Statistics 27, 415–438
Machado, J. A. F., Santos Silva, J. M. C. (2002). Quantiles for Counts. No. CWP22/02 in CeMMAP, Centre for Microdata Methods and Practice, Institute for Fiscal Studies, Depertment of Economics, UCL.
Marron J.S., Wand M. P. (1992). Exact mean integrated squared error. Annals of Statistics 20, 712–736
Rosenblatt M. (1956). Remarks on some nonparametric estimates of a density function. Annals of Mathematical Statistics 27, 832–837
Ruppert D., Carroll R.J., Maca J.D. (1999). Nonparametric regression in the presence of measurement error. Biometrika 86, 541–554
Silverman B.W. (1986). Density Estimation for Statistics and Data Analysis. In, Monographs on Statistics and Applied Probability. London, Chapman and Hall
Stone C.J. (1984). An asymptotically optimal window selection rule for kernel density estimates. Annals of Statistics 12, 1285–1297
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Żychaluk, K., Patil, P.N. A cross-validation method for data with ties in kernel density estimation. AISM 60, 21–44 (2008). https://doi.org/10.1007/s10463-006-0077-1
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-006-0077-1