Abstract
Based on a random sample of size \(n\) from an unknown \(d\)-dimensional density \(f\), the nonparametric estimations of a single integrated density partial derivative functional as well as a vector of such functionals are considered. These single and vector functionals are important in a number of contexts. The purpose of this paper is to derive the information bounds for such estimations and propose estimates that are asymptotically optimal. The proposed estimates are constructed in the frequency domain using the sample characteristic function. For every \(d\) and sufficiently smooth \(f\), it is shown that the proposed estimates are asymptotically normal, attain the optimal \(O_p(n^{-1/2})\) convergence rate and achieve the (conjectured) information bounds. In simulation studies the superior performances of the proposed estimates are clearly demonstrated.
Similar content being viewed by others
References
Aldershof, B. (1991). Estimation of integrated squared density derivatives. Mimeo Series, No. 2053, Institute of Statistics, University of North Carolina.
Bellman, R. E. (1961). Adaptive control processes. Princeton, NJ: Princeton University Press.
Bickel, P. J., Ritov, Y. (1988). Estimating integrated squared density derivatives: sharp best order of convergence estimates. Sankhyā Series A, 50, 381–393.
Cao, R., Francisco-Fernandez, M., Anand, A., Bastida, F., Gonzalez-Andujar, J. L. (2011). Computing statistical indices for hydrothermal times using weed emergence data. Journal of Agricultural Science, 149, 701–712.
Chacón, J. E. (2009). Data-driven choice of the smoothing parametrization for kernel density estimators. Canadian Journal of Statistics, 37, 249–265.
Chacón, J. E., Duong, T. (2010). Multivariate plug-in bandwidth selection with unconstrained pilot bandwidth matrices. Test, 19, 375–398.
Chacón, J. E., Duong, T. (2011). Unconstrained pilot selectors for smoothed cross validation. Australian & New Zealand Journal of Statistics, 53, 331–351.
Chacón, J. E., Duong, T. (2013). Data-driven density derivative estimation, with applications to nonparametric clustering and bump hunting. Electronic Journal of Statistics, 7, 499–532.
Chacón, J. E., Tenreiro, C. (2012). Exact and asymptotically optimal bandwidths for kernel estimation of density functionals. Methodology and Computing in Applied Probability, 14, 523–548.
Chacón, J. E., Duong, T., Wand, M. P. (2011). Asymptotics for general multivariate kernel density derivative estimators. Statistica Sinica, 21, 807–840.
Cheng, M.-Y. (1997). Boundary aware estimators of integrated density derivative products. Journal of the Royal Statistical Society: Series B, 59, 191–203.
Chiu, S. T. (1991). Bandwidth selection for kernel density estimation. Annals of Statistics, 19, 1883–1905.
Chiu, S. T. (1992). An automatic bandwidth selector for kernel density estimation. Biometrika, 79, 771–782.
Csörgő, S. (1981). Multivariate empirical characteristic functions. Zeitschrift för Wahrscheinlichkeitstheorie und Verwandte Gebiete, 55, 203–229.
Davis, K. B. (1975). Mean square error properties of density estimates. Annals of Statistics, 3, 1025–1030.
Davis, K. B. (1977). Mean integrated square error properties of density estimates. Annals of Statistics, 5, 530–535.
Delaigle, A., Gijbels, I. (2002). Estimation of integrated squared density derivatives from a contaminated sample. Journal of the Royal Statistical Society: Series B, 64, 869–886.
Devroye, L. (1988). Asymptotic performance bounds for the kernel estimates. Annals of Statistics, 16, 1162–1179.
Devroye, L. (1992). A note on the usefulness of superkernels in density estimation. Annals of Statistics, 20, 2037–2056.
Donoho, D. L., Nussbaum, M. (1990). Minimax quadratic estimation of a quadratic functional. Journal of Complexity, 6, 290–323.
Duong, T., Hazelton, M. L. (2003). Plug-in bandwidth matrices for bivariate kernel density estimation. Journal of Nonparametric Statistics, 15, 17–30.
Duong, T., Hazelton, M. L. (2005a). Convergence rates for unconstrained bandwidth matrix selectors in multivariate kernel density estimation. Journal of Multivariate Analysis, 93, 417–433.
Duong, T., Hazelton, M. L. (2005b). Cross-validation bandwidth matrices for multivariate kernel density estimation. Scandinavian Journal of Statistics, 32, 485–506.
Efromovich, S., Low, M. (1996a). On Bickel and Ritov’s conjecture about adaptive estimation of the integral of the square of density derivative. Annals of Statistics, 24, 682–686.
Efromovich, S., Low, M. (1996b). On optimal adaptive estimation of a quadratic functional. Annals of Statistics, 24, 1106–1125.
Fan, J. (1991). On the estimation of quadratic functionals. Annals of Statistics, 19, 1273–1294.
Fan, J., Marron, J. S. (1992). Best possible constant for bandwidth selection. Annals of Statistics, 20, 2057–2070.
Fisher, N. I., Mammen, E., Marron, J. S. (1994). Testing for multimodality. Computational Statistics & Data Analysis, 18, 499–512.
Giné, E., Mason, D. M. (2008). Uniform in bandwidth estimation of integral functionals of the density function. Scandinavian Journal of Statistics, 35, 739–761.
Goldstein, L., Messer, K. (1992). Optimal plug-in estimators for nonparametric functional estimation. Annals of Statistics, 20, 1306–1328.
Hall, P., Marron, J. S. (1987). Estimation of integrated squared density derivatives. Statistics & Probability Letters, 6, 109–115.
Hall, P., Marron, J. S. (1988). Choice of kernel order in density estimation. Annals of Statistics, 16, 161–173.
Hall, P., Marron, J. S. (1991a). Lower bounds for bandwidth selection in density estimation. Probability Theory and Related Fields, 90, 149–173.
Hall, P., Marron, J. S. (1991b). Local minima in cross-validation functions. Journal of the Royal Statistical Society: Series B, 53, 245–252.
Hewitt, E., Stromberg, K. (1969). Real and abstract analysis. New York: Springer.
Ibragimov, I. A., Khas’minskii, R. Z. (1982). Estimation of distribution density belonging to a class of entire functions. Theory of Probability and Its Applications, 27, 551–562.
Janssen, P., Marron, J. S., Veraverbeke, N., Sarle, W. (1995). Scale measures for bandwidth selection. Journal of Nonparametric Statistics, 5, 359–380.
Jones, M. C., Sheather, S. J. (1991). Using non-stochastic terms to advantage in kernel-based estimation of integrated squared density derivatives. Statistics & Probability Letters, 11, 511–514.
Koshevnik, Y. A., Levit, B. Y. (1976). On a non-parametric analogue of the information matrix. Theory of Probability and Its Applications, 21, 738–753.
Kotz, S., Balakrishnan, N., Johnson, N. L. (2000). Continuous multivariate distributions: V. 1: models and applications (2nd ed.). New York: Wiley.
Laurent, B. (1997). Estimation of integral functionals of a density and its derivatives. Bernoulli, 3, 181–211.
Martinez, H. V., Olivares, M. M. (1999). Estimation of quadratic functionals of a density. Statistics & Probability Letters, 42, 327–332.
Panaretos, V. M., Konis, K. (2012). Nonparametric construction of multivariate kernels. Journal of the American Statistical Association, 107, 1085–1095.
Park, B. U., Marron, J. S. (1990). Comparison of data-driven bandwidth selectors. Journal of the American Statistical Association, 85, 66–78.
Puri, M. L., Sen, P. K. (1971). Nonparametric methods in multivariate Analysis. New York: Wiley.
Rektorys, K. (1969). Survey of applicable mathematics. Cambridge, MA: M.I.T. Press.
Sain, S. R., Baggerly, K. A., Scott, D. W. (1994). Cross-Validation of Multivariate Densities. Journal of the American Statistical Association, 89, 807–817.
Scott, D. W. (1992). Multivariate dendity estimation: theory, practice, and visualization. New York: John Wiley.
Sheather, S. J., Jones, M. C. (1991). A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society: Series B, 53, 683–690.
Silverman, B. W. (1986). Density estimation for statistics and data analysis. London: Chapman and Hall.
Stone, C. J. (1984). An asymptotically optimal window selection rule for kernel density estimates. Annals of Statistics, 12, 1285–1297.
Wand, M. P. (1992). Error Analysis for general multivariate kernel estimators. Journal of Nonparametric Statistics, 2, 1–15.
Wand, M. P., Jones, M. C. (1993). Comparison of smoothing parameterizations in bivariate kernel density estimation. Journal of the American Statistical Association, 88, 520–528.
Wand, M. P., Jones, M. C. (1994). Multivariate plug-in bandwidth selection. Computational Statistics, 9, 97–166.
Wu, T.-J. (1995). Adaptive root n estimates of integrated squared density derivatives. Annals of Statistics, 23, 1474–1495.
Wu, T.-J. (1997). Root n bandwidth selectors for kernel estimation of density derivatives. Journal of the American Statistical Association, 92, 536–547.
Wu, T.-J., Tsai, M. H. (2004). Root n bandwidth selectors in multivariate kernel density estimation. Probability Theory and Related Fields, 129, 537–558.
Acknowledgments
The authors are grateful to an Associate Editor and two referees for many valuable suggestions which significantly improved the quality and presentation of the paper. In particular, the critical suggestions by the referees of including the study of estimating the vector functionals (3), at both the theoretical- and practical-levels, led the authors to reconstruct the modified procedure of Sect. 2.3 that significantly improved its first version.
Author information
Authors and Affiliations
Corresponding author
Additional information
Research supported by a grant from the National Science Council of Taiwan and by the National Center for Theoretical Sciences (South) of Taiwan.
About this article
Cite this article
Wu, TJ., Hsu, CY., Chen, HY. et al. Root \(n\) estimates of vectors of integrated density partial derivative functionals. Ann Inst Stat Math 66, 865–895 (2014). https://doi.org/10.1007/s10463-013-0428-7
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-013-0428-7