Abstract
The paper is devoted to statistical nonparametric estimation of multivariate distribution density. The influence of data pre-clustering on the estimation accuracy of multimodal density is analyzed by means of the Monte Carlo method. It is shown that the soft clustering is more advantageous than the hard one. While a moderate increase in the number of clusters also increases the calculation time, it considerably reduces the estimation error.
Similar content being viewed by others
References
Aburdene, M.F.: Recursive computation of discrete Legendre polynomial coefficients. Multidimens. Syst. Signal Process. 7(2), 221–224 (1996)
Ćwik, J., Koronacki, J.: Multivariate density estimation: a comparative study. Neural Comput. Appl. 6(3), 173–185 (1997)
Duong, T.: Bandwidth matrices for multivariate kernel density estimation. PhD thesis (2004), p. 161
Friedman, J.H.: Exploratory projection pursuit. J. Am. Stat. Assoc. 82(397), 249–266 (1987)
Friedman, J.H., Stuetzle, W., Schroeder, A.: Projection pursuit density estimation. J. Am. Stat. Assoc. 79, 599–608 (1984)
Hall, P.: The Bootstrap and Edgeworth Expansion. Springer, New York (1992)
Huber, P.J.: Projection pursuit. Ann. Stat. 13(2), 435–475 (1985)
Hwang, J.N., Lay, S.R., Lippman, A.: Nonparametric multivariate density estimation: a comparative study. IEEE Trans. Signal Process. 42(10), 2795–2810 (1994)
Jeon, B., Landgrebe, D.A.: Fast parzen density estimation using clustering-based branch and bound. IEEE Trans. Pattern Anal. Mach. Intell. 16(9), 950–954 (1994)
Jones, M.C., Marron, J.S., Sheather, S.J.: A brief survey of bandwidth selection for density estimation. J. Am. Stat. Assoc. 91, 401–407 (1996)
Rudzkis, R., Radavičius, M.: Statistical estimations of a mixture of gaussian distributions. Acta Appl. Math. 38, 37–54 (1995)
Ruzgas, T., Rudzkis, R., Kavaliauskas, M.: Application of clustering in the non-parametric estimation of distribution density. Nonlinear Anal. Model. Control 11(4), 393–411 (2006)
Scott, D.W.: Multivariate Density Estimation: Theory, Practice, and Visualization. Wiley, New York (1992)
Silverman, B.W.: Density Estimation for Statistics and Data Analysis. Chapman and Hall, London (1986)
Stone, C.J., Hansen, M., Kooperberg, C., Truong, Y.K.: Polynomial splines and their tensor products in extended linear modeling. Ann. Stat. 25, 1371–1470 (1997)
Wong, M.: A bootstrap testing procedure for investigating the number of subpopulations. J. Stat. Comput. Simul. 22, 99–112 (1985)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Rudzkis, R., Ruzgas, T. Clustering Effect on the Statistical Estimation Accuracy of Distribution Density. Acta Appl Math 97, 211–219 (2007). https://doi.org/10.1007/s10440-007-9127-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10440-007-9127-9