Abstract
In this paper we propose a smooth nonparametric estimation for the conditional probability density function based on a Bernstein polynomial representation. Our estimator can be written as a finite mixture of beta densities with data-driven weights. Using the Bernstein estimator of the conditional density function, we derive new estimators for the distribution function and conditional mean. We establish the asymptotic properties of the proposed estimators, by proving their asymptotic normality and by providing their asymptotic bias and variance. Simulation results suggest that the proposed estimators can outperform the Nadaraya-Watson estimator and, in some specific setups, the local linear kernel estimators. Finally, we use our estimators for modeling the income in Italy, conditional on year from 1951 to 1998, and have another look at the well known Old Faithful Geyser data.
Similar content being viewed by others
References
Azzalini, A., & Bowman, A. W. (1990). A look at some data on the Old Faithful geyser. Applied Statistics, 357–365.
Babu, G. J., Canty, A. J., & Chaubey, Y. P. (2002). Application of Bernstein polynomials for smooth estimation of a distribution and density function. Journal of Statistical Planing and Inference, 105, 377–392.
Babu, G. J., & Chaubey, Y. P. (2006). Smooth estimation of a distribution and density function on a hyper-cube using Bernstein polynomials for dependent random vectors. Statistics & Probability Letters, 76, 959–969.
Baiocchi, G. (2006). Economic applications of nonparametric methods (Ph.D. thesis), University of York.
Bashtannyk, D. M., & Hyndman, R. J. (2001). Bandwidth selection for kernel conditional density estimation. Computational Statistics & Data Analysis, 36, 279–298.
Belalia, M. (2016). On the asymptotic properties of the Bernstein estimator of the multivariate distribution function. Statistics & Probability Letters, 110(C), 249–256.
Belalia, M., Bouezmarni, T., & Leblanc, A. (2017). Smooth conditional distribution estimators using Bernstein polynomials. Computational Statistics & Data Analysis, 111, 166–182.
Bouezmarni, T., & Rolin, J. (2007). Bernstein estimator for unbounded density function. Journal of Nonparametric Statistics, 19, 145–161.
Efromovich, S. (2007). Conditional density estimation in a regression setting. The Annals of Statistics, 35(6), 2504–2535.
Efromovich, S. (2010). Dimension reduction and adaptation in conditional density estimation. Journal of the American Statistical Association, 105(490), 761–774.
Fan, J., & Gijbels, I. (1996). Local polynomial modelling and its applications. London: Chapman and Hall.
Fan, J., Yao, Q., & Tong, H. (1996). Estimation of conditional densities and sensitivity measures in nonlinear dynamical systems. Biometrika, 83(1), 189–206.
Fan, J., & Yim, T. H. (2004). A crossvalidation method for estimating conditional densities. Biometrika, 91(4), 819–834.
Faugeras, O. P. (2009). A quantile-copula approach to conditional density estimation. Journal of Multivariate Analysis, 100, 2083–2099.
Fu, G., Shih, F. Y., & Wang, H. (2011). A kernel-based parametric method for conditional density estimation. Pattern Recognition, 44, 284–294.
Gawronski, W., & Stadtmüller, U. (1981). Smoothing histograms by means of lattice- and continuous distributions. Metrika, 28, 155–164.
Ghosal, S. (2001). Convergence rates for density estimation with Bernstein polynomials. The Annals of Statistics, 28, 1264–1280.
Hall, P., Racine, J. S., & Li, Q. (2004). Cross-validation and the estimation of conditional probability densities. Journal of the American Statistical Association, 99(468), 1015–1026.
Hall, P., Wolff, R. C. L., & Yao, Q. (1999). Methods for estimating a conditional distribution function. Journal of the american Statistical Association, 94(445), 154–163.
Hansen, B. E. (2004). Nonparametric estimation of smooth conditional distributions: Technical report, Madison: University of Wisconsin.
Härdle, W. (2012). Smoothing techniques: with implementation in S. Springer Science & Business Media.
Hyndman, R. J., Bashtannyk, D. M., & Grunwald, G. K. (1996). Estimating and visualizing conditional densities. Journal of Computational and Graphical Statistics, 5(4), 315–336.
Hyndman, R. J., & Yao, Q. (2002). Nonparametric estimation and symmetry tests for conditional density functions. Journal of Nonparametric Statistics, 14(3), 259–278.
Igarashi, G., & Kakizawa, Y. (2014). On improving convergence rate of Bernstein polynomial density estimator. Journal of Nonparametric Statistics, 26, 61–84.
Janssen, P., Swanepoel, J., & Veraverbeke, N. (2014). A note on the asymptotic bevahior of the Bernstein estimator of the copula density. Journal of Multivariate Analysis, 124, 480–487.
Janssen, P., Swanepoel, J., & Veraverbeke, N. (2016). Bernstein estimation for a copula derivative with application to conditional distribution and regression functionals. TEST, 25(2), 351–374.
Kakizawa, Y. (2004). Bernstein polynomial probability density estimation. Journal of Nonparametric Statistics, 16, 709–729.
Leblanc, A. (2010). A bias-reduced approach to density estimation using Bernstein polynomials. Journal of Nonparametric Statistics, 22, 459–475.
Leblanc, A. (2012a). On estimating distribution functions using Bernstein polynomials. Annals of the Institute of Statistical Mathematics, 64, 919–943.
Leblanc, A. (2012b). On the boundary properties of Bernstein polynomial estimators of density and distribution functions. Journal of Statistical Planning and Inference, 142, 2762–2778.
Lehmann, E. L. (1966). Some concepts of dependence. The Annals of Mathematical Statistics, 1137–1153.
Li, Q., & Racine, J. S. (2007). Nonparametric econometrics: theory and practice. Princeton University Press.
Li, Q., & Racine, J. S. (2008). Nonparametric estimation of conditional CDF and quantile functions with mixed categorical and continuous data. Journal of Business & Economic Statistics, 26(4), 423–434.
Nadaraya, E. A. (1965). On nonparametric estimates of density functions and regression curves. Theory of Probability and its Applications, 10, 186–190.
Petrone, S. (1999). Bayesian Density estimation using Bernstein polynomials. The Canadian Journal of Statistics, 27, 105–126.
Rosenblatt, M. (1969). Conditional probability density and regression estimators. In P. Krishnaiah (Ed.), Multivariate analysis II (pp. 25–31). New York: Academic Press.
Tenbusch, A. (1994). Two-dimensional Bernstein polynomial density estimation. Metrika, 41, 233–253.
Tenbusch, A. (1997). Nonparametric curve estimation with Bernstein estimates. Metrika, 45, 1–30.
Turnbull, B. C., & Ghosh, S. K. (2014). Unimodal density estimation using Bernstein polynomials. Computational Statistics & Data Analysis, 72, 13–29.
Veraverbeke, N., Gijbels, I., & Omelka, M. (2014). Preadjusted non-parametric estimation of a conditional distribution function. Journal of the Royal Statistical Society. Series B., 76, 399–438.
Vitale, R. (1975). A Bernstein polynomial approach to density estimation. In M. Puri (Ed.), Statistical inference and related topics, Volume 2 (pp. 87–99). New York: Academic Press.
Wahba, G. (1971). A polynomial algorithm for density estimation. The Annals of Mathematical Statistics, 42(6), 1870–1886.
Watson, G. S. (1964). Smooth regression analysis. Sankhya A, 26(4), 359–372.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Belalia, M., Bouezmarni, T. & Leblanc, A. Bernstein conditional density estimation with application to conditional distribution and regression functions. J. Korean Stat. Soc. 48, 356–383 (2019). https://doi.org/10.1016/j.jkss.2019.05.005
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1016/j.jkss.2019.05.005