Skip to main content
Log in

Bayesian bandwidth selection in discrete multivariate associated kernel estimators for probability mass functions

  • Published:
Journal of the Korean Statistical Society Aims and scope Submit manuscript

Abstract

This paper proposed a nonparametric estimator for probability mass function of multivariate data. The estimator is based on discrete multivariate associated kernel without correlation structure. For the choice of the bandwidth diagonal matrix, we presented the Bayes global method against the likelihood cross-validation one, and we used the Bayesian Markov chain Monte Carlo (MCMC) method for deriving the global optimal bandwidth. We have compared the proposed method with the cross-validation method. The performance of both methods is evaluated under the integrated square error criterion through simulation studies based on for univariate and multivariate models. We also presented applications of the proposed methods to bivariate and trivariate real data. The obtained results show that the Bayes global method performs better than cross-validation one, even for the Poisson kernel which is the very bad discrete

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Abdous, B., Kokonendji, C. C., & Senga Kiessé, T. (2012). On semiparametric regression for count explanatory variables. Journal of Statistical Planning and Inference, 142, 1537–1548.

    Article  MathSciNet  Google Scholar 

  • Aitchison, J., & Aitken, C. G. G. (1976). Multivariate binary discrimination by the kernel method. Biometrika, 63, 413–420.

    Article  MathSciNet  Google Scholar 

  • Aitchison, J., & Ho, C. H. (1989). The multivariate Poisson-log normal distribution. Biometrika, 76, 643–653.

    Article  MathSciNet  Google Scholar 

  • Bouezmarni, T., & Roumbouts, J. V. K. (2010). Nonparametric density estimation for multivariate bounded data. Journal of Statistical Planning and Inference, 140, 139–152.

    Article  MathSciNet  Google Scholar 

  • Brewer, M. J. (1998). A modelling approach for bandwidth selection in kernel density estimation. In Proceedings of COMPSTAT (pp. 203–208). Heidelberg: Physica Verlag.

    Chapter  Google Scholar 

  • Brewer, M. J. (2000). A Bayesian model for local smoothing in kernel density estimation. Statistics and Computing, 10, 299–309.

    Article  Google Scholar 

  • Chacón, J. E., & Duong, T. (2010). Multivariate plug-in bandwidth selection with unconstrained pilot matrices. Test, 19, 375–398.

    Article  MathSciNet  Google Scholar 

  • Chacón, J. E., & Duong, T. (2011). Unconstrained pilot selectors for smoothed cross-validation. Australian and New Zealand Journal of Statistics, 53, 331–351.

    Article  MathSciNet  Google Scholar 

  • De Lima, M. S., & Atuncar, G. S. (2010). A Bayesian method to estimate the optimal bandwidth for multivariate kernel estimator. Journal of Nonparametric Statistics, 23, 137–148.

    Article  MathSciNet  Google Scholar 

  • Duong, T., & Hazelton, M. L. (2003). Plug-in bandwidth matrices for bivariate kernel density estimation. Journal of Nonparametric Statistics, 15, 17–30.

    Article  MathSciNet  Google Scholar 

  • Duong, T., & Hazelton, M. L. (2005). Cross-validation bandwidth matrices for multivariate kernel density estimation. Scandinavian Journal of Statistics, 32, 485–506.

    Article  MathSciNet  Google Scholar 

  • Gangopadhyay, A. K., & Cheung, K. N. (2002). Bayesian approach to the choice of smoothing parameter in kernel density estimation. Journal of Nonparametric Statistics, 14, 655–664.

    Article  MathSciNet  Google Scholar 

  • Hu, S., Poskitt, D. S., & Zhang, X. (2012). Bayesian adaptive bandwidth kernel density estimation of irregular multivariate distributions. Computational Statistics and Data Analysis, 56, 732–740.

    Article  MathSciNet  Google Scholar 

  • Kokonendji, C. C., & Senga Kiessé, T. (2011). Discrete associated kernels method and extensions. Statistical Methodology, 8, 497–516.

    Article  MathSciNet  Google Scholar 

  • Kokonendji, C. C., Senga Kiessé, T., & Balakrishnan, N. (2009). Semiparametric estimation for count data through weighted distributions. Journal of Statistical Planning and Inference, 139, 3625–3638.

    Article  MathSciNet  Google Scholar 

  • Kokonendji, C. C., Senga Kiessé, T., & Zocchi, S. S. (2007). Discrete triangular distributions and non-parametric estimation for probability mass function. Journal of Nonparametric Statistics, 19, 241–257.

    Article  MathSciNet  Google Scholar 

  • Kokonendji, C. C., & Somé, S. M. (2015). On multivariate associated kernels for smoothing general density function. arXiv: 1502.01173.

    Google Scholar 

  • Kulasekera, K. B., & Padgett, W. J. (2006). Bayes bandwidth selection in kernel density estimation with censored data. Journal of Nonparametric Statistics, 18, 129–143.

    Article  MathSciNet  Google Scholar 

  • Kuruwita, C. N., Kulasekera, K. B., & Padgett, W. J. (2010). Density estimation using asymmetric kernels and Bayes bandwidths with censored data. Journal of Statistical Planning and Inference, 140, 1765–1774.

    Article  MathSciNet  Google Scholar 

  • Li, Q., & Racine, J. S. (2007). Nonparametric econometrics: Theory and practice. Princeton, Oxford: Princeton University Press.

    MATH  Google Scholar 

  • Racine, J. S., & Li, Q. (2004). Nonparametric estimation of regression functions with both categorical and continuous data. Journal of Econometrics, 119, 99–130.

    Article  MathSciNet  Google Scholar 

  • R Development Core Team (2015). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing, ISBN: 3-900051-07-0, http://www.R-project.org.

    Google Scholar 

  • Sain, S. R., Baggerly, K. A., & Scott, D. W. (1994). Cross-validation of multivariate densities. Journal of the American Statistical Association, 89, 807–817.

    Article  MathSciNet  Google Scholar 

  • Senga Kiessé, T., Zougab, N., & Kokonendji, C. C. (2015). Bayesian estimation of bandwidth in semiparametric kernel estimation of unknown probability mass and regression functions of count data. Computational Statistics, https://doi.org/10.1007/s00180-015-0627-1.

    Google Scholar 

  • Silverman, B. W. (1986). Density estimation for statistics and data analysis. London: Chapman and Hall.

    Book  Google Scholar 

  • Simonoff, J. S. (1996). Smoothing methods in statistics. New York: Springer-Verlag.

    Book  Google Scholar 

  • Somé, S. M., & Kokonendji, C. C. (2015). Effects of associated kernels in nonparametric multiple regressions. arXiv: 1502.01488.

    MATH  Google Scholar 

  • Wand, M. P., & Jones, M. C. (1994). Multivariate plug-in bandwidth selection. Computational Statistics, 9, 97–116.

    MathSciNet  MATH  Google Scholar 

  • Wand, M., & Jones, M. (1995). Kernel smoothing. London: Chapman and Hall.

    Book  Google Scholar 

  • Wang, M. C., & Ryzin, J. V. (1981). A class of smooth estimators for discrete distributions. Biometrika, 68, 301–309.

    Article  MathSciNet  Google Scholar 

  • Wansouwé, W. E., Kokonendji, C. C., & Kolyang, D. T. (2015). Disake: Discrete associated kernel estimators. URL: http://cran.r-project.org.

    Google Scholar 

  • Zhang, X., King, M. L., & Hyndman, R. J. (2006). A Bayesian approach to bandwidth selection for multivariate kernel density estimation. Computational Statistics and Data Analysis, 50, 3009–3031.

    Article  MathSciNet  Google Scholar 

  • Zhang, X., King, M. L., & Shang, H. L. (2013). Bayesian bandwidth selection for a nonparametric regression model with mixed types of regressors. https://ideas.repec.org.

    Google Scholar 

  • Zougab, N., Adjabi, S., & Kokonendji, C. C. (2012). Binomial kernel and Bayes local bandwidth in discrete functions estimation. Journal of Nonparametric Statistics, 24, 783–795.

    Article  MathSciNet  Google Scholar 

  • Zougab, N., Adjabi, S., & Kokonendji, C. C. (2013a). A Bayesian approach to bandwidth selection in univariate associate kernel estimation. Journal of Statistical Theory and Practice, 7, 8–23.

    Article  MathSciNet  Google Scholar 

  • Zougab, N., Adjabi, S., & Kokonendji, C. C. (2013b). Adaptive smoothing in associated kernel discrete functions estiamtion using Bayesian approach. Journal of Statistical Computation and Simulation., 83, 2219–2231.

    Article  MathSciNet  Google Scholar 

  • Zougab, N., Adjabi, S., & Kokonendji, C. C. (2014). Bayesian estimation of adaptive bandwidth matrices in multivariate kernel density estimation. Computational Statistics and Data Analysis, 75, 28–38.

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nawal Belaid.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Belaid, N., Adjabi, S., Zougab, N. et al. Bayesian bandwidth selection in discrete multivariate associated kernel estimators for probability mass functions. J. Korean Stat. Soc. 45, 557–567 (2016). https://doi.org/10.1016/j.jkss.2016.04.001

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1016/j.jkss.2016.04.001

AMS 2000 subject classifications

Keywords

Navigation