Abstract
This work takes advantage of semiparametric modelling which improves significantly in many situations the estimation accuracy of the purely nonparametric approach. Herein for semiparametric estimations of probability mass function (pmf) of count data, and an unknown count regression function (crf), the kernel used is a binomial one and the bandiwdth selection is investigated by developing Bayesian approaches. About the latter, Bayes local and global bandwidth approaches are used to establish data-driven selection procedures in semiparametric framework. From conjugate beta prior distributions of the smoothing parameter and under the squared errors loss function, Bayes estimate for pmf is obtained in closed form. This is not available for the crf which is computed by the Markov Chain Monte Carlo technique. Simulation studies demonstrate that both proposed methods perform better than the classical cross-validation procedures, in particular the smoothing quality and execution times are optimized. All applications are made on real data sets.
Similar content being viewed by others
References
Abdous B, Kokonendji CC, Senga Kiessé T (2012) On semiparametric regression for count explanatory variables. J Stat Plan Inference 142:1537–1548
Brewer MJ (1998) A modelling approach for bandwidth selection in kernel density estimation. In: Proceedings of COMPSTAT 1998, Physica Verlag, Heidelberg, 203–208
Cuny HE, Senga Kiessé T (2014) On modeling wood formation using parametric and semiparametric regressions for count data. Commun Stat Simul Comput. doi:10.1080/03610918.2013.875570
Fan J, Wu Y, Feng Y (2009) Local quasi-likelihood with a parametric guide. Ann Stat 37:4153–4183
Gelman A, Roberts GO, Gilks WR (1996) Efficient metropolis jumping rules. Bayesian Stat 5:599–608
Glad IK (1998) Parametrically guided non-parametric regression. Scand J Stat 25:649–668
Hjort NL, Glad IK (1995) Nonparametric density estimation with a parametric start. Ann Stat 23:882–904
Hjort NL, Jones MC (1996) Locally parametric nonparametric density estimation. Ann Stat 24:1619–1647
Hu S, Poskitt DS, Zhang X (2012) Bayesian adaptive bandwidth kernel density estimation of irregular multivariate distributions. Comput Stat Data Anal 56:732–740
Kim S, Shephard N, Chib S (1998) Stochastic volatility: likelihood inference and comparison with ARCH models. Rev Econ Stud 65:361–393
Kokonendji CC (2014) Over- and underdispersion models. In: Balakrishnan N (ed) The wiley encyclopedia of clinical trials—methods and applications of statistics in clinical trials, vol. 2, planning, analysis, and inferential methods. Wiley, New York, pp 506–526 Chap. 30
Kokonendji CC, Pérez-Casany M (2012) A note on weighted count distributions. J Stat Theory Appl 11:337–352
Kokonendji CC, Senga Kiessé T (2011) Discrete associated kernels method and extensions. Stat Methodol 8:497–516
Kokonendji CC, Zocchi SS (2010) Extensions of discrete triangular distribution and boundary bias in kernel estimation for discrete functions. Stat Probab Lett 80:1655–1662
Kokonendji CC, Senga Kiessé T, Balakrishnan N (2009a) Semiparametric estimation for count data through weighted distributions. J Stat Plann Inference 139:3625–3638
Kokonendji CC, Senga Kiessé T, Demétrio CGB (2009b) Appropriate kernel regression on a count explanatory variable and applications. Adv Appl Stat 12:99–125
McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman and Hall, London
Naito K (2004) Semiparametric density estimation by local L2-fitting. Ann Stat 32:1162–1191
Senga Kiessé T (2008) Nonparametric approach by discrete associated kernel for count data. Ph.D. of University of Pau, France. http://tel.archives-ouvertes.fr/tel-00372180/fr/
Senga Kiessé T, Cuny HE (2014) Discrete triangular associated kernel and bandwidth choices in semiparametric estimation for count data. J Stat Comput Simul 84:1813–1829
Senga Kiessé T, Mizère D (2012) Weighted Poisson and semiparametric kernel models applied for a parasite growth. Aust N Z J Stat 55:1–13
Senga Kiessé T, Rivoire M (2011) Discrete semiparametric regression models with associated kernel and applications. J Nonparametric Stat 23:927–941
Wansouwé WE, Kokonendji CC, Kolyang DT (2015) Disake: discrete associated kernel estimators. http://cran.r-project.org/web/packages/Disake/
Zhang X, Brooks RD, King ML (2009) A Bayesian approach to bandwidth selection for multivariate kernel regression with an application to state-price density estimation. J Econom 153:21–32
Zhang X, King ML, Shang HL (2014) A sampling algorithm for bandwidth estimation in a nonparametric regression model with a flexible error density. Comput Stat Data Anal 78:218–234
Zougab N, Adjabi S, Kokonendji CC (2012) Binomial kernel and Bayes local bandwidth in discrete functions estimation. J Nonparametric Stat 24:783–795
Zougab N, Adjabi S, Kokonendji CC (2014a) Bayesian approach in nonparametric count regression with binomial kernel. Commun Stat Simul Comput 43:1052–1063
Zougab N, Adjabi S, Kokonendji CC (2014b) Bayesian estimation of adaptive bandwidth matrices in multivariate kernel density estimation. Comput Stat Data Anal 75:28–38
Acknowledgments
The research and education chair of civil engineering and eco-construction is financed by the Chamber of Trade and Industry of Nantes and Saint-Nazaire cities, the CARENE (urban agglomeration of Saint-Nazaire), Charier, Architectes Ingénieurs Associés, Vinci construction, the Regional Federation of Buildings, and the Regional Federation of Public Works. The authors wish also to thank these partners for their patronage. The authors would like to thank two anonymous referees for their careful readings and helpful comments that led to a considerable improvement of the paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Senga Kiessé, T., Zougab, N. & Kokonendji, C.C. Bayesian estimation of bandwidth in semiparametric kernel estimation of unknown probability mass and regression functions of count data. Comput Stat 31, 189–206 (2016). https://doi.org/10.1007/s00180-015-0627-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-015-0627-1