Bayesian local bandwidths in a flexible semiparametric kernel estimation for multivariate count data with diagnostics

Somé, Sobom M.; Kokonendji, Célestin C.; Belaid, Nawel; Adjabi, Smail; Abid, Rahma

doi:10.1007/s10260-023-00682-5

Bayesian local bandwidths in a flexible semiparametric kernel estimation for multivariate count data with diagnostics

Original Paper
Published: 23 January 2023

Volume 32, pages 843–865, (2023)
Cite this article

Statistical Methods & Applications Aims and scope Submit manuscript

Sobom M. Somé ORCID: orcid.org/0000-0003-4454-5777^1,2,
Célestin C. Kokonendji^3,4,
Nawel Belaid⁵,
Smail Adjabi⁵ &
…
Rahma Abid⁶

579 Accesses
1 Citation
Explore all metrics

Abstract

In this paper, we consider a flexible semiparametric approach for estimating multivariate probability mass functions. The corresponding estimator is governed by a parametric starter, for instance a multivariate Poisson distribution with nonnegative cross correlations which is basically estimated through an expectation–maximization algorithm, and a nonparametric part which is an unknown weight discrete function to be smoothed through multiple binomial kernels. Our central focus is upon the selection matrix of bandwidths by the local Bayesian method. We additionally discuss the diagnostic model to enact an appropriate choice between the parametric, semiparametric and nonparametric approaches. Retaining a pure nonparametric method implies losing parametric benefices in this modelling framework. Practical applications, including a tail probability estimation, on multivariate count datasets are analyzed under several scenarios of correlations and dispersions. This semiparametic approach demonstrates superior performances and better interpretations compared to parametric and nonparametric ones.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bayesian estimation of bandwidth in semiparametric kernel estimation of unknown probability mass and regression functions of count data

Article 20 October 2015

Bayesian approach to bandwidth selection for multivariate count regression function estimation by associated discrete kernel

Article 01 December 2017

Reduction of Potential Boundary Bias in Kernel Cumulative Distribution Estimation in Univariate and Multivariate Settings

Article 28 February 2024

Data availibility

The data that support the findings of this study are available in this published article, and also from the corresponding author upon request.

Code availability

The code that supports the findings of this study is available from the corresponding author upon request.

References

Abdous B, Kokonendji CC, Senga Kiessé T (2012) On semiparametric regression for count explanatory variables. J Stat Plan Inference 142:1537–1548
Article MathSciNet MATH Google Scholar
Aitchinson J, Ho CH (1989) The multivariate Poisson-log normal distribution. Biometrika 75:621–629
MathSciNet Google Scholar
Arab A, Holan SH, Wikle CK, Wildhaber ML (2011) Semiparametric bivariate zero-inflated Poisson models with application to studies of abundance for multiple species. Environmetrics 23:183–196
Article MathSciNet Google Scholar
Belaid N, Adjabi S, Zougab N, Kokonendji CC (2016a) Bayesian bandwidth selection in discrete multivariate associated kernel estimators for probability mass functions. J Korean Stat Soc 45:557–567
Belaid N, Adjabi S, Kokonendji CC, Zougab N (2016b) Bayesian local bandwidth selector in multivariate associated kernel estimator for joint probability mass functions. J Stat Comput Simul 86:3667–3681
Belaid N, Adjabi S, Kokonendji CC, Zougab N (2018) Bayesian adaptive bandwidth selector for multivariate binomial kernel estimator. Commun Stat Theory Methods 47:2988–3001
Article MATH Google Scholar
Berkhout P, Plug E (2004) A bivariate Poisson count data model using conditional probabilities. Stat Neerl 58:349–364
Article MathSciNet MATH Google Scholar
Corporación Favorita (2018) Grocery sales data. https://www.kaggle.com/c/favorita-grocery-salesforecasting/data. Accessed 12 Nov 2021
Cuenin J, Jørgensen B, Kokonendji CC (2016) Simulations of full multivariate Tweedie with flexible dependence structure. Comput Stat 31:1477–1492
Article MathSciNet MATH Google Scholar
Hall P, Marron JS (1991) Lower bounds for bandwidth selection in density estimation. Probab Theory Relat Fields 90:149–173
Article MathSciNet MATH Google Scholar
Harfouche L, Adjabi S, Zougab N, Funke B (2018) Multiplicative bias correction for discrete kernels. Stat Methods Appl 27:253–276
Article MathSciNet MATH Google Scholar
Huang A, Sippel L, Fung T (2022) Consistent second-order discrete kernel smoothing using dispersed Conway–Maxwell–Poisson kernels. Comput Stat 37:551–563
Article MathSciNet MATH Google Scholar
Johnson NL, Kotz S, Balakrishnan N (1997) Discrete multivariate distributions. Wiley, New York
MATH Google Scholar
Jørgensen B, Kokonendji CC (2016) Discrete dispersion models and their Tweedie asymptotics. AStA Adv Stat Anal 100:43–78
Article MathSciNet MATH Google Scholar
Kano K, Kawamura K (1991) On recurrence relations for the probability function of multivariate generalized Poisson distribution. Commun Stat Theory Methods 20:165–178
Article MathSciNet MATH Google Scholar
Karlis D (2003) An EM algorithm for multivariate Poisson distribution and related models. J Appl Stat 30:63–77
Article MathSciNet MATH Google Scholar
Karlis D, Ntzoufras J (2005) Bivariate Poisson and diagonal inflated bivariate Poisson regression Models in R. J Stat Softw 14(10):1–36
Article Google Scholar
Kocherlakota S, Kocherlakota K (1992) Bivariate discrete distributions. Marcel Dekker Inc, New York
MATH Google Scholar
Kokonendji CC, Puig P (2018) Fisher dispersion index for multivariate count distributions: a review and a new proposal. J Multivar Anal 165:180–193
Article MathSciNet MATH Google Scholar
Kokonendji CC, Senga Kiessé T (2011) Discrete associated kernels method and extensions. Stat Methodol 8:497–516
Article MathSciNet MATH Google Scholar
Kokonendji CC, Senga Kiessé T, Balakrishnan N (2009) Semiparametric estimation for count data through weighted distributions. J Stat Plan Inference 139:3625–3638
Article MathSciNet MATH Google Scholar
Kokonendji CC, Somé SM (2018) On multivariate associated kernels to estimate general density functions. J Korean Stat Soc 47:112–126
Article MathSciNet MATH Google Scholar
Kokonendji CC, Somé SM (2021) Bayesian bandwidths in semiparametric modelling for nonnegative orthant data with diagnostics. Stats 4:162–183
Article Google Scholar
Kokonendji CC, Touré AY, Sawadogo A (2020) Relative variation indexes for multivariate continuous distributions on $[0,\infty )^k$ and extensions. AStA Adv Stat Anal 104:285–307
Article MathSciNet MATH Google Scholar
Kokonendji CC, Zougab N, Senga Kiessé T (2017) Poisson-weighted estimation by discrete kernel with application to radiation biodosimetry. In: Ainsbury EA, Calle ML, Cardis E, Einbeck J, Gomez G, Puig P (eds) Biomedical big data & statistics for low dose radiation research-extended abstracts fall 2015, vol. VII, Part II, Chap. 19. Springer, Basel, pp 115–120
Chapter Google Scholar
Krummenauer F (1998) Efficient simulation of multivariate binomial and Poisson distributions. Biom J 40:823–832
Article MathSciNet MATH Google Scholar
Mellinger GD, Sylwester DL, Gaffey WR, Manheimer DI (1965) A mathematical model with application to a study of accident repeatedness among children. J Am Stat Assoc 60:1046–1059
Article Google Scholar
NBA. NBA All-Star Game, 2000–2016. https://www.kaggle.com/fmejia21/nba-all-star-game-20002016?. Accessed 12 Nov 2021
R Core Team (2021) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Sellers KF, Li T, Wu Y, Balakrishnan N (2021) A flexible multivariate distribution for correlated count data. Stats 4:308–326
Article Google Scholar
Senga Kiessé T, Durrieu G (2020) Discrete optimal symmetric kernels for estimating count data distributions. Preprint hal-02503789
Somé SM, Kokonendji CC (2016) Effects of associated kernels in nonparametric multiple regressions. J Stat Theory Pract 10:456–471
Article MathSciNet MATH Google Scholar
Su P (2015) Generation of multivariate data with arbitrary marginals—Package ’NORTARA’. https://cran.r-project.org/web/packages/NORTARA/
Tsionas EG (1999) Bayesian analysis of the multivariate Poisson distribution. Commun Stat Theory Methods 28:431–451
Article MathSciNet MATH Google Scholar
Tsionas EG (2001) Bayesian multivariate Poisson regression. Commun Stat Theory Methods 30:243–255
Article MathSciNet MATH Google Scholar
White H (1982) Maximum likelihood estimation of misspecified models. Econometrica 50:1–26
Article MathSciNet MATH Google Scholar
Zougab N, Adjabi S, Kokonendji CC (2012) Binomial kernel and Bayes local bandwidth in discrete functions estimation. J Nonparametr Stat 24:783–795
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

This article is dedicated to, and in memory of Professor Longin Somé (1955-2022). The authors sincerely thank an Associate Editor and two anonymous referees for their valuable comments that improved the paper. Part of this work was carried out while the second author was at Research Unit LaMOS - University of Béjaia as a visiting scientist. For the second coauthor, this work is supported by the EIPHI Graduate School (contract ANR-17-EURE-0002).

Funding

The LmB (from the second author) receives support from the EIPHI Graduate School (contract ANR-17-EURE-0002).

Author information

Authors and Affiliations

Laboratoire Sciences et Technologies, Université Thomas SANKARA, 12 BP 417 Ouagadougou 12, Ouagadougou, Burkina Faso
Sobom M. Somé
Laboratoire d’Analyse Numérique d’Informatique et de BIOmathématique, Université Joseph KI-ZERBO, 03 B.P. 7021, Ouagadougou, Burkina Faso
Sobom M. Somé
Laboratoire de Mathématiques de Besançon UMR 6623 CNRS-UFC, Université de Franche-Comté, 16 Route de Gray, 25030, Besançon cedex, France
Célestin C. Kokonendji
Laboratoire de Mathématiques et Connexes de Bangui, Université de Bangui, B.P. 908, Bangui, République centrafricaine
Célestin C. Kokonendji
Research Unit LaMOS, University of Bejaia, Route de Targa-Ouzemour, 06000, Bejaïa, Algeria
Nawel Belaid & Smail Adjabi
Laboratory of Probability and Statistics, University of Sfax, Sfax, Tunisia and University Paris-Dauphine Tunis, Tunis, Tunisia
Rahma Abid

Authors

Sobom M. Somé
View author publications
You can also search for this author in PubMed Google Scholar
Célestin C. Kokonendji
View author publications
You can also search for this author in PubMed Google Scholar
Nawel Belaid
View author publications
You can also search for this author in PubMed Google Scholar
Smail Adjabi
View author publications
You can also search for this author in PubMed Google Scholar
Rahma Abid
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sobom M. Somé.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Proof of Theorem 1

From (6), it is enough to calculate $\textrm{Bias}[{\widetilde{w}}_{n}({\textbf{x}})]$ and $\textrm{Var}[{\widetilde{w}}_{n}({\textbf{x}})]$, since one has $\textrm{Bias}[{\widehat{f}}_n({\textbf{x}})]=p_{d}({\textbf{x}};\widehat{\varvec{\theta }}_{n}){\mathbb {E}}[{\widetilde{w}}_n({\textbf{x}})]-f({\textbf{x}})$ and $\textrm{var}[{\widehat{f}}_n({\textbf{x}})]=[p_{d}({\textbf{x}};\widehat{\varvec{\theta }}_{n})]^2 \textrm{var} [{\widetilde{w}}_n({\textbf{x}})]$. Hence,

$$\begin{aligned} {\mathbb {E}}({\widetilde{w}}_n({\textbf{x}}))={\mathbb {E}}\left( \prod _{j=1}^{d} K_{x_{j} h_{j}}\left( X_{1 j}\right) /p_{d}({\textbf{X}}_1;\,\widehat{\varvec{\theta }}_{n})\right) ={\mathbb {E}}\left[ w\left( {\textsf{Z}}_{x_{1}, h_{1}}, {\textsf{Z}}_{x_{2}, h_{2}}, \ldots , {\textsf{Z}}_{x_{d}, h_{d}}\right) \right] , \end{aligned}$$

(13)

where the random variables ${\textsf{Z}}_{x_{j},h_{j}}$ are independent with mean $\mu _j$ and variance $\sigma _j$.

Then, using a second order Taylor exapansion and finite differences, we obtain

$$\begin{aligned} & w\left( {\textsf{Z}}_{x_{1}, h_{1}}, {\textsf{Z}}_{x_{2}, h_{2}}, \ldots , {\textsf{Z}}_{x_{d}, h_{d}}\right) =w\left( \mu _{1}, \mu _{2}, \ldots , \mu _{d}\right) \\ & \quad +\sum _{j=1}^{d}\left( {\textsf{Z}}_{x_{j} h_{j}}-\mu _{j}\right) w_{j}^{(1)}+\frac{1}{2} \sum _{j=1}^{d}\left( {\textsf{Z}}_{x_{j}, h_{j}}-\mu _{j}\right) ^{2} w_{j j}^{(2)} \\ & \quad +\sum _{l \ne j}^{d}\left( {\textsf{Z}}_{x_{l},h_{l}}-\mu _{l}\right) \left( {\textsf{Z}}_{x_{j}, h_{j}}-\mu _{j}\right) w_{l j}^{(2)}+o\left( \sum _{j=1}^{d} h_{j}^{2}\right) \end{aligned}$$

and (13) becomes

$$\begin{aligned} {\mathbb {E}}({\widetilde{w}}_n({\textbf{x}}))&= w\left( \mu _{1}, \mu _{2}, \ldots , \mu _{d}\right) +\frac{1}{2} \sum _{j=1}^{d} {\text {Var}}\left( {\textsf{Z}}_{x_{j}, h_{j}}\right) w_{j j}^{(2)}+o\left( \sum _{j=1}^{d} h_{j}^{2}\right) \\&= \left[ w\left( x_{1}, \ldots , x_{d}\right) +\sum _{j=1}^{d} h_{j} w_{j}^{(1)}+\frac{1}{2} \sum _{j=1}^{d} {\text {Var}}\left( {\textsf{Z}}_{x_{j} h_{j}}\right) w_{j j}^{(2)}\right] \{1+o(1)+n^{-2}\} , \end{aligned}$$

and the desired Bias is deduced.

The pointwise variance is successively obtained with

$$\begin{aligned} {\text {Var}}({\widehat{f}}({\textbf{x}})))&= \frac{1}{n}{\text {Var}}\left( \prod _{j=1}^{d} K_{x_{j}, h_{j}}\left( X_{1j}\right) \{p_{d}({\textbf{X}}_1;\,\widehat{\varvec{\theta }}_{n})\}^{-1}\right) \\&= \frac{1}{n}\left[ f({\textbf{x}}) \{p_{d}({\textbf{x}};\,\widehat{\varvec{\theta }}_{n})\}^{-2}\left( \prod _{j=1}^{d} {\mathbb {P}}\left( {\textsf{Z}}_{x_{j}, h_{j}}=x_{j}\right) \right) ^{2}\right. \\ & \left. +\sum _{{\textbf{y}} \in {\mathbb {T}}_{d} \backslash {\textbf{x}}} f({\textbf{y}}) \{p_{d}({\textbf{y}};\,\widehat{\varvec{\theta }}_{n})\}^{-1}\left( \prod _{j=1}^{d} {\mathbb {P}}\left( {\textsf{Z}}_{x_{j}, h_{j}}=y_{j}\right) \right) ^{2}\right] \\ & -\frac{1}{n}\left[ \left( \prod _{j=1}^{d} f({\textbf{x}}) \{p_{d}({\textbf{x}};\,\widehat{\varvec{\theta }}_{n})\}^{-1} {\mathbb {P}}\left( {\textsf{Z}}_{x_{j} h_{j}}=x_{j}\right) \right) ^{2}\right. \\ & \left. +\left( \sum _{{\textbf{z}} \in {\mathbb {T}}_{d} \backslash {\textbf{x}}} f({\textbf{z}}) \{p_{d}({\textbf{z}};\,\widehat{\varvec{\theta }}_{n})\}^{-1} \prod _{j=1}^{d} {\mathbb {P}}\left( {\textsf{Z}}_{x_{j}, h_{j}}=z_{j}\right) \right) ^{2}\right] \\&= \frac{1}{n} f(\textrm{x}) \{p_{d}({\textbf{x}};\,\widehat{\varvec{\theta }}_{n})\}^{-2}(1-f(\textrm{x}))\left( \prod _{j=1}^{d} {\mathbb {P}}\left( {\textsf{Z}}_{x_{j}, h_{j}}=x_{j}\right) \right) ^{2}+R, \end{aligned}$$

where

$$\begin{aligned} R&= \frac{1}{n}\left[ \sum _{{\textbf{y}} \in {\mathbb {T}}_{d} \backslash {\textbf{x}}} f({\textbf{y}}) \{p_{d}({\textbf{y}};\,\widehat{\varvec{\theta }}_{n})\}^{-2}\left( \prod _{j=1}^{d} {\mathbb {P}}\left( {\textsf{Z}}_{x_{j} h_{j}}=y_{j}\right) \right) ^{2}\right. \\ & \left. -\left( \sum _{{\textbf{z}} \in {\mathbb {T}}_{d} \backslash {\textbf{x}}} f({\textbf{z}}) \{p_{d}({\textbf{z}};\,\widehat{\varvec{\theta }}_{n})\}^{-1} \prod _{j=1}^{d} {\mathbb {P}}\left( {\textsf{Z}}_{x_{j} h_{j}}=z_{j}\right) \right) ^{2}\right] \rightarrow o\left( \frac{1}{n}\right) . \end{aligned}$$

Hence, this concludes the proof. $\square$

Proof of Theorem 2

Expanding $(x+h)^y=\sum \limits _{k=0}^{y}x^k h^{y-k}y![k!(y-k)!]^{-1}$, and denoting by ${\textbf{L}}:={\widehat{f}}_{n}({\textbf{x}})\pi ({\textbf{H}})$, we successively express ${\textbf{L}}$ as

$$\begin{aligned} {\textbf{L}}&= \frac{1}{n}\sum _{i=1}^n \frac{p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)}{p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)} \prod _{j=1}^d\frac{(x_j+1)!}{X_{ij}!(x_j+1-X_{ij})!}\left( \frac{x_j+h_j}{x_j+1}\right) ^{X_{ij}}\left( \frac{1-h_j}{x_j+1}\right) ^{x_j+1-X_{ij}} \\ & \times \frac{1}{{\textbf{B}}(\alpha ,\beta )}h_j^{\alpha -1}(1-h_j)^{\beta -1} \\&= \frac{1}{n{\textbf{B}}(\alpha ,\beta )^d}\sum _{i=1}^n \frac{p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)}{p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)} \prod _{j=1}^d\sum _{k=0}^{X_{ij}}\frac{(x_j+1)!~x_j^k~h_j^{X_{ij}-k+\alpha -1}~(1-h_j)^{x_j-X_{ij}+\beta }}{(x_j+1-X_{ij})!~k!~(X_{ij}-k)!~(x_j+1)^{x_j+1}} \\&= \frac{1}{n{\textbf{B}}(\alpha ,\beta )^d}\times {\textbf{N}}. \end{aligned}$$

(14)

By direct calculation, the second term $\int _{{\mathcal {M}}}{\widehat{f}}_{n}({\textbf{x}})\pi ({\textbf{H}})d{\textbf{H}}$ of (9) becomes

$$\begin{aligned} \int _{(0,1]^d}{\widehat{f}}_{n}({\textbf{x}})\pi ({\textbf{H}})d{\textbf{H}}=\frac{1}{n{\textbf{B}}(\alpha ,\beta )^d}\times {\textbf{D}}. \end{aligned}$$

(15)

Combining (14) and (15) as in (9), we easily get the closed expression of the posterior distribution ${\widehat{\pi }}({\textbf{H}}|{\textbf{x}},{\textbf{X}}_1,\dots ,{\textbf{X}}_n)$ provided in (11).

The diagonal elements of the matrix of bandwidths are obtained as:

$$\begin{aligned} \widehat{{\textbf{H}}}({\textbf{x}})&= \int _{(0,1]^d}(h_1,\ldots ,h_d) {\widehat{\pi }}({\textbf{H}}|{\textbf{x}},{\textbf{X}}_1,\ldots ,{\textbf{X}}_n)dh_1,\ldots , dh_d\\&= \int _{(0,1]^d}h_1{\widehat{\pi }}({\textbf{H}}|{\textbf{x}},{\textbf{X}}_1,\ldots ,{\textbf{X}}_n)dh_1,\ldots ,dh_d,\ldots ,\\ & \quad \int _{(0,1]^d}h_d{\widehat{\pi }}({\textbf{H}}|{\textbf{x}},{\textbf{X}}_1,\ldots ,{\textbf{X}}_n)dh_1\ldots , dh_d. \end{aligned}$$

Then

$$\begin{aligned} {\widehat{h}}_j(x_j)&= \sum _{i=1}^n \frac{p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)}{p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)} \left( \sum _{k=0}^{X_{ij}}\frac{(x_j+1)!x_j^{k}{\textbf{B}}(X_{ij}-k+\alpha +1,x_j-X_{ij}+\beta +1)}{(x_j+1-X_{ij})!k!(X_{ij}-k)!(x_j+1)^{x_j+1}}\right) \\ & \times \left( \prod \limits _{\begin{array}{c} m=1 \\ m \ne j \end{array}}^{d} \sum _{k=0}^{X_{im}}\frac{(x_m+1)!x_m^{k}{\textbf{B}}(X_{im}-k+\alpha ,x_m-X_{im}+\beta +1)}{(x_m+1-X_{im})!k!(X_{im}-k)!(x_m+1)^{x_m+1}}\right) \\ & \times \left( \sum _{i=1}^n \frac{p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)}{p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)}\prod _{s=1}^d \sum _{k=0}^{X_{is}}\frac{(x_s+1)!x_s^k {\textbf{B}}(X_{is}-k+\alpha ,x_s-X_{is}+\beta +1)}{(x_s+1-X_{is})!k!(X_{is}-k)!(x_s+1)^{x_s+1}}\right) ^{-1}\\&= {\textbf{D}}^{-1}\sum \limits _{i=1}^{n} \frac{p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)}{p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)} \left( \sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha +1,x_j-X_{ij}+\beta +1)\right) \\ & \times \left( \prod \limits _{\begin{array}{c} m=1 \\ m \ne j \end{array}}^{d}\sum _{k=0}^{X_{im}}{\textbf{A}}_{imk}{\textbf{B}}(X_{im}-k+\alpha ,x_m-X_{im}+\beta +1)\right) , \end{aligned}$$

which corresponds to Eq. (12). $\square$

Proof of Proposition 1

Using the property of the beta function and for fixed $j \in 1,\dots ,d$, ${\widehat{h}}_{j}(x_j)$ is written as

$$\begin{aligned} & {\widehat{h}}_j(x_j)=\sum \limits _{i=1}^{n} [p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)/p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)]\times \\ & \qquad \left[ \sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha +1,x_j-X_{ij}+\beta _{n}+1)\right] \\ & \qquad \times \left[ \prod \limits _{\begin{array}{c} m=1 \\ m \ne j \end{array}}^{d}\sum _{k=0}^{X_{im}}{\textbf{A}}_{imk}{\textbf{B}}(X_{im}-k+\alpha ,x_m-X_{im}+\beta _{n}+1)\right] \\ & \qquad \times \left\{ \sum \limits _{i=1}^{n} \left[ p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)/p_{d}({\textbf{X}}_i,\right. \right. \\ & \qquad \left. \left. \widehat{\varvec{\theta }}_n)\right] \left[ \prod _{j=1}^{d}\sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha ,x_j-X_{ij}+\beta _{n}+1)\right] \right\} ^{-1}\\ & \quad = \sum \limits _{i=1}^{n} [p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)/p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)]\\ & \qquad \times \left[ \sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha ,x_j-X_{ij}+\beta _{n}+1)\right. \\ & \qquad \left. \times (X_{ij}-k+\alpha )/(x_{j}+\beta _{n}-k+\alpha +1)\right] \\ & \qquad \times \left[ \prod \limits _{\begin{array}{c} m=1 \\ m \ne j \end{array}}^{d}\sum _{k=0}^{X_{im}}{\textbf{A}}_{imk}{\textbf{B}}(X_{im}-k+\alpha ,x_m-X_{im}+\beta _{n}+1)\right] \\ & \qquad \times \left\{ \sum \limits _{i=1}^{n} [p_{d}({\textbf{x}}, \widehat{\varvec{\theta }}_n)/p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)]\right. \\ & \qquad \left. \left[ \prod _{j=1}^{d}\sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha ,x_j-X_{ij}+\beta _{n}+1)\right] \right\} ^{-1}. \end{aligned}$$

Hence, ${\widehat{h}}_j(x_j)$ can be bounded to the left-hand side as follows:

$$\begin{aligned} & {\widehat{h}}_j(x_j)\ge \left( \frac{\alpha }{\beta _{n}+\alpha +x+1} \right) \\ & \qquad \times \left\{ \sum \limits _{i=1}^{n} [p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)/p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)]\right. \\ & \qquad \left. \left[ \prod _{j=1}^{d}\sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha ,x_j-X_{ij}+\beta _{n}+1)\right] \right\} \\ & \qquad \times \left\{ \sum \limits _{i=1}^{n} [p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)/p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)]\right. \\ & \qquad \left. \left[ \prod _{j=1}^{d}\sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha ,x_j-X_{ij}+\beta _{n}+1)\right] \right\} ^{-1}\\ & \quad \ge \left( \frac{\alpha }{\beta _{n}+\alpha +x+1} \right) . \end{aligned}$$

Since $X_{ij}\le x_{j}$, the bandwidth ${\widehat{h}}_j(x_j)$ is bounded to the right-hand side by:

$$\begin{aligned} & {\widehat{h}}_j(x_j)\le \left( \frac{x+1+\alpha }{\beta _{n}} \right) \\ & \qquad \times \left\{ \sum \limits _{i=1}^{n} [p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)/p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)]\right. \\ & \qquad \left. \left[ \prod _{j=1}^{d}\sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha ,x_j-X_{ij}+\beta _{n}+1)\right] \right\} \\ & \qquad \times \left\{ \sum \limits _{i=1}^{n} [p_{d}({\textbf{x}},\widehat{\varvec{\theta }}_n)/p_{d}({\textbf{X}}_i,\widehat{\varvec{\theta }}_n)]\right. \\ & \qquad \left. \left[ \prod _{j=1}^{d}\sum _{k=0}^{X_{ij}}{\textbf{A}}_{ijk}{\textbf{B}}(X_{ij}-k+\alpha ,x_j-X_{ij}+\beta _{n}+1)\right] \right\} ^{-1}\\ & \quad \le \left( \frac{x+1+\alpha }{\beta _{n}}\right) . \end{aligned}$$

Then, one gets

$$\begin{aligned} \left( \frac{\alpha }{\beta _{n}+\alpha +x+1} \right) \le {\widehat{h}}_j(x_j)\le \left( \frac{x+1+\alpha }{\beta _{n}}\right) {,} \end{aligned}$$

which leads to the desired result. $\square$

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Somé, S.M., Kokonendji, C.C., Belaid, N. et al. Bayesian local bandwidths in a flexible semiparametric kernel estimation for multivariate count data with diagnostics. Stat Methods Appl 32, 843–865 (2023). https://doi.org/10.1007/s10260-023-00682-5

Download citation

Accepted: 08 January 2023
Published: 23 January 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s10260-023-00682-5

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bayesian local bandwidths in a flexible semiparametric kernel estimation for multivariate count data with diagnostics

Abstract

Access this article

Similar content being viewed by others

Bayesian estimation of bandwidth in semiparametric kernel estimation of unknown probability mass and regression functions of count data

Bayesian approach to bandwidth selection for multivariate count regression function estimation by associated discrete kernel

Reduction of Potential Boundary Bias in Kernel Cumulative Distribution Estimation in Univariate and Multivariate Settings

Data availibility

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Proof of Theorem 1

Proof of Theorem 2

Proof of Proposition 1

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Bayesian local bandwidths in a flexible semiparametric kernel estimation for multivariate count data with diagnostics

Abstract

Access this article

Similar content being viewed by others

Bayesian estimation of bandwidth in semiparametric kernel estimation of unknown probability mass and regression functions of count data

Bayesian approach to bandwidth selection for multivariate count regression function estimation by associated discrete kernel

Reduction of Potential Boundary Bias in Kernel Cumulative Distribution Estimation in Univariate and Multivariate Settings

Data availibility

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Appendix

Proof of Theorem 1

Proof of Theorem 2

Proof of Proposition 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation