On testing the hidden heterogeneity in negative binomial regression models

Kim, Jeonghwan; Lee, Woojoo

doi:10.1007/s00184-018-0684-x

On testing the hidden heterogeneity in negative binomial regression models

Published: 22 September 2018

Volume 82, pages 457–470, (2019)
Cite this article

Metrika Aims and scope Submit manuscript

373 Accesses
1 Citation
Explore all metrics

Abstract

Negative binomial regression models have been widely used for analyzing overdispersed count data. However, when an important covariate is not included or individuals show some heterogeneity, negative binomial regression models may lead to erroneous standard errors or confidence intervals for the regression parameters. To test the existence of the hidden heterogeneity in negative binomial regression models, score statistics are developed under additive and multiplicative random effect models. We provide the explicit form of the score test statistics and their asymptotic distribution, and investigate the relationship between the score test statistics from the two random effect models. Our numerical study shows that the proposed score statistic has superior performance than existing methods in terms of controlling for the type I error and power.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Consistency for the negative binomial regression with fixed covariate

Article 24 October 2019

A new generalization of the zero-truncated negative binomial distribution by a Lagrange expansion with associated regression model and applications

Article 16 September 2023

Methods for meta-analysis and meta-regression of binomial data: concepts and tutorial with Stata command metapreg

Article Open access 29 January 2024

References

Breslow N (1984) Extra-Poisson variation in log-linear models. Appl Stat 33:38–44
Article Google Scholar
Chesher A (1984) Testing for neglected heterogeneity. Econometrica 52:865–872
Article MATH Google Scholar
Cox DR (1983) Some remarks on overdispersion. Biometrika 70:269–274
Article MathSciNet MATH Google Scholar
De Jong P, Heller GZ (2008) Generalized linear models for insurance data. Cambridge University Press, New York
Book MATH Google Scholar
Dean CB (1992) Testing for overdispersion in Poisson and binomial regression models. J Am Stat Assoc 87:451–457
Article Google Scholar
Hilbe JM (2011) Negative binomial regression, 2nd edn. Cambridge University Press, Cambridge
Book MATH Google Scholar
Hinde J, Demétrio CGB (1998) Overdispersion: models and estimation. Comput Stat Data Anal 27:151–170
Article MATH Google Scholar
Jacqmin-Gadda H, Commenges D (1995) Tests of homogeneity for generalized linear models. J Am Stat Assoc 90:1237–1246
Article MathSciNet MATH Google Scholar
Lawless JF (1987) Negative binomial and mixed Poisson regression. Can J Stat 15(3):209–225
Article MathSciNet MATH Google Scholar
le Cessie S, van Houwelingen HC (1995) Testing the fit of a regression model via score tests in random effects models. Biometrics 51:600–614
Article MATH Google Scholar
Liang K (1987) A locally most powerful test for homogeneity with many strata. Biometrika 74:259–264
Article MathSciNet MATH Google Scholar
Lin X (1997) Variance component testing in generalised linear models with random effects. Biometrika 84:309–326
Article MathSciNet MATH Google Scholar
Long JS (1990) The origins of sex differences in science. Soc Forces 68(3):1297–1316
Article Google Scholar
Long JS, Freese J (2001) Regression models for categorical dependent variables using stata. A Stata Press Publication, College Station
MATH Google Scholar
Verbeke G, Molenberghs G (2003) The use of score tests for inference on variance components. Biometrics 59:254–262
Article MathSciNet MATH Google Scholar
Xue D, Deddnes JA (1992) Overdispersed negative binomial regression models. Commun Stat-Theory Methods 21(8):2215–2226
Article MATH Google Scholar

Download references

Acknowledgements

This paper is a part of the first author’s Ph.D. thesis at Inha University. This paper was supported by a Grant from the Next-Generation BioGreen 21 program (Project No. PJ01337701), Rural Development Administration, Republic of Korea and by the Bio-Synergy Research Project (NRF-2017M3A9C4065964) of the Ministry of Science, ICT and Future Planning through the National Research Foundation.

Author information

Authors and Affiliations

Department of Statistics, Inha University, 100 Inha-ro, Nam-Gu, Incheon, 22212, Korea
Jeonghwan Kim & Woojoo Lee

Authors

Jeonghwan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Woojoo Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Woojoo Lee.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Appendix

Proof of Theorem 1

For $y_{i} \sim NB(\mu _{i}, \alpha )$, the following basic results are repeatedly used:

$$\begin{aligned} E(y_{i} - \mu _{i})^2= & {} \mu _{i}(1 + \alpha \mu _{i}) \\ E(y_{i} - \mu _{i})^3= & {} \mu _{i}(1 + \alpha \mu _{i})(1 + 2\alpha \mu _{i}) \\ E(y_{i} - \mu _{i})^4= & {} \mu _{i}(1 + \alpha \mu _{i})(1 + 3\mu _{i} + 6\alpha \mu _{i} + 3\alpha \mu _{i}^{2} + 6\alpha ^2 \mu _{i}^2) \end{aligned}$$

From these expectations, we have

$$\begin{aligned} E \left( \frac{\partial \ell _{i}}{\partial \sigma _{v}^{2}} \right) ^{2}= & {} E \left\{ \frac{1}{2} \cdot \frac{(y_{i} - \mu _{i})^{2} - (\mu _{i} + \alpha \mu _{i} y_{i})}{(1 + \alpha \mu _{i})^{2}} \right\} ^{2} \\= & {} \frac{1}{4} \cdot \frac{E(y_{i} - \mu _{i})^{4} - 2 \alpha \mu _{i} E(y_{i} - \mu _{i})^{3} + [\alpha ^{2} \mu _{i}^2 - 2(\mu _{i} + \alpha \mu _{i}^{2})] E(y_{i} - \mu _{i})^{2} + (\mu _{i} + \alpha \mu _{i}^{2})^{2}}{(1 + \alpha \mu _{i})^{4}} \\= & {} \frac{1}{4} \cdot \frac{(1 + 2 \mu _{i} + 3 \alpha \mu _{i}) \mu _{i}}{(1 + \alpha \mu _{i})^{2}}, \\ E \left( \frac{\partial \ell _{i}}{\partial \sigma _{v}^{2}} \frac{\partial \ell _{i}}{\partial \beta ^{T}} \right)= & {} E \left\{ \frac{1}{2} \cdot \frac{(y_{i} - \mu _{i})^{2} - (\mu _{i} + \alpha \mu _{i} y_{i})}{(1 + \alpha \mu _{i})^{2}} \right\} \left\{ \frac{y_{i} - \mu _{i}}{1 + \alpha \mu _{i}} x_{i}^{T} \right\} \\= & {} \frac{1}{2} \cdot \frac{E(y_{i} - \mu _{i})^{3} - \mu _{i}E(y_{i} - \mu _{i}) - \alpha \mu _{i} (E(y_{i}^2) - \mu _{i}^2)}{(1 + \alpha \mu _{i})^{3}} x_{i}^{T} \\= & {} \frac{1}{2} \cdot \frac{\mu _{i}}{(1 + \alpha \mu _{i})} x_{i}^{T}, \\ E \left( \frac{\partial \ell _{i}}{\partial \sigma _{v}^{2}} \frac{\partial \ell _{i}}{\partial \alpha } \right)= & {} E \left( - \frac{\partial ^{2} \ell _{i}}{\partial \sigma _{v}^{2} \partial \alpha } \right) = E \left( \frac{\partial }{\partial \alpha } \left\{ - \frac{1}{2} \cdot \frac{(y_{i} - \mu _{i})^{2} - (\mu _{i} + \alpha \mu _{i} y_{i})}{(1 + \alpha \mu _{i})^{2}} \right\} \right) \\= & {} \frac{1}{2} \cdot \frac{\mu _{i} (1 + \alpha \mu _{i})^2 E(y_{i}) - [E(y_{i} - \mu _{i})^2 - \mu _{i} -\alpha \mu _{i} E(y_{i})] \cdot 2 (1 + \alpha \mu _{i}) \mu _{i}}{(1 + \alpha \mu _{i})^{4}} \\= & {} \frac{1}{2} \cdot \frac{\mu _{i}^2}{(1 + \alpha \mu _{i})^2}, \\ E \left( \frac{\partial \ell _{i}}{\partial \beta } \frac{\partial \ell _{i}}{\partial \beta ^{T}} \right)= & {} E \left\{ \frac{y_{i} - \mu _{i}}{1 + \alpha \mu _{i}} x_{i} \right\} \left\{ \frac{y_{i} - \mu _{i}}{1 + \alpha \mu _{i}} x_{i}^{T} \right\} = x_{i}^{T} \frac{E(y_{i} - \mu _{i})^2}{(1 + \alpha \mu _{i})^2} x_{i} \\= & {} x_{i}^{T} \frac{\mu _{i}}{(1 + \alpha \mu _{i})} x_{i}, \\ E \left( \frac{\partial \ell _{i}}{\partial \beta } \frac{\partial \ell _{i}}{\partial \alpha } \right)= & {} E \left( - \frac{\partial ^{2} \ell _{i}}{\partial \beta \partial \alpha } \right) = E \left( \frac{\partial }{\partial \alpha } \left\{ - \frac{y_{i} - \mu _{i}}{1 + \alpha \mu _{i}} x_{i}^{T} \right\} \right) \\= & {} - \frac{E( y_{i} - \mu _{i}) \mu _{i}}{(1 + \alpha \mu _{i})^{2}} x_{i}^{T} = 0. \end{aligned}$$

Following Lawless (1987), we have

$$\begin{aligned} E \left( \frac{\partial \ell _{i}}{\partial \alpha } \right) ^{2}= & {} E \left( - \frac{\partial ^{2} \ell _{i}}{\partial \alpha ^{2}} \right) = \alpha ^{-4} \left\{ E \left( \sum _{j=0}^{y_{i}-1}(\alpha ^{-1} + j)^{-2} \right) - \frac{\alpha \mu _{i}}{\mu _{i} + \alpha ^{-1}} \right\} \\= & {} \alpha ^{-4} \left( \sum _{j=0}^{\infty }(\alpha ^{-1} + j)^{-2} \Pr (y_{i} \ge j+1) - \frac{\alpha \mu _{i}}{\mu _{i} + \alpha ^{-1}} \right) . \end{aligned}$$

Under the regularity conditions on the central limit theorem for the score components and maximum likelihood theory (see Chesher 1984), asymptotic normality follows for $T_A$. $\square $

Proof of Theorem 2

By using the basic results used in the proof of Theorem 1, we have

$$\begin{aligned} E \left( \frac{\partial \ell _{i}}{\partial \sigma _{v}^{2}} \right) ^{2}= & {} E \left\{ \frac{1}{2} \cdot \frac{(1 + \alpha )(y_{i} - \mu _{i})^{2} - \alpha y_{i}^{2} - y_{i}}{(1 + \alpha \mu _{i})^{2}} \right\} ^{2} \\= & {} \frac{1}{4} \cdot \frac{E(y_{i} - \mu _{i})^{4} - 2 (1 + 2 \alpha \mu _{i}) E(y_{i} - \mu _{i})^{3} + [(1 + 2 \alpha \mu _{i})^2 - 2(\mu _{i} + \alpha \mu _{i}^{2})] E(y_{i} - \mu _{i})^{2} + (\mu _{i} + \alpha \mu _{i}^{2})^{2}}{(1 + \alpha \mu _{i})^{4}} \\= & {} \frac{1}{4} \cdot \frac{2 \mu _{i}^{2} (1 + \alpha )}{(1 + \alpha \mu _{i})^{2}}, \\ E \left( \frac{\partial \ell _{i}}{\partial \sigma _{v}^{2}} \frac{\partial \ell _{i}}{\partial \beta ^{T}} \right)= & {} E \left\{ \frac{1}{2} \cdot \frac{(1 + \alpha )(y_{i} - \mu _{i})^{2} - \alpha y_{i}^{2} - y_{i}}{(1 + \alpha \mu _{i})^{2}} \right\} \left\{ \frac{y_{i} - \mu _{i}}{1 + \alpha \mu _{i}} x_{i}^{T} \right\} \\= & {} \frac{1}{2} \cdot \frac{(1 + \alpha ) E(y_{i} - \mu _{i})^{3} - \alpha E(y_{i}^{3}) + \alpha \mu E(y_{i}^{2}) - E(y_{i}^{2}) + \mu E(y_{i})}{(1 + \alpha \mu _{i})^{3}} x_{i}^{T} \\= & {} 0, \\ E \left( \frac{\partial \ell _{i}}{\partial \sigma _{v}^{2}} \frac{\partial \ell _{i}}{\partial \alpha } \right)= & {} E \left( - \frac{\partial ^{2} \ell _{i}}{\partial \sigma _{v}^{2} \partial \alpha } \right) = E \left( \frac{\partial }{\partial \alpha } \left\{ - \frac{1}{2} \cdot \frac{(1 + \alpha )(y_{i} - \mu _{i})^{2} - \alpha y_{i}^{2} - y_{i}}{(1 + \alpha \mu _{i})^{2}} \right\} \right) \\= & {} - \frac{1}{2} \cdot \frac{[E(y_{i} - \mu _{i})^2 - E(y_{i}^2)](1 + \alpha \mu _{i})^{2} - [(1 + \alpha ) E(y_{i} - \mu _{i})^{2} - \alpha E(y_{i}^{2}) - E(y_{i})] \cdot 2 \mu _{i} (1 + \alpha \mu _{i})}{(1 + \alpha \mu _{i})^{4}} \\= & {} \frac{1}{2} \cdot \frac{\mu _{i}^2}{(1 + \alpha \mu _{i})^2}, \\ E \left( \frac{\partial \ell _{i}}{\partial \beta } \frac{\partial \ell _{i}}{\partial \beta ^{T}} \right)= & {} E \left\{ \frac{y_{i} - \mu _{i}}{1 + \alpha \mu _{i}} x_{i} \right\} \left\{ \frac{y_{i} - \mu _{i}}{1 + \alpha \mu _{i}} x_{i}^{T} \right\} = x_{i}^{T} \frac{E(y_{i} - \mu _{i})^2}{(1 + \alpha \mu _{i})^2} x_{i} \\= & {} x_{i}^{T} \frac{\mu _{i}}{(1 + \alpha \mu _{i})} x_{i}, \\ E \left( \frac{\partial \ell _{i}}{\partial \beta } \frac{\partial \ell _{i}}{\partial \alpha } \right)= & {} E \left( - \frac{\partial ^{2} \ell _{i}}{\partial \beta \partial \alpha } \right) = E \left( \frac{\partial }{\partial \alpha } \left\{ - \frac{y_{i} - \mu _{i}}{1 + \alpha \mu _{i}} x_{i}^{T} \right\} \right) \\= & {} - \frac{E( y_{i} - \mu _{i}) \mu _{i}}{(1 + \alpha \mu _{i})^{2}} x_{i}^{T} = 0.\\ E \left( \frac{\partial \ell _{i}}{\partial \alpha } \right) ^{2}= & {} E \left( - \frac{\partial ^{2} \ell _{i}}{\partial \alpha ^{2}} \right) = \alpha ^{-4} \left( \sum _{j=0}^{\infty }(\alpha ^{-1} + j)^{-2} {\Pr (y_{i} \ge j + 1)} - \frac{\alpha \mu _{i}}{\mu _{i} + \alpha ^{-1}} \right) . \end{aligned}$$

Similar to the proof of Theorem 1, asymptotic normality follows for T under the same regularity conditions. $\square $

Proof of Theorem 3

Consider the numerator of the score statistics first.

$$\begin{aligned}&S(\beta ,\alpha ) = \dfrac{1}{2} \displaystyle \sum \limits _{i} \frac{(1 + \alpha )(y_{i} - \mu _{i})^{2} - \alpha y_{i}^2 - y_{i}}{(1 + \alpha \mu _{i})^{2}}, \quad S_{A}(\beta ,\alpha ) = \dfrac{1}{2} \displaystyle \sum \limits _{i} \frac{(y_{i} - \mu _{i})^{2} - (\mu _{i} + \alpha \mu _{i} y_{i})}{(1 + \alpha \mu _{i})^{2}} \end{aligned}$$

Their difference becomes:

$$\begin{aligned} S(\beta ,\alpha ) - S_{A}(\beta ,\alpha )= & {} \frac{1}{2} \sum _{i} \frac{\alpha (y_{i} - \mu _{i})^{2} - \alpha y_{i}^2 - y_{i} + \mu _{i} + \alpha \mu _{i} y_{i}}{(1 + \alpha \mu _{i})^{2}} \\= & {} - \frac{1}{2} \sum _{i} \frac{(y_{i} - \mu _{i}) (1 + \alpha \mu _{i}) }{(1 + \alpha \mu _{i})^{2}} = - \frac{1}{2} \sum _{i} \frac{(y_{i} - \mu _{i}) }{(1 + \alpha \mu _{i})} = 0. \end{aligned}$$

Suppose that the NB regression model includes the intercept. Then, the MLE for $\beta _{0}$ satisfies

$$\begin{aligned} \sum _{i} \frac{\partial \ell _{i}}{\partial \beta _{0}}= & {} \sum _{i} \frac{(y_{i} - \widehat{\mu }_{i})}{(1 + \alpha \widehat{\mu }_{i})} = 0, \end{aligned}$$

which implies $S(\widehat{\beta },\widehat{\alpha })=S^{A}(\widehat{\beta },\widehat{\alpha })$.

Now consider the denominator part of the score statistics. Note that

$$\begin{aligned} V= & {} \frac{1}{4} \sum _{i} \frac{2 \widehat{\mu }_{i}^2 (1 + \widehat{\alpha })}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^2} - \begin{pmatrix} 0&\frac{1}{2} \sum _{i} \frac{\widehat{\mu }_{i}^2}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^{2}} \end{pmatrix} \begin{pmatrix} X^{T} W X&{} 0 \\ 0 &{} i(\widehat{\beta }, \widehat{\alpha }) \\ \end{pmatrix}^{-1}\\&\begin{pmatrix} 0 \\ \frac{1}{2} \sum _{i} \frac{\widehat{\mu }_{i}^2}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^{2}} \end{pmatrix} \text {and} \\ V_{A}= & {} \frac{1}{4} \sum _{i} \frac{(1 + 2\widehat{\mu }_{i} + 3\widehat{\alpha }\widehat{\mu }_{i}) \widehat{\mu }_{i}}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^2} - \begin{pmatrix} \frac{1}{2} \sum _{i} \frac{\widehat{\mu }_{i}}{1 + \widehat{\alpha } \widehat{\mu }_{i}} x^{T}_{i}&\frac{1}{2} \sum _{i} \frac{\widehat{\mu }_{i}^2}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^{2}} \end{pmatrix}\\&\begin{pmatrix} X^{T} W X &{} 0 \\ 0 &{} i(\widehat{\beta }, \widehat{\alpha }) \\ \end{pmatrix}^{-1} \begin{pmatrix} \frac{1}{2} \sum _{i} \frac{\widehat{\mu }_{i}}{1 + \widehat{\alpha } \widehat{\mu }_{i}} x_{i} \\ \frac{1}{2} \sum _{i} \frac{\widehat{\mu }_{i}^2}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^{2}} \end{pmatrix}. \end{aligned}$$

By simplifying the second terms in V and $V_{A}$, we have

$$\begin{aligned} V= & {} \frac{1}{4} \sum _{i} \frac{2 \widehat{\mu }_{i}^2 (1 + \widehat{\alpha })}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^2} - \frac{1}{4} \left( \sum _{i} \frac{\widehat{\mu }_{i}^2}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^2} \right) ^{2} i(\widehat{\beta }, \widehat{\alpha })^{-1} \end{aligned}$$

From

$$\begin{aligned} V_{A}= & {} \frac{1}{4} \sum _{i} \frac{(1 + 2\widehat{\mu }_{i} + 3\widehat{\alpha }\widehat{\mu }_{i}) \widehat{\mu }_{i}}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^2} - \frac{1}{4} \left( \sum _{i} \frac{\widehat{\mu }_{i}}{1 + \widehat{\alpha } \widehat{\mu }_{i}} x^{T}_{i} \right) \left( X^{T} W X \right) ^{-1} \\&\left( \sum _{i} \frac{\widehat{\mu }_{i}}{1 + \widehat{\alpha } \widehat{\mu }_{i}} x_{i} \right) - \frac{1}{4} \left( \sum _{i} \frac{\widehat{\mu }_{i}^2}{(1 + \widehat{\alpha } \widehat{\mu }_{i})^2} \right) ^{2} i(\widehat{\beta }, \widehat{\alpha })^{-1} \end{aligned}$$

Since

$$\begin{aligned}&\left( \sum _{i} \frac{\widehat{\mu }_{i}}{1 + \widehat{\alpha } \widehat{\mu }_{i}} x^{T}_{i} \right) \left( X^{T} W X \right) ^{-1} \left( \sum _{i} \frac{\widehat{\mu }_{i}}{1 + \widehat{\alpha } \widehat{\mu }_{i}} x_{i} \right) =1^{T}WX(X^{T}WX)^{-1}X^{T}W1\\&\quad =(X^{T}WX(X^{T}WX)^{-1}X^{T}WX)_{11}=(X^{T}WX)_{11}\\&\quad =1^{T}W1=\sum _{i} \frac{\widehat{\mu }_{i}}{(1 + \widehat{\alpha } \widehat{\mu }_{i})}, \end{aligned}$$

we have $V=V_{A}$. Consequently, the two score statistics are equivalent. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, J., Lee, W. On testing the hidden heterogeneity in negative binomial regression models. Metrika 82, 457–470 (2019). https://doi.org/10.1007/s00184-018-0684-x

Download citation

Received: 29 April 2018
Published: 22 September 2018
Issue Date: 01 May 2019
DOI: https://doi.org/10.1007/s00184-018-0684-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On testing the hidden heterogeneity in negative binomial regression models

Abstract

Access this article

Similar content being viewed by others

Consistency for the negative binomial regression with fixed covariate

A new generalization of the zero-truncated negative binomial distribution by a Lagrange expansion with associated regression model and applications

Methods for meta-analysis and meta-regression of binomial data: concepts and tutorial with Stata command metapreg

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Appendix

Proof of Theorem 1

Proof of Theorem 2

Proof of Theorem 3

Rights and permissions

About this article

Cite this article

Keywords

Navigation

On testing the hidden heterogeneity in negative binomial regression models

Abstract

Access this article

Similar content being viewed by others

Consistency for the negative binomial regression with fixed covariate

A new generalization of the zero-truncated negative binomial distribution by a Lagrange expansion with associated regression model and applications

Methods for meta-analysis and meta-regression of binomial data: concepts and tutorial with Stata command metapreg

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Appendix

Appendix

Proof of Theorem 1

Proof of Theorem 2

Proof of Theorem 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation