On Testing for the Number of Components in a Mixed Poisson Model

Karlis, Dimitris; Xekalaki, Evdokia

doi:10.1023/A:1003839420071

On Testing for the Number of Components in a Mixed Poisson Model

Published: March 1999

Volume 51, pages 149–162, (1999)
Cite this article

Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Dimitris Karlis¹ &
Evdokia Xekalaki¹

152 Accesses
24 Citations
Explore all metrics

Abstract

Poisson mixtures are usually used to describe overdispersed data. Finite Poisson mixtures are used in many practical situations where often it is of interest to determine the number of components in the mixture. Identifying how many components comprise a mixture remains a difficult problem. The likelihood ratio test (LRT) is a general statistical procedure to use. Unfortunately, a number of specific problems arise and the classical theory fails to hold. In this paper a new procedure is proposed that is based on testing whether a new component can be added to a finite Poisson mixture which eventually leads to the number of components in the mixture. It is a sequential testing procedure based on the well known LRT that utilises a resampling technique to construct the distribution of the test statistic. The application of the procedure to real data reveals some interesting features of the distribution of the test statistic.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Estimation of the Complexity of a Finite Mixture Distribution: From Well- to Less Known Methods

Article Open access 25 August 2022

General mixed Poisson regression models with varying dispersion

Article 16 September 2015

Robust estimation of the number of components for mixtures of linear regression models

Article 04 August 2015

REFERENCES

Aitkin, M., Anderson, D. and Hinde, J. (1981). Statistical modelling of data on teaching styles, J. Roy. Statist. Soc. Ser. A, 144, 419-461.
Google Scholar
Aitkin, M., Finch, S., Mendell, N. and Thode, H. (1996). A new test for the presence of a normal mixture distribution based on the posterior Bayes factor, Statistics and Computing, 6, 121-125.
Google Scholar
Beran, R. (1988). Prepivoting test statistics: a bootstrap review of asymptotic refinements, J. Amer. Statist. Assoc., 83, 687-697.
Google Scholar
Berdai, A. and Garrel, B. (1996). Detecting a univariate normal mixture with two components, Statist. Decisions, 14, 35-51.
Google Scholar
Bohning, D. (1995). A review of reliable maximum likelihood algorithms for semiparametric mixture models, J. Statist. Plann. Inference, 47, 5-28.
Google Scholar
Bohning, D., Dietz, Ek., Schaub, R., Schlattman, P. and Lindsay, B. (1994). The distribution of the likelihood ratio for mixtures of densities from the one-parameter exponential family. Ann. Inst. Statist. Math., 46, 373-388.
Google Scholar
Celeux, G. and Diebolt, J. (1985). The SEM algorithm: a probabilistic teacher algorithm derived from the EM algorithm for the mixture problem, Computational Statistics Quarterly, 2, 73-92.
Google Scholar
Chen, J. and Kalbfleisch, J. D. (1996). Penalised minimum-distance estimates in finite mixture models, Canad. J. Statist., 24, 167-175.
Google Scholar
Dempster, A. P., Laird N. M. and Rubin, D. (1977). Maximum likelihood from incomplete data via the EM aglgorithm, J. Roy. Statist. Soc. Ser. B, 39, 1-38.
Google Scholar
Feng, Z. and McCulloch, C. E. (1994). On the likelihood ratio test statistic for the number of components in a normal mixture with unequal variances, Biometrics, 50, 1158-1162.
Google Scholar
Feng, Z. and McCulloch, C. E. (1996). Using bootstrap likelihood ratios in finite mixture models, J. Roy. Statist. Soc. Ser. B, 58, 609-617.
Google Scholar
Fruman, W. D. and Lindsay, B. (1994). Testing for the number of components in a mixture of normal distributions using moment estimators, Comput. Statist. Data Anal., 17, 473-492.
Google Scholar
Greenwood, M. and Yule, G. (1920). An inquiry into the nature of frequency distributions representative of multiple happenings with particular reference to the occurrence of multiple attacks of disease or of repeated accidents, J. Roy. Statist. Soc. Ser. A, 83, 255-279.
Google Scholar
Hasselblad, V. (1969). Estimation of finite mixtures from the exponential family, J. Amer. Statist. Assoc., 64, 1459-1471.
Google Scholar
Henna, J. (1985). On estimating the number of constituents of a finite mixture of continuous distributions, Ann. Inst. Statist. Math., 37, 235-240.
Google Scholar
Izenmann, A. J. and Sommer, C. (1988). Philatelic mixtures and multimodal densities, J. Amer. Statist. Assoc., 83, 941-953.
Google Scholar
Karlis, D. and Xekalaki, E. (1996a). Testing for finite mixtures via the likelihood ratio test, Tech. Report, No. 28, Department of Statistics, Athens University of Economics and Business.
Karlis, D. and Xekalaki, E. (1996b). A note on the maximum likelihood estimation of the parameters of finite Poisson mixtures, Tech. Report, No. 24, Department of Statistics, Athens University of Economics and Business.
Leroux, B. (1992). Consistent estimation of a mixing distribution, Ann. Statist., 20, 1350-1360.
Google Scholar
Leroux, B. and Puterman, M. (1992). Maximum-penalised-likelihood for independent and Markov-dependent mixture models, Biometrics, 48, 545-558.
Google Scholar
Lindsay, B. (1983). The geometry of mixture likelihood: A general theory, Ann. Statist., 11, 86-94.
Google Scholar
Lindsay, B. (1989). Moment matrices: Application in mixtures, Ann. Statist., 17, 722-740.
Google Scholar
Lindsay, B. and Roeder, K. (1992). Residuals diagnostics for mixture models, J. Amer. Statist. Assoc., 87, 785-794.
Google Scholar
McLachlan, G. (1987). On bootstraping the likelihood ratio test statistic for the number of components in a normal mixture, Applied Statistics, 36, 318-324.
Google Scholar
Mendell, N., Thode, H. and Finch, S. J. (1991). The likelihood ratio test for the 2-component normal mixture problem: Power and sample size analysis, Biometrics, 47, 1143-1148.
Google Scholar
Mendell, N., Finch, S. J. and Thode, H. C. (1993). Where is the likelihood ratio test powerful for detecting two components normal mixture? (The consultant's forum), Biometrics, 49, 907-915.
Google Scholar
Richardson, S. and Green, P. (1997). On Bayesian analysis of mixtures with an unknown number of components, J. Roy. Statist. Soc. Ser. B, 59, 751-793.
Google Scholar
Self, S. and Liang, K. (1987). Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions, J. Amer. Statist. Assoc., 82, 605-610.
Google Scholar
Symons, M., Grimson, R. and Yuan, Y. (1983). Clustering of rare events, Biometrics, 39, 193-205.
Google Scholar
Teicher, H. (1961). Identifiability of mixtures, Ann. Math. Statist., 32, 244-248.
Google Scholar
Titterington, M., Markov, G. and Smith, A. F. M. (1985). Statistical Analysis of Finite Mixtures, Willey, London.
Google Scholar
Thode, H., Finch, S. and Mendell, N. (1988). Simulated percentage points for the null distribution of the likelihood ratio test for a mixture of two normals, Biometrics, 44, 1195-1201.
Google Scholar
Windham, M. and Cutler, A. (1992). Information ratios for validating mixture analyses, J. Amer. Statist. Assoc., 87, 1188-1192.
Google Scholar
Wolfe, J. H. (1970). Pattern clustering by multivariate mixture analysis, Multivariate Behavioral Research, 5, 329-350.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, Athens University of Economics and Business, 76 Patission St., 10434, Athens, Greece
Dimitris Karlis & Evdokia Xekalaki

Authors

Dimitris Karlis
View author publications
You can also search for this author in PubMed Google Scholar
Evdokia Xekalaki
View author publications
You can also search for this author in PubMed Google Scholar

About this article

Cite this article

Karlis, D., Xekalaki, E. On Testing for the Number of Components in a Mixed Poisson Model. Annals of the Institute of Statistical Mathematics 51, 149–162 (1999). https://doi.org/10.1023/A:1003839420071

Download citation

Issue Date: March 1999
DOI: https://doi.org/10.1023/A:1003839420071

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On Testing for the Number of Components in a Mixed Poisson Model

Abstract

Access this article

Similar content being viewed by others

Estimation of the Complexity of a Finite Mixture Distribution: From Well- to Less Known Methods

General mixed Poisson regression models with varying dispersion

Robust estimation of the number of components for mixtures of linear regression models

REFERENCES

Author information

Authors and Affiliations

About this article

Cite this article

Navigation

On Testing for the Number of Components in a Mixed Poisson Model

Abstract

Access this article

Similar content being viewed by others

Estimation of the Complexity of a Finite Mixture Distribution: From Well- to Less Known Methods

General mixed Poisson regression models with varying dispersion

Robust estimation of the number of components for mixtures of linear regression models

REFERENCES

Author information

Authors and Affiliations

About this article

Cite this article

Share this article

Search

Navigation