Exact test of goodness of fit for binomial distribution

Li, Benchong; Fu, Liya

doi:10.1007/s00362-016-0793-4

Exact test of goodness of fit for binomial distribution

Regular Article
Published: 07 July 2016

Volume 59, pages 851–860, (2018)
Cite this article

Statistical Papers Aims and scope Submit manuscript

Benchong Li¹ &
Liya Fu²

619 Accesses
3 Citations
Explore all metrics

Abstract

In this paper, we consider an exact test of goodness of fit for binomial distribution in sparse data situation. A conventional way is viewing this problem as an independence test problem of a two-way contingency table. We propose an approach to promote the efficiency of the Diaconis–Sturmfels (DS) algorithm when n (sample size) is much larger than m [the first parameter of a binomial distribution B(m, p)] through representing the data and then utilizing minimal Markov bases of the corresponding multinomial model. Simulation results and real data analysis indicate that our method makes the DS algorithm computationally faster.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Selection of statistics for a multinomial goodness-of-fit test and a test of independence for a multi-way contingency table when data are sparse

Article 03 January 2024

On testing the hidden heterogeneity in negative binomial regression models

Article 22 September 2018

Two-sample test for sparse high-dimensional multinomial distributions

Article 21 July 2018

References

Agresti A (2001) Exact inference for categorical data: recent advances and continuing controversies. Statist Med 20:2709–2722
Article Google Scholar
Aigner M, Ziegler GM (1998) Proofs from THE BOOK. Springer, Berlin, pp 141–146
Book MATH Google Scholar
Aoki S, Hara H, Takemura A (2012) Markov bases in algebraic statistics. Springer, New York
Book MATH Google Scholar
Baglivo J, Oliver D, Pagano M (1988) Methods for the analysis of contingency tables with large and small cell counts. J Am Stat Assoc 83:1006–1013
Article MathSciNet Google Scholar
Best DJ, Rayner JCW (1997) Goodness of fit for the binomial distribution. Austril J Statist 39(3):355–364
Article MathSciNet MATH Google Scholar
Best DJ, Rayner JCW (2006) Improved testing for the binomial distribution using chi-squared components with data-dependent cells. J Stat Comput Simul 76(1):75–81
Article MathSciNet MATH Google Scholar
Briales E, Campillo A, Marijuán C, Pisón P (1998) Minimal systems of generators for ideals of semigroups. J Pure Appl Alge 124:7–30
Article MathSciNet MATH Google Scholar
Cheon S, Liang F, Chen Y, Yu K (2014) Stochastic approximation Monte Carlo important sampling for approximating exact conditional probabilities. Stat Comput 24:505–520
Article MathSciNet MATH Google Scholar
Dalenius P, Reiss RS (1982) Data-swapping: a technique for disclosure control. J Stat Plan Inference 6:73–85
Article MathSciNet MATH Google Scholar
De Loera JA, Onn S (2006) Markov basis of three-way tables are arbitrarily complecated. J Symb Comput 41:173–181
Article MATH Google Scholar
Diaconis P, Sturmfels B (1998) Algebraic algorithms for sampling from conditional distributions. Ann Stat 26:363–397
Article MATH Google Scholar
Drton M, Sturmfels B, Sullivant S (2009) In: Lectures on algebraic statistics. Oberwolfach seminars, , vol 39. Birkh\(\ddot{\text{a}}\)user Verlag, Basel
Haberman SJ (1988) A warning on the use of chi-squared statistics with frequency tables with small expected cell counts. J Am Stat Assoc 83:555–560
Article MathSciNet MATH Google Scholar
Hara H, Takemura A, Yoshida R (2010) On connectivity of fibers with positive marginals in multiple logistic regression. J Multivar Anal 101:909–925
Article MathSciNet MATH Google Scholar
Klein M, Linton P (2013) On a comparison of tests of homogeneity of binomial proportions. J Stat Theo Appl 12(3):208–224
MathSciNet Google Scholar
Lindsay JK (1995) Modelling frequency and count data. Oxford University Press, Oxford
Google Scholar
Mehta CR, Patel NR (1983) A network algorithm for performing Fisher’s exact test in \(r\times c\) contingency tables. J Am Stat Asso 78:427–434
MATH Google Scholar
Mehta CR, Patel NR, Senchaudhuri P (1988) Importance sampling for estimating exact probabilities in permutational inference. J Am Stat Asso 83:999–1005
Article MathSciNet Google Scholar
Park S, Lim J (2015) On censored cumulative residual Kullback-Leibler information and goodness-of-fit test with type II censored data. Stat Pap 56:247–256
Article MathSciNet MATH Google Scholar
Patefield WM (1981) Algorithm AS 159: an efficient method of generating random \(R\times C\) tables with given row and column totals. J R Stat Soc C 30:91–97
MATH Google Scholar
Quinino EC, Ho LL, Suyama E (2013) Alternative estimator for the parameters of a mixture of two binomial distributions. Stat Pap 54:47–69
Article MathSciNet MATH Google Scholar
Takemura A, Aoki S (2004) Some characterizations of minimal Markov basis for sampling from discrete conditional distributions. Ann Inst Stat Math 56(1):1–17
Article MathSciNet MATH Google Scholar
Takken A (1999) Monte Carlo goodness-of-fit for discrete data. Ph.D. Dissertation, Department of Statistics, Stanford University
Zardasht V, Parsi S, Mousazadeh M (2015) On empirical cumulative residual entropy and a goodness-of-fit test for exponentiality. Stat Papers 56:677–688
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

The authors wish to thank the Editor, an Associate Editor, and two anonymous referees for their constructive comments on early versions of this work that lead to substantial improvements in the article. This research was supported by the National Natural Science Foundation of China (Grants 11201365, 11301408) and the Fundamental Research Funds for the Central Universities (Grants K5051370016, JB140701).

Author information

Authors and Affiliations

School of Mathematics and Statistics, Xidian University, Xi’an, 710126, People’s Republic of China
Benchong Li
School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an, China
Liya Fu

Authors

Benchong Li
View author publications
You can also search for this author in PubMed Google Scholar
Liya Fu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Benchong Li.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, B., Fu, L. Exact test of goodness of fit for binomial distribution. Stat Papers 59, 851–860 (2018). https://doi.org/10.1007/s00362-016-0793-4

Download citation

Received: 02 February 2015
Revised: 26 February 2016
Published: 07 July 2016
Issue Date: September 2018
DOI: https://doi.org/10.1007/s00362-016-0793-4

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exact test of goodness of fit for binomial distribution

Abstract

Access this article

Similar content being viewed by others

Selection of statistics for a multinomial goodness-of-fit test and a test of independence for a multi-way contingency table when data are sparse

On testing the hidden heterogeneity in negative binomial regression models

Two-sample test for sparse high-dimensional multinomial distributions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Exact test of goodness of fit for binomial distribution

Abstract

Access this article

Similar content being viewed by others

Selection of statistics for a multinomial goodness-of-fit test and a test of independence for a multi-way contingency table when data are sparse

On testing the hidden heterogeneity in negative binomial regression models

Two-sample test for sparse high-dimensional multinomial distributions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation