Estimating the number of zero-one multi-way tables via sequential importance sampling
- 116 Downloads
In 2005, Chen et al. introduced a sequential importance sampling (SIS) procedure to analyze zero-one two-way tables with given fixed marginal sums (row and column sums) via the conditional Poisson (CP) distribution. They showed that compared with Monte Carlo Markov chain (MCMC)-based approaches, their importance sampling method is more efficient in terms of running time and also provides an easy and accurate estimate of the total number of contingency tables with fixed marginal sums. In this paper, we extend their result to zero-one multi-way (\(d\)-way, \(d \ge 2\)) contingency tables under the no \(d\)-way interaction model, i.e., with fixed \(d-1\) marginal sums. Also, we show by simulations that the SIS procedure with CP distribution to estimate the number of zero-one three-way tables under the no three-way interaction model given marginal sums works very well even with some rejections. We also applied our method to Samson’s monks data set.
KeywordsCategorical data analysis Conditional Poisson Counting problem No three-way interaction
The authors would like to thank Drs. Stephen Fienberg and Yuguo Chen for useful conversations.
- Blitzstein, J., Diaconis, P. (2010). A sequential importance sampling algorithm for generating random graphs with prescribed degrees. Internet Mathematics, 6(4), 489–522.Google Scholar
- Breiger, R., Boorman, S., Arabie, P. (1975). An algorithm for clustering relational data with applications to social network analysis and comparison with multidimensional scaling. Journal of Mathematical Psychology, 12, 328–383.Google Scholar
- Chen, Y., Diaconis, P., Holmes, S., Liu, J. S. (2005). Sequential monte carlo methods for statistical analysis of tables. Journal of the American Statistical Association, 100, 109–120.Google Scholar
- Chen, Y., Dinwoodie, I., Sullivant, S. (2006). Sequential importance sampling for multiway tables. The Annals of Statistics, 34(1), 523–545.Google Scholar
- De Loera, J., Haws, D., Hemmecke, R., Huggins, P., Tauzer, J., Yoshida, R. (2005). LattE, version 1.2. http://www.math.ucdavis.edu/~latte/.
- De Loera, J., Onn, S. (2006). All linear and integer programs are slim 3-way transportation programs. SIAM Journal on Optimization, 17, 806–821.Google Scholar
- Dinwoodie, I. H. (2008). Polynomials for classification trees and applications. Statistical and Applied Mathematical Sciences Institute Technical, Report 2008-7.Google Scholar
- Dinwoodie, I. H., Chen, Y. (2011). Sampling large tables with constraints. Statistica Sinica, 21, 1591–1609.Google Scholar
- Garey, M. R., Johnson, D. S. (1979). Computers and intractabihty, a guide to the theory of NP-completeness. San Francisco: Freeman & Co.Google Scholar
- R-Project-Team. (2011). R project. GNU software. http://www.r-project.org/.
- Sampson, S. (1969). Crisis in a cloister. Doctoral dissertation (unpublished).Google Scholar