Sequential Monte Carlo methods for statistical analysis of tables

成果类型:
Article
署名作者:
Chen, YG; Diaconis, P; Holmes, SR; Liu, JS
署名单位:
Duke University; Stanford University; Harvard University; Harvard University
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1198/016214504000001303
发表日期:
2005
页码:
109-120
关键词:
species cooccurrences contingency-tables fixed margins tests enumeration matrices number
摘要:
We describe a sequential importance sampling (SIS) procedure for analyzing two-way zero-one or contingency tables with fixed marginal sums. An essential feature of the new method is that it samples the columns of the table progressively according to certain special distributions. Our method produces Monte Carlo samples that are remarkably close to the uniform distribution, enabling one to approximate closely the null distributions of various test statistics about these tables. Our method compares favorably with other existing Monte Carlo-based algorithms, and sometimes is a few orders of magnitude more efficient. In particular, compared with Markov chain Monte Carlo (MCMC)-based approaches, our importance sampling method not only is more efficient in terms of absolute running time and frees one from pondering over the mixing issue, but also provides an easy and accurate estimate of the total number of tables with fixed marginal sums, which is far more difficult for an MCMC method to achieve.