A MULTINOMIAL BAYESIAN-APPROACH TO THE ESTIMATION OF POPULATION AND VOCABULARY SIZE

成果类型:
Article
署名作者:
BOENDER, CGE; KAN, AHGR
刊物名称:
BIOMETRIKA
ISSN/ISSBN:
0006-3444
发表日期:
1987
页码:
849856
关键词:
摘要:
We approach estimation of the size of a population or a vocabulary through a Bayesian analysis of the multinomial distribution. We view the sample as being generated from such a distribution with an unknown number of cells and unknown cell probabilities, and develop a Bayesian procedure to estimate the number of cells and the coverage of the sample. The prior distribution of the number of cells is arbitrary. Given that number, the cell probabilities are assumed to follow a symmetric Dirichlet prior. A two-stage approach is developed for use when the flattening constant of the latter prior cannot be specified in advance. Our procedures are applied to samples of butterflies, insect species and alleles, to the works of Shakespeare and Joyce, and to Eldridge''s sample of English words.