Nonparametric estimation of component distributions in a multivariate mixture
成果类型:
Article
署名作者:
Hall, P; Zhou, XH
署名单位:
Australian National University; US Department of Veterans Affairs; Veterans Health Administration (VHA); Vet Affairs Puget Sound Health Care System; University of Washington; University of Washington Seattle
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
发表日期:
2003
页码:
201-224
关键词:
latent class analysis
diagnostic-tests
maximum-likelihood
proportions
models
geometry
摘要:
Suppose k-variate data are drawn from a mixture of two distributions, each having independent components. It is desired to estimate the univariate marginal distributions in each of the products, as well as the mixing proportion. This is the setting of two-class, fully parametrized latent models that has been proposed for estimating the distributions of medical test results when disease status is unavailable. The problem is one of inference in a mixture of distributions without training data, and until now it has been tackled only in a fully parametric setting. We investigate the possibility of using nonparametric methods. Of course, when k = 1 the problem is not identifiable from a nonparametric viewpoint. We show that the problem is almost identifiable when k = 2; there, the set of all possible representations can be expressed, in terms of any one of those representations, as a two-parameter family. Furthermore, it is proved that when k greater than or equal to 3 the problem is nonparametrically identifiable under particularly mild regularity conditions. In this case we introduce root-n consistent nonparametric estimators of the 2k univariate marginal distributions and the mixing proportion. Finite-sample and asymptotic properties of the estimators are described.