HYPOTHESIS TEST FOR NORMAL MIXTURE MODELS: THE EM APPROACH
成果类型:
Article
署名作者:
Chen, Jiahua; Li, Pengfei
署名单位:
University of British Columbia; University of Alberta
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/08-AOS651
发表日期:
2009
页码:
2523-2542
关键词:
likelihood ratio test
structural parameter
variable selection
homogeneity
gene
asymptotics
population
components
number
摘要:
Normal mixture distributions are arguably the most important mixture models. and also the most technically challenging. The likelihood function of the normal mixture model is unbounded based oil a set of random samples, unless an artificial bound is placed oil its component variance parameter. Moreover, the model is not strongly identifiable so it is hard to differentiate between over dispersion caused by the presence of a mixture and that caused by a large variance, and it has infinite Fisher information with respect to mixing proportions. There has been extensive research oil finite normal mixture models, but much of it addresses merely consistency of the point estimation or useful practical procedures, and many, results require undesirable restrictions oil the parameter space. We show that an EM-test for homogeneity is effective at overcoming many challenges in the context of finite normal mixtures. We find that the limiting, distribution of the EM-test is a simple function of the 0.5 chi(2)(0) + 0.5 chi(1)(2) and chi(2)(1) distributions when the mixing variances are equal but unknown and the chi(2)(2) when variances are unequal and unknown. Simulations unknown and the show that the limiting distributions approximate the finite sample distribution satisfactorily. Two genetic examples are used to illustrate the application of the EM-test.