ESTIMATION OF THE NUMBER OF COMPONENTS OF NONPARAMETRIC MULTIVARIATE FINITE MIXTURE MODELS
成果类型:
Article
署名作者:
Kwon, Caleb; Mbakop, Eric
署名单位:
Harvard University; University of Calgary
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/20-AOS2032
发表日期:
2021
页码:
2178-2205
关键词:
latent structure models
incomplete information
identification
inference
games
摘要:
We propose a novel estimator for the number of mixture components (denoted by M) in a nonparametric finite mixture model. The setting that we consider is one where the analyst has repeated observations of K >= 2 variables that are conditionally independent given a finitely supported latent variable withM support points. Under a mild assumption on the joint distribution of the observed and latent variables, we show that an integral operator T that is identified from the data has rank equal to M. We use this observation, in conjunction with the fact that singular values of operators are stable under perturbations, to propose an estimator of M, which essentially consists of a thresholding rule that counts the number of singular values of a consistent estimator of T that are greater than a data-driven threshold. We prove that our estimator of M is consistent, and establish nonasymptotic results, which provide finite sample performance guarantees for our estimator. We present a Monte Carlo study, which shows that our estimator performs well for samples of moderate size.
来源URL: