SHARP OPTIMAL RECOVERY IN THE TWO COMPONENT GAUSSIAN MIXTURE MODEL
成果类型:
Article
署名作者:
Ndaoud, Mohamed
署名单位:
ESSEC Business School
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/22-AOS2178
发表日期:
2022
页码:
2096-2126
关键词:
摘要:
In this paper, we study the problem of clustering in the Two component Gaussian mixture model when the centers are separated by some Delta > 0. We present a nonasymptotic lower bound for the corresponding minimax Hamming risk improving on existing results. We also propose an optimal, efficient and adaptive procedure that is minimax rate optimal. The rate optimality is moreover sharp in the asymptotics when the sample size goes to infinity. Our procedure is based on a variant of Lloyd's iterations initialized by a spectral method. As a consequence of nonasymptotic results, we find a sharp phase transition for the problem of exact recovery in the Gaussian mixture model. We prove that the phase transition occurs around the critical threshold (Delta) over bar given by (Delta) over bar (-2) = sigma(2)(1 + root 1+ 2p/n log n) log n.