OPTIMAL PREDICTION IN THE LINEARLY TRANSFORMED SPIKED MODEL

成果类型:
Article
署名作者:
Dobriban, Edgar; Leeb, William; Singer, Amit
署名单位:
University of Pennsylvania; University of Minnesota System; University of Minnesota Twin Cities; Princeton University
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/19-AOS1819
发表日期:
2020
页码:
491-513
关键词:
low-rank matrix cryo-em LARGEST EIGENVALUE optimal shrinkage completion components asymptotics noise POWER
摘要:
We consider the linearly transformed spiked model, where the observations Y-i are noisy linear transforms of unobserved signals of interest X-i: Y-i = A(i)X(i) + epsilon(i), for i = 1, ..., n. The transform matrices A(i) are also observed. We model the unobserved signals (or regression coefficients) X-i as vectors lying on an unknown low-dimensional space. Given only Y-i and A(i) how should we predict or recover their values? The naive approach of performing regression for each observation separately is inaccurate due to the large noise level. Instead, we develop optimal methods for predicting X-i by borrowing strength across the different samples. Our linear empirical Bayes methods scale to large datasets and rely on weak moment assumptions. We show that this model has wide-ranging applications in signal processing, deconvolution, cryo-electron microscopy, and missing data with noise. For missing data, we show in simulations that our methods are more robust to noise and to unequal sampling than well-known matrix completion methods.