ACCURACY ASSESSMENT FOR HIGH-DIMENSIONAL LINEAR REGRESSION

成果类型:
Article
署名作者:
Cai, T. Tony; Guo, Zijian
署名单位:
University of Pennsylvania; Rutgers University System; Rutgers University New Brunswick
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/17-AOS1604
发表日期:
2018
页码:
1807-1836
关键词:
dantzig selector confidence-intervals sparse regression Minimax Rates Lasso shrinkage matrices balls bands RISK
摘要:
This paper considers point and interval estimation of the l(q) loss of an estimator in high-dimensional linear regression with random design. We establish the minimax rate for estimating the l(q) loss and the minimax expected length of confidence intervals for the l(q) loss of rate-optimal estimators of the regression vector, including commonly used estimators such as Lasso, scaled Lasso, square-root Lasso and Dantzig Selector. Adaptivity of confidence intervals for the l(q) loss is also studied. Both the setting of the known identity design covariance matrix and known noise level and the setting of unknown design covariance matrix and unknown noise level are studied. The results reveal interesting and significant differences between estimating the l(2) loss and l(q) loss with 1 <= q < 2 as well as between the two settings. New technical tools are developed to establish rate sharp lower bounds for the minimax estimation error and the expected length of minimax and adaptive confidence intervals for the l(q) loss. A significant difference between loss estimation and the traditional parameter estimation is that for loss estimation the constraint is on the performance of the estimator of the regression vector, but the lower bounds are on the difficulty of estimating its l(q) loss. The technical tools developed in this paper can also be of independent interest.