您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 统计学 > Biometrika > 2014 > 2期

Variance estimation in high-dimensional linear models

成果类型：

Article

署名作者：

Dicker, Lee H.

署名单位：

Rutgers University System; Rutgers University New Brunswick

刊物名称：

BIOMETRIKA

ISSN/ISSBN：

0006-3444

DOI：

10.1093/biomet/ast065

发表日期：

2014

页码：

269284

关键词：

optimal rates covariance CONVERGENCE

摘要：

The residual variance and the proportion of explained variation are important quantities in many statistical models and model fitting procedures. They play an important role in regression diagnostics and model selection procedures, as well as in determining the performance limits in many problems. In this paper we propose new method-of-moments-based estimators for the residual variance, the proportion of explained variation and other related quantities, such as the l(2) signal strength. The proposed estimators are consistent and asymptotically normal in high-dimensional linear models with Gaussian predictors and errors, where the number of predictors d is proportional to the number of observations n; in fact, consistency holds even in settings where d/n -> infinity. Existing results on residual variance estimation in high-dimensional linear models depend on sparsity in the underlying signal. Our results require no sparsity assumptions and imply that the residual variance and the proportion of explained variation can be consistently estimated even when d > n and the underlying signal itself is nonestimable. Numerical work suggests that some of our distributional assumptions may be relaxed. A real-data analysis involving gene expression data and single nucleotide polymorphism data illustrates the performance of the proposed methods.