Assessing linearity in high dimensions
成果类型:
Article
署名作者:
Owen, AB
署名单位:
Stanford University
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/aos/1016120362
发表日期:
2000
页码:
1-19
关键词:
algorithms
摘要:
Some standard numerical problems become intractable in high dimensions. Yet successes are often achieved in practice. This may be explained in terms of the underlying target function being somehow simpler than assumed in the intractability arguments. A prototypical form of simplicity is approximate linearity. In moderate dimensions, linearity can be investigated by linear regression. In very high dimensions, this becomes computationally inefficient and eventually infeasible, as the cost of regression for n observations in d dimensions grows as nd(2). This paper presents a quasi-regression method for determining the degree of linearity in a function, where the cost grows only as nd. A bias-corrected version of quasi-regression is able to estimate the degree of linearity with a sample size of order d(2/3). An example is given of a function on [0, 1](1,000,000), for which the amount of linear variability is accurately estimated from only 100, 000 observations.