AFFINE-EQUIVARIANT INFERENCE FOR MULTIVARIATE LOCATION UNDER Lp LOSS FUNCTIONS

成果类型:
Article
署名作者:
Durre, Alexander; Paindaveine, Davy
署名单位:
Universite Libre de Bruxelles; Universite Libre de Bruxelles
刊物名称:
ANNALS OF STATISTICS
ISSN/ISSBN:
0090-5364
DOI:
10.1214/22-AOS2199
发表日期:
2022
页码:
2616-2640
关键词:
hilbert-spaces sign depth
摘要:
We consider the fundamental problem of estimating the location of a d-variate probability measure under an L-p loss function. The naive estimator, that minimizes the usual empirical L-p risk, has a known asymptotic behavior but suffers from several deficiencies for p not equal 2, the most important one being the lack of equivariance under general affine transformations. In this work, we introduce a collection of L-p location estimators (mu) over cap (p,l)(n) that minimize the size of suitable l-dimensional data-based simplices. For l = 1, these estimators reduce to the naive ones, whereas, for l = d, they are equivariant under affine transformations. Irrespective of l, these estimators reduce to the sample mean for p = 2, whereas for p = 1, the estimators provide the well-known spatial median and Oja median for l = 1 and l = d, respectively. Under very mild assumptions, we derive an explicit Bahadur representation result for (mu) over cap (p,l)(n) and establish asymptotic normality. We prove that, quite remarkably, the asymptotic behavior of the estimators does not depend on l under spherical symmetry, so that the affine equivariance for l = d is achieved at no cost in terms of efficiency. To allow for large sample size n and/or large dimension d, we introduce a version of our estimators relying on incomplete U-statistics. Under a centro-symmetry assumption, we also define companion tests phi(p,l)(n) for the problem of testing the null hypothesis that the location mu of the underlying probability measure coincides with a given location mu(0). For any p, affine invariance is achieved for l = d. For any l and p, we derive explicit expressions for the asymptotic power of these tests under contiguous local alternatives, which reveals that asymptotic relative efficiencies with respect to traditional parametric Gaussian procedures for hypothesis testing coincide with those obtained for point estimation. We illustrate finite-sample relevance of our asymptotic results through Monte Carlo exercises and also treat a real data example.