Coordinatewise Gaussianization: Theories and Applications

成果类型:
Article
署名作者:
Mai, Qing; He, Di; Zou, Hui
署名单位:
State University System of Florida; Florida State University; Nanjing University; University of Minnesota System; University of Minnesota Twin Cities
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2022.2044825
发表日期:
2023
页码:
2329-2343
关键词:
Covariance Estimation DISCRIMINANT-ANALYSIS shrunken centroids kolmogorov filter Matrix Estimation linear-models regression association likelihood
摘要:
In statistical analysis, researchers often perform coordinatewise Gaussianization such that each variable is marginally normal. The normal score transformation is a method for coordinatewise Gaussianization and is widely used in statistics, econometrics, genetics and other areas. However, few studies exist on the theoretical properties of the normal score transformation, especially in high-dimensional problems where the dimension p diverges with the sample size n. In this article, we show that the normal score transformation uniformly converges to its population counterpart even when log p=o(n/ log n). Our result can justify the normal score transformation prior to any downstream statistical method to which the theoretical normal transformation is beneficial. The same results are established for the Winsorized normal transformation, another popular choice for coordinatewise Gaussianization. We demonstrate the benefits of coordinatewise Gaussianization by studying its applications to the Gaussian copula model, the nearest shrunken centroids classifier and distance correlation. The benefits are clearly shown in theory and supported by numerical studies. Moreover, we also point out scenarios where coordinatewise Gaussinization does not help and even causes damages. We offer a general recommendation on how to use coordinatewise Gaussianization in applications. for this article are available online.