您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 统计学 > Journal of the Royal Statistical Society: Series B > 2021 > 4期

Covariate powered cross-weighted multiple testing

成果类型：

Article

署名作者：

Ignatiadis, Nikolaos; Huber, Wolfgang

署名单位：

Stanford University; European Molecular Biology Laboratory (EMBL)

刊物名称：

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY

ISSN/ISSBN：

1369-7412

DOI：

10.1111/rssb.12411

发表日期：

2021

页码：

720-751

关键词：

false discovery rate increases detection power EMPIRICAL BAYES Mixture Model fdr error rates

摘要：

A fundamental task in the analysis of data sets with many variables is screening for associations. This can be cast as a multiple testing task, where the objective is achieving high detection power while controlling type I error. We consider m hypothesis tests represented by pairs ((Pi,Xi))1 <= i <= m of p-values Pi and covariates Xi, such that Pi perpendicular to Xi if Hi is null. Here, we show how to use information potentially available in the covariates about heterogeneities among hypotheses to increase power compared to conventional procedures that only use the Pi. To this end, we upgrade existing weighted multiple testing procedures through the independent hypothesis weighting (IHW) framework to use data-driven weights that are calculated as a function of the covariates. Finite sample guarantees, for example false discovery rate control, are derived from cross-weighting, a data-splitting approach that enables learning the weight-covariate function without overfitting as long as the hypotheses can be partitioned into independent folds, with arbitrary within-fold dependence. IHW has increased power compared to methods that do not use covariate information. A key implication of IHW is that hypothesis rejection in common multiple testing setups should not proceed according to the ranking of the p-values, but by an alternative ranking implied by the covariate-weighted p-values.

来源URL：

访问原文