Doubly robust conditional independence testing with generative neural networks

成果类型:
Article; Early Access
署名作者:
Zhang, Yi; Huang, Linjun; Yang, Yun; Shao, Xiaofeng
署名单位:
University of Illinois System; University of Illinois Urbana-Champaign; University System of Maryland; University of Maryland College Park; Washington University (WUSTL); Washington University (WUSTL)
刊物名称:
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY
ISSN/ISSBN:
1369-7412
DOI:
10.1093/jrsssb/qkaf047
发表日期:
2025
关键词:
dependence selection
摘要:
This article addresses the problem of testing the conditional independence of two generic random vectors X and Y given a third random vector Z, which plays an important role in statistical and machine learning applications. We propose a new non-parametric testing procedure that avoids explicitly estimating any conditional distributions but instead requires sampling from the two marginal conditional distributions of X given Z and Y given Z. We further propose using a generative neural network (GNN) framework to sample from these approximated marginal conditional distributions, which tends to mitigate the curse of dimensionality due to its adaptivity to any low-dimensional structures and smoothness underlying the data. Theoretically, our test statistic is shown to enjoy a double robustness property against GNN approximation errors, meaning that the test statistic retains all desirable properties of the oracle test statistic utilizing the true marginal conditional distributions, as long as the product of the two approximation errors decays to zero faster than the parametric rate. Asymptotic properties of our statistic and the consistency of a bootstrap procedure are derived under both null and local alternatives. Extensive numerical experiments and real data analysis illustrate the effectiveness and broad applicability of our proposed test.
来源URL: