Root cause discovery via permutations and Cholesky decomposition
成果类型:
Article; Early Access
署名作者:
Li, Jinzhou; Chu, Benjamin B.; Scheller, Ines F.; Gagneur, Julien; Maathuis, Marloes H.
署名单位:
National University of Singapore; Stanford University; Technical University of Munich; Helmholtz Association; Helmholtz-Center Munich - German Research Center for Environmental Health; Technical University of Munich; Technical University of Munich; Swiss Federal Institutes of Technology Domain; ETH Zurich
刊物名称:
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY
ISSN/ISSBN:
1369-7412
DOI:
10.1093/jrsssb/qkaf066
发表日期:
2025
关键词:
interventions
inference
selection
MODEL
摘要:
This work is motivated by the following problem: Can we identify the disease-causing gene in a patient affected by a monogenic disorder? This problem is an instance of root cause discovery. Specifically, we aim to identify the intervened variable in one interventional sample using a set of observational samples as reference. We consider a linear structural equation model where the causal ordering is unknown. We begin by examining a simple method that uses squared z-scores and characterize the conditions under which this method succeeds and fails, showing it generally cannot identify the root cause. We then prove, without additional assumptions, that the root cause is identifiable even if the causal ordering is not. Two key ingredients of this identifiability result are the use of permutations and the Cholesky decomposition, which allow us to exploit an invariant property across different permutations to discover the root cause. Furthermore, we characterize permutations that yield the correct root cause and, based on this, propose a valid method for root cause discovery. We also adapt this approach to high-dimensional settings. Finally, we evaluate our methods through simulations and apply the high-dimensional method to discover disease-causing genes in the gene expression dataset that motivates this work.
来源URL: