Robust Inference for Federated Meta-Learning
成果类型:
Article; Early Access
署名作者:
Guo, Zijian; Li, Xiudi; Han, Larry; Cai, Tianxi
署名单位:
Rutgers University System; Rutgers University New Brunswick; University of California System; University of California Berkeley; Northeastern University; Harvard University; Harvard T.H. Chan School of Public Health; Harvard University; Harvard Medical School
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2024.2443246
发表日期:
2025
关键词:
confidence-intervals
invalid instruments
GROUP-PERFORMANCE
adaptive lasso
n-bootstrap
selection
摘要:
Synthesizing information from multiple data sources is critical to ensure knowledge generalizability. Integrative analysis of multi-source data is challenging due to the heterogeneity across sources and data-sharing constraints. In this article, we consider a general robust inference framework for federated meta-learning of data from multiple sites, enabling statistical inference for the prevailing model, defined as the one matching the majority of the sites. Statistical inference for the prevailing model is challenging since it requires a data-adaptive mechanism to select eligible sites and subsequently account for the selection uncertainty. We propose a novel sampling method to address the additional variation arising from the selection. Our devised confidence interval does not require sites to share individual-level data and is shown to be valid without requiring the selection of eligible sites to be error-free. The proposed robust inference for federated meta-learning (RIFL) methodology is broadly applicable and illustrated with three inference problems: aggregation of parametric models, high-dimensional prediction models, and inference for average treatment effects. We use RIFL to perform federated learning of mortality risk for patients hospitalized with COVID-19 using real-world EHR data from 15 healthcare centers representing 274 hospitals across four countries. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.
来源URL: