DISTRIBUTED PROPORTIONAL LIKELIHOOD RATIO MODEL WITH APPLICATION TO DATA INTEGRATION ACROSS CLINICAL SITES

成果类型:
Article
署名作者:
Luo, Chongliang; Duan, Rui; Edmondson, Mackenzie; Shi, Jiasheng; Maltenfort, Mitchell; Morris, Jeffrey S.; Forrest, Christopher B.; Hubbard, Rebecca; Chen, Yong
署名单位:
Washington University (WUSTL); Harvard University; Harvard T.H. Chan School of Public Health; University of Pennsylvania; University of Pennsylvania; Pennsylvania Medicine; Childrens Hospital of Philadelphia
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/23-AOAS1779
发表日期:
2024
页码:
63-79
关键词:
regression-analysis longitudinal data health-care count data estimators multiple
摘要:
Real -world evidence synthesis through integration of data from distributed research networks has gained increasing attention in recent years. Due to privacy concerns and restrictions of sharing patient -level data, distributed algorithms that do not require sharing patient level information are in great need for facilitating multisite collaborations. On the other hand, data collected at multiple sites often come from diverse populations, and there exists a substantial amount of heterogeneity across sites in patient characteristics. Most of the existing distributed algorithms have ignored such betweensite heterogeneity. In this paper we aim to fill this methodological gap by proposing a general distributed algorithm. We develop our distributed algorithm based on a general semiparametric model, namely, the proportional likelihood ratio model (Biometrika 99 (2012) 211-222), which is a semiparametric extension of generalized linear model. We devise the proportional likelihood ratio model with site -specific baseline function, to account for between -site heterogeneity, and shared regression parameters to borrow information across sites. Under this flexible formulation, our distributed algorithm is designed to be privacy -preserving and communication -efficient (i.e., only one round of communication across sites is needed). We validate our method via simulation studies and demonstrate the utility of our method via a multisite study of pediatric avoidable hospitalization based on electronic health record data from a total of 354,672 patients across 26 different clinical sites within the Children's Hospital of Philadelphia health system.