JOINT INTEGRATIVE ANALYSIS OF MULTIPLE DATA SOURCES WITH CORRELATED VECTOR OUTCOMES
成果类型:
Article
署名作者:
Hector, Emily C.; Song, Peter X-K
署名单位:
North Carolina State University; University of Michigan System; University of Michigan
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/21-AOAS1563
发表日期:
2022
页码:
1700-1717
关键词:
confidence distribution
generalized-method
longitudinal data
sample properties
metaanalysis
statistics
parameter
摘要:
We propose a distributed quadratic inference function framework to jointly estimate regression parameters from multiple potentially heterogeneous data sources with correlated vector outcomes. The primary goal of this joint integrative analysis is to estimate covariate effects on all outcomes through a marginal regression model in a statistically and computationally efficient way. We develop a data integration procedure for statistical estimation and inference of regression parameters that is implemented in a fully distributed and parallelized computational scheme. To overcome computational and modeling challenges arising from the high-dimensional likelihood of the correlated vector outcomes, we propose to analyze each data source using Qu, Lindsay and Li's (Biometrika 87 (2000) 823-836) quadratic inference functions and then to jointly reestimate parameters from each data source by accounting for correlation between data sources using a combined meta-estimator in a similar spirit to the generalized method of moments put forward by Hansen (Econometrica 50 (1982) 1029-1054). We show both theoretically and numerically that the proposed method yields efficiency improvements and is computationally fast. We illustrate the proposed methodology with the joint integrative analysis of the association between smoking and metabolites in a large multicohort study and provide an R package for ease of implementation.
来源URL: