Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data

成果类型:
Article
署名作者:
Niu, Yabo; Ni, Yang; Pati, Debdeep; Mallick, Bani K.
署名单位:
University of Houston System; University of Houston; Texas A&M University System; Texas A&M University College Station
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2023.2233744
发表日期:
2024
页码:
1985-1999
关键词:
breast-cancer subtypes ising-model selection variable selection pi3k/akt/mtor pathway convergence-rates Matrix Estimation expression regression inference mixtures
摘要:
In a traditional Gaussian graphical model, data homogeneity is routinely assumed with no extra variables affecting the conditional independence. In modern genomic datasets, there is an abundance of auxiliary information, which often gets under-utilized in determining the joint dependency structure. In this article, we consider a Bayesian approach to model undirected graphs underlying heterogeneous multivariate observations with additional assistance from covariates. Building on product partition models, we propose a novel covariate-dependent Gaussian graphical model that allows graphs to vary with covariates so that observations whose covariates are similar share a similar undirected graph. To efficiently embed Gaussian graphical models into our proposed framework, we explore both Gaussian likelihood and pseudo-likelihood functions. For Gaussian likelihood, a G-Wishart distribution is used as a natural conjugate prior, and for the pseudo-likelihood, a product of Gaussian-conditionals is used. Moreover, the proposed model has large prior support and is flexible to approximate any nu-Holder conditional variance-covariance matrices with nu is an element of(0,1] . We further show that based on the theory of fractional likelihood, the rate of posterior contraction is minimax optimal assuming the true density to be a Gaussian mixture with a known number of components. The efficacy of the approach is demonstrated via simulation studies and an analysis of a protein network for a breast cancer dataset assisted by mRNA gene expression as covariates. Supplementary materials for this article are available online.