Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data
成果类型:
Article
署名作者:
Niu, Yabo; Ni, Yang; Pati, Debdeep; Mallick, Bani K.
署名单位:
University of Houston System; University of Houston; Texas A&M University System; Texas A&M University College Station
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2023.2233744
发表日期:
2024
页码:
1985-1999
关键词:
breast-cancer subtypes
ising-model selection
variable selection
pi3k/akt/mtor pathway
convergence-rates
Matrix Estimation
expression
regression
inference
mixtures
摘要:
In a traditional Gaussian graphical model, data homogeneity is routinely assumed with no extra variables affecting the conditional independence. In modern genomic datasets, there is an abundance of auxiliary information, which often gets under-utilized in determining the joint dependency structure. In this article, we consider a Bayesian approach to model undirected graphs underlying heterogeneous multivariate observations with additional assistance from covariates. Building on product partition models, we propose a novel covariate-dependent Gaussian graphical model that allows graphs to vary with covariates so that observations whose covariates are similar share a similar undirected graph. To efficiently embed Gaussian graphical models into our proposed framework, we explore both Gaussian likelihood and pseudo-likelihood functions. For Gaussian likelihood, a G-Wishart distribution is used as a natural conjugate prior, and for the pseudo-likelihood, a product of Gaussian-conditionals is used. Moreover, the proposed model has large prior support and is flexible to approximate any nu-Holder conditional variance-covariance matrices with nu is an element of(0,1] . We further show that based on the theory of fractional likelihood, the rate of posterior contraction is minimax optimal assuming the true density to be a Gaussian mixture with a known number of components. The efficacy of the approach is demonstrated via simulation studies and an analysis of a protein network for a breast cancer dataset assisted by mRNA gene expression as covariates. Supplementary materials for this article are available online.