KERNEL-PENALIZED REGRESSION FOR ANALYSIS OF MICROBIOME DATA

成果类型:
Article
署名作者:
Randolph, Timothy W.; Zhao, Sen; Copeland, Wade; Hullar, Meredith; Shojaie, Ali
署名单位:
Fred Hutchinson Cancer Center; University of Washington; University of Washington Seattle
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/17-AOAS1102
发表日期:
2018
页码:
540-566
关键词:
diversity unifrac tests
摘要:
The analysis of human microbiome data is often based on dimensionreduced graphical displays and clusterings derived from vectors of microbial abundances in each sample. Common to these ordination methods is the use of biologically motivated definitions of similarity. Principal coordinate analysis, in particular, is often performed using ecologically defined distances, allowing analyses to incorporate context-dependent, non-Euclidean structure. In this paper, we go beyond dimension-reduced ordination methods and describe a framework of high-dimensional regression models that extends these distance-based methods. In particular, we use kernel-based methods to show how to incorporate a variety of extrinsic information, such as phylogeny, into penalized regression models that estimate taxon-specific associations with a phenotype or clinical outcome. Further, we show how this regression framework can be used to address the compositional nature of multivariate predictors comprised of relative abundances; that is, vectors whose entries sum to a constant. We illustrate this approach with several simulations using data from two recent studies on gut and vaginal microbiomes. We conclude with an application to our own data, where we also incorporate a significance test for the estimated coefficients that represent associations between microbial abundance and a percent fat.
来源URL: