ADAPTIVE GPCA: A METHOD FOR STRUCTURED DIMENSIONALITY REDUCTION WITH APPLICATIONS TO MICROBIOME DATA

成果类型:
Article
署名作者:
Fukuyama, Julia
署名单位:
Indiana University System; Indiana University Bloomington
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/18-AOAS1227
发表日期:
2019
页码:
1043-1067
关键词:
principal-components matrix communities sparsity unifrac
摘要:
Exploratory analysis is an important first step for discovering latent structure and generating hypotheses in large biological data sets. However, when the number of variables is large compared to the number of samples, standard methods such as principal components analysis give results that are unstable and difficult to interpret. Here, we present adaptive generalized principal components analysis (adaptive gPCA), a new method that solves these problems by incorporating information about the relationships among the variables. Adaptive gPCA gives a low-dimensional representation of the samples with axes that are interpretable in terms of groups of closely related variables. We show that adaptive gPCA does well at reconstructing true latent structure in simulated data and demonstrate its use on a study of the effect of antibiotics on the human gut microbiota.
来源URL: