TRUNCATED RANK-BASED TESTS FOR TWO-PART MODELS WITH EXCESSIVE ZEROS AND APPLICATIONS TO MICROBIOME DATA
成果类型:
Article
署名作者:
Wang, Wanjie; Chen, Eric; Li, Hongzhe
署名单位:
National University of Singapore; University of Pennsylvania
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/22-AOAS1688
发表日期:
2023
页码:
1663-1680
关键词:
gut microbiota
摘要:
High-throughput sequencing technology allows us to test the composi-tional difference of bacteria in different populations. One important feature of human microbiome data is that it often includes a large number of ze-ros. Such data can be treated as being generated from a two-part model that includes a zero-point mass. Motivated by analysis of such nonnegative data with excessive zeros, we introduce several truncated rank-based two-group and multigroup tests, including a truncated rank-based Wilcoxon rank-sum test for two-group comparison and two truncated Kruskal-Wallis tests for multigroup comparisons. We show, both analytically through asymptotic rela-tive efficiency analysis and by simulations, that the proposed tests have higher power than the standard rank-based tests in typical microbiome data settings, especially when the proportion of zeros in the data is high. The tests can also be applied to repeated measurements of compositional data via simple within -subject permutations. In a simple before-and-after treatment experiment, the within-subject permutation is similar to the paired rank test. However, the proposed tests handle the excessive zeros which leads to a better power. We apply the tests to compare the microbiome compositions of healthy children and pediatric Crohn's disease patients and to assess the treatment effects on microbiome compositions. We identify several bacterial genera that are missed by the standard rank-based tests.
来源URL: