BNP-Seq: Bayesian Nonparametric Differential Expression Analysis of Sequencing Count Data
成果类型:
Article
署名作者:
Dadaneh, Siamak Zamani; Qian, Xiaoning; Zhou, Mingyuan
署名单位:
Texas A&M University System; Texas A&M University College Station; University of Texas System; University of Texas Austin; University of Texas System; University of Texas Austin
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2017.1328358
发表日期:
2018
页码:
81-94
关键词:
renal-cell carcinoma
rna-seq
tumor
genes
proliferation
normalization
bioconductor
patterns
package
models
摘要:
We perform differential expression analysis of high-throughput sequencing count data under a Bayesian nonparametric framework, removing sophisticated ad hoc pre-processing steps commonly required in existing algorithms. We propose to use the gamma (beta) negative binomial process, which takes into account different sequencing depths using sample-specific negative binomial probability (dispersion) parameters, to detect differentially expressed genes by comparing the posterior distributions of gene-specific negative binomial dispersion (probability) parameters. These model parameters are inferred by borrowing statistical strength across both the genes and samples. Extensive experiments on both simulated and real-world RNA sequencing count data show that the proposed differential expression analysis algorithms clearly outperform previously proposed ones in terms of the areas under both the receiver operating characteristic and precision-recall curves. Supplementary materials for this article are available online.