SEMIPARAMETRIC COVARIATE-MODULATED LOCAL FALSE DISCOVERY RATE FOR GENOME-WIDE ASSOCIATION STUDIES

成果类型:
Article
署名作者:
Zablocki, Rong W.; Levine, Richard A.; Schork, Andrew J.; Xu, Shujing; Wang, Yunpeng; Fan, Chun C.; Thompson, Wesley K.
署名单位:
California State University System; San Diego State University; California State University System; San Diego State University; Claremont Colleges; Claremont Graduate University; University of California System; University of California San Diego; University of California System; University of California San Diego; University of Oslo; Lundbeck Foundation Initiative for Integrative Psychiatric Research (iPSYCH); Lundbeckfonden
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/17-AOAS1077
发表日期:
2017
页码:
2252-2269
关键词:
schizophrenia BAYES heritability genes kegg
摘要:
While genome-wide association studies (GWAS) have discovered thousands of risk loci for heritable disorders, so far even very large meta-analyses have recovered only a fraction of the heritability of most complex traits. Recent work utilizing variance components models has demonstrated that a larger fraction of the heritability of complex phenotypes is captured by the additive effects of SNPs than is evident only in loci surpassing genome-wide significance thresholds, typically set at a Bonferroni-inspired p <= 5 x 10(-8). Procedures that control false discovery rate can be more powerful, yet these are still under-powered to detect the majority of nonnull effects from GWAS. The current work proposes a novel Bayesian semiparametric two-group mixture model and develops a Markov Chain Monte Carlo (MCMC) algorithm for a covariate-modulated local false discovery rate (cmfdr). The probability of being nonnull depends on a set of covariates via a logistic function, and the nonnull distribution is approximated as a linear combination of B-spline densities, where the weight of each B-spline density depends on a multinomial function of the covariates. The proposed methods were motivated by work on a large meta-analysis of schizophrenia GWAS performed by the Psychiatric Genetics Consortium (PGC). We show that the new cmfdr model fits the PGC schizophrenia GWAS test statistics well, performing better than our previously proposed parametric gamma model for estimating the nonnull density and substantially improving power over usual fdr. Using loci declared significant at cmfdr <= 0.20, we perform follow-up pathway analyses using the Kyoto Encyclopedia of Genes and Genomes (KEGG) Homo sapiens pathways database. We demonstrate that the increased yield from the cmfdr model results in an improved ability to test for pathways associated with schizophrenia compared to using those SNPs selected according to usual fdr.
来源URL: