-
作者:Huang, Yen-Ning; Reich, Brian J.; Fuentes, Montserrat; Sankarasubramanian, A.
作者单位:North Carolina State University; Virginia Commonwealth University; North Carolina State University
摘要:Computer simulation models are central to environmental science. These mathematical models are used to understand complex weather and climate patterns and to predict the climate's response to different forcings. Climate models are of course not perfect reflections of reality, and so comparison with observed data is needed to quantify and to correct for biases and other deficiencies. We propose a new method to calibrate model output using observed data. Our approach not only matches the margina...
-
作者:Huo, Zhiguang; Song, Chi; Tseng, George
作者单位:State University System of Florida; University of Florida; University System of Ohio; Ohio State University; Pennsylvania Commonwealth System of Higher Education (PCSHE); University of Pittsburgh
摘要:Due to the rapid development of high-throughput experimental techniques and fast-dropping prices, many transcriptomic datasets have been generated and accumulated in the public domain. Meta-analysis combining multiple transcriptomic studies can increase the statistical power to detect disease-related biomarkers. In this paper we introduce a Bayesian latent hierarchical model to perform transcriptomic meta-analysis. This method is capable of detecting genes that are differentially expressed (DE...
-
作者:Regier, Jeffrey; Miller, Andrew C.; Schlegel, David; Adams, Ryan P.; McAuliffe, Jon D.; Prabhat
作者单位:University of California System; University of California Berkeley; University of California System; University of California Berkeley; Columbia University; United States Department of Energy (DOE); Lawrence Berkeley National Laboratory; Princeton University
摘要:We present a new, fully generative model for constructing astronomical catalogs from optical telescope image sets. Each pixel intensity is treated as a random variable with parameters that depend on the latent properties of stars and galaxies. These latent properties are themselves modeled as random. We compare two procedures for posterior inference. One procedure is based on Markov chain Monte Carlo (MCMC) while the other is based on variational inference (VI). The MCMC procedure excels at qu...
-
作者:Singh, Susheela P.; Staicu, Ana-Maria; Dunn, Robert R.; Fierer, Noah; Reich, Brian J.
作者单位:North Carolina State University
摘要:The advent of high-throughput sequencing technologies has made data from DNA material readily available, leading to a surge of microbiome-related research establishing links between markers of microbiome health and specific outcomes. However, to harness the power of microbial communities we must understand not only how they affect us, but also how they can be influenced to improve outcomes. This area has been dominated by methods that reduce community composition to summary metrics, which can ...
-
作者:Kim, Chanmin; Daniels, Michael J.; Hogan, Joseph W.; Choirat, Christine; Zigler, Corwin M.
作者单位:Boston University; State University System of Florida; University of Florida; Brown University; University of Texas System; University of Texas Austin
摘要:Emission control technologies installed on power plants are a key feature of many air pollution regulations in the US. While such regulations are predicated on the presumed relationships between emissions, ambient air pollution and human health, many of these relationships have never been empirically verified. The goal of this paper is to develop new statistical methods to quantify these relationships. We frame this problem as one of mediation analysis to evaluate the extent to which the effec...
-
作者:Zhang, Youyi; Morris, Jeffrey S.; Aerry, Shivali Narang; Rao, Arvind U. K.; Baladandayuthapani, Veerabhadran
作者单位:University of Texas System; UTMD Anderson Cancer Center; Johns Hopkins University; University of Michigan System; University of Michigan; University of Michigan System; University of Michigan
摘要:Technological innovations have produced large multi-modal datasets that include imaging and multi-platform genomics data. Integrative analyses of such data have the potential to reveal important biological and clinical insights into complex diseases like cancer. In this paper, we present Bayesian approaches for integrative analysis of radiological imaging and multi-platform genomic data, where-in our goals are to simultaneously identify genomic and radiomic, that is, radiology-based imaging ma...
-
作者:Jaeger, Byron C.; Long, D. Leann; Long, Dustin M.; Sims, Mario; Szychowski, Jeff M.; Min, Yuan-, I; Mcclure, Leslie A.; Howard, George; Simon, Noah
作者单位:University of Alabama System; University of Alabama Birmingham; University of Mississippi Medical Center; University of Mississippi; Drexel University; University of Washington; University of Washington Seattle
摘要:We introduce and evaluate the oblique random survival forest (ORSF). The ORSF is an ensemble method for right-censored survival data that uses linear combinations of input variables to recursively partition a set of training data. Regularized Cox proportional hazard models are used to identify linear combinations of input variables in each recursive partitioning step. Benchmark results using simulated and real data indicate that the ORSF's predicted risk function has high prognostic value in c...
-
作者:Liu, Lin; Qiu, Yuqi; Natarajan, Loki; Messer, Karen
作者单位:University of California System; University of California San Diego
摘要:It is common to encounter missing data among the potential predictor variables in the setting of model selection. For example, in a recent study we attempted to improve the US guidelines for risk stratification after screening colonoscopy (Cancer Causes Control 27 (2016) 1175-1185), with the aim to help reduce both overuse and underuse of follow-on surveillance colonoscopy. The goal was to incorporate selected additional informative variables into a neoplasia risk-prediction model, going beyon...
-
作者:Johndrow, James E.; Lum, Kristian
作者单位:Stanford University
摘要:Predictive modeling is increasingly being employed to assist human decision-makers. One purported advantage of replacing or augmenting human judgment with computer models in high stakes settings-such as sentencing, hiring, policing, college admissions, and parole decisions-is the perceived neutrality of computers. It is argued that because computer models do not hold personal prejudice, the predictions they produce will be equally free from prejudice. There is growing recognition that employin...
-
作者:Sales, Adam C.; Pane, John F.
作者单位:University of Texas System; University of Texas Austin; RAND Corporation
摘要:Students in Algebra I classrooms typically learn at different rates and struggle at different points in the curriculum-a common challenge for math teachers. Cognitive Tutor Algebra I (CTA1), an educational computer program, addresses such student heterogeneity via what they term mastery learning, where students progress from one section of the curriculum to the next by demonstrating appropriate mastery at each stage. However, when students are unable to master a section's skills even after try...