-
作者:DiCiccio, Cyrus J.; Romano, Joseph P.
作者单位:Stanford University; Stanford University
摘要:Given a sample from a bivariate distribution, consider the problem of testing independence. A permutation test based on the sample correlation is known to be an exact level a test. However, when used to test the null hypothesis that the samples are uncorrelated, the permutation test can have rejection probability that is far from the nominal level. Further, the permutation test can have a large Type 3 (directional) error rate, whereby there can be a large probability that the permutation test ...
-
作者:Fang, Ethan X.; Li, Min-Dian; Jordan, Michael I.; Liu, Han
作者单位:Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; Harvard University; Harvard T.H. Chan School of Public Health; Harvard University; Harvard T.H. Chan School of Public Health; University of California System; University of California Berkeley; Princeton University
摘要:Characterizing the functional relevance of transcription factors (TFs) in different biological contexts is pivotal in systems biology. Given the massive amount of genornic data, computational identification of TFs is emerging as a useful approach to bridge functional genorriics with disease risk loch In this article, we use large-scale gene expression and chromatin immunoprecipitation (ChIP) data corpuses to conduct high-throughput TF-biological context association analysis. This work makes tw...
-
作者:Laber, Eric B.; Shedden, Kerby
作者单位:North Carolina State University; University of Michigan System; University of Michigan
-
作者:Li, Jialiang; Huang, Chao; Zhu, Hongtu
作者单位:National University of Singapore; University of North Carolina; University of North Carolina Chapel Hill; University of Texas System; UTMD Anderson Cancer Center
摘要:Motivated by the analysis of imaging data, we propose a novel functional varying-coefficient single-index model (FVCSIM) to carry out the regression analysis of functional response data on a set of covariates of interest. FVCSIM represents a new extension of varying-coefficient single-index models for scalar responses collected from cross-sectional and longitudinal studies. An efficient estimation procedure is developed to iteratively estimate varying coefficient functions, link functions, ind...
-
作者:Linderman, Scott W.; Blei, David M.
作者单位:Columbia University; Columbia University
-
作者:Tsionas, Mike G.
作者单位:Lancaster University; Athens University of Economics & Business
摘要:The issues of functional form, distributions of the error components, and endogeneity are for the most part still open in stochastic frontier models. The same is true when it comes to imposition of restrictions of mono tonicity and curvature, making efficiency estimation an elusive goal. In this article, we attempt to consider these problems simultaneously and offer practical solutions to the problems raised by Stone and addressed by Badunenko, Henderson and Kumbhakar. We provide major extensi...
-
作者:Wang, Tao; Zhao, Hongyu
作者单位:Shanghai Jiao Tong University; Yale University
摘要:Recent advances in DNA sequencing technology have enabled rapid advances in our understanding of the contribution of the human microbiome to many aspects of normal human physiology and disease. A major goal of human microbiome studies is the identification of important groups of microbes that are predictive of host phenotypes. However, the large number of bacterial taxa and the compositional nature of the data make this goal difficult to achieve using traditional approaches. Furthermore, the m...
-
作者:Zubizarreta, Jose R.; Keele, Luke
作者单位:Columbia University; Columbia University; Georgetown University; Georgetown University
摘要:A distinctive feature of a clustered observational study is its multilevel or nested data structure arising from the assignment of treatment, in a nonrandom manner, to groups or clusters of units or individuals. Examples are ubiquitous in the health,and social sciences including patients in hospitals, employees in firms, and students in schools. What is the optimal matching strategy in a clustered observational study? At first thought, one might start by matching clusters of individuals and th...
-
作者:Bhat, K. Sham; Mebane, David S.; Mahapatra, Priyadarshi; Storlie, Curtis B.
作者单位:United States Department of Energy (DOE); Los Alamos National Laboratory; West Virginia University; United States Department of Energy (DOE); National Energy Technology Laboratory - USA; Mayo Clinic
摘要:Uncertainties from model parameters and model discrepancy from small-scale models impact the accuracy and reliability of predictions of large-scale systems. Inadequate representation of these uncertainties may result in inaccurate and overconfident predictions during scale-up to larger systems. Hence, multiscale modeling efforts must accurately quantify the effect of the propagation of uncertainties during upscaling. Using a Bayesian approach, we calibrate a small-scale solid sorbent model to ...
-
作者:Chen, Hao; Friedman, Jerome H.
作者单位:University of California System; University of California Davis; Stanford University
摘要:Two-sample tests for multivariate data and especially for non-Euclidean data are not well explored. This article presents a novel test statistic based on a similarity graph constructed on the pooled observations from the two samples. It can be applied to multivariate data and non-Euclidean data as long as a dissimilarity measure on the sample space can be defined, which can usually be provided by domain experts. Existing tests based on a similarity graph lack power either for location or for s...