-
作者:Wei, Peng; Pan, Wei
作者单位:University of Texas System; University of Texas Health Science Center Houston; University of Texas School Public Health; University of Texas System; University of Texas Health Science Center Houston; University of Texas School Public Health; University of Minnesota System; University of Minnesota Twin Cities
摘要:We consider integrative modeling of multiple gene networks and diverse genomic data, including protein-DNA binding, gene expression and DNA sequence data, to accurately identify the regulatory target genes of a transcription factor (TF). Rather than treating all the genes equally and independently a priori in existing joint modeling approaches, we incorporate the biological prior knowledge that neighboring genes on a gene network tend to be (or not to be) regulated together by a TF. A key cont...
-
作者:Kim, Seyoung; Xing, Eric P.
作者单位:Carnegie Mellon University
摘要:We consider the problem of estimating a sparse multi-response regression function, with an application to expression quantitative trait locus (eQTL) mapping, where the goal is to discover genetic variations that influence gene-expression levels. In particular, we investigate a shrinkage technique capable of capturing a given hierarchical structure over the responses, such as a hierarchical clustering tree with leaf nodes for responses and internal nodes for clusters of related responses at mul...
-
作者:Li, Shaoyu; Cui, Yuehua
作者单位:Michigan State University; St Jude Children's Research Hospital
摘要:Much of the natural variation for a complex trait can be explained by variation in DNA sequence levels. As part of sequence variation, gene-gene interaction has been ubiquitously observed in nature, where its role in shaping the development of an organism has been broadly recognized. The identification of interactions between genetic factors has been progressively pursued via statistical or machine learning approaches. A large body of currently adopted methods, either parametrically or nonpara...
-
作者:Baggaley, Andrew W.; Boys, Richard J.; Golightly, Andrew; Sarson, Graeme R.; Shukurov, Anvar
作者单位:Newcastle University - UK
摘要:We consider parameter estimation for the spread of the Neolithic incipient farming across Europe using radiocarbon dates. We model the arrival time of farming at radiocarbon-dated, early Neolithic sites by a numerical solution to an advancing wavefront. We allow for (technical) uncertainty in the radiocarbon data, lack-of-fit of the deterministic model and use a Gaussian process to smooth spatial deviations from the model. Inference for the parameters in the wavefront model is complicated by t...
-
作者:Liu, Chong; Ray, Surajit; Hooker, Giles; Friedl, Mark
作者单位:Boston University; Cornell University; Cornell University; Boston University
摘要:We present a new approach to factor rotation for functional data. This is achieved by rotating the functional principal components toward a predefined space of periodic functions designed to decompose the total variation into components that are nearly-periodic and nearly-aperiodic with a predefined period. We show that the factor rotation can be obtained by calculation of canonical correlations between appropriate spaces which make the methodology computationally efficient. Moreover, we demon...
-
作者:Liu, Hai; Tu, Wanzhu
作者单位:Indiana University System; Indiana University Bloomington
摘要:This research examines the simultaneous influences of height and weight on longitudinally measured systolic and diastolic blood pressure in children. Previous studies have shown that both height and weight are positively associated with blood pressure. In children, however, the concurrent increases of height and weight have made it all but impossible to discern the effect of height from that of weight. To better understand these influences, we propose to examine the joint effect of height and ...
-
作者:Wang, Yong; Ziedins, Ilze; Holmes, Mark; Challands, Neil
作者单位:University of Auckland
摘要:A new family of tree models is proposed, which we call differential trees. A differential tree model is constructed from multiple data sets and aims to detect distributional differences between them. The new methodology differs from the existing difference and change detection techniques in its nonparametric nature, model construction from multiple data sets, and applicability to high-dimensional data. Through a detailed study of an arson case in New Zealand, where an individual is known to ha...
-
作者:Cooley, Daniel; Davis, Richard A.; Naveau, Philippe
作者单位:Colorado State University System; Colorado State University Fort Collins; Columbia University; Centre National de la Recherche Scientifique (CNRS); Universite Paris Saclay
摘要:Phenomena such as air pollution levels are of greatest interest when observations are large, but standard prediction methods are not specifically designed for large observations. We propose a method, rooted in extreme value theory, which approximates the conditional distribution of an unobserved component of a random vector given large observed values. Specifically, for Z = (Z(1), ... , Z(d))(T) and Z(-d) = (Z(1), ... , Z(d-1))(T), the method approximates the conditional distribution of [Z(d) ...
-
作者:Aston, John A. D.; Kirch, Claudia
作者单位:University of Warwick; Helmholtz Association; Karlsruhe Institute of Technology
摘要:Functionalmagnetic resonance imaging (fMRI) is now a well-established technique for studying the brain. However, in many situations, such as when data are acquired in a resting state, it is difficult to know whether the data are truly stationary or if level shifts have occurred. To this end, change-point detection in sequences of functional data is examined where the functional observations are dependent and where the distributions of change-points from multiple subjects are required. Of parti...
-
作者:Aksakalli, Vural; Ceyhan, Elvan
作者单位:Istanbul Sehir University; Koc University
摘要:We introduce the optimal obstacle placement with disambiguations problem wherein the goal is to place true obstacles in an environment cluttered with false obstacles so as to maximize the total traversal length of a navigating agent (NAVA). Prior to the traversal, the NAVA is given location information and probabilistic estimates of each disk-shaped hindrance (hereinafter referred to as disk) being a true obstacle. The NAVA can disambiguate a disk's status only when situated on its boundary. T...