-
作者:Purdom, Elizabeth
作者单位:University of California System; University of California Berkeley
摘要:In biological experiments researchers often have information in the form of a graph that supplements observed numerical data. Incorporating the knowledge contained in these graphs into an analysis of the numerical data is an important and nontrivial task. We look at the example of metagenomic data-data from a genomic survey of the abundance of different species of bacteria in a sample. Here, the graph of interest is a phylogenetic tree depicting the interspecies relationships among the bacteri...
-
作者:Clements, Robert Alan; Schoenberg, Frederic Paik; Schorlemmer, Danijel
作者单位:University of California System; University of California Los Angeles; University of Southern California; Helmholtz Association; Helmholtz-Center Potsdam GFZ German Research Center for Geosciences
摘要:Modern, powerful techniques for the residual analysis of spatial-temporal point process models are reviewed and compared. These methods are applied to California earthquake forecast models used in the Collaboratory for the Study of Earthquake Predictability (CSEP). Assessments of these earthquake forecasting models have previously been performed using simple, low-power means such as the L-test and N-test. We instead propose residual methods based on rescaling, thinning, superposition, weighted...
-
作者:Panaretos, Victor M.; Konis, Kjell
作者单位:Swiss Federal Institutes of Technology Domain; Ecole Polytechnique Federale de Lausanne
摘要:Single-particle electron microscopy is a modern technique that biophysicists employ to learn the structure of proteins. It yields data that consist of noisy random projections of the protein structure in random directions, with the added complication that the projection angles cannot be observed. In order to reconstruct a three-dimensional model, the projection directions need to be estimated by use of an ad-hoc starting estimate of the unknown particle. In this paper we propose a methodology ...
-
作者:Kaufman, Cari G.; Bingham, Derek; Habib, Salman; Heitmann, Katrin; Frieman, Joshua A.
作者单位:University of California System; University of California Berkeley; Simon Fraser University; United States Department of Energy (DOE); Argonne National Laboratory; United States Department of Energy (DOE); Los Alamos National Laboratory; United States Department of Energy (DOE); University of Chicago; Fermi National Accelerator Laboratory
摘要:Statistical emulators of computer simulators have proven to be useful in a variety of applications. The widely adopted model for emulator building, using a Gaussian process model with strictly positive correlation function, is computationally intractable when the number of simulator evaluations is large. We propose a new model that uses a combination of low-order regression terms and compactly supported correlation functions to recreate the desired predictive behavior of the emulator at a frac...
-
作者:Czogiel, Irina; Dryden, Ian L.; Brignell, Christopher J.
作者单位:Max Planck Society; University of Nottingham; University of South Carolina System; University of South Carolina Columbia
摘要:Statistical methodology is proposed for comparing unlabeled marked point sets, with an application to aligning steroid molecules in chemoinformatics. Methods from statistical shape analysis are combined with techniques for predicting random fields in spatial statistics in order to define a suitable measure of similarity between two marked point sets. Bayesian modeling of the predicted field overlap between pairs of point sets is proposed, and posterior inference of the alignment is carried out...
-
作者:Brentnall, Adam R.; Duffy, Stephen W.; Crowder, Martin J.; Gillan, Maureen G. C.; Astley, Susan M.; Wallis, Matthew G.; James, Jonathan; Boggis, Caroline R. M.; Gilbert, Fiona J.
作者单位:University of London; Queen Mary University London; Imperial College London; University of Aberdeen; University of Manchester; Cambridge University Hospitals NHS Foundation Trust; Addenbrooke's Hospital; University of Cambridge; Nottingham University Hospital NHS Trust; Nottingham City Hospital; Wythenshawe Hospital NHS Foundation Trust; Wythenshawe Hospital
摘要:When a model may be fitted separately to each individual statistical unit, inspection of the point estimates may help the statistician to understand between-individual variability and to identify possible relationships. However, some information will be lost in such an approach because estimation uncertainty is disregarded. We present a comparative method for exploratory repeated-measures analysis to complement the point estimates that was motivated by and is demonstrated by analysis of data f...
-
作者:Witten, Daniela M.
作者单位:University of Washington; University of Washington Seattle
摘要:In recent years, advances in high throughput sequencing technology have led to a need for specialized methods for the analysis of digital gene expression data. While gene expression data measured on a microarray take on continuous values and can be modeled using the normal distribution, RNA sequencing data involve nonnegative counts and are more appropriately modeled using a discrete count distribution, such as the Poisson or the negative binomial. Consequently, analytic tools that assume a Ga...
-
作者:Thioulouse, Jean
作者单位:VetAgro Sup; Centre National de la Recherche Scientifique (CNRS); CNRS - Institute of Ecology & Environment (INEE); Universite Claude Bernard Lyon 1
摘要:A pair of ecological tables is made of one table containing environmental variables (in columns) and another table containing species data (in columns). The rows of these two tables are identical and correspond to the sites where environmental variables and species data have been measured. Such data are used to analyze the relationships between species and their environment. If sampling is repeated over time for both tables, one obtains a sequence of pairs of ecological tables. Analyzing this ...