-
作者:Hunt, Gregory J.; Gagnon-Bartsch, Johann A.
作者单位:University of Michigan System; University of Michigan
摘要:Complex tissues are composed of a large number of different types of cells, each involved in a multitude of biological processes. Consequently, an important component to understanding such processes is understanding the cell-type composition of the tissues. Estimating cell-type composition using high-throughput gene expression data is known as cell-type deconvolution. In this paper we first summarize the extensive deconvolution literature by identifying a common regression-like approach to dec...
-
作者:Moran, Gemma E.; Rockova, Veronika; George, Edward, I
作者单位:Columbia University; University of Chicago; University of Pennsylvania
摘要:Biclustering methods simultaneously group samples and their associated features. In this way, biclustering methods differ from traditional clustering methods, which utilize the entire set of features to distinguish groups of samples. Motivating applications for biclustering include genomics data, where the goal is to cluster patients or samples by their gene expression profiles; and recommender systems, which seek to group customers based on their product preferences. Biclusters of interest of...
-
作者:Feng, Jean; DeWitt, William S., III; McKenna, Aaron; Simon, Noah; Willis, Amy D.; Matsen, Frederick A.
作者单位:University of California System; University of California San Francisco; University of Washington; University of Washington Seattle; Dartmouth College; University of Washington; University of Washington Seattle; Fred Hutchinson Cancer Center
摘要:CRISPR technology has enabled cell lineage tracing for complex multicellular organisms through insertion-deletion mutations of synthetic genomic barcodes during organismal development. To reconstruct the cell lineage tree from the mutated barcodes, current approaches apply general-purpose computational tools that are agnostic to the mutation process and are unable to take full advantage of the data's structure. We propose a statistical model for the CRISPR mutation process and develop a proced...
-
作者:Li, Fan; Mercatanti, Andrea; Makinen, Taneli; Silvestrini, Andrea
作者单位:Duke University; European Central Bank; Bank of Italy
摘要:Regression discontinuity (RD) is a widely used quasi-experimental design for causal inference. In the standard RD the assignment to treatment is determined by a continuous pretreatment variable (i.e., running variable) falling above or below a prefixed threshold. Recent applications increasingly feature ordered categorical or ordinal running variables which pose challenges to RD estimation due to the lack of a meaningful measure of distance. This paper proposes an RD approach for ordinal runni...
-
作者:Page, Garritt L.; Quintana, Fernando A.; Rosner, Gary L.
作者单位:Brigham Young University; Pontificia Universidad Catolica de Chile; Johns Hopkins University
摘要:Combination chemotherapy treatment regimens created for patients diagnosed with childhood acute lymphoblastic leukemia have had great success in improving cure rates. Unfortunately, patients prescribed these types of treatment regimens have displayed susceptibility to the onset of osteonecrosis. Some have suggested that this is due to pharmacokinetic interaction between two agents in the treatment regimen (asparaginase and dexamethasone) and other physiological variables. Determining which phy...
-
作者:Schauer, Jacob M.; Fitzgerald, Kaitlyn G.; Peko-Spicer, Sarah; Whalen, Mena C. R.; Zejnullahi, Rrita; Hedges, Larry, V
作者单位:Northwestern University; Feinberg School of Medicine; Northwestern University
摘要:Several programs of research have sought to assess the replicability of scientific findings in different fields, including economics and psychology. These programs attempt to replicate several findings and use the results to say something about large-scale patterns of replicability in a field. However, little work has been done to understand the analytic methods used to do this, including what they are assessing and what their statistical properties are. This article examines several methods t...
-
作者:Warren, Joshua L.; Miranda, Marie Lynn; Tootoo, Joshua L.; Osgood, Claire E.; Bell, Michelle L.
作者单位:Yale University; University of Notre Dame; University of Notre Dame; Yale University
摘要:We introduce spatial (DLfuse) and spatiotemporal (DLfuseST) distributed lag data fusion methods for predicting point-level ambient air pollution concentrations, using, as input, gridded average pollution estimates from a deterministic numerical air quality model. The methods incorporate predictive information from grid cells surrounding the prediction location of interest and are shown to collapse to existing downscaling approaches when this information adds no benefit. The spatial lagged para...
-
作者:Plumlee, Matthew; Asher, Taylor G.; Chang, Won; Bilskie, Matthew, V
作者单位:Northwestern University; University of North Carolina; University of North Carolina Chapel Hill; University System of Ohio; University of Cincinnati; University System of Georgia; University of Georgia
摘要:Probabilistic hurricane storm surge forecasting using a high-fidelity model has been considered impractical due to the overwhelming computational expense to run thousands of simulations. This article demonstrates that modern statistical tools enable good forecasting performance using a small number of carefully chosen simulations. This article offers algorithms that quickly handle the massive output of a surge model while addressing the missing data at unsubmerged locations. Also included is a...
-
作者:Li, Yicheng; Raftery, Adrian E.
作者单位:University of Washington; University of Washington Seattle
摘要:Smoking is one of the main risk factors that has affected human mortality and life expectancy over the past century. Smoking accounts for a large part of the nonlinearities in the growth of life expectancy and of the geographic and gender differences in mortality. As Bongaarts (Popul. Dev. Rev. 32 (2006) 605-628) and Janssen (Genus 74 (2018) 21) suggested, accounting for smoking could improve the quality of mortality forecasts due to the predictable nature of the smoking epidemic. We propose a...
-
作者:Xie, Shanghong; Zeng, Donglin; Wang, Yuanjia
作者单位:Columbia University; University of North Carolina; University of North Carolina Chapel Hill
摘要:The biomarker networks measured by different modalities of data (e.g., structural magnetic resonance imaging (sMRI), diffusion tensor imaging (DTI)) may share the same true underlying biological model. In this work we propose a nodewise biomarker graphical model to leverage the shared mechanism between multimodality data to provide a more reliable estimation of the target modality network and account for the heterogeneity in networks due to differences between subjects and networks of external...