-
作者:Ma, Rong; Tony Cai, T.; Li, Hongzhe
作者单位:University of Pennsylvania; University of Pennsylvania
摘要:Motivated by recent research on quantifying bacterial growth dynamics based on genome assemblies, we consider a permuted monotone matrix model , where the rows represent different samples, the columns represent contigs in genome assemblies and the elements represent log-read counts after preprocessing steps and Guanine-Cytosine (GC) adjustment. In this model, Theta is an unknown mean matrix with monotone entries for each row, pi is a permutation matrix that permutes the columns of Theta, and Z...
-
作者:Laga, Ian; Bao, Le; Niu, Xiaoyue
作者单位:Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park
摘要:Estimating the size of hard-to-reach populations is an important problem for many fields. The network scale-up method (NSUM) is a relatively new approach to estimate the size of these hard-to-reach populations by asking respondents the question, How many X's do you know, where X is the population of interest (e.g., How many female sex workers do you know?). The answers to these questions form aggregated relational data (ARD). The NSUM has been used to estimate the size of a variety of subpopul...
-
作者:Chu, Chi Wing; Sit, Tony; Xu, Gongjun
作者单位:Columbia University; Chinese University of Hong Kong; University of Michigan System; University of Michigan
摘要:We propose a class of power-transformed linear quantile regression models for time-to-event observations subject to censoring. By introducing a process of power transformation with different transformation parameters at individual quantile levels, our framework relaxes the assumption of logarithmic transformation on survival times and provides dynamic estimation of various quantile levels. With such formulation, our proposal no longer requires the potentially restrictive global linearity assum...
-
作者:Tang, Francesca; Feng, Yang; Chiheb, Hamza; Fan, Jianqing
作者单位:Princeton University; New York University
摘要:With the severity of the COVID-19 outbreak, we characterize the nature of the growth trajectories of counties in the United States using a novel combination of spectral clustering and the correlation matrix. As the United States and the rest of the world are still suffering from the effects of the virus, the importance of assigning growth membership to counties and understanding the determinants of the growth is increasingly evident. For the two communities (faster versus slower growth traject...
-
作者:Zhang, Bo; Pu, Hongming
作者单位:University of Pennsylvania
-
作者:Chen, Haoyu; Lu, Wenbin; Song, Rui
作者单位:North Carolina State University
摘要:Online decision making aims to learn the optimal decision rule by making personalized decisions and updating the decision rule recursively. It has become easier than before with the help of big data, but new challenges also come along. Since the decision rule should be updated once per step, an offline update which uses all the historical data is inefficient in computation and storage. To this end, we propose a completely online algorithm that can make decisions and update the decision rule on...
-
作者:Gorfine, Malka; Keret, Nir; Ben Arie, Asaf; Zucker, David; Hsu, Li
作者单位:Tel Aviv University; Hebrew University of Jerusalem; Fred Hutchinson Cancer Center
摘要:The UK Biobank is a large-scale health resource comprising genetic, environmental, and medical information on approximately 500,000 volunteer participants in the United Kingdom, recruited at ages 40-69 during the years 2006-2010. The project monitors the health and well-being of its participants. This work demonstrates how these data can be used to yield the building blocks for an interpretable risk-prediction model, in a semiparametric fashion, based on known genetic and environmental risk fa...
-
作者:Heinrich, Claudio; Hellton, Kristoffer H.; Lenkoski, Alex; Thorarinsdottir, Thordis L.
摘要:Seasonal weather forecasts are crucial for long-term planning in many practical situations and skillful forecasts may have substantial economic and humanitarian implications. Current seasonal forecasting models require statistical postprocessing of the output to correct systematic biases and unrealistic uncertainty assessments. We propose a multivariate postprocessing approach using covariance tapering, combined with a dimension reduction step based on principal component analysis for efficien...
-
作者:Nattino, Giovanni; Lu, Bo; Shi, Junxin; Lemeshow, Stanley; Xiang, Henry
作者单位:University System of Ohio; Ohio State University; University System of Ohio; Ohio State University; Nationwide Childrens Hospital; Research Institute at Nationwide Children's Hospital; University System of Ohio; Ohio State University
摘要:Comparing outcomes across different levels of trauma centers is vital in evaluating regionalized trauma care. With observational data, it is critical to adjust for patient characteristics to render valid causal comparisons. Propensity score matching is a popular method to infer causal relationships in observational studies with two treatment arms. Few studies, however, have used matching designs with more than two groups, due to the complexity of matching algorithms. We fill the gap by develop...
-
作者:Yan, Bowei; Sarkar, Purnamrita
作者单位:University of Texas System; University of Texas Austin
摘要:In this article, we investigate community detection in networks in the presence of node covariates. In many instances, covariates and networks individually only give a partial view of the cluster structure. One needs to jointly infer the full cluster structure by considering both. In statistics, an emerging body of work has been focused on combining information from both the edges in the network and the node covariates to infer community memberships. However, so far the theoretical guarantees ...