-
作者:Shah, Rajen D.; Samworth, Richard J.
作者单位:University of Cambridge
-
作者:Kong, Xiangrong; Wang, Mei-Cheng; Gray, Ronald
作者单位:Johns Hopkins University; Johns Hopkins Bloomberg School of Public Health; Johns Hopkins University; Johns Hopkins Bloomberg School of Public Health; Johns Hopkins University; Johns Hopkins Bloomberg School of Public Health
摘要:We consider a specific situation of correlated data where multiple outcomes are repeatedly measured on each member of a couple. Such multivariate longitudinal data from couples may exhibit multi-faceted correlations that can be further complicated if there are polygamous partnerships. An example is data from cohort studies on human papillomavirus (HPV) transmission dynamics in heterosexual couples. HPV is a common sexually transmitted disease with 14 known oncogenic types causing anogenital ca...
-
作者:Jiang, Bo; Liu, Jun S.
作者单位:Harvard University; Harvard University
摘要:Expression quantitative trait loci (eQTLs) are genomic locations associated with changes of expression levels of certain genes. By assaying gene expressions and genetic variations simultaneously on a genome-wide scale, scientists wish to discover genomic loci responsible for expression variations of a set of genes. The task can be viewed as a multivariate regression problem with variable selection on both responses (gene expression) and covariates (genetic variations), including alsomulti-way ...
-
作者:Hokayem, Charles; Bollinger, Christopher; Ziliak, James P.
作者单位:University of Kentucky; University of Kentucky
摘要:The Current Population Survey Annual Social and Economic Supplement (CPS ASEC) serves as the data source for official income, poverty, and inequality statistics in the United States. There is a concern that the rise in nonresponse to earnings questions could deteriorate data quality and distort estimates of these important metrics. We use a dataset of internal ASEC records matched to Social Security Detailed Earnings Records (DER) to study the impact of earnings nonresponse on estimates of pov...
-
作者:Cui, Hengjian; Li, Runze; Zhong, Wei
作者单位:Capital Normal University; Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; Xiamen University; Xiamen University
摘要:This work is concerned with marginal sure independence feature screening for ultrahigh dimensional discriminant analysis. The response variable is categorical in discriminant analysis. This enables us to use the conditional distribution function to construct a new index for feature screening. In this article, we propose a marginal feature screening procedure based on empirical conditional distribution function. We establish the sure screening and ranking consistency properties for the proposed...
-
作者:Lai, Randy C. S.; Hannig, Jan; Lee, Thomas C. M.
作者单位:University of California System; University of California Davis; University of North Carolina; University of North Carolina Chapel Hill
摘要:In recent years, the ultrahigh-dimensional linear regression problem has attracted enormous attention from the research community. Under the sparsity assumption, most of the published work is devoted to the selection and estimation of the predictor variables with nonzero coefficients. This article studies a different but fundamentally important aspect of this problem: uncertainty quantification for parameter estimates and model choices. To be more specific, this article proposes methods for de...
-
作者:Linero, Antonio R.; Daniels, Michael J.
作者单位:State University System of Florida; University of Florida; University of Texas System; University of Texas Austin
摘要:We develop a Bayesian nonparametric model for a longitudinal response in the presence of nonignorable missing data. Our general approach is to first specify a working model that flexibly models the missingness and full outcome processes jointly. We specify a Dirichlet process mixture of missing at random (MAR) models as a prior on the joint distribution of the working model. This aspect of the model governs the fit of the observed data by modeling the observed data distribution as the marginal...
-
作者:Rosenbaum, Paul R.
作者单位:University of Pennsylvania
摘要:An observational study draws inferences about treatment effects when treatments are not randomly assigned, as they would be in a randomized experiment. The naive analysis of an observational study assumes that adjustments for measured covariates suffice to remove bias from nonrandom treatment assignment. A sensitivity analysis in an observational study determines the magnitude of bias from nonrandom treatment assignment that would need to be present to alter the qualitative conclusions of the ...
-
作者:Jiang, Wenxin; Zhao, Yu
作者单位:Shandong University; Northwestern University; Amazon.com
摘要:A LIFT measure, such as the response rate, lift, or the percentage of captured response, is a fundamental measure of effectiveness for a scoring rule obtained from data mining, which is estimated from a set of validation data. In this article, we study how to construct confidence intervals of the LIFT measures. We point out the subtlety of this task and explain how simple binomial confidence intervals can have incorrect coverage probabilities, due to omitting variation from the sample percenti...
-
作者:Liang, Faming; Song, Qifan; Qiu, Peihua
作者单位:State University System of Florida; University of Florida; Purdue University System; Purdue University
摘要:Gaussian graphical models (GGMs) are frequently used to explore networks, such as gene regulatory networks, among a set of variables. Under the classical theory of GGMs, the construction of Gaussian graphical networks amounts to finding the pairs of variables with nonzero partial correlation coefficients. However, this is infeasible for high-dimensional problems for which the number of variables is larger than the sample size. In this article, we propose a new measure of partial correlation co...