-
作者:Qiu, Jiaming; Dai, Xiongtao; Zhu, Zhengyuan
作者单位:Iowa State University; University of California System; University of California Berkeley
摘要:We consider the estimation of densities in multiple subpopulations, where the available sample size in each subpopulation greatly varies. This problem occurs in epidemiology, for example, where different diseases may share similar pathogenic mechanism but differ in their prevalence. Without specifying a parametric form, our proposed method pools information from the population and estimate the density in each subpopulation in a data-driven fashion. Drawing from functional data analysis, low-di...
-
作者:Ye, Ting; Brumback, Babette A.
作者单位:University of Washington; University of Washington Seattle
-
作者:Wong, Raymond K. W.; Zubizarreta, Jose R.; Stuart, Elizabeth A.; Small, Dylan S.; Rosenbaum, Paul R.
作者单位:Texas A&M University System; Texas A&M University College Station
-
作者:Seo, Insuk
作者单位:Seoul National University (SNU)
-
作者:Liu, Bingyuan; Zhang, Qi; Xue, Lingzhou; Song, Peter X. -K.; Kang, Jian
作者单位:Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; University of Michigan System; University of Michigan
摘要:It is important to develop statistical techniques to analyze high-dimensional data in the presence of both complex dependence and possible heavy tails and outliers in real-world applications such as imaging data analyses. We propose a new robust high-dimensional regression with coefficient thresholding, in which an efficient nonconvex estimation procedure is proposed through a thresholding function and the robust Huber loss. The proposed regularization method accounts for complex dependence st...
-
作者:Demirkaya, Emre; Fan, Yingying; Gao, Lan; Lv, Jinchi; Vossler, Patrick; Wang, Jingbo
作者单位:University of Tennessee System; University of Tennessee Knoxville; University of Southern California; Chinese University of Hong Kong
摘要:The weighted nearest neighbors (WNN) estimator has been popularly used as a flexible and easy-to-implement nonparametric tool for mean regression estimation. The bagging technique is an elegant way to form WNN estimators with weights automatically generated to the nearest neighbors; we name the resulting estimator as the distributional nearest neighbors (DNN) for easy reference. Yet, there is a lack of distributional results for such estimator, limiting its application to statistical inference...
-
作者:Klusowski, Jason M.; Tian, Peter M.
作者单位:Princeton University
摘要:This article shows that decision trees constructed with Classification and Regression Trees (CART) and C4.5 methodology are consistent for regression and classification tasks, even when the number of predictor variables grows sub-exponentially with the sample size, under natural 0-norm and 1-norm sparsity constraints. The theory applies to a wide range of models, including (ordinary or logistic) additive regression models with component functions that are continuous, of bounded variation, or, ...
-
作者:Mork, Daniel; Kioumourtzoglou, Marianthi-Anna; Weisskopf, Marc; Coull, Brent A.; Wilson, Ander
作者单位:Harvard University; Harvard T.H. Chan School of Public Health; Columbia University; Harvard University; Harvard T.H. Chan School of Public Health; Colorado State University System; Colorado State University Fort Collins
摘要:Children's health studies support an association between maternal environmental exposures and children's birth outcomes. A common goal is to identify critical windows of susceptibility-periods during gestation with increased association between maternal exposures and a future outcome. The timing of the critical windows and magnitude of the associations are likely heterogeneous across different levels of individual, family, and neighborhood characteristics. Using an administrative Colorado birt...
-
作者:Bormetti, Giacomo
作者单位:University of Bologna
-
作者:Wu, Xiao; Mealli, Fabrizia; Kioumourtzoglou, Marianthi-Anna; Dominici, Francesca; Braun, Danielle
作者单位:Columbia University; University of Florence; University of Florence; European University Institute; Columbia University; Harvard University; Harvard T.H. Chan School of Public Health; Harvard University; Harvard University Medical Affiliates; Dana-Farber Cancer Institute
摘要:In the context of a binary treatment, matching is a well-established approach in causal inference. However, in the context of a continuous treatment or exposure, matching is still underdeveloped. We propose an innovative matching approach to estimate an average causal exposure-response function under the setting of continuous exposures that relies on the generalized propensity score (GPS). Our approach maintains the following attractive features of matching: a) clear separation between the des...