-
作者:Kurtek, Sebastian; Srivastava, Anuj; Klassen, Eric; Ding, Zhaohua
作者单位:State University System of Florida; Florida State University; State University System of Florida; Florida State University; Vanderbilt University
摘要:Motivated by the problems of analyzing protein backbones, diffusion tensor magnetic resonance imaging (DT-MRI) fiber tracts in the human brain, and other problems involving curves, in this study we present some statistical models of parameterized curves, in R-3, in terms of combinations of features such as shape, location, scale, and orientation. For each combination of interest, we identify a representation manifold, endow it with a Riemannian metric, and outline tools for computing sample st...
-
作者:Leng, Chenlei; Tang, Cheng Yong
作者单位:National University of Singapore; Children's Hospital Colorado; University of Colorado System; University of Colorado Anschutz Medical Campus; University of Colorado Denver
摘要:Matrix-variate observations are frequently encountered in many contemporary statistical problems due to a rising need to organize and analyze data with structured information. In this article, we propose a novel sparse matrix graphical model for these types of statistical problems. By penalizing, respectively, two precision matrices corresponding to the rows and columns, our method yields a sparse matrix graphical model that synthetically characterizes the underlying conditional independence s...
-
作者:Sussman, Daniel L.; Tang, Minh; Fishkind, Donniell E.; Priebe, Carey E.
作者单位:Johns Hopkins University
摘要:We present a method to estimate block membership of nodes in a random graph generated by a stochastic blockmodel. We use an embedding procedure motivated by the random dot product graph model, a particular example of the latent position model. The embedding associates each node with a vector; these vectors are clustered via minimization of a square error criterion. We prove that this method is consistent for assigning nodes to blocks, as only a negligible number of nodes will be misassigned. W...
-
作者:Hai Nguyen; Cressie, Noel; Braverman, Amy
作者单位:California Institute of Technology; National Aeronautics & Space Administration (NASA); NASA Jet Propulsion Laboratory (JPL); University System of Ohio; Ohio State University; University System of Ohio; Ohio State University
摘要:Aerosols are tiny solid or liquid particles suspended in the atmosphere; examples of aerosols include windblown dust, sea salts, volcanic ash, smoke from wildfires, and pollution from factories. The global distribution of aerosols is a topic of great interest in climate studies since aerosols can either cool or warm the atmosphere depending on their location, type, and interaction with clouds. Aerosol concentrations are important input components of global climate models, and it is crucial to ...
-
作者:Picciotto, Sally; Hernan, Miguel A.; Page, John H.; Young, Jessica G.; Robins, James M.
作者单位:Harvard University; Harvard T.H. Chan School of Public Health; Harvard University; Harvard T.H. Chan School of Public Health; Harvard University; Harvard T.H. Chan School of Public Health
摘要:In the presence of time-varying confounders affected by prior treatment, standard statistical methods for failure time analysis may be biased. Methods that correctly adjust for this type of covariate include the parametric g-formula, inverse probability weighted estimation of marginal structural Cox proportional hazards models, and g-estimation of structural nested accelerated failure time models. In this article, we propose a novel method to estimate the causal effect of a time-dependent trea...
-
作者:Carroll, Raymond; Delaigle, Aurore; Hall, Peter
作者单位:Texas A&M University System; Texas A&M University College Station; University of Melbourne
摘要:In the present study, we consider the problem of classifying spatial data distorted by a linear transformation or convolution and contaminated by additive random noise. In this setting, we show that classifier performance can be improved if we carefully invert the data before the classifier is applied. However, the inverse transformation is not constructed so as to recover the original signal, and in fact, we show that taking the latter approach is generally inadvisable. We introduce a fully d...
-
作者:Danilov, Mike; Yohai, Victor J.; Zamar, Ruben H.
作者单位:Alphabet Inc.; Google Incorporated; University of Buenos Aires; University of British Columbia
摘要:Two main issues regarding data quality are data contamination (outliers) and data completion (missing data). These two problems have attracted much attention and research but surprisingly, they are seldom considered together. Popular robust methods such as S-estimators of multivariate location and scatter offer protection against outliers but cannot deal with missing data, except for the obviously inefficient approach of deleting all incomplete cases. We generalize the definition of S-estimato...
-
作者:Jin, Jiashun
作者单位:Carnegie Mellon University
-
作者:Huang, Chiung-Yu; Qin, Jing
作者单位:National Institutes of Health (NIH) - USA; NIH National Institute of Allergy & Infectious Diseases (NIAID)
摘要:The Canadian Study of Health and Aging (CSHA) employed a prevalent cohort design to study survival after onset of dementia, where patients with dementia were sampled and the onset time of dementia was determined retrospectively. The prevalent cohort sampling scheme favors individuals who survive longer. Thus, the observed survival times are subject to length bias. In recent years, there has been a rising interest in developing estimation procedures for prevalent cohort survival data that not o...
-
作者:Li, Runze; Zhong, Wei; Zhu, Liping
作者单位:Xiamen University; Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; Shanghai University of Finance & Economics
摘要:This article is concerned with screening features in ultrahigh-dimensional data analysis, which has become increasingly important in diverse scientific fields. We develop a sure independence screening procedure based on the distance correlation (DC-SIS). The DC-SIS can be implemented as easily as the sure independence screening (SIS) procedure based on the Pearson correlation proposed by Fan and Lv. However, the DC-SIS can significantly improve the SIS. Fan and Lv established the sure screenin...