-
作者:Hui, Francis K. C.; Muller, Samuel; Welsh, A. H.
作者单位:Australian National University; University of Sydney
摘要:Multivariate data are commonly analyzed using one of two approaches: a conditional approach based on generalized linear latent variable models (GLLVMs) or some variation thereof, and a marginal approach based on generalized estimating equations (GEEs). With research on mixed models and GEEs having gone down separate paths, there is a common mindset to treat the two approaches as mutually exclusive, with which to use driven by the question of interest. In this article, focusing on multivariate ...
-
作者:Li, Lingzhu; Zhu, Xuehu; Zhu, Lixing
作者单位:Hong Kong Baptist University; University of Alberta; Beijing Normal University; Xi'an Jiaotong University
摘要:In model checking for regressions, nonparametric estimation-based tests usually have tractable limiting null distributions and are sensitive to oscillating alternative models, but suffer from the curse of dimensionality. In contrast, empirical process-based tests can, at the fastest possible rate, detect local alternatives distinct from the null model, yet are less sensitive to oscillating alternatives and rely on Monte Carlo approximation for critical value determination, which is costly in c...
-
作者:Chen, Jie; Stein, Michael L.
作者单位:International Business Machines (IBM); IBM USA; Rutgers University System; Rutgers University New Brunswick
摘要:Gaussian random fields (GRF) are a fundamental stochastic model for spatiotemporal data analysis. An essential ingredient of GRF is the covariance function that characterizes the joint Gaussian distribution of the field. Commonly used covariance functions give rise to fully dense and unstructured covariance matrices, for which required calculations are notoriously expensive to carry out for large data. In this work, we propose a construction of covariance functions that result in matrices with...
-
作者:Giordano, Sabrina
作者单位:University of Calabria
-
作者:Lei, Jing; Lin, Kevin Z.
作者单位:Carnegie Mellon University; University of Pennsylvania
摘要:We consider the problem of estimating common community structures in multi-layer stochastic block models, where each single layer may not have sufficient signal strength to recover the full community structure. In order to efficiently aggregate signal across different layers, we argue that the sum-of-squared adjacency matrices contain sufficient signal even when individual layers are very sparse. Our method uses a bias-removal step that is necessary when the squared noise matrices may overwhel...
-
作者:Dudek, Anna E.; Lenart, Lukasz
作者单位:AGH University of Krakow; Cracow University of Economics
摘要:We introduce a new approach for nonparametric spectral density estimation based on the subsampling technique, which we apply to the important class of nonstationary time series. These are almost periodically correlated sequences. In contrary to existing methods, our technique does not require demeaning of the data. On the simulated data examples, we compare our estimator of spectral density function with the classical one. Additionally, we propose a modified estimator, which allows to reduce t...
-
作者:Painsky, Amichai
作者单位:Tel Aviv University
摘要:Consider a finite sample from an unknown distribution over a countable alphabet. The missing mass refers to the probability of symbols that do not appear in the sample. Estimating the missing mass is a basic problem in statistics and related fields, which dates back to the early work of Laplace, and the more recent seminal contribution of Good and Turing. In this article, we introduce a generalized Good-Turing (GT) framework for missing mass estimation. We derive an upper-bound for the risk (i...
-
作者:Lee, Kuang-Yao; Li, Lexin; Li, Bing; Zhao, Hongyu
作者单位:Pennsylvania Commonwealth System of Higher Education (PCSHE); Temple University; University of California System; University of California Berkeley; Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; Yale University
摘要:In this article, we develop a nonparametric graphical model for multivariate random functions. Most existing graphical models are restricted by the assumptions of multivariate Gaussian or copula Gaussian distributions, which also imply linear relations among the random variables or functions on different nodes. We relax those assumptions by building our graphical model based on a new statistical object-the functional additive regression operator. By carrying out regression and neighborhood sel...
-
作者:Saha, Arkajyoti; Basu, Sumanta; Datta, Abhirup
作者单位:Johns Hopkins University; Cornell University
摘要:Spatial linear mixed-models, consisting of a linear covariate effect and a Gaussian process (GP) distributed spatial random effect, are widely used for analyses of geospatial data. We consider the setting where the covariate effect is nonlinear. Random forests (RF) are popular for estimating nonlinear functions but applications of RF for spatial data have often ignored the spatial correlation. We show that this impacts the performance of RF adversely. We propose RF-GLS, a novel and well-princi...
-
作者:Xie, Fangzheng; Xu, Yanxun
作者单位:Indiana University System; Indiana University Bloomington; Johns Hopkins University
摘要:We propose a one-step procedure to estimate the latent positions in random dot product graphs efficiently. Unlike the classical spectral-based methods, the proposed one-step procedure takes advantage of both the low-rank structure of the expected adjacency matrix and the Bernoulli likelihood information of the sampling model simultaneously. We show that for each vertex, the corresponding row of the one-step estimator (OSE) converges to a multivariate normal distribution after proper scaling an...