-
作者:Ray, Kolyan; Szabo, Botond
作者单位:Imperial College London; Vrije Universiteit Amsterdam
摘要:We study a mean-field spike and slab variational Bayes (VB) approximation to Bayesian model selection priors in sparse high-dimensional linear regression. Under compatibility conditions on the design matrix, oracle inequalities are derived for the mean-field VB approximation, implying that it converges to the sparse truth at the optimal rate and gives optimal prediction of the response vector. The empirical performance of our algorithm is studied, showing that it works comparably well as other...
-
作者:Liu, Yaowu; Li, Zilin; Lin, Xihong
作者单位:Southwestern University of Finance & Economics - China; Harvard University; Harvard T.H. Chan School of Public Health; Harvard University
摘要:Testing a global hypothesis for a set of variables is a fundamental problem in statistics with a wide range of applications. A few well-known classical tests include the Hotelling's T-2 test, the F-test, and the empirical Bayes based score test. These classical tests, however, are not robust to the signal strength and could have a substantial loss of power when signals are weak or moderate, a situation we commonly encounter in contemporary applications. In this article, we propose a minimax op...
-
作者:Fan, Yiwei; Lu, Xiaoling; Liu, Yufeng; Zhao, Junlong
作者单位:Renmin University of China; University of North Carolina; University of North Carolina Chapel Hill; Beijing Normal University
摘要:Hierarchical classification problems are commonly seen in practice. However, most existing methods do not fully use the hierarchical information among class labels. In this article, a novel label embedding approach is proposed, which keeps the hierarchy of labels exactly, and reduces the complexity of the hypothesis space significantly. Based on the newly proposed label embedding approach, a new angle-based classifier is developed for hierarchical classification. Moreover, to handle massive da...
-
作者:Yang, Jun; Zhou, Zhou
作者单位:University of Toronto
摘要:We develop a unified theory and methodology for the inference of evolutionary Fourier power spectra for a general class of locally stationary and possibly nonlinear processes. In particular, simultaneous confidence regions (SCR) with asymptotically correct coverage rates are constructed for the evolutionary spectral densities on a nearly optimally dense grid of the joint time-frequency domain. A simulation based bootstrap method is proposed to implement the SCR. The SCR enables researchers and...
-
作者:Banerjee, Sudipto
作者单位:University of California System; University of California Los Angeles
-
作者:Fiksel, Jacob; Datta, Abhirup; Amouzou, Agbessi; Zeger, Scott
作者单位:Johns Hopkins University; Johns Hopkins University
摘要:Quantification learning is the task of prevalence estimation for a test population using predictions from a classifier trained on a different population. Quantification methods assume that the sensitivities and specificities of the classifier are either perfect or transportable from the training to the test population. These assumptions are inappropriate in the presence of dataset shift, when the misclassification rates in the training population are not representative of those for the test po...
-
作者:Mai, Qing; Zhang, Xin; Pan, Yuqing; Deng, Kai
作者单位:State University System of Florida; Florida State University
摘要:Modern scientific studies often collect datasets in the form of tensors. These datasets call for innovative statistical analysis methods. In particular, there is a pressing need for tensor clustering methods to understand the heterogeneity in the data. We propose a tensor normal mixture model approach to enable probabilistic interpretation and computational tractability. Our statistical model leverages the tensor covariance structure to reduce the number of parameters for parsimonious modeling...
-
作者:Zhang, Likun; Shaby, Benjamin A.; Wadsworth, Jennifer L.
作者单位:Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park; Colorado State University System; Colorado State University Fort Collins; Lancaster University
摘要:Flexible spatial models that allow transitions between tail dependence classes have recently appeared in the literature. However, inference for these models is computationally prohibitive, even in moderate dimensions, due to the necessity of repeatedly evaluating the multivariate Gaussian distribution function. In this work, we attempt to achieve truly high-dimensional inference for extremes of spatial processes, while retaining the desirable flexibility in the tail dependence structure, by mo...
-
作者:Linton, Oliver B.; Tang, Haihan
作者单位:University of Cambridge; Fudan University
-
作者:Frigau, Luca; Wu, Qiuyi; Banks, David
作者单位:University of Cagliari; University of Rochester; Duke University
摘要:Sometimes the Joint Statistical Meetings (JSM) is frustrating to attend, because multiple sessions on the same topic are scheduled at the same time. This article uses seeded latent Dirichlet allocation and a scheduling optimization algorithm to very significantly reduce overlapping content in the original schedule for the 2020 JSM program. Specifically, a measure based on total variation distance that ranges from 0 (random scheduling) to 1 (no overlapping content) finds that the original sched...