-
作者:Dubey, Paromita; Mueller, Hans-Georg
作者单位:University of California System; University of California Davis
摘要:We propose a method to infer the presence and location of change-points in the distribution of a sequence of independent data taking values in a general metric space, where change-points are viewed as locations at which the distribution of the data sequence changes abruptly in terms of either its Frechet mean, Frechet variance or both. The proposed method is based on comparisons of Frechet variances before and after putative change-point locations and does not require a tuning parameter, excep...
-
作者:Kohler, Michael; Langer, Sophie
作者单位:Technical University of Darmstadt
-
作者:Abbe, Emmanuel; Fan, Jianqing; Wang, Kaizheng; Zhong, Yiqiao
作者单位:Princeton University; Princeton University; Princeton University
摘要:Recovering low-rank structures via eigenvector perturbation analysis is a common problem in statistical machine learning, such as in factor analysis, community detection, ranking, matrix completion, among others. While a large variety of bounds are available for average errors between empirical and population statistics of eigenvectors, few results are tight for entrywise analyses, which are critical for a number of problems such as community detection. This paper investigates entrywise behavi...
-
作者:Paindaveine, Davy; Verdebout, Thomas
作者单位:Universite Libre de Bruxelles; Universite de Toulouse; Universite Toulouse 1 Capitole
摘要:Motivated by the fact that circular or spherical data are often much concentrated around a location theta, we consider inference about theta under high concentration asymptotic scenarios for which the probability of any fixed spherical cap centered at theta converges to one as the sample size n diverges to infinity. Rather than restricting to Fisher-von Mises-Langevin distributions, we consider a much broader, semiparametric, class of rotationally symmetric distributions indexed by the locatio...
-
作者:Cannings, Timothy I.; Berrett, Thomas B.; Samworth, Richard J.
作者单位:University of Edinburgh; University of Cambridge
摘要:We derive a new asymptotic expansion for the global excess risk of a local-k-nearest neighbour classifier, where the choice of k may depend upon the test point. This expansion elucidates conditions under which the dominant contribution to the excess risk comes from the decision boundary of the optimal Bayes classifier, but we also show that if these conditions are not satisfied, then the dominant contribution may arise from the tails of the marginal distribution of the features. Moreover, we p...
-
作者:Xue, Kaijie; Yao, Fang
作者单位:Nankai University; Peking University
摘要:We propose a two-sample test for high-dimensional means that requires neither distributional nor correlational assumptions, besides some weak conditions on the moments and tail properties of the elements in the random vectors. This two-sample test based on a nontrivial extension of the one-sample central limit theorem (Ann. Probab. 45 (2017) 2309-2352) provides a practically useful procedure with rigorous theoretical guarantees on its size and power assessment. In particular, the proposed test...
-
作者:Schmidt-Hieber, Johannes
作者单位:University of Twente
摘要:Consider the multivariate nonparametric regression model. It is shown that estimators based on sparsely connected deep neural networks with ReLU activation function and properly chosen network architecture achieve the minimax rates of convergence (up to log n-factors) under a general composition assumption on the regression function. The framework includes many well-studied structural constraints such as (generalized) additive models. While there is a lot of flexibility in the network architec...
-
作者:Cai, T. Tony; Wu, Yihong
作者单位:University of Pennsylvania; Yale University
摘要:This paper investigates the fundamental limits for detecting a high-dimensional sparse matrix contaminated by white Gaussian noise from both the statistical and computational perspectives. We consider p x p matrices whose rows and columns are individually k-sparse. We provide a tight characterization of the statistical and computational limits for sparse matrix detection, which precisely describe when achieving optimal detection is easy, hard or impossible, respectively. Although the sparse ma...
-
作者:Chatelain, Simon; Fougeres, Anne-Laure; Neslehova, Johanna G.
作者单位:Centre National de la Recherche Scientifique (CNRS); Ecole Centrale de Lyon; Institut National des Sciences Appliquees de Lyon - INSA Lyon; Universite Claude Bernard Lyon 1; Universite Jean Monnet; CNRS - National Institute for Mathematical Sciences (INSMI); McGill University
摘要:Archimax copula models can account for any type of asymptotic dependence between extremes and at the same time capture joint risks at medium levels. An Archimax copula is characterized by two functional parameters: the stable tail dependence function l, and the Archimedean generator psi which distorts the extreme-value dependence structure. This article develops semiparametric inference for Archimax copulas: a nonparametric estimator of l and a moment-based estimator of psi assuming the latter...
-
作者:Gao, Chao; Han, Fang; Zhang, Cun-Hui
作者单位:University of Chicago; University of Washington; University of Washington Seattle; Rutgers University System; Rutgers University New Brunswick
摘要:Consider a sequence of real data points X-1, ..., X-n with underlying means theta(1)*, ..., theta(n)*. This paper starts from studying the setting that theta(i)* is both piecewise constant and monotone as a function of the index i. For this, we establish the exact minimax rate of estimating such monotone functions, and thus give a nontrivial answer to an open problem in the shape-constrained analysis literature. The minimax rate under the loss of the sum of squared errors involves an interesti...