-
作者:Castillo, Ismael; Nickl, Richard
作者单位:Centre National de la Recherche Scientifique (CNRS); Sorbonne Universite; Universite Paris Cite; Centre National de la Recherche Scientifique (CNRS); University of Cambridge
摘要:We continue the investigation of Bernstein- von Mises theorems for non-parametric Bayes procedures from [Ann. Statist. 41 (2013) 1999-2028]. We introduce multiscale spaces on which nonparametric priors and posteriors are naturally defined, and prove Bernstein- von Mises theorems for a variety of priors in the setting of Gaussian nonparametric regression and in the i.i.d. sampling model. From these results we deduce several applications where posterior-based inference coincides with efficient f...
-
作者:Chernozhukov, Victor; Chetverikov, Dents; Kato, Kengo
作者单位:Massachusetts Institute of Technology (MIT); Massachusetts Institute of Technology (MIT); University of California System; University of California Los Angeles; University of Tokyo
摘要:Modern construction of uniform confidence bands for nonparametric densities (and other functions) often relies on the classical Smirnov-Bickel-Rosenblatt (SBR) condition; see, for example, Gine and Nickl [Probab. Theory Related Fields 143 (2009) 569-596]. This condition requires the existence of a limit distribution of an extreme value type for the supremum of a studentized empirical process (equivalently, for the supremum of a Gaussian process with the same covariance function as that of the ...
-
作者:Chakraborty, Anirvan; Chaudhuri, Probal
作者单位:Indian Statistical Institute; Indian Statistical Institute Kolkata
摘要:The spatial distribution has been widely used to develop various non-parametric procedures for finite dimensional multivariate data. In this paper, we investigate the concept of spatial distribution for data in infinite dimensional Banach spaces. Many technical difficulties are encountered in such spaces that are primarily due to the noncompactness of the closed unit ball. In this work, we prove some Glivenko-Cantelli and Donsker-type results for the empirical spatial distribution process in i...
-
作者:Feng, Xingdong; He, Xuming
作者单位:Shanghai University of Finance & Economics; Shanghai University of Finance & Economics; University of Michigan System; University of Michigan
摘要:The singular value decomposition is widely used to approximate data matrices with lower rank matrices. Feng and He [Ann. Appl. Stat. 3 (2009) 1634-1654] developed tests on dimensionality of the mean structure of a data matrix based on the singular value decomposition. However, the first singular values and vectors can be driven by a small number of outlying measurements. In this paper, we consider a robust alternative that moderates the effect of outliers in low-rank approximations. Under the ...
-
作者:Bhaskar, Anand; Song, Yun S.
作者单位:University of California System; University of California Berkeley; University of California System; University of California Berkeley
摘要:The sample frequency spectrum (SFS) is a widely-used summary statistic of genomic variation in a sample of homologous DNA sequences. It provides a highly efficient dimensional reduction of large-scale population genomic data and its mathematical dependence on the underlying population demography is well understood, thus enabling the development of efficient inference algorithms. However, it has been recently shown that very different population demographies can actually generate the same SFS f...
-
作者:Genon-Catalot, Valentine; Laredo, Catherine
作者单位:Centre National de la Recherche Scientifique (CNRS); CNRS - National Institute for Mathematical Sciences (INSMI); Universite Paris Cite; Centre National de la Recherche Scientifique (CNRS); CNRS - National Institute for Mathematical Sciences (INSMI); Universite Paris Cite; Universite Paris Saclay; INRAE; Universite Paris Saclay; Universite Paris Cite; Centre National de la Recherche Scientifique (CNRS); CNRS - National Institute for Mathematical Sciences (INSMI); INRAE; INRAE; Universite Paris Cite; Universite Paris Saclay
摘要:We prove a global asymptotic equivalence of experiments in the sense of Le Cam's theory. The experiments are a continuously observed diffusion with nonparametric drift and its Euler scheme. We focus on diffusions with nonconstant-known diffusion coefficient. The asymptotic equivalence is proved by constructing explicit equivalence mappings based on random time changes. The equivalence of the discretized observation of the diffusion and the corresponding Euler scheme experiment is then derived....
-
作者:Du, Lilun; Zhang, Chunming
作者单位:University of Wisconsin System; University of Wisconsin Madison; Nankai University
摘要:In the context of large-scale multiple testing, hypotheses are often accompanied with certain prior information. In this paper, we present a single-index modulated (SIM) multiple testing procedure, which maintains control of the false discovery rate while incorporating prior information, by assuming the availability of a bivariate p-value, (p(1), p(2)), for each hypothesis, where pi is a preliminary p-value from prior information and p(2) is the primary p-value for the ultimate analysis. To fi...
-
作者:Pati, Debdeep; Bhattacharya, Anirban; Pillai, Natesh S.; Dunson, David
作者单位:State University System of Florida; Florida State University; Texas A&M University System; Texas A&M University College Station; Harvard University; Duke University
摘要:Sparse Bayesian factor models are routinely implemented for parsimonious dependence modeling and dimensionality reduction in high-dimensional applications. We provide theoretical understanding of such Bayesian procedures in terms of posterior convergence rates in inferring high-dimensional covariance matrices where the dimension can be larger than the sample size. Under relevant sparsity assumptions on the true covariance matrix, we show that commonly-used point mass mixture priors on the fact...
-
作者:Sun, Yan; Yan, Hongjia; Zhang, Wenyang; Lu, Zudi
作者单位:Shanghai University of Finance & Economics; University of York - UK; University of Southampton
摘要:Stimulated by the Boston house price data, in this paper, we propose a semiparametric spatial dynamic model, which extends the ordinary spatial autoregressive models to accommodate the effects of some covariates associated with the house price. A profile likelihood based estimation procedure is proposed. The asymptotic normality of the proposed estimators are derived. We also investigate how to identify the parametric/nonparametric components in the proposed semiparametric model. We show how m...
-
作者:Lecue, Guillaume; Rigollet, Philippe
作者单位:Centre National de la Recherche Scientifique (CNRS); Institut Polytechnique de Paris; Ecole Polytechnique; Princeton University
摘要:We consider a general supervised learning problem with strongly convex and Lipschitz loss and study the problem of model selection aggregation. In particular, given a finite dictionary functions (learners) together with the prior, we generalize the results obtained by Dai, Rigollet and Zhang [Ann. Statist. 40 (2012) 1878-1905] for Gaussian regression with squared loss and fixed design to this learning setup. Specifically, we prove that the Q-aggregation procedure outputs an estimator that sati...