-
作者:Ma, Xinwei; Wang, Jingshen; Wu, Chong
作者单位:University of California System; University of California San Diego; University of California System; University of California Berkeley; University of Texas System; UTMD Anderson Cancer Center
摘要:Developments in genome-wide association studies and the increasing availability of summary genetic association data have made the application of two-sample Mendelian Randomization (MR) with summary data increas-ingly popular. Conventional two-sample MR methods often employ the same sample for selecting relevant genetic variants and for constructing final causal estimates. Such a practice often leads to biased causal effect estimates due to the well-known ???winner???s curse??? phenomenon. To a...
-
作者:Schoen, Eric D.; Mee, Robert W.
作者单位:KU Leuven; University of Tennessee System; University of Tennessee Knoxville
摘要:The effect of the order in which a set of m treatments is applied can be modeled by relative-position factors that indicate whether treatment i is carried out before or after treatment j, or by the absolute position for treatment i in the sequence. A design with the same normalized information matrix as the design with all m! sequences is D- and G-optimal for the main-effects model involving the relative-position factors. We prove that such designs are also I-optimal for this model and D-optim...
-
作者:Aamari, Eddie; Berenfeld, Clement; Levrard, Clement
作者单位:Sorbonne Universite; Universite Paris Cite; University of Potsdam
摘要:We study the estimation of the reach, an ubiquitous regularity parameter in manifold estimation and geometric data analysis. Given an i.i.d. sample vide optimal nonasymptotic bounds for the estimation of its reach. We build upon a formulation of the reach in terms of maximal curvature on one hand and geodesic metric distortion on the other. The derived rates are adaptive, with rates depending on whether the reach of M arises from curvature or from a bottleneck structure. In the process we deri...
-
作者:Berrett, Thomas b.; Samworth, Richard j.
作者单位:University of Warwick; University of Cambridge
摘要:Given a set of incomplete observations, we study the nonparametric problem of testing whether data are Missing Completely At Random (MCAR). Our first contribution is to characterise precisely the set of alternatives that can be distinguished from the MCAR null hypothesis. This reveals interesting and novel links to the theory of Frechet classes (in particular, compatible distributions) and linear programming, that allow us to propose MCAR tests that are consistent against all detectable altern...
-
作者:Bhattacharjee, Satarupa; Muller, Hans-georg
作者单位:University of California System; University of California Davis
摘要:Single index models provide an effective dimension reduction tool in regression, especially for high-dimensional data, by projecting a general multivariate predictor onto a direction vector. We propose a novel single-index model for regression models where metric space-valued random object responses are coupled with multivariate Euclidean predictors. The responses in this regression model include complex, non-Euclidean data, including covariance matrices, graph Laplacians of networks and univa...
-
作者:Wang, Jiayi; Qi, Zhengling; Wong, Raymond K. W.
作者单位:University of Texas System; University of Texas Dallas; George Washington University; Texas A&M University System; Texas A&M University College Station
摘要:Off-policy evaluation is considered a fundamental and challenging problem in reinforcement learning (RL). This paper focuses on value estimation of a target policy based on pre-collected data generated from a possibly different policy, under the framework of infinite-horizon Markov decision processes. Motivated by the recently developed marginal importance sampling method in RL and the covariate balancing idea in causal inference, we propose a novel estimator with approximately projected state...
-
作者:Fujiwara, Akio; Yamagata, Koichi
作者单位:University of Osaka; Research Organization of Information & Systems (ROIS); National Institute of Informatics (NII) - Japan
摘要:We herein establish an asymptotic representation theorem for locally asymptotically normal quantum statistical models. This theorem enables us to study the asymptotic efficiency of quantum estimators, such as quantum regular estimators and quantum minimax estimators, leading to a universal tight lower bound beyond the i.i.d. assumption. This formulation complements the theory of quantum contiguity developed in the previous paper [Fujiwara and Yamagata, Bernoulli 26 (2020) 2105-2141], providing...
-
作者:Doss, Natalie; Wu, Yihong; Yang, Pengkun; Zhou, Harrison H.
作者单位:Yale University; Tsinghua University
摘要:This paper studies the optimal rate of estimation in a finite Gaussian location mixture model in high dimensions without separation conditions. We assume that the number of components k is bounded and that the centers lie in a ball of bounded radius, while allowing the dimension d to be as large as the sample size n. Extending the one-dimensional result of Heinrich and Kahn (Ann. Statist. 46 (2018) 2844-2870), we show that the minimax rate of estimating the mixing distribution in Wasserstein d...
-
作者:Awan, Jordan; Vadhan, Salil
作者单位:Purdue University System; Purdue University
摘要:f-DP has recently been proposed as a generalization of differential pri-vacy allowing a lossless analysis of composition, post-processing, and pri-vacy amplification via subsampling. In the setting of f-DP, we propose the concept of a canonical noise distribution (CND), the first mechanism de-signed for an arbitrary f-DP guarantee. The notion of CND captures whether an additive privacy mechanism perfectly matches the privacy guarantee of a given f . We prove that a CND always exists, and give ...
-
作者:Ma, Cong; Pathak, Reese; Wainwright, Martin J.
作者单位:University of Chicago; University of California System; University of California Berkeley; Massachusetts Institute of Technology (MIT)
摘要:We study the covariate shift problem in the context of nonparametric regression over a reproducing kernel Hilbert space (RKHS). We focus on two natural families of covariate shift problems defined using the likelihood ratios between the source and target distributions. When the likelihood ratios are uniformly bounded, we prove that the kernel ridge regression (KRR) estimator with a carefully chosen regularization parameter is minimax rate-optimal (up to a log factor) for a large family of RKHS...