-
作者:Chatterjee, A.; Bhattacharya, B. B.
作者单位:University of Pennsylvania
摘要:The kernel two-sample test based on the maximum mean discrepancy is one of the most popular methods for detecting differences between two distributions over general metric spaces. In this paper we propose a method to boost the power of the kernel test by combining maximum mean discrepancy estimates over multiple kernels using their Mahalanobis distance. We derive the asymptotic null distribution of the proposed test statistic and use a multiplier bootstrap approach to efficiently compute the r...
-
作者:Zhang, Yaowu; Zhu, Liping
作者单位:Shanghai University of Finance & Economics; Renmin University of China
摘要:Testing independence between high-dimensional random vectors is fundamentally different from testing independence between univariate random variables. Taking the projection correlation as an example, it suffers from at least three problems. First, it has a high computational complexity of O{n3(p+q)}, where n, p and q are the sample size and dimensions of the random vectors; this limits its usefulness substantially when n is extremely large. Second, the asymptotic null distribution of the proje...
-
作者:Battey, H. S.; Mccullagh, Peter
作者单位:Imperial College London; University of Chicago
摘要:It is frequently observed in practice that the Wald statistic gives a poor assessment of the statistical significance of a variance component. This paper provides detailed analytic insight into the phenomenon by way of two simple models, which point to an atypical geometry as the source of the aberration. The latter can in principle be checked numerically to cover situations of arbitrary complexity, such as those arising from elaborate forms of blocking in an experimental context, or models fo...
-
作者:Wood, S. N.
-
作者:Zhou, Ying; Tang, Dingke; Kong, Dehan; Wang, Linbo
作者单位:University of Connecticut; University of Toronto
摘要:A key challenge in causal inference from observational studies is the identification and estimation of causal effects in the presence of unmeasured confounding. In this paper, we introduce a novel approach for causal inference that leverages information in multiple outcomes to deal with unmeasured confounding. An important assumption in our approach is conditional independence among multiple outcomes. In contrast to existing proposals in the literature, the roles of multiple outcomes in the co...
-
作者:Guedon, T.; Baey, C.; Kuhn, E.
作者单位:Universite Paris Saclay; INRAE; Universite de Lille; Centre National de la Recherche Scientifique (CNRS); CNRS - National Institute for Mathematical Sciences (INSMI)
摘要:We examine the problem of variance component testing in general mixed effects models using the likelihood ratio test. We account for the presence of nuisance parameters, ie, the fact that some untested variances might also be equal to zero. Two main issues arise in this context, leading to a nonregular setting. First, under the null hypothesis, the true parameter value lies on the boundary of the parameter space. Moreover, due to the presence of nuisance parameters, the exact locations of thes...
-
作者:Li, Yicheng; Zhang, Haobo; Lin, Qian
作者单位:Tsinghua University
摘要:One of the most interesting problems in the recent renaissance of the studies in kernel regression might be whether kernel interpolation can generalize well, since it may help us understand the 'benign overfitting phenomenon' reported in the literature on deep networks. In this paper, under mild conditions, we show that, for any epsilon>0, the generalization error of kernel interpolation is lower bounded by Omega(n(-epsilon)). In other words, the kernel interpolation generalizes poorly for a l...
-
作者:Hojsgaard, S.; Lauritzen, S.
作者单位:Aalborg University; University of Copenhagen
摘要:In Gaussian graphical models, the likelihood equations must typically be solved iteratively. This paper investigates two algorithms: a version of iterative proportional scaling, which avoids inversion of large matrices, and an algorithm based on convex duality and operating on the covariance matrix by neighbourhood coordinate descent, which corresponds to the graphical lasso with zero penalty. For large, sparse graphs, the iterative proportional scaling algorithm appears feasible and has simpl...
-
作者:Yang, Cheng-Han; Cheng, Yu-Jen
作者单位:University of Texas System; University of Texas Health Science Center Houston; National Tsing Hua University
摘要:We propose a model-free variable screening method for the optimal treatment regime with high-dimensional survival data. The proposed screening method provides a unified framework to select the active variables in a prespecified target population, including the treated group as a special case. Based on this framework, the optimal treatment regime is exactly the optimal classifier that minimizes a weighted misclassification error rate, with weights associated with survival outcome variables, the...
-
作者:Stensrud, M. J.; Laurendeau, J. D.; Sarvet, A. L.
作者单位:Swiss Federal Institutes of Technology Domain; Ecole Polytechnique Federale de Lausanne
摘要:We consider optimal regimes for algorithm-assisted human decision-making. Such regimes are decision functions of measured pre-treatment variables and, by leveraging natural treatment values, enjoy a superoptimality property whereby they are guaranteed to outperform conventional optimal regimes. When there is unmeasured confounding, the benefit of using superoptimal regimes can be considerable. When there is no unmeasured confounding, superoptimal regimes are identical to conventional optimal r...