-
作者:Cai, T. Tony; Guo, Zijian; Ma, Rong
作者单位:University of Pennsylvania; Rutgers University System; Rutgers University New Brunswick; Stanford University
摘要:This article develops a unified statistical inference framework for high-dimensional binary generalized linear models (GLMs) with general link functions. Both unknown and known design distribution settings are considered. A two-step weighted bias-correction method is proposed for constructing confidence intervals (CIs) and simultaneous hypothesis tests for individual components of the regression vector. Minimax lower bound for the expected length is established and the proposed CIs are shown t...
-
作者:Zito, Alessandro; Rigon, Tommaso; Ovaskainen, Otso; Dunson, David B.
作者单位:Duke University; University of Milano-Bicocca; University of Jyvaskyla; University of Helsinki; Norwegian University of Science & Technology (NTNU)
摘要:We aim at modeling the appearance of distinct tags in a sequence of labeled objects. Common examples of this type of data include words in a corpus or distinct species in a sample. These sequential discoveries are often summarized via accumulation curves, which count the number of distinct entities observed in an increasingly large set of objects. We propose a novel Bayesian method for species sampling modeling by directly specifying the probability of a new discovery, therefore, allowing for ...
-
作者:Dai, Xiaowu; Li, Lexin
作者单位:University of California System; University of California Berkeley; University of California System; University of California Berkeley; University of California System; University of California Berkeley
摘要:Multimodal imaging has transformed neuroscience research. While it presents unprecedented opportunities, it also imposes serious challenges. Particularly, it is difficult to combine the merits of the interpretability attributed to a simple association model with the flexibility achieved by a highly adaptive nonlinear model. In this article, we propose an orthogonalized kernel debiased machine learning approach, which is built upon the Neyman orthogonality and a form of decomposition orthogonal...
-
作者:Deng, Yujia; Tang, Xiwei; Qu, Annie
作者单位:University of Illinois System; University of Illinois Urbana-Champaign; University of Virginia; University of California System; University of California Irvine
摘要:Multi-dimensional tensor data have gained increasing attention in the recent years, especially in biomedical imaging analyses. However, the most existing tensor models are only based on the mean information of imaging pixels. Motivated by multimodal optical imaging data in a breast cancer study, we develop a new tensor learning approach to use pixel-wise correlation information, which is represented through the higher order correlation tensor. We proposed a novel semi-symmetric correlation ten...
-
作者:Henzi, Alexander; Kleger, Gian-Reto; Ziegel, Johanna F.
作者单位:University of Bern; Kantonsspital St. Gallen
摘要:A Distributional (Single) Index Model (DIM) is a semiparametric model for distributional regression, that is, estimation of conditional distributions given covariates. The method is a combination of classical single-index models for the estimation of the conditional mean of a response given covariates, and isotonic distributional regression. The model for the index is parametric, whereas the conditional distributions are estimated nonparametrically under a stochastic ordering constraint. We sh...
-
作者:Nie, Lizhen; Rockova, Veronika
作者单位:University of Chicago; University of Chicago
摘要:The impracticality of posterior sampling has prevented the widespread adoption of spike-and-slab priors in high-dimensional applications. To alleviate the computational burden, optimization strategies have been proposed that quickly find local posterior modes. Trading off uncertainty quantification for computational speed, these strategies have enabled spike-and-slab deployments at scales that would be previously unfeasible. We build on one recent development in this strand of work: the Spike-...
-
作者:Yang, Ying; Yao, Fang
作者单位:Peking University
摘要:Functional data analysis has attracted considerable interest and is facing new challenges, one of which is the increasingly available data in a streaming manner. In this article we develop an online nonparametric method to dynamically update the estimates of mean and covariance functions for functional data. The kernel-type estimates can be decomposed into two sufficient statistics depending on the data-driven bandwidths. We propose to approximate the future optimal bandwidths by a sequence of...
-
作者:Edwards, David J.; Mee, Robert W.
作者单位:Virginia Commonwealth University; University of Tennessee System; University of Tennessee Knoxville
摘要:Two-level fractional factorial designs are often used in screening scenarios to identify active factors. This article investigates the block diagonal structure of the information matrix of nonregular two-level designs. This structure is appealing since estimates of parameters belonging to different diagonal submatrices are uncorrelated. As such, the covariance matrix of the least squares estimates is simplified and the number of linear dependencies is reduced. We connect the block diagonal inf...
-
作者:Gasperoni, Francesca; Luati, Alessandra; Paci, Lucia; D'Innocenzo, Enzo
作者单位:University of Cambridge; MRC Biostatistics Unit; University of Bologna; Catholic University of the Sacred Heart
摘要:A simultaneous autoregressive score-driven model with autoregressive disturbances is developed for spatio-temporal data that may exhibit heavy tails. The model specification rests on a signal plus noise decomposition of a spatially filtered process, where the signal can be approximated by a nonlinear function of the past variables and a set of explanatory variables, while the noise follows a multivariate Student-t distribution. The key feature of the model is that the dynamics of the space-tim...
-
作者:Zhu, Wanchuang; Jiang, Yingkai; Liu, Jun S.; Deng, Ke
作者单位:Tsinghua University; Tsinghua University; Tsinghua University; Tsinghua University; Harvard University
摘要:Learning how to aggregate ranking lists has been an active research area for many years and its advances have played a vital role in many applications ranging from bioinformatics to internet commerce. The problem of discerning reliability of rankers based only on the rank data is of great interest to many practitioners, but has received less attention from researchers. By dividing the ranked entities into two disjoint groups, that is, relevant and irrelevant/background ones, and incorporating ...