-
作者:Chevallier, Frederic; Breon, Francois-Marie
作者单位:CEA; Centre National de la Recherche Scientifique (CNRS); Universite Paris Saclay; Universite Paris Cite
摘要:Based on the measurements of the OCO-2 satellite, Noel Cressie addresses a particularly hard challenge for Earth observation, arguably an extreme case in remote sensing. He is one of the very few who has expertise in most of the processing chain and his article brilliantly discusses the diverse underlying statistical challenges. In this comment, we provide a complementary view of the topic to qualify its prospects as drawn by N. Cressie at the end of his article. We first summarize the motivat...
-
作者:Braun, Danielle; Gorfine, Malka; Katki, Hormuzd A.; Ziogas, Argyrios; Parmigiani, Giovanni
作者单位:Harvard University; Harvard T.H. Chan School of Public Health; Harvard University; Harvard University Medical Affiliates; Dana-Farber Cancer Institute; Tel Aviv University; Technion Israel Institute of Technology; National Institutes of Health (NIH) - USA; NIH National Cancer Institute (NCI); NIH National Cancer Institute- Division of Cancer Epidemiology & Genetics; University of California System; University of California Irvine
摘要:Mismeasured time-to-event data used as a predictor in risk prediction models will lead to inaccurate predictions. This arises in the context of self-reported family history, a time-to-event predictor often measured with error, used in Mendelian risk prediction models. Using validation data, we propose a method to adjust for this type of error. We estimate the measurement error process using a nonparametric smoothed Kaplan-Meier estimator, and use Monte Carlo integration to implement the adjust...
-
作者:Duchi, John C.; Jordan, Michael I.; Wainwright, Martin J.
作者单位:Stanford University; Stanford University; University of California System; University of California Berkeley; University of California System; University of California Berkeley
摘要:Working under a model of privacy in which data remain private even from the statistician, we study the tradeoff between privacy guarantees and the risk of the resulting statistical estimators. We develop private versions of classical information-theoretical bounds, in particular those due to Le Cam, Fano, and Assouad. These inequalities allow for a precise characterization of statistical rates under local privacy constraints and the development of provably (minimax) optimal estimation procedur...
-
作者:Kong, Shengchun; Nan, Bin; Kalbfleisch, John D.; Saran, Rajiv; Hirth, Richard
作者单位:Gilead Sciences; University of Michigan System; University of Michigan; University of Michigan System; University of Michigan; University of Michigan System; University of Michigan
摘要:We consider a random effects model for longitudinal data with the occurrence of an informative terminal event that is subject to right censoring. Existing methods for analyzing such data include the joint modeling approach using latent frailty and the marginal estimating equation approach using inverse probability weighting; in both cases the effect of the terminal event on the response variable is not explicit and thus not easily interpreted. In contrast, we treat the terminal event time as a...
-
作者:Guo, Beibei; Yuan, Ying
作者单位:University of Texas System; UTMD Anderson Cancer Center
-
作者:Hilton, Ross P.; Zheng, Yuchen; Serban, Nicoleta
作者单位:University System of Georgia; Georgia Institute of Technology
摘要:We introduce a modeling approach for characterizing heterogeneity in healthcare utilization using massive medical claims data. We first translate the medical claims observed for a large study population and across five years into individual-level discrete events of care called utilization sequences. We model the utilization sequences using an exponential proportional hazards mixture model to capture heterogeneous behaviors in patients' healthcare utilization. The objective is to cluster patien...
-
作者:Chen, Kehui; Lei, Jing
作者单位:Pennsylvania Commonwealth System of Higher Education (PCSHE); University of Pittsburgh; Pennsylvania Commonwealth System of Higher Education (PCSHE); University of Pittsburgh; Carnegie Mellon University
摘要:The stochastic block model (SBM) and its variants have been a popular tool for analyzing large network data with community structures. In this article, we develop an efficient network cross-validation (NCV) approach to determine the number of communities, as well as to choose between the regular stochastic block model and the degree corrected block model (DCBM). The proposed NCV method is based on a block-wise node-pair splitting technique, combined with an integrated step of community recover...
-
作者:Miller, Jeffrey W.; Harrison, Matthew T.
作者单位:Harvard University; Brown University
摘要:A natural Bayesian approach for mixture models with an unknown number of components is to take the usual finite mixture model with symmetric Dirichlet weights, and put a prior on the number of componentsthat is, to use a mixture of finite mixtures (MFM). The most commonly used method of inference for MFMs is reversible jump Markov chain Monte Carlo, but it can be nontrivial to design good reversible jump moves, especially in high-dimensional spaces. Meanwhile, there are samplers for Dirichlet ...
-
作者:Rockova, Veronika; George, Edward I.
作者单位:University of Chicago; University of Pennsylvania
摘要:Despite the wide adoption of spike-and-slab methodology for Bayesian variable selection, its potential for penalized likelihood estimation has largely been overlooked. In this article, we bridge this gap by cross-fertilizing these two paradigms with the Spike-and-Slab LASSO procedure for variable selection and parameter estimation in linear regression. We introduce a new class of self-adaptive penalty functions that arise from a fully Bayes spike-and-slab formulation, ultimately moving beyond ...
-
作者:Kuha, Jouni; Butt, Sarah; Katsikatsou, Myrsini; Skinner, Chris J.
作者单位:University of London; London School Economics & Political Science; City St Georges, University of London
摘要:In survey interviews, Don't know (DK) responses are commonly treated as missing data. One way to reduce the rate of such responses is to probe initial DK answers with a follow-up question designed to encourage respondents to give substantive, non-DK responses. However, such probing can also reduce data quality by introducing additional or differential measurement error. We propose a latent variable model for analyzing the effects of probing on responses to survey questions. The model makes it ...