-
作者:Park, Seyoung; Xu, Hao; Zhao, Hongyu
作者单位:Sungkyunkwan University (SKKU); Yale University
摘要:Advances in high-throughput genomic technologies coupled with large-scale studies including The Cancer Genome Atlas (TCGA) project have generated rich resources of diverse types of omics data to better understand cancer etiology and treatment responses. Clustering patients into subtypes with similar disease etiologies and/or treatment responses using multiple omics data types has the potential to improve the precision of clustering than using a single data type. However, in practice, patient c...
-
作者:Tang, Xiwei; Xue, Fei; Qu, Annie
作者单位:University of Virginia; University of Pennsylvania
摘要:In this article, we propose a heterogeneous modeling framework which achieves individual-wise feature selection and heterogeneous covariates' effects subgrouping simultaneously. In contrast to conventional model selection approaches, the new approach constructs a separation penalty with multidirectional shrinkages, which facilitates individualized modeling to distinguish strong signals from noisy ones and selects different relevant variables for different individuals. Meanwhile, the proposed m...
-
作者:Ma, Rong; Cai, T. Tony; Li, Hongzhe
作者单位:University of Pennsylvania; University of Pennsylvania
摘要:High-dimensional logistic regression is widely used in analyzing data with binary outcomes. In this article, global testing and large-scale multiple testing for the regression coefficients are considered in both single- and two-regression settings. A test statistic for testing the global null hypothesis is constructed using a generalized low-dimensional projection for bias correction and its asymptotic null distribution is derived. A lower bound for the global testing is established, which sho...
-
作者:Izenman, Alan Julian
作者单位:Pennsylvania Commonwealth System of Higher Education (PCSHE); Temple University; Pennsylvania Commonwealth System of Higher Education (PCSHE); Temple University
摘要:Discrete Markov random fields are undirected graphical models in which the nodes of a graph are discrete random variables with values usually represented by colors. Typically, graphs are taken to be square lattices, although more general graphs are also of interest. Such discrete MRFs have been studied in many disciplines. We describe the two most popular types of discrete MRFs, namely the two-state Ising model and the q-state Potts model, and variations such as the cellular automaton, the cel...
-
作者:Jacob, Pierre E.; Gong, Ruobin; Edlefsen, Paul T.; Dempster, Arthur P.
作者单位:Harvard University; Rutgers University System; Rutgers University New Brunswick; Fred Hutchinson Cancer Center
摘要:We present a Gibbs sampler for the Dempster-Shafer (DS) approach to statistical inference for categorical distributions. The DS framework extends the Bayesian approach, allows in particular the use of partial prior information, and yields three-valued uncertainty assessments representing probabilities for, against, and don't know about formal assertions of interest. The proposed algorithm targets the distribution of a class of random convex polytopes which encapsulate the DS inference. The sam...
-
作者:Lee, DongHyuk; Zhu, Bin
作者单位:National Institutes of Health (NIH) - USA; NIH National Cancer Institute (NCI); NIH National Cancer Institute- Division of Cancer Epidemiology & Genetics
摘要:Cancers arise owing to somatic mutations, and the characteristic combinations of somatic mutations form mutational signatures. Despite many mutational signatures being identified, mutational processes underlying a number of mutational signatures remain unknown, which hinders the identification of interventions that may reduce somatic mutation burdens and prevent the development of cancer. We demonstrate that the unknown cause of a mutational signature can be inferred by the associated signatur...
-
作者:Pan, Yinghao; Zhao, Ying-Qi
作者单位:University of North Carolina; University of North Carolina Charlotte; Fred Hutchinson Cancer Center
摘要:Individualized treatment rules (ITRs) recommend treatment according to patient characteristics. There is a growing interest in developing novel and efficient statistical methods in constructing ITRs. We propose an improved doubly robust estimator of the optimal ITRs. The proposed estimator is based on a direct optimization of an augmented inverse-probability weighted estimator of the expected clinical outcome over a class of ITRs. The method enjoys two key properties. First, it is doubly robus...
-
作者:Qiu, Hongxiang; Carone, Marco; Sadikova, Ekaterina; Petukhova, Maria; Kessler, Ronald C.; Luedtke, Alex
作者单位:University of Washington; University of Washington Seattle; Harvard University; Harvard Medical School; University of Washington; University of Washington Seattle
摘要:There is an extensive literature on the estimation and evaluation of optimal individualized treatment rules in settings where all confounders of the effect of treatment on outcome are observed. We study the development of individualized decision rules in settings where some of these confounders may not have been measured but a valid binary instrument is available for a binary treatment. We first consider individualized treatment rules, which will naturally be most interesting in settings where...
-
作者:Ferrari, Federico; Dunson, David B.
作者单位:Duke University
摘要:This article is motivated by the problem of inference on interactions among chemical exposures impacting human health outcomes. Chemicals often co-occur in the environment or in synthetic mixtures and as a result exposure levels can be highly correlated. We propose a latent factor joint model, which includes shared factors in both the predictor and response components while assuming conditional independence. By including a quadratic regression in the latent variables in the response component,...
-
作者:Chau, Joris; von Sachs, Rainer
作者单位:Universite Catholique Louvain
摘要:Intrinsic wavelet transforms and wavelet estimation methods are introduced for curves in the non-Euclidean space of Hermitian positive definite matrices, with in mind the application to Fourier spectral estimation of multivariate stationary time series. The main focus is on intrinsic average-interpolation wavelet transforms in the space of positive definite matrices equipped with an affine-invariant Riemannian metric, and convergence rates of linear wavelet thresholding are derived for intrins...