-
作者:Zhang, Jian
作者单位:University of Kent
摘要:Most clustering methods assume that the data can be represented by mutually exclusive clusters, although this assumption may not be the case in practice. For example, in gene expression microarray studies, investigators have often found that a gene can play multiple functions in a cell and may, therefore, belong to more than one cluster simultaneously, and that gene clusters can be linked to each other in certain pathways. This article examines the effect of the above assumption on the likelih...
-
作者:Paindaveine, Davy; Van Bever, Germain
作者单位:Universite Libre de Bruxelles; Universite Libre de Bruxelles
摘要:Aiming at analyzing multimodal or nonconvexly supported distributions through data depth, we introduce a local extension of depth. Our construction is obtained by conditioning the distribution to appropriate depth-based neighborhoods and has the advantages, among others, of maintaining affine-invariance and applying to all depths in a generic way. Most importantly, unlike their competitors, which (for extreme localization) rather measure probability mass, the resulting local depths focus on ce...
-
作者:Oh, Dong Hwan; Patton, Andrew J.
作者单位:Duke University
摘要:This article considers the estimation of the parameters of a copula via a simulated method of moments (MM) type approach. This approach is attractive when the likelihood of the copula model is not known in closed form, or when the researcher has a set of dependence measures or other functionals of the copula that are of particular interest. The proposed approach naturally also nests MM and generalized method of moments estimators. Drawing on results for simulation-based estimation and on recen...
-
作者:Gorfine, Malka; Hsu, Li; Parmigiani, Giovanni
作者单位:Technion Israel Institute of Technology; Fred Hutchinson Cancer Center; Harvard University; Harvard University Medical Affiliates; Dana-Farber Cancer Institute; Harvard University; Harvard T.H. Chan School of Public Health
摘要:In evaluating familial risk for disease we have two main statistical tasks: assessing the probability of carrying an inherited genetic mutation conferring higher risk, and predicting the absolute risk of developing diseases over time for those individuals whose mutation status is known. Despite substantial progress, much remains unknown about the role of genetic and environmental risk factors, about the sources of variation in risk among families that carry high-risk mutations, and about the s...
-
作者:McShane, Blakeley B.; Jensen, Shane T.; Pack, Allan I.; Wyner, Abraham J.
作者单位:Northwestern University; University of Pennsylvania; University of Pennsylvania
-
作者:Reich, Brian J.; Bandyopadhyay, Dipankar; Bondell, Howard D.
作者单位:North Carolina State University; University of Minnesota System; University of Minnesota Twin Cities
摘要:Periodontal disease (PD) progression is often quantified by clinical attachment level (CAL) defined as the distance down a tooth's root that is detached from the surrounding bone. Measured at six locations per tooth throughout the mouth (excluding the molars), it gives rise to a dependent data setup. These data are often reduced to a one-number summary, such as the whole-mouth average or the number of observations greater than a threshold, to be used as the response in a regression to identify...
-
作者:McShane, Blakeley B.; Jensen, Shane T.; Pack, Allan I.; Wyner, Abraham J.
作者单位:Northwestern University; University of Pennsylvania; University of Pennsylvania
摘要:We develop methodology that combines statistical learning methods with generalized Markov models, thereby enhancing the former to account for time series dependence. Our methodology can accommodate very general and very long-term time dependence structures in an easily estimable and computationally tractable fashion. We apply our methodology to the scoring of sleep behavior in mice. As methods currently used to score sleep in mice are expensive, invasive, and labor intensive, there is consider...
-
作者:Zhou, Hua; Li, Lexin; Zhu, Hongtu
作者单位:North Carolina State University; University of North Carolina; University of North Carolina Chapel Hill; University of North Carolina; University of North Carolina Chapel Hill
摘要:Classical regression methods treat covariates as a vector and estimate a corresponding vector of regression coefficients. Modem applications in medical imaging generate covariates of more complex form such as multidimensional arrays (tensors). Traditional statistical and computational methods are proving insufficient for analysis of these high-throughput data due to their ultrahigh dimensionality as well as complex structure. In this article, we propose a new family of tensor regression models...
-
作者:Robbins, Michael W.; Ghosh, Sujit K.; Habiger, Joshua D.
作者单位:University of Missouri System; University of Missouri Columbia; North Carolina State University; Oklahoma State University System; Oklahoma State University - Stillwater
摘要:In this article, we consider imputation in the USDA's Agricultural Resource Management Survey (ARMS) data, which is a complex, high-dimensional economic dataset. We develop a robust joint model for ARMS data, which requires that variables are transformed using a suitable class of marginal densities (e.g., skew normal family). We assume that the transformed variables may be linked through a Gaussian copula, which enables construction of the joint model via a sequence of conditional linear model...
-
作者:Cattaneo, Matias D.; Crump, Richard K.; Jansson, Michael
作者单位:University of Michigan System; University of Michigan; Federal Reserve System - USA; Federal Reserve Bank - New York; University of California System; University of California Berkeley
摘要:With the aim of improving the quality of asymptotic distributional approximations for nonlinear functionals of nonparametric estimators, this article revisits the large-sample properties of an importantmember of that class, namely a kernel-based weighted average derivative estimator. Asymptotic linearity of the estimator is established under weak conditions. Indeed, we show that the bandwidth conditions employed are necessary in some cases. A bias-corrected version of the estimator is proposed...