-
作者:Kraus, David; Panaretos, Victor M.
作者单位:Swiss Federal Institutes of Technology Domain; Ecole Polytechnique Federale de Lausanne
摘要:Inferences related to the second-order properties of functional data, as expressed by covariance structure, can become unreliable when the data are non-Gaussian or contain unusual observations. In the functional setting, it is often difficult to identify atypical observations, as their distinguishing characteristics can be manifold but subtle. In this paper, we introduce the notion of a dispersion operator, investigate its use in probing the second-order structure of functional data, and devel...
-
作者:Sun, Tingni; Zhang, Cun-Hui
作者单位:Rutgers University System; Rutgers University New Brunswick
摘要:Scaled sparse linear regression jointly estimates the regression coefficients and noise level in a linear model. It chooses an equilibrium with a sparse regression method by iteratively estimating the noise level via the mean residual square and scaling the penalty in proportion to the estimated noise level. The iterative algorithm costs little beyond the computation of a path or grid of the sparse regression estimator for penalty levels above a proper threshold. For the scaled lasso, the algo...
-
作者:Shen, Xiaotong; Huang, Hsin-Cheng; Pan, Wei
作者单位:University of Minnesota System; University of Minnesota Twin Cities; Academia Sinica - Taiwan; University of Minnesota System; University of Minnesota Twin Cities
摘要:In this article, we propose a regression method for simultaneous supervised clustering and feature selection over a given undirected graph, where homogeneous groups or clusters are estimated as well as informative predictors, with each predictor corresponding to one node in the graph and a connecting path indicating a priori possible grouping among the corresponding predictors. The method seeks a parsimonious model with high predictive power through identifying and collapsing homogeneous group...
-
作者:Garthwaite, Paul H.; Critchley, Frank; Anaya-Izquierdo, Karim; Mubwandarikwa, Emmanuel
作者单位:Open University - UK
摘要:Two transformations are proposed that give orthogonal components with a one-to-one correspondence between the original vectors and the components. The aim is that each component should be close to the vector with which it is paired, orthogonality imposing a constraint. The transformations lead to a variety of new statistical methods, including a unified approach to the identification and diagnosis of collinearities, a method of setting prior weights for Bayesian model averaging, and a means of...
-
作者:Huang, Hanwen; Liu, Yufeng; Marron, J. S.
作者单位:University of Texas System; University of Texas Health Science Center Houston; University of North Carolina; University of North Carolina Chapel Hill
摘要:Linear classifiers are very popular, but can have limitations when classes have distinct subpopulations. General nonlinear kernel classifiers are very flexible, but do not give clear interpretations and may not be efficient in high dimensions. We propose the bidirectional discrimination classification method, which generalizes linear classifiers to two or more hyperplanes. This new family of classification methods gives much of the flexibility of a general nonlinear classifier while maintainin...
-
作者:Hall, Peter; Maiti, Tapabrata
作者单位:University of Melbourne; Michigan State University
摘要:In some problems involving functional data, it is desired to undertake prediction or classification before the full trajectory of a function is observed. In such cases, it is often preferable to suffer somewhat greater error in return for making a decision relatively early. The prediction and classification problems can be treated similarly, using mean squared prediction error, or classification error, respectively, as the means for quantifying performance, so in this paper we focus principall...
-
作者:Chan, Kwun Chuen Gary; Chen, Ying Qing; Di, Chong-Zhi
作者单位:University of Washington; University of Washington Seattle; Fred Hutchinson Cancer Center
摘要:To study disease association with risk factors in epidemiologic studies, cross-sectional sampling is often more focused and less costly for recruiting study subjects who have already experienced initiating events. For time-to-event outcome, however, such a sampling strategy may be length biased. Coupled with censoring, analysis of length-biased data can be quite challenging, due to induced informative censoring in which the survival time and censoring time are correlated through a common backw...
-
作者:Butler, Ronald W.; Bronson, Douglas A.
作者单位:Southern Methodist University; US Department of Veterans Affairs; Veterans Health Administration (VHA)
摘要:Transient semi-Markov processes have traditionally been used to describe the transitions of a patient through the various states of a multistate survival model. A survival distribution in this context is a sojourn through the states until passage to a fatal absorbing state or certain endpoint states. Using complete sojourn data, this paper shows how such survival distributions and associated hazard functions can be estimated nonparametrically and also how nonparametric bootstrap pointwise conf...
-
作者:Dai, James Y.; Kooperberg, Charles; Leblanc, Michael; Prentice, Ross L.
作者单位:Fred Hutchinson Cancer Center
摘要:Several two-stage multiple testing procedures have been proposed to detect gene-environment interaction in genome-wide association studies. In this article, we elucidate general conditions that are required for validity and power of these procedures, and we propose extensions of two-stage procedures using the case-only estimator of gene-treatment interaction in randomized clinical trials. We develop a unified estimating equation approach to proving asymptotic independence between a filtering s...
-
作者:Jiang, Binyan; Loh, Wei-Liem
作者单位:National University of Singapore
摘要:This article proposes a method of moments technique for estimating the sparsity of signals in a random sample. This involves estimating the largest eigenvalue of a large Hermitian trigonometric matrix under mild conditions. As illustration, the method is applied to two well-known problems. The first focuses on the sparsity of a large covariance matrix and the second investigates the sparsity of a sequence of signals observed with stationary, weakly dependent noise. Simulation shows that the pr...