-
作者:Khan, Jafar A.; Van Aelst, Stefan; Zamar, Ruben H.
作者单位:University of Dhaka; Ghent University; University of British Columbia
摘要:In this article we consider the problem of building a linear prediction model when the number of candidate predictors is large and the data possibly contain anomalies that are difficult to visualize and clean. We want to predict the nonoutlying cases; therefore, we need a method that is simultaneously robust and scalable. We consider the stepwise least angle regression (LARS) algorithm which is computationally very efficient but sensitive to outliers. We introduce two different approaches to r...
-
作者:Reiter, Jerome P.; Raghunathan, Trivellore E.
作者单位:Duke University; University of Michigan System; University of Michigan
摘要:Multiple imputation was first conceived as a tool that statistical agencies could use to handle nonresponse in large-sample public use surveys. In the last two decades, the multiple-imputation framework has been adapted for other statistical contexts. For example, individual researchers use multiple imputation to handle missing data in small samples, statistical agencies disseminate multiply-imputed data sets for purposes of protecting data confidentiality, and survey methodologists and epidem...
-
作者:Shen, Yu; Qin, Jing; Costantino, Joseph R.
作者单位:University of Texas System; UTMD Anderson Cancer Center; Howard Hughes Medical Institute; National Institutes of Health (NIH) - USA; NIH National Institute of Allergy & Infectious Diseases (NIAID); Pennsylvania Commonwealth System of Higher Education (PCSHE); University of Pittsburgh
摘要:The largest randomized, double-blind, placebo-controlled chemoprevention trial, the National Surgical Adjuvant Breast and Bowel Project's Breast Cancer Prevention Trial (NSABP-BCPT), evaluated the efficacy of tamoxifen in preventing breast cancer among women at high risk of developing the disease. The trial has reported a reduction of breast cancer incidence for the tamoxifen group; however, the effect of tamoxifen on the time to diagnosis of the disease over the 6-year follow-up of the trial ...
-
作者:Christman, Mary C.
作者单位:State University System of Florida; University of Florida
-
作者:Qiu, Peihua; Sun, Jingran
作者单位:University of Minnesota System; University of Minnesota Twin Cities
摘要:Gene microarray data are used in a wide variety of applications, including pharmaceutical and clinical research. By comparing gene expression in normal and abnormal cells, microarrays can be used to identify genes involved in particular diseases, and these genes then can be targeted by therapeutic drugs. Most gene expression data are produced from spotted microarray images. A spotted microarray image consists of thousands of spots, with individual DNA sequences first printed at each spot and t...
-
作者:Frey, Jesse; Ozturk, Omer; Deshpande, Jayant V.
作者单位:Villanova University; University System of Ohio; Ohio State University; Savitribai Phule Pune University
摘要:The ranked-set sampling literature includes both inference procedures that rely on the assumption of perfect rankings and inference procedures that are robust to violations of this assumption. Procedures that assume perfect rankings tend to be more efficient when rankings are in fact perfect, but they may be invalid when perfect rankings fail. As a result, users of ranked-set sampling must decide between efficiency and robustness, and there is at present little to guide their decision. In this...
-
作者:Baker, Stuart G.
作者单位:National Institutes of Health (NIH) - USA; NIH National Cancer Institute (NCI)
-
作者:Kaziska, David M.; Srivastava, Anuj
作者单位:United States Department of Defense; United States Air Force; US Air Force Research Laboratory; Air Force Institute of Technology (AFIT); State University System of Florida; Florida State University
-
作者:MacEachern, Steven N.; Rao, Youlan; Wu, Chunjie
作者单位:University System of Ohio; Ohio State University; Shanghai University of Finance & Economics
摘要:In practice, the cumulative sum (CUSUM) control chart is often used to detect small shifts in the mean of a normally distributed process, but it performs poorly for thick-tailed processes and for large shifts. This article provides a robust-likelihood cumulative sum (RLCUSUM) chart that discounts outliers and yet has the ability to detect large shifts quickly. The new chart is motivated by the likelihood underpinnings of the CUSUM. It is based on the likelihood of a variate constructed to ensu...
-
作者:Huang, Hsin-Cheng; Chen, Chun-Shu
作者单位:National Central University
摘要:In many fields of science, predicting variables of interest over a study region based on noisy data observed at some locations is an important problem. Two popular methods for the problem are kriging and smoothing splines. The former assumes that the underlying process is stochastic, whereas the latter assumes it is purely deterministic. Kriging performs better than smoothing splines in some situations, but is outperformed by smoothing splines in others. However, little is known regarding sele...