您的位置: 首页 > 全球经管学术 > 顶刊追踪 > 顶尖期刊 > 统计学 > Journal of the American Statistical Association > 1993 > 421期

CASE-CONTROL STUDIES WITH ERRORS IN COVARIATES

成果类型：

Article

署名作者：

CARROLL, RJ; GAIL, MH; LUBIN, JH

署名单位：

National Institutes of Health (NIH) - USA; NIH National Cancer Institute (NCI)

刊物名称：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION

ISSN/ISSBN：

0162-1459

DOI：

10.2307/2290713

发表日期：

1993

页码：

185-199

关键词：

likelihood estimation confidence-intervals binary regression linear-models IN-VARIABLES parameters statistics cancer

摘要：

We devise methods for estimating the parameters of a prospective logistic model with dichotomous response D and arbitrary covariates X from case-control data when these covariates are measured with error. We suppose that some fraction of the cases and controls provide only the error-prone covariate measurements, W (the ''incomplete'' or ''reduced'' data), whereas some of the cases and controls provide measurements on X and W (the ''complete'' data). We assume a measurement error density with a finite set of parameters alpha, namely f(W\XD)(w\x, d, alpha), and nondifferential error is treated as a special case of this model, f(W\X)(w\x, alpha). Our algorithm estimates both the logistic parameters and alpha from a pseudolikelihood. Because empirical distribution functions are used in place of needed distributions in the pseudolikelihoods, the required asymptotic theory is more elaborate than for pseudolikelihoods based on substitution for a finite number of nuisance parameters. We also examine computationally simpler methods under the assumptions that the disease is rare and that errors are nondifferential. Estimates of m(W) = E(X\W) are substituted for X in the logistic model when X is not available. Such estimates of m(W) can be obtained from the complete data described above or from an independent validation study. If measurements on X are not available, m(W) can still be estimated from replicated W measurements in some circumstances. A final approach uses approximate logistic regression techniques and is appropriate when a more accurate approximation is required than obtained by simply substituting m(W) for X. Asymptotic theory is presented for each of these procedures, and examples are used to illustrate the calculations.