CASE-CONTROL STUDIES WITH ERRORS IN COVARIATES
成果类型:
Article
署名作者:
CARROLL, RJ; GAIL, MH; LUBIN, JH
署名单位:
National Institutes of Health (NIH) - USA; NIH National Cancer Institute (NCI)
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.2307/2290713
发表日期:
1993
页码:
185-199
关键词:
likelihood estimation
confidence-intervals
binary regression
linear-models
IN-VARIABLES
parameters
statistics
cancer
摘要:
We devise methods for estimating the parameters of a prospective logistic model with dichotomous response D and arbitrary covariates X from case-control data when these covariates are measured with error. We suppose that some fraction of the cases and controls provide only the error-prone covariate measurements, W (the ''incomplete'' or ''reduced'' data), whereas some of the cases and controls provide measurements on X and W (the ''complete'' data). We assume a measurement error density with a finite set of parameters alpha, namely f(W\XD)(w\x, d, alpha), and nondifferential error is treated as a special case of this model, f(W\X)(w\x, alpha). Our algorithm estimates both the logistic parameters and alpha from a pseudolikelihood. Because empirical distribution functions are used in place of needed distributions in the pseudolikelihoods, the required asymptotic theory is more elaborate than for pseudolikelihoods based on substitution for a finite number of nuisance parameters. We also examine computationally simpler methods under the assumptions that the disease is rare and that errors are nondifferential. Estimates of m(W) = E(X\W) are substituted for X in the logistic model when X is not available. Such estimates of m(W) can be obtained from the complete data described above or from an independent validation study. If measurements on X are not available, m(W) can still be estimated from replicated W measurements in some circumstances. A final approach uses approximate logistic regression techniques and is appropriate when a more accurate approximation is required than obtained by simply substituting m(W) for X. Asymptotic theory is presented for each of these procedures, and examples are used to illustrate the calculations.