Propriety of the posterior distribution and existence of the MLE for regression models with covariates missing at random

成果类型:
Article
署名作者:
Chen, MH; Ibrahim, JG; Shao, QM
署名单位:
University of Connecticut; University of North Carolina; University of North Carolina Chapel Hill; University of Oregon; National University of Singapore
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1198/016214504000000368
发表日期:
2004
页码:
421-438
关键词:
GENERALIZED LINEAR-MODELS Bayesian methods improper priors
摘要:
Characterizing model identifiability in the presence of missing covariate data is a very important issue in missing data problems. In this article. we characterize the propriety of the posterior distribution of the regression coefficients for some general classes of regression models, including the class of generalized linear models (GLM's) and parametric survival models with right-censored data. Toward this goal, we 14 derive some very general and easy-to-check conditions for the matrix of covariates. We also derive sufficient conditions for the existence of the maximum likelihood estimates and establish novel results for checking propriety of the posterior when the sample size is large. Several theorems are given to establish propriety of the posterior and the existence of the maximum likelihood estimator. The conditions reduce to solving a system of linear equations. which can be carried Out using software such as MAPLE, IMSL, or SAS. We assume that the missing covariates are missing at random and assume an improper uniform prior for the regression coefficients. In addition, we establish these results assuming a very general form for the covariate distribution, allowing for both missing categorical and/or continuous covariates. A small dataset is used to illustrate that the posterior can be improper based on complete cases while proper when all of the cases are used in the analysis. Two real datasets are presented to demonstrate verification of posterior propriety for GLM's and parametric survival models, and also to illustrate propriety for large datasets.
来源URL: