Predicting random effects from finite population clustered samples with response error

成果类型:
Article
署名作者:
Stanek, EJ III; Singer, JM
署名单位:
University of Massachusetts System; University of Massachusetts Amherst; Universidade de Sao Paulo
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1198/016214504000001718
发表日期:
2004
页码:
1119-1130
关键词:
regression-analysis cholesterol levels seasonal-variation unified approach
摘要:
In many situations there is interest in parameters (e.g., mean) associated with the response distribution of individual Clusters in a finite clustered population. We develop predictors of such parameters using a two-stage sampling probability model with response error. The probability model sterns directly from finite population sampling without additional assumptions, and thus is design-based. The predictors are closely related to best linear unbiased predictors (BLUP) that arise from common mixed-model methods, as well as to model-based predictors obtained via super population approaches for survey sampling, The context assumes Clusters of equal size and equal size sampling of units within clusters, Target parameters may correspond to clusters realized in the sample, as well as nonrealized clusters. In either case, the predictors are linear and unbiased, and minimize the expected mean squared error. They correspond to the sum of predictors of responses for realized and nonrealized units in the Cluster, accounting directly for the second-stage sampling fraction. In contrast, the BLUP commonly used in mixed models can be interpreted as predicting only the responses of second-stage units not observed for a cluster. not the cluster mean. The development reveals that two-stage sampling does riot give rise to it more general variance structure often assumed in superpopulation models, even when variances within clusters are heterogeneous. With response error present, we predict target random variables defined as an expected (or average) response over units in a cluster.