Inference for Multivariate Regression Model Based on Synthetic Data Generated Using Plug-in Sampling

成果类型:
Article
署名作者:
Moura, Ricardo; Klein, Martin; Zylstra, John; Coelho, Carlos A.; Sinha, Bimal
署名单位:
Universidade Nova de Lisboa; US Food & Drug Administration (FDA); University System of Maryland; University of Maryland Baltimore County; Universidade Nova de Lisboa
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2021.1900860
发表日期:
2021
页码:
720-733
关键词:
microdata
摘要:
In this article, the authors derive the likelihood-based exact inference for singly and multiply imputed synthetic data in the context of a multivariate regression model. The synthetic data are generated via the Plug-in Sampling method, where the unknown parameters in the model are set equal to the observed values of their point estimators based on the original data, and synthetic data are drawn from this estimated version of the model. Simulation studies are carried out in order to confirm the theoretical results. The authors provide exact test procedures, which in case multiple synthetic datasets are permissible, are compared with the asymptotic results of Reiter. An application using 2000U.S. Current Population Survey public use data is discussed. Furthermore, properties of the proposed methodology are evaluated in scenarios where some of the conditions that were used to derive the methodology do not hold, namely for nonnormal and discrete distributed random variables, cases in which the inferential procedures developed still show very good performances.