A Bayesian Procedure for File Linking to Analyze End-of-Life Medical Costs
成果类型:
Article
署名作者:
Gutman, Roee; Afendulis, Christopher C.; Zaslavsky, Alan M.
署名单位:
Brown University; Harvard University; Harvard Medical School
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2012.726889
发表日期:
2013
页码:
34-47
关键词:
record-linkage
adjusted weights
last year
concatenation
models
care
摘要:
End-of-life medical expenses are a significant proportion of all health care expenditures. These costs were studied using costs of services from Medicare claims and cause of death (CoD) from death certificates. In the absence of a unique identifier linking the two datasets, common variables identified unique matches for only 33% of deaths. The remaining cases formed cells with multiple cases (32% in cells with an equal number of cases from each file and 35% in cells with an unequal number). We sampled from the joint posterior distribution of model parameters and the permutations that link cases from the two files within each cell. The linking models included the regression of location of death on CoD and other parameters, and the regression of cost measures with a monotone missing data pattern on CoD and other demographic characteristics. Permutations were sampled by enumerating the exact distribution for small cells and by the Metropolis algorithm for large cells. Sparse matrix data structures enabled efficient calculations despite the large dataset (approximate to 1.7 million cases). The procedure generates m datasets in which the matches between the two files are imputed. The m datasets can be analyzed independently and results can be combined using Rubin's multiple imputation rules. Our approach can be applied in other file-linking applications. Supplementary materials for this article are available online.