Distribution-Free Prediction Sets for Two-Layer Hierarchical Models
成果类型:
Article
署名作者:
Dunn, Robin; Wasserman, Larry; Ramdas, Aaditya
署名单位:
Novartis; Novartis USA; Carnegie Mellon University; Carnegie Mellon University
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1080/01621459.2022.2060112
发表日期:
2023
页码:
2491-2502
关键词:
inference
metaanalysis
variables
patterns
摘要:
We consider the problem of constructing distribution-free prediction sets for data from two-layer hierarchical distributions. For iid data, prediction sets can be constructed using the method of conformal prediction. The validity of conformal prediction hinges on the exchangeability of the data, which does not hold when groups of observations come from distinct distributions, such as multiple observations on each patient in a medical database. We extend conformal methods to a hierarchical setting. We develop CDF pooling, single subsampling, and repeated subsampling approaches to construct prediction sets in unsupervised and supervised settings. We compare these approaches in terms of coverage and average set size. If asymptotic coverage is acceptable, we recommend CDF pooling for its balance between empirical coverage and average set size. If we desire coverage guarantees, then we recommend the repeated subsampling approach. Supplementary materials for this article are available online.