How to Sample and When to Stop Sampling: The Generalised Wald Problem and Minimax Policies*

成果类型:
Article; Early Access
署名作者:
Adusumilli, Karun
署名单位:
University of Pennsylvania
刊物名称:
REVIEW OF ECONOMIC STUDIES
ISSN/ISSBN:
0034-6527
DOI:
10.1093/restud/rdaf021
发表日期:
2025
关键词:
foundations MODEL
摘要:
We study sequential experiments where sampling is costly and a decision-maker aims to determine the best treatment for full-scale implementation by (1) adaptively allocating units between two possible treatments, and (2) stopping the experiment when the expected welfare (inclusive of sampling costs) from implementing the chosen treatment is maximised. Working under a continuous time limit, we characterise the optimal policies under the minimax regret criterion. We show that the same policies also remain optimal under both parametric and non-parametric outcome distributions in an asymptotic regime where sampling costs approach zero. The minimax optimal sampling rule is just the Neyman allocation: it is independent of sampling costs and does not adapt to observed outcomes. The decision-maker halts sampling when the product of the average treatment difference and the number of observations surpasses a specific threshold. The results derived also apply to the so-called best-arm identification problem, where the number of observations is exogenously specified.
来源URL: