Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
成果类型:
Article; Early Access
署名作者:
Lin, Yifan; Wang, Yuhao; Zhou, Enlu
署名单位:
University System of Georgia; Georgia Institute of Technology
刊物名称:
OPERATIONS RESEARCH
ISSN/ISSBN:
0030-364X
DOI:
10.1287/opre.2024.0854
发表日期:
2025
关键词:
摘要:
This article may be used only for the purposes of research, teaching, and/or private study. Commercial use or systematic downloading (by robots or other automatic processes) is prohibited without explicit Publisher approval, unless otherwise noted. For more information, contact permissions@informs.org. The Publisher does not warrant or guarantee the article's accuracy, completeness, merchantability, fitness inclusion of an advertisement in this article, neither constitutes nor implies a guarantee, endorsement, or support
来源URL: