Dynamic Programming Principle for Automatic Negotiations

成果类型:
Article
署名作者:
Phogat, Karmvir Singh; Tsumura, Koji
署名单位:
University of Tokyo
刊物名称:
IEEE TRANSACTIONS ON AUTOMATIC CONTROL
ISSN/ISSBN:
0018-9286
DOI:
10.1109/TAC.2023.3285859
发表日期:
2024
页码:
1281-1287
关键词:
Dynamic Programming Search problems Protocols programming Technological innovation optimization Heuristic algorithms Automatic negotiations optimal bidding strategies Sequential bargaining
摘要:
In automatic negotiation, an automated agent is trained to negotiate on behalf of a human negotiator. We consider that the domain of negotiations is known to both the agent and its opponents. In this setting, a greedy concession algorithm (GCA) is employed to find an optimal policy for the agent when the agent's belief about the opponent is known. However, the GCA is computationally expensive to certain class of policies. In this article, we propose a reverse GCA that is computationally less expensive for such class of policies. We also provide an alternate proof to establish that the GCA is a dynamic programming principle.