...
首页> 外文期刊>Knowledge-Based Systems >Multi-issue negotiation with deep reinforcement learning
【24h】

Multi-issue negotiation with deep reinforcement learning

机译:与深增强学习的多项问题谈判

获取原文
获取原文并翻译 | 示例
           

摘要

Negotiation is a process where agents work through disputes and maximize surplus. This paper investigates the use of deep reinforcement learning in the domain of negotiation, evaluating its ability to exploit, adapt, and cooperate. Two actor-critic networks were trained for the bidding and acceptance strategy, against time-based agents, behavior-based agents, and through self-play. Results reveal four key findings. First, neural agents learn to exploit time-based agents, achieving clear transitions in decision values. The primary barriers are the change in marginal utility (second derivative) and cliff-walking resulting from negotiation deadlines. Second, the Cauchy distribution emerges as suitable for sampling offers, due to its peaky center and heavy tails. Third, neural agents demonstrate adaptive behavior against behavior-based agents. Fourth, neural agents learn to cooperate during self-play. Agents learn non-credible threats, which resemble reputation-based strategies in the evolutionary game theory literature. (C) 2020 Elsevier B.V. All rights reserved.
机译:谈判是代理通过纠纷工作并最大限度地提高盈余的过程。本文调查了在谈判领域中的深度加固学习的使用,评估其利用,适应和合作的能力。两个演员 - 评论家网络接受了招标和接受战略,针对基于时间的代理商,基于行为的代理商以及通过自我扮演的培训。结果显示了四个关键结果。首先,神经代理商学习利用基于时间的代理,在决策价值中实现明确的转换。初级障碍是谈判截止日期的边际效用(第二衍生物)和悬崖行走的变化。其次,由于其峰值中心和沉重的尾部,Cauchy分布会出现适用于抽样优惠。第三,神经剂展示了对基于行为的代理的适应性行为。第四,神经代理商学会在自我播放期间合作。代理商学习不可信的威胁,类似于进化博弈论文学中的基于信誉的战略。 (c)2020 Elsevier B.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号