Multi-issue negotiation with deep reinforcement learning

Chang Ho-Chun Herbert

首页> 外文期刊>Knowledge-Based Systems >Multi-issue negotiation with deep reinforcement learning

【24h】

Multi-issue negotiation with deep reinforcement learning

机译：与深增强学习的多项问题谈判

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Negotiation is a process where agents work through disputes and maximize surplus. This paper investigates the use of deep reinforcement learning in the domain of negotiation, evaluating its ability to exploit, adapt, and cooperate. Two actor-critic networks were trained for the bidding and acceptance strategy, against time-based agents, behavior-based agents, and through self-play. Results reveal four key findings. First, neural agents learn to exploit time-based agents, achieving clear transitions in decision values. The primary barriers are the change in marginal utility (second derivative) and cliff-walking resulting from negotiation deadlines. Second, the Cauchy distribution emerges as suitable for sampling offers, due to its peaky center and heavy tails. Third, neural agents demonstrate adaptive behavior against behavior-based agents. Fourth, neural agents learn to cooperate during self-play. Agents learn non-credible threats, which resemble reputation-based strategies in the evolutionary game theory literature. (C) 2020 Elsevier B.V. All rights reserved.

机译：谈判是代理通过纠纷工作并最大限度地提高盈余的过程。本文调查了在谈判领域中的深度加固学习的使用，评估其利用，适应和合作的能力。两个演员 - 评论家网络接受了招标和接受战略，针对基于时间的代理商，基于行为的代理商以及通过自我扮演的培训。结果显示了四个关键结果。首先，神经代理商学习利用基于时间的代理，在决策价值中实现明确的转换。初级障碍是谈判截止日期的边际效用（第二衍生物）和悬崖行走的变化。其次，由于其峰值中心和沉重的尾部，Cauchy分布会出现适用于抽样优惠。第三，神经剂展示了对基于行为的代理的适应性行为。第四，神经代理商学会在自我播放期间合作。代理商学习不可信的威胁，类似于进化博弈论文学中的基于信誉的战略。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Knowledge-Based Systems》 |2021年第9期|106544.1-106544.12|共12页
作者
Chang Ho-Chun Herbert;
展开▼
作者单位

Univ Edinburgh Sch Informat 10 Crichton St Edinburgh Midlothian Scotland|Univ Southern Calif Annenberg Sch Commun & Journalism Los Angeles CA 90007 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep reinforcement learning; Negotiation; Game theory;

机译：深增强学习;谈判;博弈论;

相似文献

外文文献
中文文献
专利

1. Advanced planning for autonomous vehicles using reinforcement learning and deep inverse reinforcement learning [J] . You Changxi, Lu Jianbo, Filev Dimitar, Robotics and Autonomous Systems . 2019,第期

机译：利用强化学习和深度逆钢筋学习的自治车辆先进规划
2. Algorithm for learning negotiation strategy with reinforcement learning [J] . Ohtake Leo, Nishida Toyoaki 電子情報通信学会技術研究報告. 人工知能と知識処理. Artificial Intelligence and Knowledge Based Processing . 2001,第210期

机译：强化学习的学习协商策略算法
3. Algorithm for learning negotiation strategy with reinforcement learning [J] . Ohtake Leo, Nishida Toyoaki 電子情報通信学会技術研究報告. オフィスシステム . 2001,第208期

机译：钢筋学习学习谈判策略的算法
4. Reinforcement Learning of Multi-Issue Negotiation Dialogue Policies [C] . Alexandras Papangelis, Kallirroi Georgila Annual meeting of the Special Interest Group on Discourse and Dialogue . 2015

机译：多问题谈判对话策略的强化学习
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Evolution with Reinforcement Learning in Negotiation [O] . Yi Zou, Wenjie Zhan, Yuan Shao -1

机译：谈判中强化学习的发展
7. Reinforcement Learning of Multi-Issue Negotiation Dialogue Policies [O] . Alexandros Papangelis, Kallirroi Georgila 2015

机译：多阶段谈判对话政策的加固学习

Multi-issue negotiation with deep reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅