Learning non-cooperative dialogue behaviours

机译：学习非合作对话行为

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Non-cooperative dialogue behaviour has been identified as important in a variety of application areas, including education, military operations, video games and healthcare. However, it has not been addressed using statistical approaches to dialogue management, which have always been trained for co-operative dialogue. We develop and evaluate a statistical dialogue agent which learns to perform non-cooperative dialogue moves in order to complete its own objectives in a stochastic trading game. We show that, when given the ability to perform both cooperative and non-cooperative dialogue moves, such an agent can learn to bluff and to lie so as to win games more often - against a variety of adversaries, and under various conditions such as risking penalties for being caught in deception. For example, we show that a non-cooperative dialogue agent can learn to win an additional 15.47% of games against a strong rule-based adversary, when compared to an optimised agent which cannot perform non-cooperative moves. This work is the first to show how an agent can learn to use non-cooperative dialogue to effectively meet its own goals.

机译：非合作对话行为已被认为在包括教育，军事行动，视频游戏和医疗保健在内的许多应用领域中都很重要。但是，尚未使用统计方法进行对话管理来解决该问题，而对话管理始终经过训练以进行合作对话。我们开发和评估一个统计对话代理，该代理学习执行非合作对话动作，以完成其在随机交易游戏中的目标。我们证明，只要具备执行合作和非合作对话动作的能力，这样的特工就可以学会虚张声势和撒谎，以便更频繁地赢得比赛-对抗各种对手，并在各种条件下（例如冒险）被欺骗的处罚。例如，我们显示，与不能执行非合作动作的优化代理相比，非合作对话代理可以学会赢得更多的15.47％的游戏，以对抗强大的基于规则的对手。这项工作是第一个展示代理如何学习非合作对话以有效实现其目标的方法。

著录项

来源
《Annual meeting of the Special Interest Group on Discourse and Dialogue》|2014年|60-68|共9页
会议地点
作者
Ioannis Efstathiou; Oliver Lemon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Energy emergency supply chain collaboration optimization with group consensus through reinforcement learning considering non-cooperative behaviours [J] . Liu Xiang Energy . 2020,第Nova1期

机译：通过考虑非合作行为，通过加强学习与集团共识的能源应急供应链协作优化
2. Teacher-student dialogue: transforming teacher interpersonal behaviour and pedagogical praxis through co-teaching and co-generative dialogue [J] . Yuli Rahmawati, Rekha Koul, Darrell Fisher Learning Environments Research . 2015,第3期

机译：师生对话：通过协同教学和协同对话，改变教师的人际行为和教学实践
3. Spoken Dialogue System for Information Navigation based on Statistical Learning of Semantic and Dialogue Structure [J] . 吉野幸一郎人工知能: 人工知能学会誌 . 2015,第1期

机译：基于语义和对话结构统计学习的信息导航口语对话系统
4. Learning non-cooperative dialogue behaviours [C] . Ioannis Efstathiou, Oliver Lemon Annual meeting of the Special Interest Group on Discourse and Dialogue . 2014

机译：学习非合作对话行为
5. Min-Max Inverse Reinforcement Learning for Learning Bi-Modal Dialogue Policies [D] . Patil, Gandharv. 2020

机译：用于学习双模对话策略的最大最大逆钢筋学习
6. Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing [O] . Zhong Ma, Yuejiao Wang, Yidai Yang, 2018

机译：基于强化学习的非合作目标捕获卫星姿态稳定方法
7. Learning non-cooperative dialogue behaviours [O] . Ioannis Efstathiou, Oliver Lemon 2015

机译：学习非合作对话行为

Learning non-cooperative dialogue behaviours

摘要

著录项

相似文献

相关主题

期刊订阅